The documentation is not super clear on the preferred method for scheduling daily execution of emr-etl-runner and storage-loader. Should we be using cron and snowplow-runner-and-loader.sh ? There’s also blog posts around using cron/make and now a new data pipeline runner called Factotum (which unfortunately I can’t use at the moment because my ETL executables live on a Ubuntu ec2 instance since I had trouble on Linux).
Can someone recommend the most straightforward process currently? I simply need to run emr-etl-runner followed by storage-loader, although in the near future I hope to add Sql Runner to the flow as well.