For engineers   AWS batch pipeline


About the AWS batch pipeline category (1)
[SOLVED] S3DistCp is not deleting files on the etl first step (5)
Empty s3 shredded logs after successful EmrEtlRunner job (6)
Exception in emr step of loading data in redshift (9)
Intermittent EMR failure: Unable to find a region via the region provider chain (4)
Handling large volumes of duplicated event_ids (4)
Error in Raw S3 -> Raw HDFS Step (1)
Monitoring S3 Loader (2)
Shredded/bad-rows output directory already exists (18)
Storage loader code from where it gets data from shredded/good (3)
Problems with Enrich / EMR process provisioning instances (3)
Monitoring snowplow (4)
Transient rdbloader error: [Amazon](60000) Error setting/closing connection (4)
EMR Job - Failing with SSL handshake? (5)
Enrich problem: "Error writing row" (1)
Unique network_userid count mismatch (2)
EMR Additional Security Groups (2)
Incorrect IP Address in the batch pipeline (1)
Processing a big file in EMR or split it up? (3)
Excutors lost and disconnecting in EMR (5)
./snowplow-emr-etl-runner: rule 24: exec: java: not found (5)
ETL RDB Loader Error (5)
Shredding fails with custom schema in eu-central-1 (6)
Repopulate a single table? (6)
java.nio.file.FileAlreadyExistsException: ./ip_geo (8)
Enable Ganglia on Snowplow EMR clusters (4)
Spark memory woes (2)
Having issues with config.yaml and Contract Violation (5)
EmrEtlRunner config.yml, cloudfront format (2)
Enriched good and bad buckets are empty in the enrich (8)