Hi Team, Thanks for your time to read this and any help would be much appreciated! I have set up a scala stream collector and it’s linked with SendGrid and Kinesis stream. I can get raw event data from SendGrid in S3. What I want to do next is loading them in Redshift. I have already created 11 ta…

SendGrid+Snowplow+AWS S3&Redshift

anton October 16, 2019, 9:06am 2

The next component you need is Stream Enrich that validates raw data and transforms it into a canonical TSV format. However this also isn’t the end if you also want to load data into Redshift because you’ll need RDB Loader and optionally EmrEtlRunner to orchestrate it.

What you want to have is so called Lambda Architecture, you can read more in its dedicated article:

Topic		Replies	Views
Shredding to Redshift in the Scala Collector Flow AWS batch pipeline (Legacy)	2	1898	September 24, 2017
Snowplow Kinesis to EmrEtl For engineers	4	1568	July 31, 2019
Scala Kinesis Enrich AWS real-time pipeline	9	2662	April 9, 2018
Enriched event stream into Redshift using Kinesis Firehose AWS real-time pipeline	7	5514	May 31, 2016
Is it possible to load data to Redshift after StreamEnricher? Storage targets	10	2595	September 12, 2018

SendGrid+Snowplow+AWS S3&Redshift

Related Topics