I’d like to use snowplow in next way:
apps sending events -> scala collector -> kinesis -> kinesis S3 sink -> S3
Raw events will be collecting and storing on S3 for some time till analytics processing module is ready.
So now I simply want to make sure those events could be read with Java and they have all the data needed.
My unarchived log file consists of following sections
- Do I understand correct that this file contains list of events in thrift format wrapped into elephant bird?
- Is there some code reading events from S3 after writing it with kinesis-s3 available? (Java/Scala)
- What is recommended way to process S3 data written with kinesis-s3?
I’m not a Scala guy so reading kinesis-s3 sources hasn’t helped to solve this puzzle