I am looking into snowplow currently. From what I understand from the documentation and the code repos, is that the following formats are used to encode an (enriched) event:
- query-string + json between client → collector
- thirft between collector → enrich
- tsv between enrich → storage / processors
- json between storage / processors →
i am wondering if my picture correct / complete.
I saw comments in the code that there is/was an intent to move to avro. I assume in order to reduce the number of formats, currently in use.
Is somewhere up to date information about that transition ?
I am wondering if this transition is still indented to be executed ?