Sending unstructured events + Schemas

ihor · July 9, 2019, 4:32pm

JSON schema is used to validate data indeed. However, it is not correct to say “so that the storage process knows where/how to save the event data”. If you mean Redshift, JSONPaths are used to instruct Redshift on how to load the self-describing events and contexts (which are in JSON format after being shred as opposed to TSV format which is what atomic data is).

The following post might clarify shredding and loading to Redshift further: Home · snowplow/snowplow Wiki · GitHub.

There are quite a few tutorials in this forum that you can find. Here are some of them as well as wiki post to start with

Setting up Iglu Server is very easy when it comes to the static server (there are different types to choose from). It’s just storage on the web where the files (JSON schemas) are accessed via standard HTTP. If we take AWS, just place the files in the appropriate folders in the bucket and set it to be accessible on the internet.

The links I have already pointed out to could be a good starting point. As Colm mentioned, Snowplow Mini is a good tool to test your implementation of JSON schemas.

Topic		Replies	Views
Javascript tracker with unstructured events	5	1445	January 14, 2020
Streaming to BigQuery with google enhanced ecommerce events (unstructured events) Storage targets	11	2903	August 12, 2016
Unstructured event Collectors	3	1540	May 19, 2016
[redshift] unstructured event not save in correct schema Redshift	5	4179	February 27, 2017
Adding one field to all track commands For engineers	6	733	September 25, 2019

Sending unstructured events + Schemas

Related Topics