Open sourced Redshift Extended tools

Good afternoon,

busuu has decided to open source their current suite of Redshift functions.

These extra functions include some really useful methods to:

  1. Assess statistical significance and statistical confidence of conversions between user cohorts in your snowplow event data.
  2. Extra Datetime functions that are clean, fail gracefully with dirty data and can make your plotting better
  3. JSON functionality that can help you gain NOSQL functionality from your redshift cluster
  4. Data type validation and filtering capacity that can help to only use clean data of certain data types that you want to analyse.

Feel free to add them to your own Redshift clusters, leave any comments or feedback in the issues, fork the repo and if you want put in a PR for some new functions?

4 Likes

This is awesome - thanks so much for sharing @brucey31!

Interesting, thanks!