Migrating from Redshift to Snowflake

iain · June 6, 2018, 4:25pm

We’re looking at migrating our Redshift events and derived data across to Snowflake. What’s the best way to achieve this? Would we be best to copy the tables from Redshift, or to use the new Snowflake schema and copy the event data from the original enriched data stored in our archive folder on S3?

anton · June 7, 2018, 3:49am

Hi @iain,

At least with non-derived data I’d go with re-processing enriched events archive.

Enriched archive is our ultimate source of truth and different storage targets apply own appropriate transformations before loading, which means in Snowflake enriched data structured in a very different way. Thus it would take a lot of efforts (or probably even impossible) to transform data unloaded from Redshift into Snowflake.

iain · June 10, 2018, 11:04am

Thanks Anton.

Am I right in thinking that if you’re only using Snowflake, then shredding is no long a part of the pipeline, just enrichment?

anton · June 11, 2018, 7:26am

Hey @iain! Yes, you’re right about that. You can think about shredding as of “DB-specific transformation” I mentioned above. So, shredding is Redshift-specific transformation and Snowflake has its own performed by Snowflake transformer. In other words we’re just swapping RDB Shredder with Snowflake Transformer.

Topic		Replies	Views
Any plans to introduce RDB shredding mode to Snowflake? Snowflake	8	1012	May 4, 2022
How Snowplow data is structured in Snowflake Snowflake	5	4466	May 8, 2020
Storing the unstructured events in Redshift	9	1437	January 27, 2020
Can we use spectrum to query shredded data instead of enriched? For data modelers & consumers	1	2039	September 27, 2017
Normalize Atomic event dbt redshift Open source verison Enrichment	4	178	March 18, 2024

Migrating from Redshift to Snowflake

Related Topics