Reprocessing / Rerunning logs from IGLU server failure for unstructured events

would just rerunning data and having dupes in redshift and then using this tutorial work?
http://discourse.snowplow.io/t/de-deduplicating-events-in-hadoop-and-redshift-tutorial/248

i don’t think de-duplication in enrichment would work since we’ve never had it on and it only works for batch runs right? and doesn’t look at what’s in redshift vs what’s being run in ETL etc. we’d have to run all the logs since 6/14 as one big batch for it to work? and even then we’d still have dupes in redshift from previous good loads.