Shredding EMR spark config (IOException: All datanodes ... are bad)

dadasami · May 27, 2021, 12:18pm

No problem! And thanks a lot for taking your time and answering the questions.

Unfortunately, I still do not understand why the EMR job failed after all. Initially, we started with an empty Redshift and an empty shredded archive. If I am not mistaken the shredder could realize that it has to shred the entire enriched archive. And indeed, it tried to shred the entire enriched archive. The issue arises where the shredder tries to process the same enriched runs twice (or maybe multiple times) in the same EMR run.

The new RDBLoader creates the manifest table automatically, and probably that is how it could catch those errors when it received the same sqs message twice. Yet, the question is why Shredder is running over files more than once, at all?

Is that because of bad spark configuration? Same batch being assigned to multiple executors for any reason?

Topic		Replies	Views
Shred stage failure on EMR ETL Runner upgrade	7	1396	August 6, 2021
Learnings from using the new Spark EMR Jobs AWS batch pipeline (Legacy)	8	13122	August 23, 2017
Optimizing and reducing shredding/loading costs For engineers	4	876	January 20, 2021
Shredder slow after stopping for a day / Reshredding 12 hours of data Spark	0	881	September 8, 2021
Spark memory woes AWS batch pipeline (Legacy)	1	1822	December 14, 2017

Shredding EMR spark config (IOException: All datanodes ... are bad)

Related Topics