Intermittent EMR failure: Unable to find a region via the region provider chain


#1

EMR jobs are failing intermittently but increasingly with this error:
“”"
com.amazon.ws.emr.hadoop.fs.shaded.com.amazonaws.SdkClientException: Unable to find a region via the region provider chain. Must provide an explicit region in the builder or setup environment to supply a region.
“”"
(Plus lots more Java traceback blather.)

We’re seeing this once every week or two, out of running snowplow-emr-etl-runner six times a day.

Seems like it can happen at any stage.

Has anyone else run into this problem? Intermittents are the worst.


#2

If it’s intermittent this seems unlikely to be a Snowplow problem. Have you tried filing a ticket with AWS support to see if they are able to provide any more information?


#3

Have you tried filing a ticket with AWS

That’s our next move. Just thought I’d check here in case it’s a Known Issue.

Thanks.


#4

I haven’t seen this exact issue before but I’ve seen similar intermittent EMR failures (and in the past it’s been related to EMR internal errors).