Unique network_userid count mismatch


#1

There is a mismatch between the distinct network_userids in elasticsearch and redshift. The network_userid count in redshift (batch pipeline) is almost 70-80% higher than the distinct network_userids in elasticsearch (real time pipeline).

Am I missing out on something?


#2

Hi @ramandamodar,

Is your Elasticsearch able to handle data throughput? Does it have sufficient disk space?