I am aiming to speed up query performance in Redshift. One way I aim to do that is to do indexing. I wonder what column to pick to do indexing? I read around snowplow’s github issue page and one way to do this in Postgres is to use
dvce_created_tstamp to index.
Any more suggestion beside those?
EDIT: After further research about redshift, found out from this reading that there is no actual ‘indexing’ in redshift and that setting up
SORTKEY is what considered as setting index. We already have collector_tstamp and event_id used as key. I would like to edit my question as is there any more distkey/sortkey that can be used for indexing?