gluent / goe

GOE: a simple and flexible way to copy data from an Oracle Database to Google BigQuery.
Apache License 2.0
8 stars 2 forks source link

BigQuery offloads should consider seeding sort columns with source primary key #170

Open nj1973 opened 2 months ago

nj1973 commented 2 months ago

It is good practice to add source primary key columns as cluster columns in BigQuery.

If the user does no explicitly specify --sort-columns then we could add primary key columns as a default.

While doing this we could think about secondary indexes and whether they give us a clue as to potentially important columns for clustering.