Open abhineet13 opened 7 years ago
Not sure if there's much we can do since we solely rely on spark-avro to translate DataFrames into Avro files on GCS and then BigQuery load feature. What's your bottleneck here? Make sure you have enough parallelism?
Hi
I am trying to load a csv zip file from google cloud into BQ, file size is 100 GB but the load is taking lot of time,
is there a way to tune the df.saveAsBigQueryTable command to speed up the loads