coiled / dask-bigquery

BSD 3-Clause "New" or "Revised" License
43 stars 13 forks source link

``read_gbq`` casts string columns to objects #85

Closed phofl closed 3 months ago

phofl commented 3 months ago

the big query reader casts string columns in the to_pandas call to object, which is bad for memory, Gil contention and performance. We should utilise the same strategy as for read_parquet over in dask to preserve the string dtype when the option is set.