Open TSienki opened 1 year ago
I used apache_beam 2.53.0 and this bug still exists.
It's a common use case that metada of a BigQuery table is required in the query.
Not only INFORMATION_SCHEMA but also _partitiondate is required in the many cases. For example,
select _partitiondate as partition_name, * from xxxx
The same query can be executed successfully using bigquery client library. The reason I want to do it using apache beam is because I want to convert to complex SQL joins to PTransform so that it can be executed in parallel to improve the performance.
However, I have to work around it for now.
Hopefully it can be fixed as soon as possible.
What happened?
Hello, I wanted to read partitions metadata from bigquery table
project_id.dataset_id.INFORMATION_SCHEMA.PARTITIONS
usingReadFromBigQuery
. Unfortunately, this function raises an error:The part of code that causes the error:
I replaced my actual project id and dataset id with tokens
[project_id]
,[dataset_id]
. I've tested it with beam versions2.36.0
and2.43.0
using direct and dataflow runners. Also tried running it with different argument values likemethod
, oruse_standard_sql
, but it doesn't help.Do you know if it is possible to read from this table using
ReadFromBigQuery
Issue Priority
Priority: 2 (default / most bugs should be filed as P2)
Issue Components