Open jwzh222 opened 1 year ago
environment: windows 10 python 3.9.0 apache-beam 2.45.0
it will fail with both DirectRunner and DataflowRunner
.add-labels DirectRunner
Label DirectRunner cannot be managed because it does not exist in the repo. Please check your spelling.
Thanks for reporting this issue. It looks like indeed a bug that space is not considered here:
Use (s.strip() for s in field_and_type.split(':')) as L1534 should fix the problem
Would you interested in getting a fix?
What happened?
there is any issue in python SDK beam.io.WriteToBigQuery() when you add a space in schema, like schema="name: STRING", it will fail.
error message: "message": "Invalid value for type: STRING is not a valid value"
example code:
`import apache_beam as beam from apache_beam.options.pipeline_options import PipelineOptions
def run(): pipeline_args = [] pipeline_options = PipelineOptions(pipeline_args)
if name == 'main': pipeline_args = [ '--runner', 'DirectRunner', '--project', 'YOUR_PROJECT_ID', ]
Issue Priority
Priority: 3 (minor)
Issue Components