Open damccorm opened 2 years ago
Hey there! 👋 I'm new to this repository and eager to contribute! 🌟 Could you kindly suggest some entry point or files to look into?
Hey, saw you added this comment several places. I'd recommend focusing on a single issue at first (I answered the underlying question here - https://github.com/apache/beam/issues/20298#issuecomment-1547888993)
WriteToBigQuery
fails when using theFILE_LOADS
method in theBundleBasedDirectRunner
.The issue appears to be in
wait_for_bq_job
, where the function expectsjob_reference
to be an actual JobReference instance and not a string. However, theWaitForBQJobs
DoFn appears to be passing a string as the argument. I believe this is during the copy step, and I'm not calling this code directly (so unfortunately I can't just pass a TableReference instance myself).Here is a traceback:
Here is the
WriteToBigQuery
step that is failing (note that the callable passed fortable
returns a TableReference instance):Note that this issue does not occur when using the standard
DirectRunner
, nor does it occur when using theSTREAMING_INSERTS
method.Thanks! (And apologies if I left out any important information. This is the first issue I've opened here.)
Imported from Jira BEAM-12659. Original Jira may contain additional context. Reported by: milesmcc.