Closed bogdankostic closed 3 years ago
As stated here, I tried to generate the NQ table dataset by executing the following command:
python3 tapas/scripts/preprocess_nq.py \ --input_path="gs://natural_questions/v1.0" \ --output_path="gs://${GCP_BUCKET}/nq_tables" \ --runner_type="DATAFLOW" \ --gc_project="${GCP_PROJECT}" \ --gc_region="us-west1" \ --gc_job_name="create-intermediate" \ --gc_staging_location="gs://${GCP_BUCKET}/staging" \ --gc_temp_location="gs://${GCP_BUCKET}/tmp" \ --extra_packages=dist/tapas-table-parsing-0.0.1.dev0.tar.gz
After some time, this results in the following error:
NameError: name 'beam' is not defined [while running 'Parse']
This is easily solved by adding the option --save_main_session, so I would suggest changing this command to:
--save_main_session
python3 tapas/scripts/preprocess_nq.py \ --input_path="gs://natural_questions/v1.0" \ --output_path="gs://${GCP_BUCKET}/nq_tables" \ --runner_type="DATAFLOW" \ --gc_project="${GCP_PROJECT}" \ --gc_region="us-west1" \ --gc_job_name="create-intermediate" \ --gc_staging_location="gs://${GCP_BUCKET}/staging" \ --gc_temp_location="gs://${GCP_BUCKET}/tmp" \ --extra_packages=dist/tapas-table-parsing-0.0.1.dev0.tar.gz \ --save_main_session
Great point, we added the save_main_session precisely for this reason but we forgot to added the to the doc. Will add it for the next release
This has been updated, thanks a lot!
As stated here, I tried to generate the NQ table dataset by executing the following command:
After some time, this results in the following error:
This is easily solved by adding the option
--save_main_session
, so I would suggest changing this command to: