allenai / scitldr

https://scitldr.apps.allenai.org/
Apache License 2.0
746 stars 84 forks source link

Error Running make_datafiles.sh for AIC #10

Open anshoomehra opened 3 years ago

anshoomehra commented 3 years ago

Authors, great work!! My Organisation sees a huge value in investing further in your research work.

However, I am stalled, running into the below error while running, can you help?

It seems some libraries' versions are a conflict causing broken code, specifically fairseq ??

Running cd SciTLDR-Data export TASK=SciTLDR-AIC chmod +x make_datafiles.sh ./make_datafiles.sh # BPE preprocess

Errors self._kwargs) File "/usr/lib/python3.6/multiprocessing/pool.py", line 103, in worker initializer(initargs) File "/jupyter_mount/scitldr/SciTLDR-Data/multiprocessing_bpe_encoder.py", line 96, in initializer bpe = get_encoder(self.args.encoder_json, self.args.vocab_bpe) File "/usr/local/lib/python3.6/dist-packages/fairseq/data/encoders/gpt2_bpe_utils.py", line 132, in get_encoder with open(encoder_json_path, "r") as f: FileNotFoundError: [Errno 2] No such file or directory: 'encoder.json' Process ForkPoolWorker-73260: Traceback (most recent call last): File "/usr/lib/python3.6/multiprocessing/process.py", line 258, in _bootstrap self.run() File "/usr/lib/python3.6/multiprocessing/process.py", line 93, in run self._target(self._args, self._kwargs) File "/usr/lib/python3.6/multiprocessing/pool.py", line 103, in worker initializer(initargs) File "/jupyter_mount/scitldr/SciTLDR-Data/multiprocessing_bpe_encoder.py", line 96, in initializer bpe = get_encoder(self.args.encoder_json, self.args.vocab_bpe) File "/usr/local/lib/python3.6/dist-packages/fairseq/data/encoders/gpt2_bpe_utils.py", line 132, in get_encoder with open(encoder_json_path, "r") as f: FileNotFoundError: [Errno 2] No such file or directory: 'encoder.json' Process ForkPoolWorker-73261: Traceback (most recent call last): File "/usr/lib/python3.6/multiprocessing/process.py", line 258, in _bootstrap self.run() File "/usr/lib/python3.6/multiprocessing/process.py", line 93, in run self._target(self._args, self._kwargs) File "/usr/lib/python3.6/multiprocessing/pool.py", line 103, in worker initializer(initargs) File "/jupyter_mount/scitldr/SciTLDR-Data/multiprocessing_bpe_encoder.py", line 96, in initializer bpe = get_encoder(self.args.encoder_json, self.args.vocab_bpe) File "/usr/local/lib/python3.6/dist-packages/fairseq/data/encoders/gpt2_bpe_utils.py", line 132, in get_encoder with open(encoder_json_path, "r") as f: FileNotFoundError: [Errno 2] No such file or directory: 'encoder.json' Process ForkPoolWorker-73262: Traceback (most recent call last): File "/usr/lib/python3.6/multiprocessing/process.py", line 258, in _bootstrap self.run() File "/usr/lib/python3.6/multiprocessing/process.py", line 93, in run self._target(self._args, self._kwargs) File "/usr/lib/python3.6/multiprocessing/pool.py", line 103, in worker initializer(initargs) File "/jupyter_mount/scitldr/SciTLDR-Data/multiprocessing_bpe_encoder.py", line 96, in initializer bpe = get_encoder(self.args.encoder_json, self.args.vocab_bpe) File "/usr/local/lib/python3.6/dist-packages/fairseq/data/encoders/gpt2_bpe_utils.py", line 132, in get_encoder with open(encoder_json_path, "r") as f: FileNotFoundError: [Errno 2] No such file or directory: 'encoder.json' Process ForkPoolWorker-73263: Traceback (most recent call last): File "/usr/lib/python3.6/multiprocessing/process.py", line 258, in _bootstrap self.run() File "/usr/lib/python3.6/multiprocessing/process.py", line 93, in run self._target(self._args, self._kwargs) File "/usr/lib/python3.6/multiprocessing/pool.py", line 103, in worker initializer(*initargs) File "/jupyter_mount/scitldr/SciTLDR-Data/multiprocessing_bpe_encoder.py", line 96, in initializer bpe = get_encoder(self.args.encoder_json, self.args.vocab_bpe) File "/usr/local/lib/python3.6/dist-packages/fairseq/data/encoders/gpt2_bpe_utils.py", line 132, in get_encoder with open(encoder_json_path, "r") as f: