Closed richarddwang closed 4 years ago
I think you can just rm
this directory and it should be good :)
@lhoestq - this seems to happen quite often (already the 2nd issue). Can we maybe delete this automatically?
Yes I have an idea of what's going on. I'm sure I can fix that
Hi, I rebase my local copy to fix-empty-cache-dir
, and try to run again python nlp-cli test datasets/bookcorpus --save_infos --all_configs
.
I got this,
Traceback (most recent call last):
File "nlp-cli", line 10, in <module>
from nlp.commands.run_beam import RunBeamCommand
File "/home/yisiang/nlp/src/nlp/commands/run_beam.py", line 6, in <module>
import apache_beam as beam
ModuleNotFoundError: No module named 'apache_beam'
And after I installed it. I got this
File "/home/yisiang/nlp/src/nlp/datasets/bookcorpus/aea0bd5142d26df645a8fce23d6110bb95ecb81772bb2a1f29012e329191962c/bookcorpus.py", line 88, in _split_generators
downloaded_path_or_paths = dl_manager.download_custom(_GDRIVE_FILE_ID, download_file_from_google_drive)
File "/home/yisiang/nlp/src/nlp/utils/download_manager.py", line 128, in download_custom
downloaded_path_or_paths = map_nested(url_to_downloaded_path, url_or_urls)
File "/home/yisiang/nlp/src/nlp/utils/py_utils.py", line 172, in map_nested
return function(data_struct)
File "/home/yisiang/nlp/src/nlp/utils/download_manager.py", line 126, in url_to_downloaded_path
return os.path.join(self._download_config.cache_dir, hash_url_to_filename(url))
File "/home/yisiang/miniconda3/envs/nlppr/lib/python3.7/posixpath.py", line 80, in join
a = os.fspath(a)
The problem is when I print self._download_config.cache_dir
using pdb, it is None
.
Did I miss something ? Or can you provide a workaround first so I can keep testing my script ?
I'll close this issue because I brings more reports in another issue #249 .
Hi, I am trying to create Toronto Book Corpus. #131
I ran
~/nlp % python nlp-cli test datasets/bookcorpus --save_infos --all_configs
but this doesn't createdataset_info.json
and try to use itbtw,
ls /home/yisiang/.cache/huggingface/datasets/book_corpus/plain_text/1.0.0/
show me nothing is in the directory.I have also pushed the script to my fork bookcorpus.py.