NVIDIA / DeepLearningExamples

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
13.29k stars 3.18k forks source link

How can I download BookCorpus without container? #250

Closed yifding closed 4 years ago

yifding commented 4 years ago

Since I am using the remote LINUX machine without root and container. I am trying to download the BookCorpus data and find it doesn't work. How can i find the "/workspace/bookcorpus/download_files.py" or where should I find the data?

def download(self):
    bookscorpus_download_command = 'python3 /workspace/bookcorpus/download_files.py --list /workspace/bookcorpus/url_list.jsonl --out'
    bookscorpus_download_command += ' ' + self.save_path + '/bookscorpus'
    bookscorpus_download_command += ' --trash-bad-count'
    bookscorpus_download_process = subprocess.run(bookscorpus_download_command, shell=True, check=True)
swethmandava commented 4 years ago

37

/workspace/bookscorpus is cloned from this repository https://github.com/soskek/bookcorpus

yifding commented 4 years ago

Thank you so much