pytorch / text

Models, data loaders and abstractions for language processing, powered by PyTorch
https://pytorch.org/text
BSD 3-Clause "New" or "Revised" License
3.51k stars 810 forks source link

Multi30k.splits(exts=('.fr', '.de'), fields=(SRC, TRG)) still failing with [Errno 2] No such file or directory: '.data/multi30k/train.fr' #1101

Open amitkayal opened 3 years ago

amitkayal commented 3 years ago

🐛 Bug

Describe the bug Hi All,

This issue still seems to be happening for me in colab.

The following one train_data, valid_data, test_data = Multi30k.splits(exts=('.de', '.en'), fields=(SRC, TRG)) works while train_data, valid_data, test_data = Multi30k.splits(exts=('.fr', '.de'), fields=(SRC, TRG)) still failing.

Error Details Here is the backtrace..

FileNotFoundError Traceback (most recent call last)

in () ----> 1 train_data, valid_data, test_data = Multi30k.splits(exts=('.fr', '.de'), fields=(SRC, TRG)) 2 frames /usr/local/lib/python3.6/dist-packages/torchtext/datasets/translation.py in splits(cls, exts, fields, root, train, validation, test, **kwargs) 112 113 return super(Multi30k, cls).splits( --> 114 exts, fields, path, root, train, validation, test, **kwargs) 115 116 /usr/local/lib/python3.6/dist-packages/torchtext/datasets/translation.py in splits(cls, exts, fields, path, root, train, validation, test, **kwargs) 64 65 train_data = None if train is None else cls( ---> 66 os.path.join(path, train), exts, fields, **kwargs) 67 val_data = None if validation is None else cls( 68 os.path.join(path, validation), exts, fields, **kwargs) /usr/local/lib/python3.6/dist-packages/torchtext/datasets/translation.py in __init__(self, path, exts, fields, **kwargs) 32 33 examples = [] ---> 34 with io.open(src_path, mode='r', encoding='utf-8') as src_file, \ 35 io.open(trg_path, mode='r', encoding='utf-8') as trg_file: 36 for src_line, trg_line in zip(src_file, trg_file): FileNotFoundError: [Errno 2] No such file or directory: '.data/multi30k/train.fr' **Environment** Collecting environment information... PyTorch version: 1.7.0+cu101 Is debug build: True CUDA used to build PyTorch: 10.1 ROCM used to build PyTorch: N/A OS: Ubuntu 18.04.5 LTS (x86_64) GCC version: (Ubuntu 7.5.0-3ubuntu1~18.04) 7.5.0 Clang version: 6.0.0-1ubuntu2 (tags/RELEASE_600/final) CMake version: version 3.12.0 Python version: 3.6 (64-bit runtime) Is CUDA available: True CUDA runtime version: 10.1.243 GPU models and configuration: GPU 0: Tesla T4 Nvidia driver version: 418.67 cuDNN version: /usr/lib/x86_64-linux-gnu/libcudnn.so.7.6.5 HIP runtime version: N/A MIOpen runtime version: N/A Versions of relevant libraries: [pip3] numpy==1.18.5 [pip3] torch==1.7.0+cu101 [pip3] torchsummary==1.5.1 [pip3] torchtext==0.3.1 [pip3] torchvision==0.8.1+cu101 [conda] Could not collect torchtext version is 0.3.1
zhangguanheng66 commented 3 years ago

Could you put together a short code snippet so I could reproduce the error?