-
请问你是怎么解决“def build_vocabulary(spacy_de, spacy_en):
def tokenize_de(text):
return tokenize(text, spacy_de)
def tokenize_en(text):
return tokenize(text, spacy_en)
pr…
-
Add the Multi30K datasets for multilingual image--sentence retrieval evaluation. The evaluation data is available in English, French, Czech, and German. The sentence data can be found on Github at htt…
-
when using the following method to create data
train, val, test = Multi30k.splits(exts=('.de', '.en'), fields=(DE, EN))
I got the following error message
------------------------------
//anacond…
-
ㅁ 오류난 행 :
train_dataset, valid_dataset, test_dataset = Multi30k.splits(exts = (".de", ".en"), fields = (SRC, TRG))
ㅁ 오류코드 :
[ssl: certificate_verify_failed] certificate verify failed: hostname mi…
-
The link to Multi30K dataset at `http://www.quest.dcs.shef.ac.uk/wmt16_files_mmt/training.tar.gz` is broken: https://github.com/pytorch/text/blob/73bf4fa8cedc12d910ab76190e446bd2e47a8325/torchtext/dat…
-
I'm new to transformer recently and don't know how to get the dataset in this project.
Please help me to provide a linux script if you can.
-
PermissionError: [Errno 13] Permission denied: '.data\\multi30k\\train.de_core_news_sm-2.3.0'
how can I solve it
-
Traceback (most recent call last):
File "/media/amax/836e911f-c5c3-4c4b-91f2-41bb8f3f5cb6/DATA/EventGroup3/why/pytorch-transformer-main/train.py", line 6, in
from dataset import en_preprocess…
-
How do we get files like test_2016_flickr.lc.norm.tok.de ? I don't think the download_data script gets this. I also looked at the multi30k dataset (https://github.com/multi30k/dataset) , but can't s…
-
Hi, thank you very much for the excellent work!
However, I am facing an error when I try to train the network with `train_mmt.sh`:
```
Traceback (most recent call last): …