n-waves / multifit

The code to reproduce results from paper "MultiFiT: Efficient Multi-lingual Language Model Fine-tuning" https://arxiv.org/abs/1909.04761
MIT License
282 stars 56 forks source link

Missing File in CLS-DE.ipynb #73

Closed PGrothaus closed 4 years ago

PGrothaus commented 4 years ago

Hi,

first of all thanks for sharing your code, this is great work! I am interested in using your models and am playing around with the CLS-DE.ipynb notebook file that you provide to fine-tune on a dataset.

Everything works smoothly until I run into a missing file error:

> cls_dataset.load_clas_databunch(bs=exp.finetune_lm.bs).show_batch()

Running tokenization: 'lm-notst' ...

---------------------------------------------------------------------------

FileNotFoundError                         Traceback (most recent call last)

<ipython-input-13-51e69f217e1f> in <module>()
----> 1 cls_dataset.load_clas_databunch(bs=exp.finetune_lm.bs).show_batch()

11 frames

/usr/local/lib/python3.6/dist-packages/pandas/io/parsers.py in __init__(self, src, **kwds)
   1889         kwds["usecols"] = self.usecols
   1890 
-> 1891         self._reader = parsers.TextReader(src, **kwds)
   1892         self.unnamed_cols = self._reader.unnamed_cols
   1893 

pandas/_libs/parsers.pyx in pandas._libs.parsers.TextReader.__cinit__()

pandas/_libs/parsers.pyx in pandas._libs.parsers.TextReader._setup_parser_source()

FileNotFoundError: [Errno 2] File data/cls/de-music/de.train.csv does not exist: 'data/cls/de-music/de.train.csv'

Can you offer any help on where one can get that file, or, other steps that I may be missing?

Thank you!

PGrothaus commented 4 years ago

Just notices there is a closed issue for this. Sorry...