'DistilBertTokenizer' object has no attribute 'max_len'

AIPHES / emnlp19-moverscore

MoverScore: Text Generation Evaluating with Contextualized Embeddings and Earth Mover Distance

MIT License

192 stars 31 forks source link

'DistilBertTokenizer' object has no attribute 'max_len' #17

Closed mymusise closed 3 years ago

mymusise commented 3 years ago

Got an error when running the example:

from moverscore_v2 import plot_example
reference = 'they are now equipped with air conditioning and new toilets.'
translation = 'they have air conditioning and new toilets.'
plot_example(True, reference, translation)

I think maybe the API has changed, model_max_length now.

mymusise commented 3 years ago

fix in https://github.com/AIPHES/emnlp19-moverscore/pull/18

sarwart commented 3 years ago

I am currently using this on Google colab. Can you please be more specific how to solve this on colab? I am using the latest transformers. I got an error when I tried the mentioned example in the following line

"idf_dict_hyp = get_idf_dict(translations)"

Also, how can I specify a different mode like ClinicalBERT to compute the moverscore?

Tabinda

mymusise commented 3 years ago

@sarwart try my branch: !pip uninstall moverscore !pip install https://github.com/mymusise/emnlp19-moverscore/archive/master.zip

To use a different mode like ClinicalBERT, just define an env like this:

import os 
os.environ['MOVERSCORE_MODEL'] = "emilyalsentzer/Bio_ClinicalBERT"
# or 
# os.environ['MOVERSCORE_MODEL'] = "albert-base-v2"
# and so on
from moverscore_v2 import get_idf_dict
idf_dict_hyp = get_idf_dict(translations)

andyweizhao commented 3 years ago

@mymusise, thanks a lot! The PR is merged. Cheers :-)