Open MrAsimZahid opened 2 years ago
Try with transformers==4.15.0
, that's what I've got.
It worked perfectly. Thank you so much!
On Wed, Jun 8, 2022, 2:19 PM Attila Szász @.***> wrote:
Try with transformers==4.15.0, that's what I've got.
— Reply to this email directly, view it on GitHub https://github.com/aiforsec/CyNER/issues/4#issuecomment-1149672430, or unsubscribe https://github.com/notifications/unsubscribe-auth/AFTL2ZL2HODB2IO4XL2AHOTVOBQTZANCNFSM5YFSDRAA . You are receiving this because you authored the thread.Message ID: @.***>
Again, facing the same issue but in some other section of code. Could you please help with the resolution? @tilusnet
Traceback (most recent call last):
File "/home/yasir/Documents/Projects/Blog_Data_Extraction/prototype/test_CyNER/v2/CyNER/run.py", line 10, in <module>
model.train()
File "/home/yasir/Documents/Projects/Blog_Data_Extraction/prototype/test_CyNER/v2/CyNER/cyner/transformers_ner.py", line 52, in train
trainer.train(monitor_validation=True)
File "/home/yasir/Documents/Projects/Blog_Data_Extraction/prototype/test_CyNER/v2/CyNER/cyner/tner/model.py", line 292, in train
self.__setup_model_data(self.args.dataset, self.args.lower_case)
File "/home/yasir/Documents/Projects/Blog_Data_Extraction/prototype/test_CyNER/v2/CyNER/cyner/tner/model.py", line 155, in __setup_model_data
self.transforms = Transforms(self.args.transformers_model, cache_dir=self.cache_dir)
File "/home/yasir/Documents/Projects/Blog_Data_Extraction/prototype/test_CyNER/v2/CyNER/cyner/tner/tokenizer.py", line 38, in __init__
self.tokenizer = transformers.AutoTokenizer.from_pretrained(transformer_tokenizer, cache_dir=cache_dir)
File "/home/yasir/anaconda3/envs/blogsIntel/lib/python3.9/site-packages/transformers/models/auto/tokenization_auto.py", line 550, in from_pretrained
return tokenizer_class_fast.from_pretrained(pretrained_model_name_or_path, *inputs, **kwargs)
File "/home/yasir/anaconda3/envs/blogsIntel/lib/python3.9/site-packages/transformers/tokenization_utils_base.py", line 1747, in from_pretrained
return cls._from_pretrained(
File "/home/yasir/anaconda3/envs/blogsIntel/lib/python3.9/site-packages/transformers/tokenization_utils_base.py", line 1882, in _from_pretrained
tokenizer = cls(*init_inputs, **init_kwargs)
File "/home/yasir/anaconda3/envs/blogsIntel/lib/python3.9/site-packages/transformers/models/xlm_roberta/tokenization_xlm_roberta_fast.py", line 139, in __init__
super().__init__(
File "/home/yasir/anaconda3/envs/blogsIntel/lib/python3.9/site-packages/transformers/tokenization_utils_fast.py", line 108, in __init__
fast_tokenizer = TokenizerFast.from_file(fast_tokenizer_file)
Exception: EOF while parsing a string at line 1 column 8862550
From the
CyNER Demo.ipynb
I have tried to train the model but I get this error.Output
Unable to pinpoint where the problem is occurring. Could you help me with this? Thank you.