nlp-uoregon / trankit

Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing
Apache License 2.0
724 stars 99 forks source link

UnicodeDecodeError: 'charmap' codec can't decode byte 0x81 in position 223: character maps to <undefined> #66

Open MustafaCeyhan opened 1 year ago

MustafaCeyhan commented 1 year ago

I try to use Tpipeline for custom training for Arabic dependency parser. But arabic conllu files for training in the training_config is giving me the charmap error. How can i solve this problem.