Living-with-machines / DeezyMatch

A Flexible Deep Learning Approach to Fuzzy String Matching
https://living-with-machines.github.io/DeezyMatch/
Other
139 stars 34 forks source link

Issues with dataset_path in dm_train functions #140

Open Limsy21 opened 1 year ago

Limsy21 commented 1 year ago

UnicodeDecodeError: 'charmap' codec can't decode byte 0x90 in position 1366: character maps to

Cant be resolved and after making changes to the source code in data_processing.py and savecd, it doesnt respond and no changes were made.

tried changing a different dataset, same issue. Another dataset that is UTF-8 Encoded