allenai / scispacy

A full spaCy pipeline and models for scientific/biomedical documents.
https://allenai.github.io/scispacy/
Apache License 2.0
1.72k stars 229 forks source link

encoding="utf-8" #439

Closed NetRxn closed 2 years ago

NetRxn commented 2 years ago

Thanks for the amazing library!

Hit the following error using export_umls_json.py with UMLS 2022AA: 'charmap' codec can't decode byte 0x81 in position X: character maps to

Resolved by adding the utf-8 encoding throughout umls_utils.

dakinggg commented 2 years ago

@jgroehm The black commit doesn't look right. It doesn't contain any reformatting, just a stray space added. Can you try again? Command should be black candidate_generation.py --line-length 88 and then add and commit that change.

NetRxn commented 2 years ago

Sorry for the false start there, new to black, must not have set up the VSCode extension correctly. CLI appears to have done the trick. Thank you!

dakinggg commented 2 years ago

Hey @jgroehm, sorry I forgot about this. Merging now, thanks!