dmis-lab / BioSyn

ACL'2020: Biomedical Entity Representations with Synonym Marginalization
https://arxiv.org/abs/2005.00239
MIT License
160 stars 26 forks source link

Request to add preprocessing scripts for TAC2017ADR sets and dictionaries. #1

Closed tutubalinaev closed 4 years ago

tutubalinaev commented 4 years ago

Dear authors,

could you please share preprocessing scripts and MedDRA dictionaries for the TAC2017ADR dump? Thank you in advance!

mjeensung commented 4 years ago

Hello tutubalinaev

We've just uploaded the preprocessing scripts for the TAC2017ADR dataset and the MedDRA dictionary. If you face any problem, please let me know.

https://github.com/dmis-lab/BioSyn/tree/master/preprocess#how-to-pre-process-datasets-and-dictionaires

tutubalinaev commented 4 years ago

Dear @mjeensung thank you for the scripts, I preprocessed the TAC2017ADR dataset as well as the MedDRA 19.0 files successfully.