HBAP-Dataminds / COVID-19

Central location for all things related to COVID-19 including Kaggle competitions and other work
1 stars 0 forks source link

NER creating false positives #3

Open seangrant82 opened 4 years ago

seangrant82 commented 4 years ago

When processing the Abstract data, the NER from sciSpacy is creating false positives and miss identifying entities and specific types.

Example of misidentified portions of text: image

Need to figure out how to clean/remove these entries to ensure the analysis isn't skewed.