allenai / scispacy

A full spaCy pipeline and models for scientific/biomedical documents.
https://allenai.github.io/scispacy/
Apache License 2.0
1.69k stars 226 forks source link

How to extract all species names in a document? #468

Closed bpersevere closed 1 year ago

bpersevere commented 1 year ago

Can we use scispaCy to extract all species names in a document? The species name here refers to the scientific name of the species, also known as binomal names or Linnean names. For example, Ganoderma lucidum.

nleguillarme commented 1 year ago

Okay, this is self-promotion, but you can look at TaxoNERD which relies heavily on scispacy and is specifically designed to extract taxonomic entities.

bpersevere commented 1 year ago

Thank you very much for your help.

andyjessen commented 1 year ago

Great work @nleguillarme!

dakinggg commented 1 year ago

Thanks for sharing @nleguillarme!