Open realmarcin opened 3 years ago
The data is from this paper: Exploring the functional composition of the human microbiome using a hand-curated microbial trait database https://bmcbioinformatics.biomedcentral.com/articles/10.1186/s12859-021-04216-2
The dataset itself is Additional File 1: https://bmcbioinformatics.biomedcentral.com/articles/10.1186/s12859-021-04216-2#MOESM1
To start we could perform NER with the same dictionaries as Madin et al - so NCBI Taxonomy, ENVO, ECOCORE, ChEBI.
There are additional numerical columns of interest here beyond what Madin et al provided:
OK, here is the drill:
See https://docs.google.com/document/d/1iEsLp9pDvjGjgWMSLArtNf6Jwan-wjMl6_viQGyTWG8/edit# for approach
The data is from this paper: Exploring the functional composition of the human microbiome using a hand-curated microbial trait database https://bmcbioinformatics.biomedcentral.com/articles/10.1186/s12859-021-04216-2
The dataset itself is Additional File 1: https://bmcbioinformatics.biomedcentral.com/articles/10.1186/s12859-021-04216-2#MOESM1
To start we could perform NER with the same dictionaries as Madin et al - so NCBI Taxonomy, ENVO, ECOCORE, ChEBI.
There are additional numerical columns of interest here beyond what Madin et al provided: