issues
search
bigscience-workshop
/
biomedical
Tools for curating biomedical training data for large-scale language modeling
447
stars
114
forks
source link
Create dataset loader for CafeteriaSA
#868
Open
mariosaenger
opened
1 year ago
mariosaenger
commented
1 year ago
Adding a Dataset
Name:
CafeteriaSA
Description:
Annotated corpus of 500 scientific abstracts from PubMed that consists of 6407 annotated food entities
Task:
NER,NED
Paper:
https://doi.org/10.1093/database/baac107
Data:
Zenodo-Link
License:
CC-4.0-International
Motivation:
Interesting entity type (food entries) for which only few corpora exist
Adding a Dataset