Q: Training / Ground Truth data?

DataSeer / dataseer-ml

DataSeer machine-learning service

Apache License 2.0

25 stars 2 forks source link

Q: Training / Ground Truth data? #21

Open tfmorris opened 1 year ago

tfmorris commented 1 year ago

There are a few references to a training set of, variously, 3000 or 4000 documents, but I'm not seeing them. Are they in a separate repo? Or some other place like Zenodo?

kermitt2 commented 1 year ago

Hi @tfmorris,

Normally it's 3000 annotated documents for this work. However this dataset was not released publicly.