kermitt2 / grobid

A machine learning software for extracting information from scholarly documents
https://grobid.readthedocs.io
Apache License 2.0
3.56k stars 456 forks source link

Centralise annotation guidelines for training data in the doc #209

Open kermitt2 opened 7 years ago

kermitt2 commented 7 years ago

We need to centralize all the annotation guidelines in a single place easy to manage and edit. Let's move all the annotation guidelines (now docx/pdf) under doc/ in markdown format so that it will be accessible from the documentation on readthedocs.io in a new section.

kermitt2 commented 7 years ago

work in progress with @jfix ...