As of now we have a data import pipeline for pecha.org database. The pipeline takes in markdown text and populate the database by parsing the markdown text. But most of the data we have are in STAM format. In order to import those text in markdown, we need a serialiser to convert the STAM files into markdown so that we can import the data via existing pipeline.
Outputs
OPFs which are in STAM format in Markdown format.
Inputs
OPFs in STAM format.
Timeline
Specify the expected delivery date for the project.
RFW0154: STAM to MD serialiser
Summary
We have opfs in STAM format. We want them to be in markdown format to import them in pecha.org db.
Key Concepts
STAM: Stand-off Text Annotation Model
Context
As of now we have a data import pipeline for pecha.org database. The pipeline takes in markdown text and populate the database by parsing the markdown text. But most of the data we have are in STAM format. In order to import those text in markdown, we need a serialiser to convert the STAM files into markdown so that we can import the data via existing pipeline.
Outputs
OPFs which are in STAM format in Markdown format.
Inputs
OPFs in STAM format.
Timeline
Specify the expected delivery date for the project.
References
Annotations found in OPFs can be refer in toolkit package.