The primary motivation for this project is to produce a dataset for OPUS, yet this is not one of the supported outputs.
Rather than throwing a CSV, XML, or JSON file for them to process further, it'd be nice if we could just understand the format and support it out of the box in the first place.
The primary motivation for this project is to produce a dataset for OPUS, yet this is not one of the supported outputs.
Rather than throwing a CSV, XML, or JSON file for them to process further, it'd be nice if we could just understand the format and support it out of the box in the first place.
Here is the format we should export to: https://opus.nlpl.eu/trac/wiki/DataFormats.html