OpenITI / i.cex

OpenITI Instantiation with texts split into 300w chunks, implemented with CEX (CTS Exchange).
1 stars 1 forks source link

CEX files #!ctsdata part does not have all the fields #1

Open ace280 opened 7 years ago

ace280 commented 7 years ago

As described in this issue in the CEX specification 3.0, the CEX spec 3.0 #!catalog example is misleading. This might be the reason for your project adopting the mistake in its CEX files.

As I am working on the CITE microservices, I ran into this problem while coding the part that parses the #!catalog part. I would recommend fixing the CEX files so they can accurately be used by newly developed applications.

maximromanov commented 7 years ago

Thanks for your comment! I will look into this issue. However, this is not the main format for our data, but automatically generated/regenerated instantiation to be used with packages that work with CEX format. (ToPan, in particular: https://thomask81.github.io/ToPan/)

ace280 commented 7 years ago

I see. How do you generate/regenerate the CEX instantiation?

maximromanov commented 7 years ago

With a script from our main corpus, which has texts in OpenITI mARkdown, our main format (https://alraqmiyyat.github.io/mARkdown/).