clarin-eric / ParlaMint

ParlaMint: Comparable Parliamentary Corpora
https://clarin-eric.github.io/ParlaMint/
41 stars 52 forks source link

README-XX.md #804

Open maartenpt opened 11 months ago

maartenpt commented 11 months ago

The lanugage-specific readme files (like README-BA.md) contain links - most crucially to the handle of the source - one would expect those to also be links in the MD.

And since the README is basically an extensive description of the encoding process, should that not be included in the ParlaMint-BA.ana.xml file?

matyaskopp commented 11 months ago

Released READMEs should be fixed in https://github.com/clarin-eric/ParlaMint/blob/b27cbba669df722340a25d00dc3991390b5d91d7/Scripts/parlamint2distro.pl#L444

TomazErjavec commented 11 months ago

Released READMEs should be fixed

OK, done; given that the corpora are being built now, some will not have this /unless I could re-run only README generation, but this is not implemented (yet)).

the README is basically an extensive description of the encoding process, should that not be included in the ParlaMint-BA.ana.xml file?

Indeed it should, but the "documentation" process was distinct from the encoding, incl. the teiHeader, alas. In fact, it would be great if the README files were bulked up with more data from the teiHeaders as well as auto-derived stats on the corpus. Future again....