clarin-eric / ParlaMint

ParlaMint: Comparable Parliamentary Corpora
https://clarin-eric.github.io/ParlaMint/
41 stars 52 forks source link

Version number of MTed corpora? #691

Closed TomazErjavec closed 1 year ago

TomazErjavec commented 1 year ago

A small matter, but still: what should be the version number of the MTed corpora?

matyaskopp commented 1 year ago

The linking to the source of the translation is more important than the proper versioning, so I vote for v3.0. We also have to synchronize our decision with @nljubesi and audio alignment.

nljubesi commented 1 year ago

Yes, I like 3.0 much better.

The audio alignment information will be minimal (we promised at least 50 hours per parliament), and it will quite likely not be complete (not whole speeches will be covered), so the link to the corpora the alignment is based on will be much less crucial than is the case here.

TomazErjavec commented 1 year ago

OK, thanks both, 3.0 it is. For speech, I wasn't actually counting on it being integrated with the rest of the corpora given that it doesn't cover copmlete corpora (if I have this right). Anyway, we can revisit this question for 3.1.