clarin-eric / ParlaMint

ParlaMint: Comparable Parliamentary Corpora
https://clarin-eric.github.io/ParlaMint/
43 stars 53 forks source link

Do we insist on subtitle? #480

Closed matyaskopp closed 1 year ago

matyaskopp commented 1 year ago

@TomazErjavec, I have doubts if we insist on //titleStmt/title[@type="sub"]. I believe that we are insisting on that, but now I have doubts because it is a repeated error even in old-parters corpora.

The documentation is a bit fussy for me: https://clarin-eric.github.io/ParlaMint/#sec-titleStmt because it is not clear if the text about the titles describes just a sample or describes a general rule for all corpora.

TomazErjavec commented 1 year ago

Good point, and, no, the schema does not require it. I also have my doubts if we should: it is nice to have a subtitle, but not really necessary, i.e. nothing much depends on it. From the currently submitted full corpora, TR does not have it.

So, my suggestion would be to leave things as they are, and not require it. Do you agree?

Otherwise, I could change the Guidelines (add "must") and the schema. But the schema would become more fussy. And some corpora declared as valid could become non-valid, so I am scared to do that now, but we could postope it for 3.1.

matyaskopp commented 1 year ago

So, my suggestion would be to leave things as they are, and not require it. Do you agree?

ok, agree. So the only required is the main title (unique + country and en lang)

TomazErjavec commented 1 year ago

ok, agree. So the only required is the main title (unique + country and en lang)

Unique yes, but not necessarily both languages: