OpenText-org / original_annotation

XML files for linguistic annotation of the Greek New Testament
Creative Commons Attribution Share Alike 4.0 International
10 stars 3 forks source link

N.B. for copyright reasons inflected forms are missing #1

Open zaddok opened 3 years ago

zaddok commented 3 years ago

I was intrigued by this:

word form information - N.B. for copyright reasons inflected forms are missing

It is assumedly trivial to automate/script a process of converting (for example) lex="Παῦλος" to its NON num="sing" cas="nom" gen="mas" form isn't it?

christopherland commented 3 years ago

For the vast majority of tokens, yes. One could even automate accentuation to account for clitics, etc. There's even a good argument to be made against the legality of copyrighting an ancient text like the GNT.

We released this data as legacy data, and didn't want the hassle of copyright issues. Our entirely new annotation is forthcoming, and is based on the Nestle 1904 in order to avoid disputes regarding the openness of the base text.