Clear-Bible / macula-greek

Syntax trees, morphology, and linguistic annotations for the Greek Bible
Other
22 stars 6 forks source link

Add SDBG sense data #1

Open jonathanrobie opened 2 years ago

jonathanrobie commented 2 years ago

Need to add SDBG sense data from MARBLE - this is part of the Minimum Required to Declare Victory.

klosoter commented 2 years ago

I tried using @rkjtan's mapping file (Mappings.xlsx), but it does not contain the MARBLE ids (used in marble-lexicon/SDBG/, marble-mappings/UBSGNT5/ or macula-greek/sources/MARBLE/SDBG). It does contain something called SBLGNT_ID but that is not the same as the MARBLE ids.

Marble ids have a different format from the other ids, see for example:

<word id="04000100100002">
    <sense>SDBG:βίβλος:000002:Communication</sense>
</word>
<word id="04000100100004">
    <sense>SDBG:γένεσις:000001:Kinship Terms</sense>
    <sense>SDBG:γένεσις:000002:Communication</sense>
</word>

Also, the text that MARBLE uses in MARBLE does not align well with any of the Nestle Aland versions in the mapping file. We might need a new mapping. Am I missing something here?