pishoyg / coptic

This is a project that aims to make the Coptic language more learnable.
https://remnqymi.com/
GNU General Public License v3.0
10 stars 0 forks source link

[Crum] Improve Crum's Inflection-driven Rigor #43

Open pishoyg opened 4 months ago

pishoyg commented 4 months ago
  1. Implement normalization of the remaining annotations, namely - for prenominal forms, = for pronominal, + for qualitative, and for same as above.

    Just carry them on a separate field in lexical.structured_word, just like you did with attestations.

  2. Implement normalization for English-within-Coptic.

    Just carry them on a separate field in lexical.structured_word, just like you did with attestations.

  3. Control constants.ACCEPTED_UNKNOWN_CHARACTERS*. It should be possible to exercise more rigor once the extra normalization steps have been implemented.

  4. Detached types override / invalidate root types. Investigate.

Thought: The current state of Marcion data is imperfect. We will likely have to introduce new types (e.g. articled vs. non-articled nouns) in order to build an accurate inflection module. We might also have to populate the derivations data differently.

pishoyg commented 2 weeks ago

TODO: ~5. Normalize optional letters (a letter wrapped in a pair of parentheses, to indicate that it's optional).~ Edit: This is done already!