UAlbertaALTLab / crk-db

Managing the Plains Cree dictionary database
https://itwewina.altlab.app/
GNU General Public License v3.0
0 stars 3 forks source link

Exclude metalanguage in AECD definitions from search-definitions #106

Open aarppe opened 1 year ago

aarppe commented 1 year ago

In addition, there's some AECD-specific encoding, such as Alt. or Var. for alternative or variant Cree word forms, (Plains) and (Northern) indicating dialect, that we wouldn't want to include in the search. For the 731 Alt. cases and 180 Var. cases, we'd probably want to encode the Cree word-forms appropriately (something for Daniel), perhaps marking them with <crk>...</crk>.

Originally posted by @aarppe in https://github.com/UAlbertaALTLab/crk-db/issues/101#issuecomment-1182659234

aarppe commented 1 year ago

When processing the AECD definitions, exclude metalanguage such as Alt, Var, (Plains) and (Northern) from the search-definitions (i.e. what words are included in indexing and searching the dictionary entries).