UAlbertaALTLab / crk-db

Managing the Plains Cree dictionary database
https://itwewina.altlab.app/
GNU General Public License v3.0
0 stars 3 forks source link

do not standardize SRO field (yet) #50

Closed dwhieb closed 3 years ago

dwhieb commented 3 years ago

The original version of the SRO field—with no normalization, substitutions, etc.—should be stored in the database, in addition to standardized / normalized versions for itwêwina and the FST respectively.

The current convert-cw script does processing on the SRO field, so the original transcription is lost.

This PR removes any processing of the SRO field, and defers the process of standardizing / normalizing the SRO transcription until the point when the CW data is imported / aggregated into the ALTLab database (a forthcoming PR).

Partially addresses #44.