FrankensteinVariorum / fv-collation

first-stage collation processing in the Frankenstein Variorum Project. For post processing and Variorum development, see our GitHub organization: https://github.com/FrankensteinVariorum
https://frankensteinvariorum.github.io/fv-collation/
GNU Affero General Public License v3.0
9 stars 2 forks source link

Normalization List #28

Open ebeshero opened 6 years ago

ebeshero commented 6 years ago

List of oddly spelled, regularly-occurring words to normalize in collation

desciple (disciple) (add a representative zone @xml:id if you like here) thier (their) allmost (almost) somthing (something) percieve (perceive) percieved (perceived) unpercieved (unperceived) listend (listened) & (and)

ebeshero commented 6 years ago

Looking back on this issue several months later while processing experimental collations, I think the only thing really necessary to normalize here is the ampersand.