Closed nh36 closed 7 years ago
pre-ortho.zip The spreadsheet is in this attachment. I have tried to include only those substitutions that I think are safe to do on any dataset. The one that might be a bit dangerous is j > dʒ, but my thought is that it can always be undone if it causes problems. If you know for sure that a dataset is perfect iPA then you wouldn't apply this pre-orthoprofile anyhow.
I suppose we can dynamically load specific of these instances by applying some flags. E.g., if you write me things like "ignore line X", I can ignore this line in loading the profile.
It would be helpful to run some generic orthoprofile in order to save the annotator time in making an orthography profile. NH will make a spreadsheet to this effect and send it to JML.