ftyers / commonvoice-utils

Linguistic processing for Common Voice
GNU Affero General Public License v3.0
51 stars 14 forks source link

' is a valid character in Italian #34

Closed lucarinelli closed 1 year ago

lucarinelli commented 2 years ago

Hi! I think I found an error here for Italian, the ' is a valid character for Italian and in cvutils/data/it/validate.tsv#L6 it is replaced with blank, even if there seem to be other rules below in the same file converting other symbols to ' and later allowing '. Maybe I'm not seeing a reason to have that replacement rule there on line 6? Thanks for the useful tools!

ftyers commented 1 year ago

Thanks! Sorry this took so long to merge, I missed the notification!