Closed janPensa closed 2 years ago
@janPensa tested on https://commonvoice.allizom.org/sentence-collector, looks good to me. Would you agree?
@MichaelKohler It seems like https://commonvoice.allizom.org/sentence-collector uses the old validation script. It rejects sentences longer than 14 words, and doesn't reject invalid words that don't follow phonotactics.
Yeah, looks like something is off with that deployment. I'll deploy to production now then.
:tada: This PR is included in version 2.17.3 :tada:
The release is available on GitHub release
Your semantic-release bot :package::rocket:
Okay. I'll wait a bit and test on commonvoice.mozilla.org
@MichaelKohler I did a few different tests. Looks like everything works as intended now!
@janPensa should be deployed now :)
@janPensa hah, you were faster than me. Thanks for the verification and the hotfix!
Somehow
[BbCcDdFfGgHhQqRrVvXxYyZz\u00C0-\u02BF\u1E00-\u1EFF\uF1900-\uF19FF]
and[qQwWxXyYÀ-ćĊ-ěĞ-ģĞ-ģĦ-ijĶ-śŞ-ūŮ-\u02AF\u1E00-\u1EFFα-ωΑ-ΩЀ-ӿ]
match with all regular Latin letters as well, making the Sentence Collector reject all submissions.Changed to
[BbCcDdFfGgHhQqRrVvXxYyZzÀ-ʯḀ-ỿ]
and[qQwWxXyYÀ-ćĊ-ěĞ-ģĞ-ģĦ-ijĶ-śŞ-ūŮ-ʯḀ-ỿα-ωΑ-ΩЀ-ӿ]
, which should work well. (At least they do in Notepad++, which I found has the same behavior.)