common-voice / sentence-collector

Tool to collect and review sentences for Common Voice
https://commonvoice.mozilla.org/sentence-collector/
Mozilla Public License 2.0
81 stars 64 forks source link

Removal Request: Turkish #650

Closed HarikalarKutusu closed 1 year ago

HarikalarKutusu commented 1 year ago

In the last few days, 5k+ sentences are added from a Internet service provider support chat history, but they were not corrected and does not confirm to quality standards. Fortunately they contacted me and we will be working together on these.

I checked the data, they are mostly fine wrt being public domain, but there are full proper names, sentences with abbreviations (ADSL, DSL, TT Turkish Telekom etc), no punctuation, no Turkish characters (such as i is used instead of ı, we do it on mobile) etc. So, the result is not usable.

If these can be deleted, we will add corrected ones, also adding other sentences to dilute them.

Thank you Bülent

Source: ⁨firmamız veri tabanından bu proje bizim için önemli Türkçe olarak 1500 saate yakın hedefimiz var personel mesai yapacak bunun için cümleler ekliyoruz.⁩

MichaelKohler commented 1 year ago

:tada: This issue has been resolved in version 2.19.0 :tada:

The release is available on GitHub release

Your semantic-release bot :package::rocket: