Tatoeba / tatoeba2

Tatoeba is a platform whose purpose is to create a collaborative and open dataset of sentences and their translations.
https://tatoeba.org
GNU Affero General Public License v3.0
714 stars 132 forks source link

Data validation #3073

Closed jiru closed 1 year ago

jiru commented 1 year ago

This PR introduces some data validation checks that should solve #2793 and #3002.

Basically this PR prevents users from doing this, among other things: image