Tatoeba / tatoeba2

Tatoeba is a platform whose purpose is to create a collaborative and open dataset of sentences and their translations.
https://tatoeba.org
GNU Affero General Public License v3.0
679 stars 131 forks source link

Data validation #3073

Closed jiru closed 10 months ago

jiru commented 11 months ago

This PR introduces some data validation checks that should solve #2793 and #3002.

Basically this PR prevents users from doing this, among other things: image