scribe-org / Scribe-Data

Wikidata, Wiktionary and Wikipedia language data extraction
GNU General Public License v3.0
30 stars 69 forks source link

Create Russian to all other languages translation process #77

Closed andrewtavis closed 8 months ago

andrewtavis commented 8 months ago

Terms

Description

The goal of this issue is to create a process whereby a single file is used to translate all words within Russian/translations/words_to_translate.json to all other Scribe languages. To achieve this we'll be using m2m100_418M, with the output being a JSON file that has a string and keyed values for each language. This can then be transferred to an SQLite database table with each string in an index corresponding to a column value for each language.

Of specific importance is trying to get a metric of the accuracy of the translation and doing a cutoff such that we're no longer including low quality translations in Scribe applications :)

Contribution

Happy to work on this or support someone with interest in working on it!

andrewtavis commented 8 months ago

Closed via #89 :) Thanks @shashank-iitbhu!