Strings marked for translation provide a ready-made corpus of bits of English text, ranging from individual words, through sentence fragments, statements, questions, all the way up to entire paragraphs. Translations provide the same thing in other languages. Longer strings are likely to have %s-style placeholders which could perhaps be filled by shorter fragments.
I feel like something could be assembled out of this. GNOME (for example) should easily have more than 50,000 translatable words…
Strings marked for translation provide a ready-made corpus of bits of English text, ranging from individual words, through sentence fragments, statements, questions, all the way up to entire paragraphs. Translations provide the same thing in other languages. Longer strings are likely to have
%s
-style placeholders which could perhaps be filled by shorter fragments.I feel like something could be assembled out of this. GNOME (for example) should easily have more than 50,000 translatable words…