Open soodoku opened 7 years ago
Okay, I will do. By quick check there is over 200k unique labels under "Top/World/..." that will be non-English.
But seems Google Translate is limit just 1,000 words/day?
Not sure if we have good alternatives. And it seems that Google pricing is reasonable: https://cloud.google.com/translate/v2/pricing
We can run through it one time.
Actually, Google Translate API has the following limit :- (it's not 1,000 words/day)
By splitting each level of the category and grouping them by the language, we can get the smaller unique list of words for each language. It's about 1.5M characters so probably free quota will enough to translate it all.
Sorry for my confusing, actually Google Translate API it's not free. But above number is quota to use this service per day per account.
Fortunately, Google give $300 credits for 60 days free trial on theirs Cloud services, so we can use this credits.
Lots of category labels are in a language other than English
For non-english, it appears one pattern is that language is in the path: Deutsch, Japanese etc.
Perhaps use google translate to translate it? One package we could use: https://pypi.python.org/pypi/translate
Final output will have an additional column -> cat_labels_english