fnielsen / ordia

Wikidata lexemes presentations
https://ordia.toolforge.org
Apache License 2.0
24 stars 13 forks source link

Feature request: Chinese support #132

Open Artoria2e5 opened 2 years ago

Artoria2e5 commented 2 years ago

Ordia currently does not support Chinese at all. Proper support will need #95, of course...

fnielsen commented 2 years ago

I have now implemented zh-Chinese support for the text-to-lexemes dropdown, see https://ordia.toolforge.org/text-to-lexemes?text-language=zh&text=%E6%B1%89%E8%AF%AD

This misses:

fnielsen commented 2 years ago

(Tokenization on Chinese can be done by manually adding spaces between "words".

Artoria2e5 commented 2 years ago

Given the large number of languages involved it might end up easier to give a text input for other langcodes. Wikidata entries for min involves a bunch of x-Q.... stuff that I don't think you want to deal with, for example...