BoboTiG / ebook-reader-dict

Finally decent dictionaries based on Wiktionary for your beloved eBook reader.
http://www.tiger-222.fr/?d=2020/04/17/22/14/21-un-dictionnaire-alternatif-et-complet-pour-votre-liseuse
MIT License
386 stars 21 forks source link

[RO] Many words are missing their inflection #2085

Open dragostis opened 1 month ago

dragostis commented 1 month ago

It doesn't seem to be caused by a missing template, but many words seem to be missing their inflection on Kobo. For example, looking up "pietre" does not find the correct root "piatră".

I haven't debugged this yet, but it definitely seems like some words are not missing their inflection, but most of them do.

Upvote & Fund

Fund with Polar

BoboTiG commented 1 month ago

I did not translate the page, but https://ro.wiktionary.org/wiki/pietre just does not exist. So, it is expected to not be found on the Kobo.

BoboTiG commented 1 month ago

The current process to add variants would be to create the https://ro.wiktionary.org/wiki/pietre page, and use a specific template to tell this is the inflexion of "piatră".

FTR we currently look at those variants: https://github.com/BoboTiG/ebook-reader-dict/blob/28becba87346764a58b0f64689408f4300df2fcf/wikidict/lang/ro/__init__.py#L49-L150

dragostis commented 1 month ago

Thanks for the reply!

I wonder if an alternative would be to use the declination tables (e.g. on the word's root page) just to generate cross-linking.

BoboTiG commented 1 month ago

I wonder if an alternative would be to use the declination tables (e.g. on the word's root page) just to generate cross-linking.

Actually that is a great idea! And going further, it is something doable for all locales.

That will require quite some work, at first sight. But I'm not against the idea, and might try something later on. If someone wants to give a try, I will love to review some code too!

dragostis commented 1 month ago

In the case of Romanian specifically, it seems like only en.wiktionary.org has complete inflection tables compared to the ro.wiktionary.org source that has partial ones.

Since I'm not familiar with the code, would you be willing to please leave some high-level approach to add this? I might take a stab at it tomorrow.