xxyzz / WordDumb

A calibre plugin that generates Kindle Word Wise and X-Ray files for KFX, AZW3, MOBI and EPUB eBook.
https://xxyzz.github.io/WordDumb/
GNU General Public License v3.0
386 stars 19 forks source link

Use other dictionaries instead of Wiktionary for word wise #51

Closed woaidangyang closed 2 years ago

woaidangyang commented 2 years ago

The newly added feature that adds word wise to epubs is great!

Since the quality of Wiktionary is relatively low sometimes, I'm wondering if it's possible to use some customized dictionaries for word wise. A good example is this plugin for Anki: https://github.com/sth2018/FastWordQuery

Another request is if WordDumb can support "fuzzy query", meaning that it would hit the word with different forms.

It might be too much to ask! Thank you so much!

xxyzz commented 2 years ago

Support of new dictionary file format may not be very hard but extract short glosses from a dictionary requires some work.

For Kindle Word Wise, I use LemmInflect to inflect each lemma: code

For EPUB, the forms data from Wiktionary are used.

woaidangyang commented 2 years ago

Got it. Thank you!

I agree with you that it might not be possible to extract short glosses from some of the dictionaries. But even without the glosses, it would already be a huge help if I can highlight the words that I want to learn in the books and have the word meanings available just a click away.

woaidangyang commented 2 years ago

I agree with you it might be a better idea to rely on Wiktionary instead of other commercial dictionaries. It's more sustainable and expandable.

xxyzz commented 2 years ago

Wiktionary has much more potential to explore. Once I figure out how to use https://github.com/tatuylonen/wiktextract to extract non-English Wiktionary dump files, it'll be possible to have glosses in many languages.