xxyzz / WordDumb

A calibre plugin that generates Kindle Word Wise and X-Ray files for KFX, AZW3, MOBI and EPUB eBook.
https://xxyzz.github.io/WordDumb/
GNU General Public License v3.0
357 stars 18 forks source link

Lack of vocabularies translation #188

Open Dieofu opened 5 months ago

Dieofu commented 5 months ago

Checkboxes

Describe the bug

word wise creating function works, however, somehow, only very limited vocabularies have word wise translations. When I check customize kindle word wise, I try typing some vocabularies in Filter Lemma. For example, "progressive", it only shows phrases 20240220203136; For vocabulary "fly", it only shows animal fly in translation, but no verb form in translation. 20240220203748; For vocabulary fry", the translation doesn't show up. 20240220203932 I would like to ask what I can do to correct translation database.

Operating System name and version

Windows 10

Python version

3.12

calibre version

7.5.1

WordDumb plugin version

3.31.0

Error message

No error message shown

Plugin settings and reproduce steps

20240220204402 20240220204440

Generated files, screenshots or videos

No response

xxyzz commented 5 months ago

The gloss data are obtained from the Chinese Wiktionary, you won't find some words because Wiktionary doesn't have the definition or the page has a terrible wikitext layout. For example, the fly page only has the none gloss, and the fry page doesn't use the standard layout.

You could deselect the "Use Wiktionary definition" to use the Kindle Word Wise db.

If you want to improve the Wiktionary data quality, you could edit the Wiktionary page and take a look of these packages:

Word Wise db files are created from the "Proficiency" project, and "wiktextract" is for extracting data from Wiktionary pages.

Dieofu commented 5 months ago

For word: progressive and fry. Both of them have Chinese Wiktionary translations. But, none of them is in the worddumb. Does it mean that both words have terrible wikitext layout?

When I deselect the "Use Wiktionary definition" to use the Kindle Word Wise db, it shows: 20240220203136 On kindle, Chinese translations won't match vocabularies . What I can do is using English to English wordwise. Do you have any idea what causes it? is something wrong with the Kindle Word Wise db?

Thanks

xxyzz commented 5 months ago

Both "progressive" and "fry" in Chinese Wiktionary are low quality pages: no part of speech title, and gloss text is not inside a list. So they can't be extracted or can't be used without pos.

When "Use Wiktionary definition" option is selected, a new word wise db will be uploaded to Kindle to replace the Chinese word wise db. If you want to use Kindle's Chinese word wise file, you could delete the file at /system/kll and Kindle will redownload the file.

Dieofu commented 5 months ago

/system/kll is this file in kindle? I don't see any system files on my kindle. 20240220221540

Would it cause the problem I had?

calibre, version 7.5.1 even if the device is connected, it says: Device not found: Definition data will be added when Kindle or Android(requires adb) device is connected.

xxyzz commented 5 months ago

It's a hidden folder. Are you using Scribe? The downloading word wise db file from kindle code might not work for Scribe, otherwise there might be a bug in the code.

Dieofu commented 5 months ago

It's a hidden folder. Are you using Scribe? The downloading word wise db file from kindle code might not work for Scribe, otherwise there might be a bug in the code.

I am using paperwhite 5. I found system file and just deleted kll files. Hope it could solve the problems

ImThePsycho commented 1 month ago

It's a hidden folder. Are you using Scribe? The downloading word wise db file from kindle code might not work for Scribe, otherwise there might be a bug in the code.

I am using paperwhite 5. I found system file and just deleted kll files. Hope it could solve the problems

Hi, have you fixed the problem?