janeczku / calibre-web

:books: Web app for browsing, reading and downloading eBooks stored in a Calibre database
GNU General Public License v3.0
12.2k stars 1.3k forks source link

Get Metadata: inaccurate results #1281

Open antanisblinda opened 4 years ago

antanisblinda commented 4 years ago

hi. I was hoping someone can help with this.

I'm running latest calibre and latest calibre-web. Calibre web is through the technosoft docker img.

The results obtained by the get-metadata feature are completely different between the 2 apps. Calibre finds almost always what I'm looking for, while calibre-web for the same entry retrieves 0 useful results. DEBUG level info in the logs is of no-help (does not log the query).

I understand the code between the 2 is different but I was hoping the "design" of the query and the sources would work similarly?

thank you

OzzieIsaacs commented 4 years ago

Please give an example for search queries, so I can check.

antanisblinda commented 4 years ago

thank you. so I have this book from Angelica Lefosse https://www.amazon.com/Interior-Design-Complete-design-furnish/dp/B086FWQB9T/ref=sr_1_3?dchild=1&keywords=angelica+lefosse&qid=1585658818&sr=8-3

the epub is setup with the same author and following title "Interior Design: Complete guide on how to design and furnish your home" I'm attaching the results i get. the log in the web interface does not show any info on metadata searches. thank you

Capture

antanisblinda commented 4 years ago

and if I force in the search query something like the name of the author i receive non-relevant results Capture

OzzieIsaacs commented 4 years ago

It's indeed a difficult task. You are right the results from Calibre-Web are very inaccurate. The cause: Google Book search doesn't output anything relevant, even if you use advanced search and use author and title seperate. (scrap in -> scrap out). I tried to search for the book in calibre with the title, author and id's set. For me it didn't output anything (where you say you get results). The Calibre metadata search is way better than Calibre-Web metadata search (it's doing several searches on google, bing and judging the result in the end) I also tried to search for the book on amazon.com and didn't find it! (If you search for the author you only find the kindle version but not the paper version). If I search for it with google I only find the book on "amazon.in".

Conclusion: No easy fix for now. Next step for Calibre-Web is a metadata search plugin system, to allow different metadata provider, but this won't help here.

viblo commented 3 years ago

I also noticed the same issue. So far I tested with two Chinese books, and neither of them could be identified with Calibre-Web when I uploaded them and then tried to fetch metadata. In case its of any help these are the details of the two cases:

Book1: Filename: 0-不平等的童年阶层种族和家庭生活6490113.azw3 Calibre find it as: Title: 不平等的童年:阶层、种族和家庭生活 Ids: https://www.worldcat.org/isbn/9787301270936 Cover: yep

Book2: Filename: 我是个妈妈我需要铂金包.mobi Calibre find it as: Title: 我是个妈妈,我需要铂金包 Ids: https://www.worldcat.org/isbn/9787508695488 Cover: yep

However, if I go into Edit Metadata (in Calibre) and try to search for them again it doesnt find anything, so only the inital import can find the data.

I was hoping to replace Calibre running on my laptop with Calibre-Web on my htpc, but now I have to rethink.. (except for this I think Calibre-Web looks great so far!). I understand that fixing this is a tricky case, but would be great it if it could be solved somehow!