suryakencana007 / comic-vine-scraper

Automatically exported from code.google.com/p/comic-vine-scraper
0 stars 0 forks source link

Option for always series matching when no volume #132

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
This is kind of a half-baked idea, if you have an alternate method go for it.

I recently came across a situation where I had multiple issues from different 
volumes of the same series selected to be scraped:
Avengers 1 (2010)
Avengers 2 (2010)
Avengers 142 (1975)

Due to how I had it sorted, Avengers 142 was first in the selection.  I started 
scraping, matched the issue to the 1963 Avengers series and it scraped 
properly.  However, because the other issues don't have a different volume, it 
thought they were the same series and gave them the wrong metadata.

For me this is a big problem because I have a script that takes files in my 
staging area, determines which ultimate directory they should end up in and if 
there's already a book there, only keeps the book with the greater size.  If I 
ran this with the wrong volume (1963 instead of 2010), I could lose books.

even for others, this could be a pretty annoying issue as a large number of 
books would need to be scrubbed and re-scraped.

The only thing I could think of to address this is an option to "Always prompt 
for Series match if no volume recorded".  It's chunky, so if you have any other 
ideas, I'm all for them.

Original issue reported on code.google.com by bmen...@gmail.com on 2 Sep 2010 at 1:25

GoogleCodeExporter commented 9 years ago
I have been considering a new feature that would make the scraper be somewhat 
less 'automatic' when it comes to scraping all the other comics that it 
perceives to be in the same series:

http://code.google.com/p/comic-vine-scraper/issues/detail?id=131

Like, maybe it would tell you that there are 5 other comics in this series, 
would you like to scrape them all using the currently chosen series?    That 
way if you knew how many comics you were expecting in that series, you could 
say no.  Or maybe you'd just have an option to force the scraper to scrape 
comics one at a time--you could use that whenever you knew there were comics 
from two identically names series.

Other than that, I'm not really sure how to deal with the problem you 
experienced.  To do that, there would need to be some way for the scraper to 
tell that the comic book files are in different series, even though they have 
the same series name.   Since volume information (actually, most information) 
is not normally available in unscraped comics, there's not much to go on.

Original comment by cban...@gmail.com on 2 Sep 2010 at 11:59