cbanack / comic-vine-scraper

An add-on script for ComicRack that lets you copy details from Comic Vine into your comic books.
243 stars 47 forks source link

Scraping duplicates the same as previous #275

Open GoogleCodeExporter opened 9 years ago

GoogleCodeExporter commented 9 years ago
When scraping a misnumbered comic you have to manually browse through the 
comics to find the right number, or cancel the operation and renumber the 
comic(s) manually.
Well after you selected the right comic, CVS will scrape it correctly, but if 
you have a duplicate of the comic, CVS will treat the duplicate as the wrong 
number again.

My request is that CVS treats all issues of the same number/series/etc the same 
as the previous selection. 

Original issue reported on code.google.com by echerr...@yahoo.com on 9 Feb 2013 at 6:29

GoogleCodeExporter commented 9 years ago
Thank you for the suggestion.  Is this something that happens to you regularly? 
 (i.e you have duplicates of a comic, and they are both misnumbered?)

Original comment by cban...@gmail.com on 9 Feb 2013 at 6:43

GoogleCodeExporter commented 9 years ago
Yeah sure I guess. 
It's not like a "OMG none of my issues match CVDatabase's, I can't get this 
thing to work >_<".

I just have issues time to time that just don't match up with CV's. 

Example:
Shadow Comics and Heavy Metal use multiple volumes and renumberings while 
Comicvine lists them both as a single series, so Vol 6 #4 may be Vol 1 #68 at 
Comicvine. 
Then I may have one scan that's ctc and one noads, so only one of them would 
get Vol 1 #68 while the other gets Vol 1 #6.

It'd be too much to ask for CVS to also do this comparison for later issues 
like Vol 6 #5/6/7/etc, but I wouldn't think it'd be too much of a hassle for 
just the same series/volume/format/number

Original comment by echerr...@yahoo.com on 9 Feb 2013 at 9:06

GoogleCodeExporter commented 9 years ago
Ah, yes I see what you're talking about:

You have two comics that are labeled with the same series/number, but it is the 
wrong series/number.   So you manually correct one of them while scraping.  
You'd like it if the scraper remembered this correction and automatically 
applied it to the second comic as well.

The problem is, what happens if the series/number on both books is actually 
right for one of the books and wrong for the other?  Then the scraper might 
automatically scrape the correctly labelled book with the wrong series/number.

I'll continue to give it some thought.

In the meantime, I think there is a duplicate finder script out there that you 
could use to find and remove duplicates from your collection, if that would 
help...

Original comment by cban...@gmail.com on 10 Feb 2013 at 1:45

GoogleCodeExporter commented 9 years ago
If it's right for one (the one you see first) and wrong for any other (the ones 
in the background), you wouldn't be able to fix it no way, as there's no 
checkups for them.

Duplicate thingy doesn't help me, because I don't have any particular favorites 
that I want to keep. I have to manually go through my comics and display them 
side by side to see which scans I like better and which one has more or less of 
what I want than the other.

When scanning, only the earliest comic is shown to select for series. This 
comic (and its duplicates) are the ones I'm asking for.
ANY other comic in that series is going to scan the same, no matter my 
preference. I'm only asking for the first one, as it's the only reasonable one.

I'll also like to ask for a more automated process on "When several comics 
appear to be from the same, only confirm the series from the first one". I'd 
like for the series to be confirmed for these comics, but not automatically 
select the issue. So, a sub option to basically say "don't automatically select 
issue number"

And that'd tie-in to the duplicate thing.

Original comment by echerr...@yahoo.com on 11 Feb 2013 at 12:09

GoogleCodeExporter commented 9 years ago
>I'll also like to ask for a more automated process on "When several comics 
appear to be from the same, only confirm the series from the first one". I'd 
like for the series to be confirmed for these comics, but not automatically 
select the issue. So, a sub option to basically say "don't automatically select 
issue number"

You can get this effect by selecting all of the comics from that series in 
ComicRack and 'bulk' setting their issue number to something that doesn't 
exist, like "X" or "-5" or something.  Whenever the scraper can't match the 
issue number for a comic with the available issue numbers in the series you 
chose, it will ask you to pick the right issue number.

Original comment by cban...@gmail.com on 11 Feb 2013 at 12:25

GoogleCodeExporter commented 9 years ago
Nah, with that, I'd have to scroll down and manually select each issue. I'd 
want it to automagically select the numbers for me, just not okay them.

Original comment by echerr...@yahoo.com on 11 Feb 2013 at 9:46