Closed CharlieCorner closed 7 years ago
Nice! I've long suspected that some of the regexes have grown stale. This is something good (and working >.>) CI would catch - kicking off a run twice a week can easily prevent this.
By the way, commit 76f7ad4 also included in this Pull Request fixes #89 . I forgot to mention this on the original post.
This fixes #87
The current pattern we have for Mangafox is:
re_getImage = re.compile('"><img src="([^"]*)"')
But on the actual page this is how the tag for the page image looks like; notice how there's a newline between the closing > of the a tag and the < of the img tag:
We're now searching for img tags that have an id="image" which is what Mangafox is using to identify their pages on their website.