Web crawler with very flexible crawling options. Can either use standalone or can be used with resque to perform clustered crawls.
226
stars
45
forks
source link
Fixed bug where url('xxx') directives in styles were not handled correctly. #2
Closed
rojotek closed 12 years ago
The root cause of the problem was that the regex for parsing url() directives in styles was only checking for double quotes, not single quotes.
To fix this I needed grouping and a complex regex, so I updated the ContentLinkParser to support a proc for the attribute lookup.
A series of three new specs have also been written.