xrma / crawler4j

Automatically exported from code.google.com/p/crawler4j
0 stars 0 forks source link

Resources referenced in css files #137

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
I'd like the software to get me links within the CSS files also. For example 
images that are referenced in the css:
.moviePanel { !background: url("/global/animated_spinner.gif") 50% 50% 
no-repeat; }
in the css.  I need these images also.

Currently if I give it css it doesn't think the url's inside are external links.

Original issue reported on code.google.com by boxereng...@gmail.com on 15 Mar 2012 at 10:44

GoogleCodeExporter commented 9 years ago
This is a specific case. You should parse the css content yourself and extract 
those URLs. Then you can call controller.addSeed() to add them.

-Yasser

Original comment by ganjisaffar@gmail.com on 16 Mar 2012 at 5:00