FriendsOfPHP / Goutte

Goutte, a simple PHP Web Scraper
MIT License
9.26k stars 1.01k forks source link

[Question][Is there anyway to to ignoring images and css when crawling] #338

Closed msaus closed 6 years ago

msaus commented 6 years ago

Hello there,

I found that it is pretty slow when there are lots of image on the web site.

Is there anyway to crawl only HTML code, not images itself.

Thanks.

stof commented 6 years ago

Goutte does not load images (unless your own code built on top of it does it).

msaus commented 6 years ago

I see..... Thanks for that. The reason why I am asking this question is that it seems when I crawl following URL, it is very very slow.... Therefore, I am asking this question. https://search.rakuten.co.jp/search/mall/baby/?p=1