fcavallarin / htcap

htcap is a web application scanner able to crawl single page application (SPA) recursively by intercepting ajax calls and DOM changes.
GNU General Public License v2.0
610 stars 114 forks source link

Proxy not used when crawling on localhost network #19

Closed GuilloOme closed 7 years ago

GuilloOme commented 7 years ago

When launching a crawl, it seems that only the start url and robots.txt are requested through the proxy (during the validation process).

way to reproduce:

Crawl finished, 3 pages analyzed in 0 minutes



- I only got 2 hits in the proxy log: 
  - `http://…/index.html `
  - `http://…/robots.txt `
GuilloOme commented 7 years ago

After investigating, it seems that phantomjs is not happy with playing on the localhost network… I filled a bug #14808 to them.

GuilloOme commented 7 years ago

It's seems a behavior of QT (the lib used by phantomJS) ; unfortunately, it's not be "fixable"… (see this response)