medialab / hyphe

Websites crawler with built-in exploration and control web interface
http://hyphe.medialab.sciences-po.fr/demo/
GNU Affero General Public License v3.0
329 stars 59 forks source link

Handle crawls through cloudflare hosted website #364

Closed boogheta closed 2 years ago

boogheta commented 5 years ago

For instance www.conservativepapers.com

Maybe just wait for headless crawls to work to handle this

boogheta commented 2 years ago

looks like this website is properly crawled nowadays, closing until encountering a new cloudflare problem