VIDA-NYU / ache

ACHE is a web crawler for domain-specific search.
http://ache.readthedocs.io
Apache License 2.0
444 stars 135 forks source link

Enable Javascript using TorProxy #314

Closed hackrx closed 1 year ago

hackrx commented 1 year ago

Hello team, I am trying to crawl .onion sites and I am getting this in all the HTML files:

<body><noscript><strong>We're sorry but Blog doesn't work properly without JavaScript enabled. Please enable it to continue.</strong></noscript><div id="app"></div><script src="/js/chunk-vendors.a7f219e8.js"></script><script src="/js/app.9b916c85.js"></script></body>

Is there any way to allow javascript execution? I used this guide to setup ACHE: https://ache.readthedocs.io/en/latest/tutorial-crawling-tor.html

aecio commented 1 year ago

No, unfortunately, javascript execution is not yet supported.