Open gabrielgq opened 2 months ago
Hey Gabriel! I was having the same problem also, then I found out that the 0.9.3 updated include the addition of cloudscraper (see changelog). You can read the documentation of cloudscraper library here, it basically modifies requests to bypass Cloudflare. For using it in newspaper4k, you just have to install cloudscraper (pip install cloudscraper), as the code automatically uses it if installed.
Hope it helps!
Thanks, I added cloudscraper but sadly it still doesn't work for the site I mentioned. Did the sample URLs work for you?
CRHOY:
This is a Cloudflare issue so I don't know if this is the right place to post but if anyone can help I'd be vary thankful.
Some sample urls that I have tried
The exact code i used to test this articles/website
Site is protected by Cloudflare I tried more complex methods with readability and selenium, even used 12ft.io and http://txtify.it