Open 00abCoder opened 4 years ago
Changing line 250 of init.py to this solves the problem: challenge, ms = re.search( r"setTimeout(function\s(\s){\s(var " r"\ss,\st,\so,\sp,\sb,\sr,\se,\sa,\sk,\si,\sn,\sg,\sf.+?\r?\n[\s\S]+?a.value\s=.+?)\r?\n" r"(?:[^{<>]},\s*(\d{4,}))?", javascript, flags=re.S ).groups()
Great works , thank you so much. please Tell me, is it necessary to withstand a pause of 5 seconds between requests?
Seems it is not necessary, I run the following code and it's returning the same content on all of them:
import cfscrape
url = "https://techblog.willshouse.com/2012/01/03/most-common-user-agents"
scraper = cfscrape.create_scraper()
contents = []
for i in range(5):
content = scraper.get(url).content
contents.append(content)
is it necessary to withstand a pause of 5 seconds between requests?
that might depend on the site and how much you request
Changing line 250 of init.py to this solves the problem: challenge, ms = re.search( r"setTimeout(function\s(\s){\s*(var " r"\s_s,\s_t,\s_o,\s_p,\s_b,\s_r,\s_e,\s_a,\s_k,\s_i,\s_n,\s_g,\sf.+?\r?\n[\s\S]+?a.value\s=.+?)\r?\n" r"(?:[^{<>]},\s(\d{4,}))?", javascript, flags=re.S ).groups()
@00abCoder @Anorov Thanks a lot, it's useful, so I pull a request to master branch : https://github.com/Anorov/cloudflare-scrape/pull/360
Same problem again
ValueError: Unable to identify Cloudflare IUAM Javascript on website. Cloudflare may have changed their technique, or there may be a bug in the script.
challenge, ms = re.search( r"setTimeout(function\s*(\s*){\s*(var " r"\s_s,\s_t,\s_o,\s_p,\s_b,\s_r,\s_e,\s_a,\s_k,\s_i,\s_n,\s_g,\s_f.+?\r?\n[\s\S]+?a.value\s_=.+?)\r?\n" r"(?:[^{<>]},\s(\d{4,}))?", javascript, flags=re.S ).groups()
does not work any more
I'm facing the same problem. nothing seems to be working
This project is abandoned, and the lib had broken. See #406
Before creating an issue, first upgrade cfscrape with
pip install -U cfscrape
and see if you're still experiencing the problem. Please also confirm your Node version (node --version
ornodejs --version
) is version 10 or higher.Make sure the website you're having issues with is actually using anti-bot protection by Cloudflare and not a competitor like Imperva Incapsula or Sucuri. And if you're using an anonymizing proxy, a VPN, or Tor, Cloudflare often flags those IPs and may block you or present you with a captcha as a result.
Please confirm the following statements and check the boxes before creating an issue:
pip install -U cfscrape
Python version number
Run
python --version
and paste the output below:cfscrape version number
Run
pip show cfscrape
and paste the output below:Code snippet involved with the issue
Complete exception and traceback
(If the problem doesn't involve an exception being raised, leave this blank)
URL of the Cloudflare-protected page
https://techblog.willshouse.com/2012/01/03/most-common-user-agents
URL of Pastebin/Gist with HTML source of protected page
[LINK GOES HERE]