Anorov / cloudflare-scrape

A Python module to bypass Cloudflare's anti-bot page.
MIT License
3.33k stars 455 forks source link

cloudflare issue #456

Open muhammedalisahan opened 1 year ago

muhammedalisahan commented 1 year ago

Before creating an issue, first upgrade cfscrape with pip install -U cfscrape and see if you're still experiencing the problem. Please also confirm your Node version (node --version or nodejs --version) is version 10 or higher.

Make sure the website you're having issues with is actually using anti-bot protection by Cloudflare and not a competitor like Imperva Incapsula or Sucuri. And if you're using an anonymizing proxy, a VPN, or Tor, Cloudflare often flags those IPs and may block you or present you with a captcha as a result.

Please confirm the following statements and check the boxes before creating an issue:

Python version number

Run python --version and paste the output below:

Python 2.7.18

cfscrape version number

Run pip show cfscrape and paste the output below:

Name: cfscrape
Version: 2.1.1
Summary: A simple Python module to bypass Cloudflare's anti-bot page. See https://github.com/Anorov/cloudflare-scrape for more information.
Home-page: https://github.com/Anorov/cloudflare-scrape
Author: Anorov
Author-email: anorov.vorona@gmail.com
License: UNKNOWN
Location: /home/sshuser/.local/lib/python3.9/site-packages
Requires: requests
Required-by: 

Code snippet involved with the issue

        url = "https://www.investing.com/commodities/us-cotton-no.2"
        session = requests.Session()

        params = {
        "curr_id": 8851,
        "smlID": str(randint(1000000, 99999999)),
        "header": "US Cotton #2 Futures Historical Data",
        "interval_sec": "Daily".capitalize(),
        "sort_col": "date",
        "sort_ord": "DESC",
        "action": "historical_data",
    }
        head = {
            "User-Agent":"Mozilla/5.0 (Macintosh; U; Intel Mac OS X 10.5; en-US; rv:1.9.1b3) Gecko/20090305"
    " Firefox/3.1b3 GTB5",
            "X-Requested-With": "XMLHttpRequest",
            "Accept": "text/html",
            "Accept-Encoding": "gzip, deflate",
            "Connection": "keep-alive",
        }
        scrapers = cfscrape.create_scraper(
            sess=session,
            delay=10
        )
        print(scrapers.get(url,headers=head,data=params).content)

Complete exception and traceback

(If the problem doesn't involve an exception being raised, leave this blank)

URL of the Cloudflare-protected page

https://www.investing.com/commodities/us-cotton-no.2

URL of Pastebin/Gist with HTML source of protected page

https://dpaste.org/aiJg9

lord8266 commented 1 year ago

Well this project is dead, cloudfare is too op now