Open sunnypandas opened 7 months ago
如题所示: page = ChromiumPage() page.download_set.save_path(download_folder) page.get('https://projects.propublica.org/nonprofits/download-filing?path=05_2021_prefixes_13-13/132947386_202006_990_2021051818124406.pdf') 可以成功打开,但是: page = ChromiumPage() page.download_set.save_path(download_folder) page.download('https://projects.propublica.org/nonprofits/download-filing?path=05_2021_prefixes_13-13/132947386_202006_990_2021051818124406.pdf') 无法下载,报出了403错误
想咨询下download的实现逻辑是没有饶过cloudflare吗?
谢谢。
download()功能是用requests封装的,如果需要headers等参数,需要自己写进去。
您好,感谢回复,绕过CF的话需要什么样的headers呢,谢谢
如题所示: page = ChromiumPage() page.download_set.save_path(download_folder) page.get('https://projects.propublica.org/nonprofits/download-filing?path=05_2021_prefixes_13-13/132947386_202006_990_2021051818124406.pdf') 可以成功打开,但是: page = ChromiumPage() page.download_set.save_path(download_folder) page.download('https://projects.propublica.org/nonprofits/download-filing?path=05_2021_prefixes_13-13/132947386_202006_990_2021051818124406.pdf') 无法下载,报出了403错误
想咨询下download的实现逻辑是没有饶过cloudflare吗?
谢谢。