Closed ilgiaco closed 9 months ago
Ok, it seems they are filtering by User-Agent, so just setting it to a different value seems to work for me. At the end of IMDB.yml file I put this config:
driver:
headers:
- Key: User-Agent
Value: PostmanRuntime/7.32.3
hello,
i just tried the IMDB scraper and had the same error as @ilgiaco. as you mentioned i added your code into IMDB.xml
but now i get this error:
scrapeSinglePerformer[0]: input: scrapeSinglePerformer[0] must not be null
any ideas?
Hi @swoop124 I don't know why you got this error. Here you can find my entire IMDB.yml file (maybe just your typo, but it's not an xml file). You have to unzip because github doesn't let me upload yml files. If you get the same problem you may post the link you're trying to scrape from. IMDB.zip
Hi @swoop124 Here is the solution.
import requests
headers = {"User-Agent": "PostmanRuntime/7.32.3"}
driver = {"headers": [headers]}
res = requests.get('https://www.imdb.com/chart/top/', headers=headers)
print(res.status_code)
Imdb scraper not working. I get "http error 403:Forbidden" whenever I do both query and url scraping.
Stash version: v0.20.2 CommunityScrapers version: latest as of today (
047fe0c
)Other scraper are working well. Can someone confirm the issue?