Open chriscarrollsmith opened 10 months ago
Note that this is not an issue with extractus. Version 1 of the rss-fetch-action
, which used isomorphic-fetch
, also fails:
- name: Fetch RSS Feed
uses: Promptly-Technologies-LLC/rss-fetch-action@v1
with:
feed_url: https://babafaqirchand.substack.com/feed
file_path: ./src/components/ui/RssFeed.json
remove_last_build_date: true
I have also tried a Windows runner rather than an Ubuntu runner, but still got the same Error 403.
Honestly, it seems like Substack may have just specifically blocked Github Actions runners for some reason. I'm not sure why you would do this (maybe IP concerns about Substack content appearing on Github, or abusive high-frequency requests?) or how you would go about it (some kind of CORS/IP blocking?), but it's my current best guess.
Scraping Substack with extractus works on a home PC, but it does not work from an Actions runner. For reasons I don't fully understand, Substack began returning Error 403: Forbidden at 7 PM EST on January 15, 2023. Here is a reproducible example:
I have tried adding custom headers, but without success.