-
Will this project be compatible with Manifest V3?
-
Not sure, but I am a first time user of this scraper. I ran into the issue "It appears that thegamesdb.net isn't up". Looking at the code, it appears that the scraper attempts to GET `http://thegamesd…
-
Just discovered this project via HN, super cool btw. I've always wanted something like this to exist.
I wonder if you already have enough examples of natural language -> query to fine tune a transf…
-
It seems that VSCO has been migrating their site to sit behind Cloudflare, as I've encountered various issues these past few weeks where Cloudflare was blocking content. Starting this past Friday, I n…
Tyen3 updated
3 weeks ago
-
Is it possible to get them original title
-
Hi! If I understand correctly, this code relies heavily on the Estonian language version of the site, since all the parsing stuff is hardcoded in estonian. Any plans for a universal parser, with optio…
-
I haven't checked the other scrapers as close, but the Amazon one seems like it has issues (aside from #51), which could potentially miss an item or be delayed. Here is what I am seeing. I put a sin…
-
Hi, would anyone be able to help me with this issue - cloudfare is detecting me. Thanks
-
A list of potential resources we might generate during the Sprint:
- [ ] workshop material on at-risk data collection and archiving
- [ ] collecting data stories (why does public data matter to yo…
-
I have instaleld the package, installed requirements in new VE, set configs and ran:
python substack_scraper.py --url https://substack.com/@vertox --directory F:/substack/vertox --premium
I get …