-
We need to scrape the ICRC website to retrieve key information about their operations. The goal is to extract details on:
1. **Countries with ICRC Presence**
2. **Countries with Key ICRC Operatio…
-
It'd be super useful being able to see freelancer experience, rates, etc to assess the market/competition as well as who to hire if an employer
-
-
Board reports [2024-0556](https://webapi.legistar.com/v1/metro/matters/?$filter=MatterFile%20eq%20%272024-0556%27) and [2024-0549](https://webapi.legistar.com/v1/metro/matters/?$filter=MatterFile%20eq…
-
-
For media scraping modules suggestion: turn off GIF.
GIFs are spam and usually useless and take memory a lot.
Put why not also: turn on/off images, turn on/off videos.
^ If user just wants vi…
-
Our scraper is designed to keep track of content from multiple Bitcoin-related sources, including the Bitcoin Stack Exchange. Currently, however, we haven’t received new data from the Bitcoin Stack Ex…
-
Issue is to track efforts to improve the web scraping pipeline.
- [ ] Implement Pycookie
- [ ] Implement checks for custom scraper integration (if URL matches a predefined list, use the scraper fo…
-
A potential customer reached out with a request to scrape the transcript from the video page.
The transcript appears after clicking on "Show Transcript button" within the description part of the vide…
-
"I'm using the crawl endpoint and one of the URLs it discovered is https://www.gamweb.com/assets/files/lsk.pdf, however, I get a "Invalid PDF structure" error when the page is scraped by FireCrawl. I …