-
### Source name
Google Drive
### Source link
https://drive.google.com
### Source language
All
### Other details
I open this from the perspective that if there are images that can be downloaded …
-
### Discussed in https://github.com/jgm/pandoc/discussions/9729
Originally posted by **Watterry** May 7, 2024
I use _link-citations: true_ to generate Hugo markdown( or you can say markdown_s…
-
Gousto is in some ways an optimal source for recipes. For each recipe (https://www.gousto.co.uk/cookbook/vegetarian-recipes/3-cheese-veg-packed-pasta-bake) they have a public api which provides JSON d…
tboby updated
1 month ago
-
ENVIRONMENT: production
URL: https://scrapers.peviitor.ro/src/tmax/index.html#
Browser: Google Chrome
Device: Laptop
OS: Windows 10
STEPS TO REPRODUCE:
Open the URL in the browser
Run scr…
-
@jamesturk Hi James, it's me Hailey. We worked together last year on the City Bureau City Scrapers project (I was building out the scrapers for Fresno).
Wanted to get your guidance on something. T…
-
Proposed issues for time and planning:
Issue #2: Develop Logger Module
Description: Develop logger.py to handle all logging needs of the application, including cycle start/end times, next cycle …
-
to test and help discuss relevance+exact content of data.
help define mapping from external sources, via python scrapers or google refine scripts or other APIs...
likely goolesheet.
-
### Motivation
In the past few months, we have seen AI scraping bots become more and more prevalent, especially ClaudeBot. Personally, I've seen it do as much as 10K requests in 24 hours, with some…
-
The minute scrapers are running locally right now and require a manual backup to Google Drive.
This is not a feasible solution long-term and we should investigate automating the process, including…
-
I've noticed that some spiders are more API wrappers than traditional scrapers.
We should probably organize or tag these by type, and have some documentation about the differences, how you might d…