-
#p3nnywh1stl3 on the discord had a great suggestion for the tags to exclude to get tidy content from a website:
["script", "style", "nav", "header", "footer", ".advertisement", ".sidebar", ".nav", …
-
I am adding something to this just for your awareness, it would probably be a major overhaul of the code. There is another site, Exophase.com that has a similar import and search criteria. There is al…
-
This is a list of some cool community projects that should be added to Awesome-Transit
- [ ] https://github.com/roughconsensusandrunningcode/TrainMonitor
- [ ] https://github.com/fredlockheed/db-f…
-
Sync and validate PAGASA regions, provinces and municipalities (names) data with the Philippine Standard Geographic Code (PSGC) data.
### Reference
[[1]](https://psa.gov.ph/classification/psgc/)…
-
After some investigation, I figured out that the BIRs mobile app use the `https://webservice.bir.no/api/` API. To be able to use the API we will first have to login using POST `https://webservice.bir.…
-
**Describe the Bug**
Self Host Service Certain Web Page Scrape Return Wrong Encoding Result on Self Host Service, and Official Online Demo is Totally Fine
**To Reproduce**
Steps to reproduce the …
-
For now the hackathon information comes from just one website `HackClub` they have API to retrieve the info but we need hackathons information from other websites like
- MLH
- devpost
- devfolio
…
-
general section
- getting started (open source) - (stack, general advice on getting dev build running with prod api, list of active projects. Explainer on open source vs proprietary tasks)
- fronten…
-
https://github.com/mendableai/firecrawl/blob/5ab52854b9d49983f1e2ce10e4b4fb8585c8de42/apps/api/src/scraper/WebScraper/crawler.ts#L299
-
## Problem
BlankerL's API is toast due to sheer volume of requests. I've forked the webscraper code and begun installation on my own system where I can run indefinitely.
Once this is done, I s…