-
This action should:
1. run the whole pipeline
2. run CI checks on the released ontology (I will supply)
3. Make a release with the new created file
4. If there are any supporting files created b…
-
Title.
https://github.com/traceloop/openllmetry
**Add Metrics For:**
- [ ] video ingestion pipeline
- [ ] Audio ingestion pipeline
- [ ] Podcast ingestion pipeline
- [ ] Ebook ingestion pipeli…
-
## Describe the bug
Since yesterday I only get this error.
```
Error: Error when parsing watch.html, maybe YouTube made a change.
Please report this issue with the "1726751716488-watch.html" file …
-
The GitHub advisories are somewhat weird:
1. the graphql API data require auth and are incomplete (they do not contain external references)
2. the HTML data at https://github.com/advisories contains…
-
A customer reported an issue when scraping a place.
Place ID: `HAtz-IYcrY7rhD61TpTZqg` or `midlands-landscape-and-lawn-lexington`
This is a valid place on Yelp: https://www.yelp.com/biz/HAtz-IYc…
-
For some reason argu.co is getting a lot of function invocations. Maybe because the old site was relatively popular and now some machines are scraping it. THe functions should not be invoked though, w…
-
There are many websites with tamil books details
- https://www.projectmadurai.org/
- https://www.panuval.com
- tamilvu.org
- https://www.tamildigitallibrary.in/
- freetamilebooks.com
- https:/…
-
### Issue
When /web is used with httpx it raises an error on HTTP 302 redirect and doesn't scrape the context.
When /web is used with Playwright it follows redirects without complaint. I think th…
-
One of the useful plugin is puppeteer-extra-plugin-stealth. It let you do test automation and web scraping without getting blocked.
https://github.com/berstend/puppeteer-extra/tree/master/packages/…
-
Scraping personal github profile to automatically import in Isomo dashboard.