-
Raw scraped data from Issue #4 would need to be processed before it can be used for training the models. We need a module that aggregates the raw data into a single dataset (.csv file) containing the …
-
### How are you running AnythingLLM?
Docker (local)
### What happened?
First of all, I love the idea of recurssively scraping a lot of content via a bulk link scraper.
I think it needs to be ret…
-
Hi, first of all thank you for the code!
I am however having the problem that when scraping multiple pages of reviews for the same product, only the first page gets scraped. The other pages get "sc…
-
Hi,
I'm contacting you from the Department of Health in Ireland in relation to the data source quoted for Ireland which appears to be incorrect. For example the link provided for 2015-2019 (https:…
-
**Describe the bug**
A game cannot store over 1.4MB of data in a link as it starts deleting the ending lines of code.
**Reproduction Steps**
1. Make a game with lots of rules to check and exceed …
-
Hi, thank you for your hard work on Frigate.
The `/api/stats` API currently exposes fields like:
```
alley-left: {
audio_dBFS: 0,
audio_rms: 0,
camera_fps: 5.1,
capture_pid: 9…
-
Probably because every page has a `'//*[@id="game_abandonned"]'`present =(
-
### Description:
We have several websites containing Tibetan literature data that need to be scraped to gather as much valuable information as possible for training our LLM. The task involves not only…
-
Hello,
I posted a comment about this, but I think it would be better to open a new issue. When using version 2.9.0 on Manjaro with Chromium, I get a bunch of null values when trying to scrape compa…
-
For the new instructor chip we built, we need to know how long the prof. has been at OU. But given the current data, we can only know 6 years back in the scraped dataset. Would this be a relatively ea…