-
## Issue
Automated tests do not cover the core business logic (CSV upload, scraping).
## Expected
While 100% test coverage is not required for this code challenge, all critical paths of the a…
-
using:
```
# Fragment Scrape
def scrape_scene(self, source, input):
if isinstance(source, str):
source = {"scraper_id": source}
if isinstance(input, (str, int)):
input = {"scene_id":…
-
Hi!
First of all, thank you very much for creating this amazing repo.
I was wondering if it is possible to choose just the fields we want to scrape instead of scraping the entire list available.…
-
There can be two scrapers using one prometheus receiver: `external.signoz.io/scrape` and `prometheus.io/scrape`. The first one being enabled by default, while the second one optionally enabled using p…
-
To Recode-Hive,
I hope you're well. I've reviewed our web scraping script and identified two key areas for improvement:
1. OCR Accuracy: Inconsistent text extraction due to varying screenshot qu…
-
### Component(s)
receiver/prometheus
### What happened?
## Description
Scraping a Prometheus pushgateway with `honor_labels: true` results in a scrape endpoint failure. Suspect this is due to t…
-
### What's wrong?
Most Prometheus exporters set the `instance` label to [the hostname where Alloy runs](https://github.com/grafana/agent/blob/main/internal/component/prometheus/exporter/exporter.go…
-
![image](https://github.com/wissemkarous/webscraping/assets/115191512/f8922cec-d2b9-43de-a35e-a62dd51000ca)
-
## Why
While the current demo code allows for searching news via NewsAPI - NewsAPI has only one crypto news provider (CCN). It also doesn't allow for fine grained semantic and time based search tha…
-
### Brand name
LPG BioMarkt
German organic supermarket chain
### Wikidata ID
Q107983669
https://www.wikidata.org/wiki/Q107983669
https://www.wikidata.org/wiki/Special:EntityData/Q1079836…