-
There are often reasons to make multiple separate RQs in one Crawlee project (e.g., having `CheerioCrawler` for processing most of the pages and a separate keep-alive `PlaywrightCrawler` instance for …
-
Currently, our web frontend is not very SEO friendly. It is entirely written in JS, which is fine for modern web crawlers, but it violates some best practices.
**Enhancement:** The frontend should …
-
I've been noticing I had some followers who came and left again some days later. After doing some fast research why someone would do this, I found tools like these: https://github.com/imraazrally/Gree…
-
It would be nice to be able to share a downloaded website with the rest of the world at an online location that can be visited by a web browser.
The most widely available (and cheapest) kind of web…
-
@shriphani Awesome project!
I noticed you're using `clj-http` to [pull page content](https://github.com/shriphani/pegasus/blob/master/src/pegasus/queue.clj#L18), but what are your thoughts on making …
-
12876 08:59:22 ERROR Attempted to load invalid xml.
Exception: System.Xml.XmlException
Message: Data at the root level is invalid. Line 1, position 1.
Source: System.Xml
at System.Xml.XmlTextReader…
-
I got this error when trying to launch AIL
Anyone have had experience with this? Please let me know
```
Traceback (most recent call last):
File "./Flask_server.py", line 36, in
from bluepr…
-
**Description**
Elements should not only be defined clearly how they look on the page, but also programmatically. Especially for assistive technology users in order to understand with what element he/…
-
**Description**
Focus order should be in sequence and visible in order for user to know on which element he/she is at the moment. Especially for the users, using only keyboard.
**Preconditions**
Stat…
-
Hi,
I recently tried downloading the dataset, but it comes out such error:
```
ERROR: [youtube] ---LyFa59Go: Sign in to confirm you’re not a bot. This helps protect our community
```
It seems …