-
Got a site failing with this error:
```
Adhoc task failed: tool_crawler\task\adhoc_crawl_task,rawurlencode(): Argument #1 ($string) must be of type string, array given
Backtrace:
line 522 of /li…
-
How can I change the user_agent in PlaywrightCrawler?
Here's what I tried:
```python
from crawlee.playwright_crawler import PlaywrightCrawler
crawler = PlaywrightCrawler(browser_options={'user…
-
**Describe**
A clear and concise description of what the bug is.
After migrating to 1.11.28
When I try to download a certificate in PDF i Get an error 500
generating html certificate is OK
…
-
I'm planning to add a smart crawler that takes a set of user-defined objectives and continues crawling to satisfy them. Objectives can be a query requiring a sufficient amount of information to answer…
-
We have some "new" (some are few months old ...) CLI argument of browsertrix crawler to consider:
```
--seedFile, --urlFile If set, read a list of seed urls, on
…
-
или web crawler
-
Implement some way to stop crawler in obvious and controlled way from the user function. It should properly shutdown all resources and immediately stop crawler to send any requests. It should be mirro…
-
The library is sick
It would be an beautiful addon if we can add an selenium crawler.
Connecting to selenium webdriver or remote drivers
-
Before we start the crawl, we need to test the crawler's performance. So, we need to compare the manually observed groundtruth with the analysis results. We probably need a 100-site test set.
- Ho…
-
Consider this sample program:
```python
import asyncio
from crawlee.configuration import Configuration
from crawlee.parsel_crawler import ParselCrawler, ParselCrawlingContext
async def de…