-
#### Issue Description
I am trying to build and run the sparkler from the source. I am following the example given in the readme. I have injected a url and is visible in solr.
I face problem while …
-
**Describe the bug**
Hi, I love your mod btw... Fine tuning the crowds brings more realism and game cant be boring now. Thank you for working on it.
Now on the strange bug I just found: a surprisin…
-
As the ISPs keeps blocking the domains, and each time it happens you guys have to update the complete app to make that search engine work, if there is any way to fetch working domains from a jason fil…
-
These are some notes I made on issues raised during the workshop at the IIPC conference:
- The title field too short - 50 chars is not enough.
- 'URLs prefix" scope is wrong/misleading, because th…
-
Hello,
This is a question about the log4j version employed by the Filesystem Collector.
I notice that the most recent release of the Filesystem Collector (2.9.1 at time of writing) uses log4j v…
-
Looking into why the gazebosim.org page is mostly ignores for the Google searches I found some hints that can point out that the gazebosim.org main page is totally invisible to the search crawlers:
…
-
How can I change the browser?
```
import asyncio
from crawl4ai import AsyncWebCrawler
import base64
from crawl4ai.extraction_strategy import JsonCssExtractionStrategy
import json
async def …
-
### Description
`scrapy.shell.inspect_response` does not work with the `asyncio` reactor when using the `ipython` shell
### Steps to Reproduce
1. Create a spider with the following contents:
…
-
next_page = response.css('div.paging a::attr(href)')[-1].get()
if next_page is not None:
yield response.follow(
next_page,
cookies=…
-
@erik-anderson [suggested](https://github.com/w3ctag/design-reviews/issues/342#issuecomment-818039680) over on the TAG review thread that we consider technical mechanisms in lieu of the "UA Policy" to…