-
For the Pihole blocking to work.
I think if yacy has an option to query the DNS server every time a crawl is started and not use cache.
That way you can stop the crawling the sites having a Craw…
-
- [x] get different permalinks for each locales
- [ ] expose these permalinks in the client page
- [ ] detect current browser language (or allow any callback to do so)
- [ ] redirect according to t…
-
I've pulled a website from webhttrack which is mixed of pdf and html,it seems that localgoogle can only index html files,is there anyway to solve the problem?
-
To get started with support for Opt-Out on French-language websites, the following is needed:
* Text that's usually used for links to "Terms Of Service" or "Terms Of Use" or "Conditions Of Use" or …
-
Pour un site, dont les articles, pour une raison qui nous échappe, n'étaient pas reférencés par Google, nous avons déclaré le sitemap dans la search.console de Google dans l'espoir (exaucé) de régler …
-
**Bug Description**
The summoner 君 on NA (a real account!) cannot be looked up using the by-name endpoint, even though they exist.
**Problem Description**
There is a summoner named `君` on NA …
-
HI.
the cached page using WP Rocket not reflect the real last modified since date, but only when the new page cache are regenerated.
For example:
I write blog post for Christmas Wishes and i publis…
-
Before we start the crawl, we need to test the crawler's performance. So, we need to compare the manually observed groundtruth with the analysis results. We probably need a 100-site test set.
- Ho…
-
I am using Debian 8.1, Java 8
{
"name" : "search",
"cluster_name" : "scribeweb",
"version" : {
"number" : "2.2.0",
"build_hash" : "8ff36d139e16f8720f2947ef62c8167a888992fe",
"build_t…
-
http://scrikerouleausfucosan.com/
redirects to that address,which seems dead, with a long string after when searching for anything with the word "boys" in it.
edit: any search with "boy" does t…