-
The theme for LD56 is Tiny Creatures.
- Parallelism
- 5 billion fleas
- Breeding Game --> Biological Horrors
- Insect/Tiny Creature collection to perform tasks
- Evolution through collecting sma…
-
### Context
Our documentation is written using Material for MKDocs, is written per-tool, and included in each repo. This means that anyone using the tools _also_ has a local copy of our docs which i…
-
Hello, everyone.
I set up a cluster with three Linux servers using elastic version 8.11.1. The information for each server is as follows: Server 1 ip-10.150.3.12 elasticsearch+kibana, server 2 ip…
-
As stated. When a crawl is running, if a search via the renderer search field is attempted, the web interface locks up completely. Attempts to load the web interface fail, with the browser waiting ind…
-
As identified in the SitemapXML forum support-thread, search engines can't crawl a given site if the above setting exists.
> Recently, Googlebot crawls without cookie.
> If he is force to use co…
-
Crawler seems to be mangling accents:
```
@inproceedings{Begoli_Camacho-Rodriguez_Hyde_Mior_Lemire_2018a, title={Apache Calcite: A Foundational Framework for Optimized Query Processing Over Hetero…
-
Hello lordnahor. Thanks for your sharing of the crawler on git hub.
Recently I try to use the http://www.ics.uci.edu/ as the seed to crawl.
First time I crawl 10 hour to get Persistent.shelve about 1…
-
### Have you read the Contributing Guidelines on issues?
- [X] I have read the [Contributing Guidelines on issues](https://github.com/facebook/docusaurus/blob/main/CONTRIBUTING.md#reporting-new-iss…
B4nan updated
2 years ago
-
Crawlers can now [use existing tables](https://docs.aws.amazon.com/glue/latest/dg/define-crawler.html#crawler-source-type) as a crawler source, which may give us the ability to deprecate our custom pa…
-
Meta issue where I collect some proposals for future work.
- reduce amount of irrelevant results
- [x] Do not follow outlinks of pages that were classified as irrelevant
- [x] Curate a blacklis…
noerw updated
6 years ago