-
-
```
What steps will reproduce the problem?
1. Feeding the crawler a list of websites to crawl.
2. running the crawling operation in a while loop
What is the expected output? What do you see instead?…
-
# Here are all the known issues:
### Multiplayer:
- [ ] Choppy movement when crawling / swimming
- [x] ~~Other players will seem to be crawling when very close to a block or wall. This effect only…
-
```
What steps will reproduce the problem?
1. Feeding the crawler a list of websites to crawl.
2. running the crawling operation in a while loop
What is the expected output? What do you see instead?…
-
```
What steps will reproduce the problem?
1. Feeding the crawler a list of websites to crawl.
2. running the crawling operation in a while loop
What is the expected output? What do you see instead?…
-
I'd like to basically re-open issue #4 and ask for more detailed guidance on crawling https. Simply enabling protocol-httpclient in nutch-site.xml seems to bypass this plugin
-
I do see the the spider is crawling and parsing the webpages.
What about the "rating" and "content" saved at "Item"? is there any way to check whether 'item" is generated?
-
It might be interesting to collect information during crawling, something I would like to know:
- response time
- maximum
- minimum
- average / 95 percentile
- size of response body
- …
-
resolve-order is obviously important for environments, but dependency-order is important (and indirectly exposed by the graphing functionalities and manually crawling through the resolved context), bu…
-
It would be great to support more content loaders, such as:
- Youtube video transcripts
- Direct Website crawling and via Sitemaps
- Google Drive folders
I stumbled upon [embedchain](https://…