-
What kind of issue is this?
- [ ] Question. This issue tracker is not the best place for questions. If you want to ask how to do
something, or to understand why something isn't working the…
-
Hello @jnioche,
we switched our URL and Status handling from a custom bolt to URL-frontier. But I recognized, that the Status-Bold is not acking any tuple. After going into the code, adding some lo…
-
What kind of issue is this?
- [ ] Question. This issue tracker is not the best place for questions. If you want to ask how to do
something, or to understand why something isn't working the…
-
In line 327 of the class is a test »entity.getContentLength()
-
With StormCrawler 2.3-SNAPSHOT, setting "maxDepth": 0 in the urlfilters.json prevents the seed injection into the ES index.
Expected behavior would be that the seeds would be injected and crawled w…
-
Bumps [jsoup](https://github.com/jhy/jsoup) from 1.14.3 to 1.15.1.
Release notes
Sourced from jsoup's releases.
jsoup 1.15.1 is out now with a bunch of improvements and bug fixes.
Change…
-
What kind of issue is this?
- [ ] Question. This issue tracker is not the best place for questions. If you want to ask how to do
something, or to understand why something isn't working the…
-
What kind of issue is this?
- [] Question. This issue tracker is not the best place for questions. If you want to ask how to do
something, or to understand why something isn't working the …
-
If a website has erroneous content we've got a crash of crawler with a java.lang.StackOverflowError in com.digitalpebble.stormcrawler.util.CharsetIdentification.getCharsetFromMeta(CharsetIdentificatio…
-
In version 1.17 the following error message appears in the Storm UI:
` java.lang.RuntimeException: java.lang.IllegalArgumentException: Class is not registered: com.digitalpebble.stormcrawler.persi…