Closed sylvainroussy closed 8 years ago
I just tried this config snippet with 2.5.2 and could not reproduce the error. Not finding the method suggest a invalid library version or similar issue.
Have you just unzipped a fresh copy of version 2.5.2 before trying this? In wonder if you have mixed jsoup library versions in your classpath? I have "jsoup-1.8.3.jar" in the test I just did and I can confirm the class/method reported in your error exists in that Jar. Another thing to consider maybe is the file being corrupted for some reason? I doubt this is the cause, but maybe try downloading it again just in case.
Hello! The version of my jsoup is 1.7.2 :
norconex-collector-http (2.5.0)
> norconex-collector-core (1.5.0)
> norconex-importer (2.5.2)
> tika-parsers (1.12)
> grib (4.5.5)
>jsoup (1.7.2)
No mixing jsoup in my pom.xml or classpath, adding a more recent version of Jsoup works.
Good catch. Norconex Importer had JSoup 1.8.3 as a managed dependency to by pass the version that comes with Tika, but that was not carried through to the HTTP Collector. This is now fixed in the latest snapshot release of HTTP Collector (it shall have JSoup 1.8.3 now).
I am closing this since we have a working fix.
The problem still persists. When using the HTTP collector às mvn dependency the jsoup is still being downloaded in version 1.7.2 and this breakes the DOMSplitter at line 171 String childEmbedRef = elm.cssSelector();
When using Maven, the issue is different, look here for a solution: https://github.com/Norconex/collector-http/issues/356
Thats exactly what I did :)
Hi!
I Get the following exception when I use the DOMSplitter :
java.lang.NoSuchMethodError: org.jsoup.nodes.Element.cssSelector()Ljava/lang/String; at com.norconex.importer.handler.splitter.impl.DOMSplitter.splitApplicableDocument(DOMSplitter.java:151)
With configuration: