Open rjoberon opened 9 years ago
This might be caused by bugs in hadoop and hdfs or by the fact that webarchive-commons depends on hadoop-core
in version 0.20.2-cdh3u6 (while we have CDH5 with version 2.5.0-mr1-cdh5.2.1). I tried to solve the problem using the maven shade plugin, unfortunately without any success. I first thought it's because the shade plugin only affects the classes of the current project and not classes included from dependencies into the uber JAR but I this answer suggests that this is not true.
These incompatibilities with Guava seem to be a common problem without a good solution, yet. :-(
I am trying to ensure that the libraries packaged in the
ia-hadoop-tools-jar-with-dependencies.jar
JAR file of this project are loaded before the other libraries from Hadoop (because they contain an incompatible Guava version). Since I am using Hadoop 2 I am using the environment variablesHADOOP_USER_CLASSPATH_FIRST
andHADOOP_CLASSPATH
to do this:However, this script always fails with the following output:
It seems that the CDXGenerator does not get the command line arguments. I have no idea why. One guess is that the wrong library to parse the command line arguments is used now or that that library is missing.