When running some M/R jobs that use the TikaProcessor, I was getting ClassNotFound exceptions when targeting CDH 4.1.2.
For whatever reason, CDH does not include this class in their Hadoop core jar. For me, the simplest fix was to patch Behemoth to not use this specific exception but to catch a general Exception instead.
When running some M/R jobs that use the TikaProcessor, I was getting ClassNotFound exceptions when targeting CDH 4.1.2.
For whatever reason, CDH does not include this class in their Hadoop core jar. For me, the simplest fix was to patch Behemoth to not use this specific exception but to catch a general Exception instead.