DigitalPebble / behemoth

Behemoth is an open source platform for large scale document analysis based on Apache Hadoop.
Other
281 stars 60 forks source link

Remove reference to CountersExceededException for compatibility with CDH 4.1 #44

Closed mumrah closed 11 years ago

mumrah commented 11 years ago

When running some M/R jobs that use the TikaProcessor, I was getting ClassNotFound exceptions when targeting CDH 4.1.2.

For whatever reason, CDH does not include this class in their Hadoop core jar. For me, the simplest fix was to patch Behemoth to not use this specific exception but to catch a general Exception instead.

jnioche commented 11 years ago

Thanks David