Closed pbutler closed 9 years ago
Please make sure to configure the 4mc input with: job.setInputFormatClass(FourMcTextInputFormat.class);
You can have a look at related example here: https://github.com/carlomedas/4mc/blob/master/java/hadoop-4mc/src/examples/text/TestTextInput.java
I consider this configuration issue closed, please reopen if reproducing.
I am running Hadoop 2.4.1. I run my jobs through mrjob (if that matters). When I run against an uncompressed file, splits happen and I automatically have more mappers than files. However when I run against .4mc files no splitting occurs. Running
hadoop fs -text file.4mc
works so I know it's decompressing okay and running a job against .4mc files works just no splitting occurs.One other thing I noticed is that if I use the files with the
.lz4_uc
extensionhadoop fs -text file.lz4_uc
using I get the following error:I am not sure if that's related or not.