DigitalPebble / behemoth

Behemoth is an open source platform for large scale document analysis based on Apache Hadoop.
Other
281 stars 60 forks source link

Options to replace input with output of job #14

Open jnioche opened 13 years ago

jnioche commented 13 years ago

The jobs currently generate a new seqfile. it would be great to have a '-r input' option to replace the input with the output if the job is successful. We'd also have a '-i input -o output' for the current behaviour

butlermh commented 13 years ago

This commit adds this functionality https://github.com/butlermh/behemoth/commit/87ce262e9f41a07d1025c98790dcb3f9870591b2