issues
search
openpreserve
/
nanite
Nanite - a friendly swarm of format-identifying robots.
openplanets.github.io/nanite/
15
stars
13
forks
source link
c3po compatible outputs
#23
Closed
willp-bl
closed
10 years ago
willp-bl
commented
10 years ago
c3po compatible outputs
store serialized tika parser Metadata object in the sequencefile
begin refactor to create a seperate NaniteHadoop class
use 3.1.2-SNAPSHOT of heretrix-commons so uncompressed arc files can be read (make this change in warc-hadoop-recordreader master branch)