At the KB we still have a large collection of ARC files that we would like to be able to inspect using the visualisation tool. Currently, the tool only indexes files with .warc and .warc.gz suffixes.
It seems it should be possible to support the ARC format without too much effort. It would entail widening the file filter criteria and making IndexProcessorWarc.java more general (there's already an IndexProcessorArc.java, but that's not being used and it's probably not necessary to differentiate between WARC and ARC at that level).
At the KB we still have a large collection of ARC files that we would like to be able to inspect using the visualisation tool. Currently, the tool only indexes files with .warc and .warc.gz suffixes.
It seems it should be possible to support the ARC format without too much effort. It would entail widening the file filter criteria and making IndexProcessorWarc.java more general (there's already an IndexProcessorArc.java, but that's not being used and it's probably not necessary to differentiate between WARC and ARC at that level).