Add Apache-CLI (modeled after Admin vocabularies)
-f input file of URLs to be crawled
-o output directory where text will be dumped
-n number of hops (default 0 - first page only)
-m number of terms (default 10)
-d enable differencing
Original issue reported on code.google.com by craig.wi...@unc.edu on 16 Dec 2011 at 5:26
Original issue reported on code.google.com by
craig.wi...@unc.edu
on 16 Dec 2011 at 5:26