Open RvanVeenendaal opened 4 years ago
The documentation of Heritrix's discovery path is in Heritrix's Glossary but indeed it wasn't very discoverable. I have slightly expanded the explanation and added a mention of hopsFromSeed so hopefully it will eventually start turning up in search results now. I agree that since the WARC standard mentions hopsFromSeed it should include an explanation of the values.
Can this be closed now that hopsFromSeed
is documented in the Community Annotations? Or is the motivation here that it should be in a new version of the specification?
While validating WARCs at the National Archives of the Netherlands we encountered the hopsFromSeed field. We could not find an explanation of the values, other than on Twitter or in source code of WARC tools. Please add the possible values (or those known to you) to the documentation. E.g. (from Twitter thread of 2015):