The current WrenderProcessor expects warcprox to be used to capture the rendered resources. Although the quality may suffer, it may be useful to add a mode that lets Heritrix3 (re)download the resources, to make (initial?) deployment simpler.
It would just be a case of enqueueing the other URLs in the request-response entries as E links, and handing processing along the chain rather than skipping the rest of the processors.
Should also gobble up all those delicious cookies...
The current WrenderProcessor expects warcprox to be used to capture the rendered resources. Although the quality may suffer, it may be useful to add a mode that lets Heritrix3 (re)download the resources, to make (initial?) deployment simpler.
It would just be a case of enqueueing the other URLs in the request-response entries as
E
links, and handing processing along the chain rather than skipping the rest of the processors.Should also gobble up all those delicious cookies...