machawk1 / warcreate

Chrome extension to "Create WARC files from any webpage"
https://warcreate.com
MIT License
205 stars 13 forks source link

Consider recording Memento Traces to WARCs #108

Open machawk1 opened 5 years ago

machawk1 commented 5 years ago

Memento Tracer is an up-and-coming concept that facilitates the recording of a set of instruction to capture the "essence" of a Web publication. The intent is that after recording the Trace, it can be uploaded and used as a basis set of instructions for preservation.

WARCreate could attempt to generate this trace and store it to a WARC. This may serve as a sort-of provenance of the preprocessor used (if any) to surface the content for preservation.

Among many other example, see the "unrolling/surfacing" of content in this https://twitter.com/joshuatj/status/1062509176360906753 video capture of @webrecorder (https://github.com/oduwsdl/archivefacebook did something similar).

machawk1 commented 4 years ago

This has a bit of crossover with Webrecorder's Autopilot. Further investigation is needed to determine the differences between the systems.

Ping @ikreymer for interest, as Memento Tracer seems like a means to also generate the specs that are used for Autopilot. Also CC @N0taN3rd, who I am told is partially responsible for Autopilot. Any thoughts appreciated.