-
Right now the only interface for getting at the record content is `record.content_stream().read()`, which is streaming. I can't do that twice. So if I'm passing a record around in a program and want t…
-
Having a dry-run option that reported all actions but not doing any actual archiving would be very useful for tuning this quite complicated task
-
Stemming from the discussion in https://github.com/datatogether/roadmap/issues/79, ipwb is MIT but uses some https://github.com/webrecorder/pywb modules, the latter being GPLv3.
There have been num…
-
ARCRecord encapsulates streaming of record content, hardening against parsing mistakes. Unfortunately HTTP-headers are processed on the raw ARC-stream, allowing parsing of problematic headers to strea…
tokee updated
9 years ago
-
### Problem
Sharing and distributing a jupyterlite instance for local use, currently require zipping all the file and running a local web server.
### Proposed Solution
It would be nice to lev…
-
On #18 @trifle wrote
> That said, I think the fact that almost everyone seems to need to fork warcproxy for their project is a sign that it might benefit from integrating changes back into the origin…
-
Is there a way to mmap some memory (a file for example, I can do this simply via the ffi) and use that memory without copying in a bytevector? Some sort of function that sets a bytevectors storage, an…
-
Hi,
Would it be possible/feasible to add metadata extraction to roda-in at SIP creation time?
For users who are not archivists and know little about metadata but would like to create meaningful …
-
OK; I'm using ArchiveBox but wanting to store all the archived website data on my NAS/NFS share, so I setup an 'archivebox-archive' volume and mounted ArchiveBox's /data/archive to it. I left its /dat…
-
Hello,
I have followed the examples here: https://aws.amazon.com/blogs/compute/introducing-the-c-lambda-runtime/
When I run my internal execution so that it takes 5 seconds everything works fine…