Open PsypherPunk opened 9 years ago
I vote for returning the partial. Could be useful in the case of audio streams for example. We happened to run into such a capture today, where we got a 2 hour snapshot of http://mozart.wkar.msu.edu/wkar-fm-mp3
Wouldn't it make more sense to display an interstitial page explaining the issue and offering a link to the partial content? If we just present the partial content without any context, users are likely to conclude that there is an issue with the archive service.
This is another one of those times I'd prefer a iframe
approach, as the fact that this is known to be damaged from capture could be relayed around the edge.
If that's too difficult, I'd rather we provided an interstitial, but either way, I think we should be able to return the item as-is rather than hide it.
Sure, interstitial is a good idea, although my guess is that truncated urls are usually not gonna be html pages viewed at the top level. The examples mentioned here are a jpeg and an audio stream, so the interstitial wouldn't come into play in these cases (right?)
No, at least not when viewing them as embedded resources. Only when they are accessed directly.
We've just noticed a few
timeTrunc
errors in our crawl logs and the resultingWARC-Truncated: time
headers in our WARC records. Does OpenWayback handle these specifically?At the moment it just seems to silently fail to render anything. I'm not sure, however, what it should be doing. For instance:
Here we've a partial JPEG—should OpenWayback return the partial? Or treat it like a
revisit
and try to find the nearest record (although the hash would be wrong...)?