internetarchive / wayback

IA's public Wayback Machine (moved from SourceForge)
739 stars 132 forks source link

Remove or rewrite subresource integrity hashes #149

Open vincentbernat opened 7 years ago

vincentbernat commented 7 years ago

Hey!

Some sites are using subresource integrity to ensure CDN do not alter the linked resources (notably JS and CSS). Unfortunately, when such a page is archived, CSS and JS are altered to add some comments. This makes SRI hashes inexact and browsers won't load them. See this example.

nightpool commented 7 years ago

Just ran into this myself with some Github pages:

image

It seems to be intermittent (maybe they're doing a partial rollout?) but it's definitely affected some crawls I know of.