let4be / crusty

Broad Web Crawler
GNU General Public License v3.0
83 stars 3 forks source link

Lolhtml check if we can remove elements and if this saves some cpu cycles #31

Closed let4be closed 2 years ago

let4be commented 3 years ago

Right now it does some "wasteful serialization"which we just throw away Yet the lib is so damn fast it doesn't matter...

ideally we would like to completely disable HTML rewriting functionality, but I don't think it's currently possible, see https://github.com/cloudflare/lol-html/issues/91

let4be commented 2 years ago

This does not seem to affect performance