dgtlmoon / changedetection.io

The best and simplest free open source web page change detection, website watcher, restock monitor and notification service. Restock Monitor, change detection. Designed for simplicity - Simply monitor which websites had a text change for free. Free Open source web page change detection, Website defacement monitoring, Price change notification
https://changedetection.io
Apache License 2.0
16.69k stars 924 forks source link

[feature] Block loading of resources in Playwright e.g. stylesheets, fonts, media #2229

Open VonZubinski opened 6 months ago

VonZubinski commented 6 months ago

Not a duplicate but an extension of https://github.com/dgtlmoon/changedetection.io/issues/1476

Version and OS Subscription

Is your feature request related to a problem? Please describe. Loading unoptimised font files and media on a websites can result in increased data transfers and higher proxy costs due to large file sizes and redundant requests.

Describe the solution you'd like Playwright's ability to selectively block resources by type. This would allow for a fine-tuned approach, reducing unnecessary data transfers/carbon footprint and lowering proxy costs. Simple checkboxes under the Request > Advanced Options menu should be sufficient.

Describe the use-case and give concrete real-world examples A website we are monitoring, started to detect the Playwright browser and block our watchers. We are considering to use the proxy services of Brightdata / Oxylabs and are doing cost calculations:

We've noticed, that changes are only happing throughout the daytime and will be pausing the watchers with cronjobs & API calls for 10 hours at nighttime. This will reduce the costs for fonts by 735 $.

The potential cost savings with blocked font resources would be 1035 $ per year.

Additional context

Screenshot 2024-03-03 at 09 28 12
thecheesypizza commented 4 months ago

This would be super handy as I run into this a whole bunch