w3c / webdriver-bidi

Bidirectional WebDriver protocol for browser automation
https://w3c.github.io/webdriver-bidi/
336 stars 35 forks source link

Add network.setCacheBypass command #721

Open jgraham opened 1 month ago

jgraham commented 1 month ago

This currently only allows setting the request cache mode to 'no-store' which bypasses the cache entirely. If we have future use cases for setting different cache modes, this appraoch would be extensible to those use cases.


Preview | Diff

jgraham commented 1 month ago

Draft Fetch PR: https://github.com/whatwg/fetch/pull/1756

jgraham commented 1 month ago

One question here is whether this interacts with cache-mode specific headers in the expected way. In particular this will end up setting the headers so that not only do we not use the local cache, but we also don't populate any HTTP caches for this request.

OrKoN commented 4 weeks ago

@sadym-chromium would you mind reviewing?

juliandescottes commented 3 weeks ago

Had a quick look at how Firefox behaves when disabling cache from DevTools (which we intend to reuse for BiDi).

jgraham commented 3 weeks ago

So I'm not really sure what to do with this.

It seems like talking in terms of the cache mode has made an interop difference visible: Firefox neither reads from nor writes to the cache, but Chrome does write to it. That would affect what happens if you do a no-cache request followed by a cached request.

However for the non-HTTP caches, working purely in terms of the cache mode on the request seems like it maybe doesn't work well? Per fetch it seems like requests use the CORS preflight cache irrespective of their cache-mode. So we could have a webdriver-specific rule that we should clear the preflight cache for a request if the override mode is, say, no-store or reload, but then the WebDriver API has different behaviour from the DOM API. Or we could have a separate flag to bypass the CORS cache (i.e. something like network.setCacheMode({cacheMode, contexts, bypassCORS})`, but that seems potentially unwieldy.

This also applies to all the possible in-memory caches, but even more so since the details may differ between browsers.

Or the other option is to just support the bare "bypass all caches" use case instead of specifying a specific cache mode i.e. make the command network.setCacheBypass({bypass, contexts}). That seems close to what CDP has, but it's rather inflexible if we want to offer more control (similar to existing DOM APIs) in the future.

css-meeting-bot commented 2 weeks ago

The Browser Testing and Tools Working Group just discussed Bypass network cache behaviour.

The full IRC log of that discussion <jgraham> Topic: Bypass network cache behaviour
<jgraham> github: https://github.com/w3c/webdriver-bidi/pull/721
<orkon> jgraham: this one is about request interception and how we force bypassing the network cache. I put a PR which was designed around the idea of setting the cache mode on requests. So when you have a request, it will set the cache mode to no-store, which means no reading or writing to the http cache. Unfortunately, it does not quite match what
<orkon> browsers do at the moment when you disable caches in devtools: 1) cors preflight cache which is always used 2) image cache and possible other caches So the question is how much flexibility do we care about here? Should we try work with low level primitvies? Or do we actually only care about bypassing all caches?
<orkon> jgraham: is that going to be the easier way to solve this?
<jgraham> q?
<orkon> q+
<jgraham> ack orkon
<jgraham> ScribeNick: jgraham
<jgraham> orkon: We also found that Firefox uses no-store by default but chrome uses reload, so Chrome still writes responses to the cache. We couldn't find use cases which require controlling different kinds of caches individually. Maybe there are some wpt use cases, but we haven't run into them yet. For testing scenarios you mostly care about bypassing everything. Also for interception to catch every request.
<jgraham> So it sounds like a single bypass cache setting would be sufficient. We could extend in the future with additional parameters if we needed. Question is which cache mode should the bypass use, or how important is that?\
<jgraham> q+
<orkon> ScribeNick: orkon
<jgraham> ack next
<orkon> jgraham: I think it is important that we specify a specific cache mode. bypassing the cache meaning no writing to cache makes more sense.
<orkon> jgraham: we empty the cors cache or whether we just bypass it. But then when you disconnect webdriver session, the cache will be still populated with the cache
<orkon> jgraham: I noticed that the spec has primitivees for clearing the cache but not by-passsing
<orkon> jgraham: and what happens with preflight requests if you toggle the bypass setting
<orkon> q+
<jgraham> q?
<jgraham> ack orkon
<orkon> ScribeNick: jgraham
<jdescottes> q+
<jgraham> orkon: I need to look at what we do for the CORS cache. We can change the cache mode; if we use reload we don't need to change but Firefox would. For memory caches we clear the cache. I need to double check on bypass vs clear. For HTTP we definitely don't clear, but I'm not sure about the other cache types
<jgraham> ack jdescottes
<jgraham> ScribeNick: orkon
<orkon> jdescottes: in Firefox we only bypass but do not actively clear anything, at least with DevTools
<orkon> jgraham: I think bypassing makes more sense than clearing. It sounds like at least for some of the caches it is hard to implement and memory caches can be cleared at random. So perhaps it is less imporant.
<orkon> q+
<jgraham> ack next
<jgraham> ScribeNick: jgraham
<jgraham> orkon: I think what happens to the cache after you've done your tests is less important. it's more important to be able to test a cold start of the application and measure the time taken, data transferred, etc.
<jgraham> ScribeNick: orkon
<orkon> q?
<orkon> jgraham: we should change the command to just set cache bypass to true or false and for now we should set the specific cache mode and it should probably bypass the cors cache and be a bit more handwavy about memory caches
<orkon> q+
<jgraham> ack next
<jgraham> ScribeNick: jgraham
<jgraham> orkon: I think there's an open HTML issue about the image cache, but implementations may also have more caches than that.
<orkon> ScribeNick: orkon
<orkon> jgraham: it might be that for image cache we could specify better but we could also mention implementation-specific caches
<jgraham> q?