Large browser windows with lots of widgets seem to be causing lasting blurriness

totaam commented 9 years ago

Issue migrated from trac ticket # 967

component: core | priority: critical | resolution: worksforme

2015-08-26 03:27:52: afarr created the issue

Working with a 0.15.5 r10336 windows client (our build) against a 0.15.4 10209 fedora 21 server, I'm seeing some blurriness, especially when scrolling, which doesn't seem to be resolving itself after the scrolling stops.

I've collected client side logs (with -d client,regionrefresh) and server side logs (with -d encoding,regionrefresh).

Yes, I also collected some xpra info.

And, jut for fun... I'll include a screenshot of the bit of blurriness (it was considerably worse at points, but I did catch some... but was mostly seeing only with very large windows, so I won't try to include inline).

totaam commented 9 years ago

2015-08-26 03:32:40: afarr uploaded file `ticket967_blurry-info2.txt` (117.0 KiB)

xpra info for 0.15.4/5 blurry

totaam commented 9 years ago

2015-08-26 03:42:25: afarr uploaded file `ticket967_blurry-xpra-client-d-client-refresh.txt` (2806.9 KiB)

client side logs -d client,regionrefresh

totaam commented 9 years ago

2015-08-26 03:48:47: afarr uploaded file `ticket967_blurry-xpra-server-d-encoding-regionrefresh.txt.zip` (951.4 KiB)

server logs -d encoding,regionrefresh (so long I had to zip... but if something gives time index, they might be useful)

totaam commented 9 years ago

2015-08-26 03:50:08: afarr uploaded file `ticket967_0-15-5-blurry-screen-shot.PNG` (841.4 KiB)

screenshot of medium level of blurriness I was seeing with google-chrome (you've seen webkit)

totaam commented 9 years ago

2015-08-28 04:43:40: antoine changed owner from antoine to afarr

totaam commented 9 years ago

2015-08-28 04:43:40: antoine commented

About the debug flags:

client will include far too much to be useful

refresh is usually what you want when things aren't refreshing properly

regionrefresh is only for the video region - unlike refresh, it is meant to be constantly postponed since the video region should update regularly

But in this case, the refresh is not the cause of the problem, your xpra info shows:
window[3].size=(1226, 1635)
window[3].video_subregion.refresh_region[0]=(0, 85, 1226, 1550)
So almost all of the window is detected as a video region. The scrolling made us select the whole window as video (because there is no way to tell that it isn't: it is updating fast and in exactly the same area each time, just like video). And the heuristics kept this region afterwards, probably because so many things are animated on this page that they keep the "hit counter" high.

This means that the detection heuristics get it wrong: #410. So the debug flag that you want is probably (server-side): regiondetect.

totaam commented 9 years ago

2015-09-03 03:00:17: afarr commented

Tested some more with our set up. It seems easier to reproduce with our builds & configuration... will try to carve out a few minutes to test with encryption on with your builds to see if that might be the difference.

In any case, got the screen test to go blurry and remain so (msn.com, money tab... perhaps that page should stay blurry?) - will attach server logs with -d regiondetect for a portion of time with the text stuck at blurry, despite scrolling and mousing around the links and such, as well as a new xpra-info specific to our set up. I'll also attach a full-size screenshot of the page with most of it blurry, as well as an edited-for-size to link in-line.

The large-ish image sort of top-left is a rotating ad, which refreshes every... ohh, 1-2 seconds (?) ... and some of the other widgets seem to involve some motion (including a couple more ads that I didn't bother to capture in the screenshot). I suspect they may be responsible for just enough updates to keep the region detecting as video.

totaam commented 9 years ago

2015-09-03 03:01:18: afarr uploaded file `ticket967_our-server_blurry-regiondetect.txt` (669.0 KiB)

-d regiondetect server logs, our server, 0.15.5(ish)

totaam commented 9 years ago

2015-09-03 03:03:49: afarr uploaded file `ticket967_our-server_blurry-info.txt` (117.3 KiB)

xpra info, server side (of course) - our server (0.15.5 r10308 +/-)

totaam commented 9 years ago

2015-09-03 03:04:46: afarr uploaded file `ticket967_full-size-portion-of-screen_blurry-page.png` (942.1 KiB)

full size shot of page while blurry - our server/client

totaam commented 9 years ago

2015-09-03 03:07:41: afarr uploaded file `ticket967_shot-of-page-while-blurry_rolling-banner-ad.png` (318.9 KiB)

edited for in-line shot of blurry, to show widget concentration

totaam commented 9 years ago

2015-09-03 03:08:50: afarr commented

[[Image(ticket967_shot-of-page-while-blurry_rolling-banner-ad.png)]]

totaam commented 9 years ago

2015-09-03 12:43:44: antoine commented

From your regiondetect debug log, we can see at regular intervals:
testing      current video region       rectangle[0, 79, 2098, 1306]: 100% in,   0% out,  93% of window, score=103
identify video: most=100% damage count={R(0, 79, 2098, 1306): MutableInteger(400)}
So it finds that 100% of screen updates happen in the region that previously identified as video, that's roughly 20 to 40 repaints per second! (the calculations run at most every second - less when there is not much happening on screen)

Not only that, but if you look at the actual paint events themselves (the format is simple: timestamp, X, Y, WIDTH, HEIGHT), ie:
(1441237975.382138, 0, 79, 2098, 1306), (1441237975.402772, 0, 79, 2098, 1306), (1441237975.428191, 0, 79, 2098, 1306)
All of the events that I can see actually repaint the whole of that area! (it's easy to see if you just search the log output for the string 0, 79, 2098, 1306, what is not highlighted is the rest - not much!) Usually you get smaller sub-areas, especially with players like flash that paint the screen in horizontal chunks, or youtube which repaints the video and the controls around it separately, but in this case it is all in one huge area!

You should be able to confirm that we are recording the correct values for paint events by logging with -d encoding then grepping the output for damage. But the code is unambiguous in this area: we record all non refresh events in the list you see in the regiondetect log.

So at this point I think I will close this bug as invalid. The region detect code gets it right, and we're doing remarkably well considering the heavy paint traffic.

It looks to me like the browser is needlessly repainting things that have not moved. It could also be that this particular page is triggering those events through bad javascript code. I found a good page which explains the browsers' rendering process: How Browsers Work: Behind the scenes of modern web browsers If the problem comes from the browser's rendering engine rather than the page, this needs to be fixed as it will consume huge amounts of CPU for absolutely nothing.

Edit: originally said 400 updates per second, which was incorrect. We keep the most recent 400 events, and the time difference from oldest to newest is roughly between 10 and 20 seconds.

totaam commented 9 years ago

2015-09-09 00:23:50: afarr changed status from new to closed

totaam commented 9 years ago

2015-09-09 00:23:50: afarr set resolution to invalid

totaam commented 9 years ago

2015-09-09 00:23:50: afarr commented

Looks like closing on your end is probably the right thing to do. We'll have to handle it on our end.

I'll take the liberty of closing.

totaam commented 9 years ago

2015-10-28 00:00:15: maxmylyn changed status from closed to reopened

totaam commented 9 years ago

2015-10-28 00:00:15: maxmylyn removed resolution (was invalid)

totaam commented 9 years ago

2015-10-28 00:00:15: maxmylyn commented

I have been volunteered to re-open this ticket. All jokes aside, I am seeing identical behavior in the latest Chromium (the open source variant - not the closed source Chrome):

Server is a Fedora 21 VM running trunk r11057 - built from source

Client is a Fedora 20 hardware machine running trunk r11057 - built from source

Server is launched with xpra start :13 --bind-tcp=0.0.0.0:2200 --start-new-commands=yes --start-child=xterm

Client is connected with xpra attach tcp:IP_TO_SERVER:2200

Once connected, chromium-browser --show-paint-rects is launched.

With Chromium, navigate to Ebay (easiest by far to reproduce behavior), and enter a search term (for reference, I just look for VW Super Beetles)

From there, you can do two things to see the blurry-ness stick around. You can click on a posting that will time out shortly (within 3 hours), or just sit there on the search results page(new!). With Chromiums paint debug enabled, you can see that the post titles refresh every second, and if on a posting is timed to match the clock ticking down.

The Heuristics here aren't catching (but trying if XPRA_OPENGL_PAINT_BOX=1 is set) these partial refreshes, and instead are repainting the whole window with h264...this causes the whole thing to become blurry. In some cases, it does come in clear; but that's about 30% of the time in my experience today.

I'll attach a screenshot of the behavior. If you would like logs, please let me know what flags you want and I'll attach them; as the repro is relatively simple.

As an aside, all this is very reminiscent of #410 and #596 from almost 2 years ago...speaking of which, my 2 year Anniversary here is coming up in a few short months.

totaam commented 9 years ago

2015-10-28 00:00:54: maxmylyn uploaded file `Xpra_967_Full_Blurry.png` (1526.4 KiB)

Sitting at an Ebay search query and seeing the blurry stick constantly. This behaviour appears to stick around indefinitely.

totaam commented 9 years ago

2015-11-03 00:50:49: afarr commented

Repro'd for logs, win client 0.16.0 r11118 against fedora 21 0.16.0 r11118.

Using steps listed above (comment:6), with a slightly different ebay search site... [http://www.ebay.com/sch/i.html?_from=R40&_trksid=p2047675.m570.l1313.TR12.TRC2.A0.H0.Xsuper+beetle.TRS0&_nkw=super+beetle&_sacat=0].

Scrolling up & down and mousing all over all the various widgets, even with the chromium paint boxes flashing regularly, wasn't sufficient to induce blurriness with a 1920x1080 window (give or take).

Re-sizing the window, however, seems to trigger the blurriness pretty reliably. (Shrink the window, then resize back to +/- 1920x1080).

I set the test up to be as narrow a window as possible, then blew it at the last minute... launched server with logs being captured, but no flags enabled... then connected client without logs in order to set up the blurriness.

I then disconnected client and re-connected to running session with logs enabled, -d client,regionrefresh (which will explain the disconnect/re-connect you'll see in server logs). I then used control channel to enable the server logs (and noticed that trying to pass two arguments failed... I'll make another ticket for that) - in my hurry I'm not sure if I enabled regionrefresh first, or encoding, but you'll see a few long seconds in the server logs with one enabled only, before I managed to enable the other.

I then resized the chromium window (smaller, larger), but then tried to get a screenshot... which means more logs than were probably strictly necessary. Oops.

In my hurry I also managed to blow the xpra info at the time, but I repro'd again without logs running and grabbed a new xpra info (window sizes might be a little different, but otherwise the info should be good).

Just wanted to give as much info as possible, so you'll be able to ignore as much superfluous logs as possible.

Also, ran with --desktop-scaling=off, I'll attach logs and new screenshot (the repro done my maxmylyn was on a particularly low end client machine, wanted to be sure that wasn't the root cause, rather than just the reason it was so easy to repro)... and then I'll try again with scaling of 1.5 and 2, just to see if there are different results (I imagine there will be).

totaam commented 9 years ago

2015-11-03 01:24:07: afarr uploaded file `ticket967-beetle-repro-screenshot.PNG` (988.0 KiB)

one more repro screenshot, XPRA_OPENGL_PAINT_BOX=1, most of screen encoded h264, but only link areas updating, according to chromium paint boxes