focustense / StardewUI

UI/widget library for Stardew modding
MIT License
5 stars 1 forks source link

Performance optimization #55

Open focustense opened 3 hours ago

focustense commented 3 hours ago

This is a placeholder issue for ongoing performance optimization work.

After a lot of work on #18 and #44, I'm starting to see a lot of diminishing returns - operations such as dictionary lookups and string case conversions that ought to be in the nanosecond range taking milliseconds, possibly due to JIT overhead or even tracing overhead.

On a roughly 100 ms load of the Carousel demo, the breakdown translates to about:

Other miscellaneous stuff that's too small to even mention.

Maybe, possibly, there are some more opportunities for precompilation savings, but optimism is waning. This is very obviously a game of inches, it's hard to know for certain why certain things are slow, and of course this is only on the very first frame - once things are cached, the jank is never seen again even when the view is recreated. At this point, every 5-10 ms saved seems to take another few hours of analysis and tweaking, and it's going to take progressively longer for progressively smaller gains.

So I'm documenting these findings for the future, so I can close the major perf bugs now. While SDV is very, shall we say "variable" about times, the average has gone down from between almost 300 ms to slightly over 100 ms, and it's safe to say that all the low-hanging fruit has been picked.

Attached a trace for reference.

StardewUI_20241118_223329.json

focustense commented 2 hours ago

One thing I failed to note above is that the Carousel example (intentionally) uses some bindings that generate warnings to the log, e.g. for non-updatable bindings that aren't explicitly one-time.

It's possible that at least some of the unexplained overhead is actually lock contention on the SMAPI log. Would be worth testing the example with those warnings suppressed or corrected to see if this makes a difference. It could be an easy ~10 ms of savings.

(The lock contention is a known issue with how SMAPI does its logging today)