Open xpkoala opened 1 month ago
After removing some of the large saved queries the cpu usage did begin to drop.
Discussed during standup on Sept 5th. Since we currently have no users (as far as we are aware) with a similarly setup environment we are reducing the priority of this as an ~unreleased bug
.
This does warrant further investigation.
@xpkoala todo: scale up saved queries once more to achieve 100% cpu utilization and view results when:
remove all large queries and add 100-200 smaller queries (scheduled : 1hr; query reporting enabled) and view results when:
Recommendation from a customer when debugging a profile in their environment and seeing high encoding/json cpu time:
have you tried swapping encoding/json for goccy/go-json?
Also a separate recommendation for general performance from the same customer:
ahhh, and prom hooked into http funcs. have you tried setting GOGC really high and setting GOMEMLIMIT? we had an issue with prom metrics and it constantly alloc-ing to the point it hurt performance, GOGC=2000 and GOMEMLIMIT (to the level I expected) basically fixed the problem
Should we consider these settings for environments and auto-allocation in our terraform modules/cloud deployments?
@rfairburn Any chance this is contributing to the load issues you're seeing today? If so, I'll prioritize so it gets looked at.
Fleet version:
minor-fleet-v4.56.0
Web browser and operating system: n/a
💥 Actual behavior
Seeing cpu usage maxed out on loadtest environment. The web interface is still accessible though a little sluggish. The size of these queries were not present on previous loadtest environments.
🧑💻 Steps to reproduce
🕯️ More info (optional)
Debug archive for the issue can be found here
pprof chart showing the long json encode status