We need more detailed monitoring of how the memory, file usage, etc. changes over time. At present the system is reporting data on a single endpoint but we are not aggregating it, only doing spot checks. We need to implement either a separate offline process or aggregation on server to track that.
I currently have code to track memory, files opened, GPU memory, and CPU memory. My plan for next week is to do basic testing and add any other stats requested. I plan on uploading my code to a branch soon.
We need more detailed monitoring of how the memory, file usage, etc. changes over time. At present the system is reporting data on a single endpoint but we are not aggregating it, only doing spot checks. We need to implement either a separate offline process or aggregation on server to track that.