neuralmagic / nm-vllm

A high-throughput and memory-efficient inference and serving engine for LLMs
https://nm-vllm.readthedocs.io
Other
251 stars 10 forks source link

NM Profiler : Update visualize_trace.py #370

Closed varun-sundar-rabindranath closed 3 months ago

varun-sundar-rabindranath commented 4 months ago

Update visualize trace utility.

Usage:

LucasWilkinson commented 4 months ago

They Y-label in the --plot-metric pct_cuda_time graph appears to be wrong above, but in the code it seems to be getting set correctly? I assume this has been fixed?

varun-sundar-rabindranath commented 4 months ago

They Y-label in the --plot-metric pct_cuda_time graph appears to be wrong above, but in the code it seems to be getting set correctly? I assume this has been fixed?

Hey Lucas. Yes, I noticed that and fixed it. Sorry, should have mentioned it somewhere.

varun-sundar-rabindranath commented 3 months ago

Migrated all changes including all of the layer-by-layer profiling code to https://github.com/neuralmagic/vllm/pull/3