issues
search
ThanatosShinji
/
onnx-tool
A parser, editor and profiler tool for ONNX models.
https://pypi.org/project/onnx-tool/
MIT License
383
stars
51
forks
source link
Support accurate LLM profiling and projection
#93
Closed
luoyu-intel
closed
5 days ago
luoyu-intel
commented
5 days ago
Add per-node latency projection.
Add multi devices latency projection.
Add model: llama2_7b
Add two devices: Intel Gaudi2H and NV H20