关于性能分析的一点疑惑 - Githubissues

openppl-public / ppl.llm.serving

Apache License 2.0

122 stars 13 forks source link

关于性能分析的一点疑惑 #60

Open Zhiy-Zhang opened 2 months ago

Zhiy-Zhang commented 2 months ago

What are the problems?(screenshots or detailed error messages)

想问下有性能分析的工具嘛？profiler相关，还是只能用nsight profile这种自己去看一些算子性能

What are the types of GPU/CPU you are using?

GPU：A100-80G-SXM4

What's the operating system ppl.llm.serving runs on?

Ubuntu 20.04.4 cuda：12.3 cudnn：8904 trt：9.2.0

What's the compiler and its version?

gcc 11.4 cmake version 3.27.9 Cuda compilation tools, release 12.3, V12.3.107

Which version(commit id or tag) of ppl.llm.serving is used?

commit id：51c3b3d5c5eba25c276a84388f04a2c9e198699f

Vincent-syr commented 2 months ago

serving整体的profiling信息根据宏“PPL_LLM_ENABLE_PROFILING”输出，默认是打开的，算子的profiling信息需要nsight去看，建议跑offline_inference。如果想要单step的kernel profling信息，可以参考https://github.com/openppl-public/ppl.nn/blob/master/tools/pplnn_llm.cc#L819 ，编译时“-DPPLNN_ENABLE_KERNEL_PROFILING=ON”