cli99 llm-analysis issues

cli99 / llm-analysis

Latency and Memory Analysis of Transformer Models for Training and Inference

Apache License 2.0

343 stars 40 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Fix memory estimation

#25 cli99 closed 4 months ago
0
How to get the analysis of model Qwen1.5-0.5B

#24 qxpBlog opened 6 months ago
0
Modify the bug in weight memory calculation

#23 BhAem closed 7 months ago
0
latency [BUG]

#21 Akash08naik opened 9 months ago
3
mistral and mixtral inference[BUG]

#20 Akash08naik opened 9 months ago
4
question about the memory calculation

#19 ShouyangDong closed 7 months ago
2
A question about layernorm activation memory.

#18 LinHanyueEsar closed 9 months ago
3
add more options for activation recomputation

#17 cli99 closed 11 months ago
1
[REQUEST] Support for paged attention?

#16 cnjsdfcy closed 11 months ago
2
add sharded data parallel all gather time estimation

#15 cli99 closed 11 months ago
1
fix some typo

#14 digger-yu closed 11 months ago
0
[REQUEST] How to get other GPU config

#13 Echozqn closed 11 months ago
3
[BUG]Is it possible that hbm_memory_efficiency is not working in the code?

#12 Echozqn closed 11 months ago
4
fix memory calculation for optimizer states, gradients, and activation

#11 cli99 closed 11 months ago
1
Add gated linear unit support

#10 mvpatel2000 closed 11 months ago
0
Support moe in training analysis

#9 cli99 closed 12 months ago
0
Fix weight computation for MLP

#8 weimingzha0 closed 1 year ago
0
[BUG] MLP intermediate dimension not used

#7 weimingzha0 closed 1 year ago
0
BUG Fix

#6 9tong closed 1 year ago
1
add llama2

#5 cli99 closed 1 year ago
1
fix inference activation memory size and kv cache size calculation

#4 cli99 closed 1 year ago
0
supports Llama 2 inference analysis

#3 cli99 closed 1 year ago
0
fix some spelling error

#2 digger-yu closed 1 year ago
0
Update README.md

#1 digger-yu closed 1 year ago
1