cli99 / llm-analysis

Latency and Memory Analysis of Transformer Models for Training and Inference
Apache License 2.0
343 stars 40 forks source link

supports Llama 2 inference analysis #3

Closed cli99 closed 1 year ago

cli99 commented 1 year ago

This PR add the following changes to support Llama 2 inference analysis: