cli99 / llm-analysis

Latency and Memory Analysis of Transformer Models for Training and Inference
Apache License 2.0
343 stars 40 forks source link

Add gated linear unit support #10

Closed mvpatel2000 closed 11 months ago