issues
search
neuralmagic
/
deepsparse
Sparsity-aware deep learning inference runtime for CPUs
https://neuralmagic.com/deepsparse/
Other
3.04k
stars
173
forks
source link
[TextGeneration] Fix initialization; don't try v1 init for text gen
#1571
Closed
dsikka
closed
10 months ago
dsikka
commented
10 months ago
Summary
To address this ticket:
https://app.asana.com/0/1205229323407165/1206459001098561
Also update condition for kv cache capacity check to prevent repeat check
Summary