neuralmagic / deepsparse

Sparsity-aware deep learning inference runtime for CPUs
https://neuralmagic.com/deepsparse/
Other
2.97k stars 171 forks source link

Move TokenGenerator logits copy after deterministic #1421

Closed mgoin closed 9 months ago