neuralmagic / deepsparse

Sparsity-aware deep learning inference runtime for CPUs
https://neuralmagic.com/deepsparse/
Other
2.94k stars 169 forks source link

[Text Generation Pipeline] Fix Non KV Cache Pipeline not calling parse_inputs #1552

Closed dbogunowicz closed 5 months ago