neuralmagic / deepsparse

Sparsity-aware deep learning inference runtime for CPUs
https://neuralmagic.com/deepsparse/
Other
2.97k stars 171 forks source link

[Cherry-Pick][Fix] deepsparse – tokenizer stride size issue with original transformers #1426

Closed dbogunowicz closed 9 months ago

dbogunowicz commented 9 months ago

Cherry-pick for #1425