Closed danielkorat closed 9 months ago
Check out this pull request on
See visual diffs & provide feedback on Jupyter Notebooks.
Powered by ReviewNB
The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.
We can ignore the failing test - it also occurs on main
, and I'm unable to reproduce it for now.
Nice changes!
Some extra comments:
Hi @tomaarsen 👋
This PR adds a notebook that demonstrates how to accelerate SetFit models using
optimum-intel
and achieve3.3x
latency speedup (bs=1) and3x-4x
throughput increase without any accuracy drop. Specifically, it applies static 8-bit quantization using INC to the SetFit model body.