Closed 18140663659 closed 1 year ago
Hi @18140663659 deepsparse is adding support for a text generation pipeline that supports running SparseGPT
and other generative LLMs (#1064).
As for actually sparsifying models to replicate SparseGPT
we are soon releasing a major update to Sparsify that includes this @jeanniefinks can provide more info on its release!
Hello @18140663659 As @bfineran mentioned, we are working on the next generation of Sparsify to enable optimizations like SparseGPT to be applied to your own models or generic use cases through a web app and local one-command APIs.
This Sparsify Alpha is set to release next week. If you want be notified when it goes live, fill out this form: https://neuralmagic.com/request-early-access-to-sparsify/ Specifically to use the SparseGPT algorithm on your models, you'll want to check out the Sparsify Alpha's One-Shot Pathway once it's live. More to come!
hi @18140663659 The Sparsify Alpha mentioned in the last comment is now live. Because it is an alpha, we are inviting a small subset of users like yourself to try it out and let us know what you think. Check out https://github.com/neuralmagic/sparsify. I will close out this thread for now but feel free to re-open as needed!
Describe the bug A clear and concise description of what the bug is.
Expected behavior A clear and concise description of what you expected to happen.
Environment Include all relevant environment information:
f7245c8
]:To Reproduce Exact steps to reproduce the behavior:
Errors If applicable, add a full print-out of any errors or exceptions that are raised or include screenshots to help explain your problem.
Additional context Add any other context about the problem here. Also include any relevant files.