whittle-org / whittle

Python library to compress LitGPT models for resource efficient inference.
https://whittle-org.github.io/whittle/latest/
Apache License 2.0
10 stars 4 forks source link

feat: Flexible intermediate sizes #158

Closed gabikadlecova closed 2 weeks ago

gabikadlecova commented 2 weeks ago

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Minimal Example / How should this PR be tested?

Any other comments?


By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.