MLX Transformers is a library that provides model implementation in MLX. It uses a similar model interface as HuggingFace Transformers and provides a way to load and run models in Apple Silicon devices.
Apache License 2.0
47
stars
4
forks
source link
This PR fixes the Cache used in Generation models #16
Proposed changes
This PR fixes the Cache used in Generation models