Support for encoder-decoder models like T5 and Flan

huggingface / swift-transformers

Swift Package to implement a transformers-like API in Swift

Apache License 2.0

718 stars 80 forks source link

Support for encoder-decoder models like T5 and Flan #8

Open pcuenca opened 1 year ago

zhiyuanzhai commented 11 months ago

Any updates?

pcuenca commented 11 months ago

Hi @zhiyuanzhai!

So far we are focused on improving performance of autoregressive models. Lessons learned could hopefully be transferred to encoder-decoder models down the line, although there are challenges to make attention caching work effectively.