Closed LorrinWWW closed 1 year ago
This PR requires the latest transformers (tested on 4.31.0) and flash-attention v2.
transformers
This PR requires the latest
transformers
(tested on 4.31.0) and flash-attention v2.