zhongkaifu / Seq2SeqSharp

Seq2SeqSharp is a tensor based fast & flexible deep neural network framework written by .NET (C#). It has many highlighted features, such as automatic differentiation, different network types (Transformer, LSTM, BiLSTM and so on), multi-GPUs supported, cross-platforms (Windows, Linux, x86, x64, ARM), multimodal model for text and images and so on.
Other
193 stars 38 forks source link

Add Rotary Positional Embeddings and fix bugs in SiLU when inplace is enabled #71

Closed zhongkaifu closed 10 months ago

zhongkaifu commented 10 months ago
  1. Add Rotary Positional Embeddings
  2. Fix bugs in SiLU when inplace is enabled