zhongkaifu / Seq2SeqSharp

Seq2SeqSharp is a tensor based fast & flexible deep neural network framework written by .NET (C#). It has many highlighted features, such as automatic differentiation, different network types (Transformer, LSTM, BiLSTM and so on), multi-GPUs supported, cross-platforms (Windows, Linux, x86, x64, ARM), multimodal model for text and images and so on.
Other
193 stars 38 forks source link

Could Seq2SeqSharp contribute to Tensorflow.NET NLU features in win-win way? #19

Open GeorgeS2019 opened 3 years ago

GeorgeS2019 commented 3 years ago

Suggestions

LayerNormalization, MultiHeadAttention and Attention layers in C# (for example) have been implemented in Seq2SeqSharp

The questions:

zhongkaifu commented 3 years ago

Thanks @GeorgeS2019. That's a great idea. However, Tensorflow.NET uses Google's Tensorflow as backend, but Seq2SeqSharp has its owned backend, such as computing graph, CUDA kernel, memory management and others. So, it could be problem to mix them up. One approach maybe work is to migrate Seq2SeqSharp's attention layers and normalization layers (forward pass only) to Tensorflow.NET, but it requires the operations between Seq2SeqSharp and Tensorflow.NET are aligned.