Improvements in current translation model [GSoC'23]

The current model is a fine-tuned Seq2Seq model whereas my model is a transformer-based machine translation model. One of the drawbacks of Seq2Seq modeling is that since Recurrent Neural Network (RNN) is used for constructing the encoder and the decoder, the architecture memorizes the entire sentence from the source language. Therefore, with the increase in the length of sentences, the performance would decrease. Transformers, on the other hand, breaks the entire sentence from the source language into parts and then translates these parts making them much more efficient. It allows the decoder to see the entire input sequence all at once.

For example, The translation for 'overgrown weeds' by:

Current model is: 'muy de la ciudad' which when translated by Google says 'very from the city'

My model is: 'Hierbas sobrecrecidas' which when translated by Google says 'overgrown grasses'

The results from my model are closer to being accurate than the current model. By fine-tuning my model, I believe the results generated will be more accurate.

monum / 311-translation

Improvements in current translation model [GSoC'23] #15