EdinburghNLP / nematus

Open-Source Neural Machine Translation in Tensorflow
BSD 3-Clause "New" or "Revised" License
797 stars 269 forks source link

code generation #70

Closed bhagat02 closed 6 years ago

bhagat02 commented 6 years ago

In the code generation as you have mentioned to use Namatus for the nmt in that the input is a declaration,docstring and output is body . then how the code will be generated ?

in translation like en to german we know that hi means hello .. but over here how the translation actually working and how the tokenization has been done?

i am new to generation of code .. so need some advice to implement this model and i have also read the nematus how it is working but nothing is described about the translaion of declation,docstring+body i will be very thankfull to you if you can guide me how to use nematus for code generation ..

bhagat02 commented 6 years ago

and you have used tokenizer. perl .. moses decoder for the tokenization... why not nltk tokenizer.. how it is different ...

Avmb commented 6 years ago

Hi, I have indeed used the Moses tokenizer for the code generation baseline experiments. I haven't experimented with other tokenizers, feel free to try them.

bhagat02 commented 6 years ago

Hi,

Thank you so much for your reply.

Regards, Gaurav Bhagat

On Mon, Apr 23, 2018 at 4:32 PM, Antonio Valerio Miceli Barone < notifications@github.com> wrote:

Hi, I have indeed used the Moses tokenizer for the code generation baseline experiments. I haven't experimented with other tokenizers, feel free to try them.

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/EdinburghNLP/nematus/issues/70#issuecomment-383596615, or mute the thread https://github.com/notifications/unsubscribe-auth/AdhRyxGabrpBVNjNiFeIOti5Zg7xhIDgks5treXtgaJpZM4S5BcN .