facebookresearch / CodeGen

Reference implementation of code generation projects from Facebook AI Research. General toolkit to apply machine learning to code, from dataset creation to model training and evaluation. Comes with pretrained models.
MIT License
710 stars 144 forks source link

Pretrain modell #80

Open Elvares opened 2 years ago

Elvares commented 2 years ago

Hi, I want to train model for translation pascal2java. I have small datasets about 2 Gb of pascal and 2 Gb of java. I train model mlm and then train transcoder model but translation doesn't work. When I try to translate pascal2java I got same function of pascal or bad translation of java. It can be problem with small dataset or overfittig?

Elvares commented 2 years ago

also when I train mlm model and then try to use it for transcoder model I got image Is it problem?