microsoft / CodeBERT

CodeBERT
MIT License
2.15k stars 442 forks source link

N-gram statistic model #234

Open JianyuZhao7 opened 1 year ago

JianyuZhao7 commented 1 year ago

Hi, I would like to ask how to implement the bi-directional 3-gram model for the generator. Could you please share the codes or any other material?

Thanks a lot.

Best.

fengzhangyin commented 1 year ago

We simply use a statistics-based language model. We first calculate the tri-gram distribution on the training data and then predict mask words.