Open ZhangZhuoSJTU opened 3 years ago
hi, thanks for the question.
InferCode is similar to existing pretrained language models, such as Bert, Transformer, etc, which means that once you have a pretrained model on a pretext task, you can fine-tune it, you can read our paper for more details. But if you expect to predict the method name from the vectors, I'm afraid that the reason might not be good.
@bdqnghi .
Thank you. Your pretrained model is only for Java please? If we want to use it for multiple programming languages, should we train model from scratch please? If yes, can you refer us back to details how to train model from scratch on a new dataset?
Hi all,
Thanks for bringing this excellent work up.
I have played with InferCode for a while and am thinking about how to properly predict method names.
I figured out that InferCode can generate the encoded vectors. Hence, I am thinking maybe we need additionally train a model to predict names using the vectors. May I know whether it is the right way to go? If so, would you mind sharing some ideas/insights about how the architecture of the new model looks like?
Any suggestion would be very much appreciated. Thanks!
Hi, thanks for your comment,
can you tell me if you fine-tune infercode successfully ?
Hi all,
Thanks for bringing this excellent work up.
I have played with InferCode for a while and am thinking about how to properly predict method names.
I figured out that InferCode can generate the encoded vectors. Hence, I am thinking maybe we need additionally train a model to predict names using the vectors. May I know whether it is the right way to go? If so, would you mind sharing some ideas/insights about how the architecture of the new model looks like?
Any suggestion would be very much appreciated. Thanks!