shuyanzhou / docprompting

Data and code for "DocPrompting: Generating Code by Retrieving the Docs" @ICLR 2023
Apache License 2.0
232 stars 17 forks source link

RoBERTa Off shelf #20

Open elpitero opened 6 months ago

elpitero commented 6 months ago

Could you provide what RoBERTa model was used during research? We recreated your results with all provided models witch success, but RoBERTa seems to pose some problems. The screnshot bellow shows results presented in your paper (first row) and our results (second row). We used roberta large model from https://github.com/facebookresearch/fairseq/blob/main/examples/roberta/README.md We have yet to understand why the scores differ so much. obraz

shuyanzhou commented 6 months ago

Apology for the confusion, in Table 4 for the paper, RoBERTa refers to princeton-nlp/sup-simcse-roberta-large from Gao et al. The model can be found in their repo