Closed JiexingQi closed 1 year ago
Thanks @JiexingQi, this is indeed a mistake I made when I uploaded the conala_docs.json
. The file has been updated in google drive. Great catch!
You can also checkout the dataset in huggingface: https://huggingface.co/datasets/neulab/docprompting-conala/tree/main
The unique ID for each document is indicated by the man_id
entry in fid.cmd_*.codet5.t10.json
files.
I removed unnecessary path such as .reference.api
during preprocessing
Thanks a lot.
Hi, @shuyanzhou I find your doc content in conala_docs.json is not consistant with what you provided in data/conala/fid.cmd_dev.codet5.t10.json file, the file you provided missing the function signature and usage.
just like the above figure, the function
in conala_docs.json missing the first two line of content, and the content in the tail seems also missed.
Could you provided the fully content doc files, thanks a lot.