microsoft / PyCodeGPT

A pre-trained GPT model for Python code completion and generation
MIT License
266 stars 44 forks source link

Training dataset acquisition request #25

Open z7r7y7 opened 3 months ago

z7r7y7 commented 3 months ago

Thank you for providing this excellent project! While running the code, I encountered an issue during the training phase where the PyCodeGPT dataset is mentioned, but I couldn't find a link to download it. Could you kindly provide the download link for the dataset?

If the dataset is confidential or not publicly available, would it be possible to share a custom data format template, so I can use my own dataset for training?

Thank you so much for your support and looking forward to your reply!