reddy-lab-code-research / PPOCoder

Code for the TMLR 2023 paper "PPOCoder: Execution-based Code Generation using Deep Reinforcement Learning"
https://openreview.net/forum?id=0XBuaxqEcG
MIT License
96 stars 11 forks source link