Closed Liang-Jiaying closed 1 year ago
Since to fine tune and having human feedbacks for reinforcement learning are not achievable on my computer (I believe also in others' laptop), I am not able to complete this code demo.
Since to fine tune and having human feedbacks for reinforcement learning are not achievable on my computer (I believe also in others' laptop), I am not able to complete this code demo.