Initial foundation - Githubissues

Glavin001 / PeakProgrammer

Mastering coding precision with fine-tuned reinforcement learning

MIT License

0 stars 0 forks source link

Initial foundation #1

Open Glavin001 opened 1 year ago

Glavin001 commented 1 year ago

[ ] Pluggable fine-grained reward functions
[ ] Reward
[ ] Penalty
[ ] Completion-wise feedback
[ ] Sentence/Sequence-wise feedback
[ ] Token-wise feedback

Resources