HumanSignal / RLHF

Collection of links, tutorials and best practices of how to collect the data and build end-to-end RLHF system to finetune Generative AI models
175 stars 36 forks source link

Added images and details for PPO training #3

Closed JimmyWhitaker closed 1 year ago

JimmyWhitaker commented 1 year ago
niklub commented 1 year ago

Could we replace the image by dragging the left sidebar to the left, to align 2 options horizontally instead of vertically?

image
niklub commented 1 year ago

Is it still WIP or I can remove it?

image