mindspore-lab / mindrlhf

Apache License 2.0
26 stars 12 forks source link

Create llama_reward_model_tutorial.md #58

Closed ChessQian closed 8 months ago