openreasoner / openr

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
https://openreasoner.github.io/
MIT License
1.08k stars 79 forks source link

Does the training support standalone multi-card, distributed and larger models like qwen2.5 72b? #20

Open wphtrying opened 1 month ago

wphtrying commented 1 month ago

System Info

训练是否支持分布式以及更大模型比较qwen72b?

Who can help?

@morning9393

Information

Tasks

Reproduction

训练是否支持分布式以及更大模型比较qwen72b?

Expected behavior

期待训练成功

linyaoyang commented 1 month ago

同问,train_llm.sh训练1.5b的模型需要多少显存?能否用多卡训练?如何训练参数规模比较大的模型?

wphtrying commented 1 month ago

同问,train_llm.sh训练1.5b的模型需要多少显存?能否用多卡训练?如何训练参数规模比较大的模型?

看了下PPO的代码,应该是不支持单机多卡和分布式

git-yutong commented 4 weeks ago

哭了,空有两三台4090就是推理不进去。