RuBP17 / AlphaDou

A Doudizhu reinforcement learning AI
GNU General Public License v3.0
4 stars 1 forks source link

关于训练问题 #1

Open rubbyzhang opened 1 week ago

rubbyzhang commented 1 week ago
RuBP17 commented 1 week ago

我们在arxiv上提交了一篇同名文章,如果你对我们方法的具体细节感兴趣,可以阅读该文章。

We have submitted an article with the same title on arXiv. If you are interested in the specific details of our method, you can read the article.

rubbyzhang commented 4 days ago

谢谢你的回答,请问https://github.com/RuBP17/AlphaDou/tree/main/baseline/SLModel 是论文中训练好的模型吗? 还是说需要我自己重新进行训练

RuBP17 commented 4 days ago

不是,SLModel内的模型是用于测试的叫牌模型,它采用监督学习训练而来,是Douzero Resnet项目的叫牌模型。论文中的出牌模型以及叫牌模型均是由强化学习训练而来的。在这里未给出权重文件。

No, the model within the SLModel is a bid model used for testing, which was trained through supervised learning and is the bid model from the Douzero Resnet project. The cardplay model and the bid model mentioned in the paper were both trained through reinforcement learning. The weight files are not provided here.

EdwardPooh commented 4 days ago

@rubbyzhang

rubbyzhang commented 2 days ago

谢谢大佬的详细解答