datamllab / rlcard

Reinforcement Learning / AI Bots in Card (Poker) Games - Blackjack, Leduc, Texas, DouDizhu, Mahjong, UNO.
http://www.rlcard.org
MIT License
2.91k stars 630 forks source link

DQN with Rule agent #208

Closed 274927760 closed 3 years ago

274927760 commented 3 years ago

Hello, I used two random agents to train dqn(doudizhu) ,but the effect was not good, even far behind the rule agent. However,when I used two rule agents to train it,I found that during the training process (up to 50000 rounds), the reward remained around 0.2,How can I improve the training effect of dqn(doudizhu)?

daochenzha commented 3 years ago

@274927760 Yes, the provided DQN does not work well in DouDizhu. We are working on a much better algorithm. Please stay tuned.

274927760 commented 3 years ago

I'm looking forward to your work very much. By the way, can you provide a more appropriate algorithm or idea to build a doudizhu agent?

274927760 commented 3 years ago

Maybe there is something wrong with my expression. I only want to know if there are any good suggestions about trainning a doudizhu agent,because I want to finish my graduation thesis as soon as possible.I‘m looking forward to your suggestions.Thanks!

daochenzha commented 3 years ago

@274927760 There are two issues in the current DouDizhu. First, the abstraction is not good since currently the kicker is randomly selected. This can be improved. Second, some on-policy algorithms could work better and be more efficient.

274927760 commented 3 years ago

Thanks a bundle!I‘ll have a try!

daochenzha commented 3 years ago

A strong DouDizhu agent is at https://github.com/datamllab/rlcard/tree/master/rlcard/agents/dmc_agent

and

https://github.com/kwai/DouZero