starry-sky6688 / MARL-Algorithms

Implementations of IQL, QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II
1.46k stars 283 forks source link

custom data traing #105

Closed shandongchong closed 8 months ago

shandongchong commented 8 months ago

您好,使用自己业务端产生的历史数据,基于IQL策略进行训练,能实现吗,需要改动的大吗?

starry-sky6688 commented 8 months ago

这个代码库需要有仿真环境交互才能训练,只有历史数据不行;只有数据的话可以考虑使用offline RL的算法