PKU-MARL / HARL

Official implementation of HARL algorithms based on PyTorch.
484 stars 59 forks source link

Query about Integration of Transformer Models #38

Closed antonio-guillenperez closed 5 months ago

antonio-guillenperez commented 6 months ago

Hi,

I am interested in your work on heterogeneous-agent reinforcement learning and the algorithms you have developed. I noticed that your current implementation focuses on algorithms like HAPPO, HATRPO, etc., based on PyTorch. Given the success of transformer models in various domains, are there any plans to incorporate transformer neural networks into your HARL algorithms? This could potentially enhance the capability of handling more complex heterogeneous agent interactions.

Thank you for considering my query.

guazimao commented 5 months ago

Hi. Your idea is very interesting. Combining transformer with HARL algorithms indeed has the potential to further improve performance. Your idea is similar to the paper "Multi-Agent Reinforcement Learning is a Sequence Modeling Problem", which uses the transformer, sequential update scheme, and PPO update objectives. You can take a look at this paper and its repo. If you have any further questions, feel free to discuss with me.