TD3 - Githubissues

cogment / cogment-verse

Research platform for Human-in-the-loop learning (HILL) & Multi-Agent Reinforcement Learning (MARL)

https://cogment.ai/cogment_verse

Apache License 2.0

76 stars 14 forks source link

Closed saikrishna-1996 closed 1 year ago

saikrishna-1996 commented 1 year ago

ToDo: sample random action for first 25000 time steps. We need a working sample_space function (for continuous actions) for that.

cloderic commented 1 year ago

Add a little readme on the algorithm + some training traces (basics of a model card)