Research platform for Human-in-the-loop learning (HILL) & Multi-Agent Reinforcement Learning (MARL)
76
stars
14
forks
source link
Backport improvements to cogment verse (incl. Trial Datastore and model registry interactions, PPO fixes) #125
Open
cloderic opened 1 year ago