-
Hello everyone,
I am a contributor at ReinforcementLearning.jl and we are working on enhancing the support for Offline Reinforcement Learning for which one of the main goals is to make publicly av…
-
Is OpenChat trained by supervised learning or reinforcement learning?
-
Hi,bro,
Thank you so much for your contributions to offline safe reinforcement learning.
Firstly, I close the wandb logger. And when I run the code such as train_cdt by `python .\examples\train\…
-
Hi,
Thanks for the wonderful work! I have a question regarding the hyperparameters in the paper. Are the default hyperparameters stored in config.locomotion the same as those used in Figures 2, 5,…
-
Spring 2022:
-
Pose a question about one of the following articles:
“[Human-level control through deep reinforcement learning](https://www.nature.com/articles/nature14236)” 2015. V. Mnih...D. Hassabis. Nature 51…
-
Does Executorch support on-device training (online learning/model update on edge devices)? If yes, how to enable this? Thanks.
-
Hi there,
First of all, great work and thank you for opensourcing your code!
I have a question regarding reanalyze: you chose to reanalyze 99% of policy targets and 100% of value targets. I am j…
-
Take the liberty to ask, where is your open source dataset?
-
For training an effective agent, we probably need to explore in the order of 100K to 1M transitions and collect them in the replay memory. Collecting one state in our environment can be expensive as i…