-
Hello,
I’m interested in your paper, "Goal-Guided Transformer-Enabled Reinforcement Learning for Efficient Autonomous Navigation," and I’m currently working with your code.
In the paper, you vis…
-
DDPG, A2C, etc other deep reinforcement learning models (value vs policy, actor critic, critic only actor only)
Research paper will be attached below for references, 1-2 more will be a great place …
-
### Description
As a Reinforcement Learning (RL) researcher I enjoy using Ray for various projects. However, Ray had limited community support for the Slurm architecture (which my university uses). …
-
- [ ] [LLM-Agents-Papers/README.md at main · AGI-Edgerunners/LLM-Agents-Papers](https://github.com/AGI-Edgerunners/LLM-Agents-Papers/blob/main/README.md?plain=1)
# LLM-Agents-Papers
## :writing_hand…
-
Add a new paper: Evolutionary Reinforcement Learning: A Survey 2023
https://spj.science.org/doi/10.34133/icomputing.0025
Bai, H., Cheng, R., & Jin, Y. (2023). Evolutionary reinforcement learning: A …
-
Hello, I would like to ask why the loss fluctuates so dramatically. Does this have any impact on the training? Is the model converging?
-
Dear author:
Hello! I am a graduate student in a Chinese university. I am working on a project on multi-agent reinforcement learning. I hope to connect my algorithm to the environment you de…
-
- [ ] [system-2-research/README.md at main · open-thought/system-2-research](https://github.com/open-thought/system-2-research/blob/main/README.md?plain=1)
# OpenThought - System 2 Research Links
He…
-
Hi,
The paper looks impressive! Is there a plan to release the training dataset? I noticed that you used an enhanced theorem-proving dataset with 9,645k sequences, derived from DeepSeek-Prover-V1. …
-
Hello, I have recently been reproducing the algorithm proposed in your paper "Goal-Guided Transformer-Enabled Reinforcement Learning for Efficient Autonomous Navigation" and have encountered some issu…