reinforcement-learning-paper Search Results

OscarHuangWind/DRL-Transformer-SimtoReal-Navigation #18

Attention Flow Visualization

Hello, I’m interested in your paper, "Goal-Guided Transformer-Enabled Reinforcement Learning for Efficient Autonomous Navigation," and I’m currently working with your code. In the paper, you vis…

n-shintaro updated 1 week ago

supreme-gg-gg/rl-lab #12

Implement new architectures for deep RL

DDPG, A2C, etc other deep reinforcement learning models (value vs policy, actor critic, critic only actor only) Research paper will be attached below for references, 1-2 more will be a great place …

supreme-gg-gg updated 1 month ago

ray-project/ray #47875

[Docs] Ray + Slurm Container Orchestration

### Description As a Reinforcement Learning (RL) researcher I enjoy using Ray for various projects. However, Ray had limited community support for the Slurm architecture (which my university uses). …

destin-v updated 1 month ago

irthomasthomas/undecidability #951

LLM-Agents-Papers repo

- [ ] [LLM-Agents-Papers/README.md at main · AGI-Edgerunners/LLM-Agents-Papers](https://github.com/AGI-Edgerunners/LLM-Agents-Papers/blob/main/README.md?plain=1) # LLM-Agents-Papers ## :writing_hand…

ShellLM updated 5 days ago

yeshenpy/Awesome-Evolutionary-Reinforcement-Learning #4

Add a new paper: Evolutionary Reinforcement Learning: A Surv…

Add a new paper: Evolutionary Reinforcement Learning: A Survey 2023 https://spj.science.org/doi/10.34133/icomputing.0025 Bai, H., Cheng, R., & Jin, Y. (2023). Evolutionary reinforcement learning: A …

AntonioRodriguezUFAM updated 4 months ago

Shuijing725/CrowdNav_Prediction_AttnGraph #20

About loss

Hello, I would like to ask why the loss fluctuates so dramatically. Does this have any impact on the training? Is the model converging?

cancanzhu updated 6 hours ago

Future-Power-Networks/MAPDN #35

question about paper result

Dear author: Hello! I am a graduate student in a Chinese university. I am working on a project on multi-agent reinforcement learning. I hope to connect my algorithm to the environment you de…

cycaoyang updated 1 month ago

irthomasthomas/undecidability #940

open-thought/system-2-research

- [ ] [system-2-research/README.md at main · open-thought/system-2-research](https://github.com/open-thought/system-2-research/blob/main/README.md?plain=1) # OpenThought - System 2 Research Links He…

ShellLM updated 6 days ago

deepseek-ai/DeepSeek-Prover-V1.5 #2

Request for Release of Enhanced Theorem-Proving Dataset

Hi, The paper looks impressive! Is there a plan to release the training dataset? I noticed that you used an enhanced theorem-proving dataset with 9,645k sequences, derived from DeepSeek-Prover-V1. …

PrithwishJana updated 1 month ago

OscarHuangWind/DRL-Transformer-SimtoReal-Navigation #16

Mismatch Between the Determined Range of the check_pos Funct…

Hello, I have recently been reproducing the algorithm proposed in your paper "Goal-Guided Transformer-Enabled Reinforcement Learning for Efficient Autonomous Navigation" and have encountered some issu…

VoryKwin updated 1 month ago

1000+ results for reinforcement-learning-paper

1000+ results
for reinforcement-learning-paper