reinforcement-learning-algorithms Search Results

1000+ results
for reinforcement-learning-algorithms

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

facebookresearch/BenchMARL #52

Suggestion of integrating HARL algorithms

Hello. Thank you for your amazing work. I appreciate the efforts to provide a unified library of MARL algorithms and environments for benchmarking and reproducibility. To better achieve this goal, I s…

Ivan-Zhong updated 6 months ago
1
pathak22/noreward-rl #31

a GAN idea

Thank you for the work. I recently start working on reinforcement learning of mathematical research (with the formal language and deduction system of a proof assistant as the environment); it's not st…

alreadydone updated 5 years ago
1
flow-project/flow #707

Add an implementation of "Learning Multi-agent Communication…

It would be interesting to port a few basic communication environments/training procedures to Flow. In particular, a popular communications baselines is "Learning Multi-agent Communication with Backpr…

eugenevinitsky updated 1 year ago
10
ReScience/ReScience #27

Reviewer application

If you want to become a reviewer for ReScience, please post your information here. The format is: ``` [name](github account link) Scientific expertise - Language expertise ORCID: [xxxx](http…

rougier updated 1 month ago
161
tkipf/keras-gcn #24

Differences on the Cora dataset

The lables here at keras-gcn does not seem to corresponds with the labels of the gcn repository when you load the data. It's the same indices, but not the same values. Also if you sum y_train here, …

hechtlinger updated 6 years ago
3
ftn-ai-lab/ri-2023-siit #2

Reinforcement learning for 3D Volleyball game

# Reinforcement learning for 3D Volleyball game ## Team members: - Tamara Ilić SV45/2020 group 3 - Uroš Poček SV57/2020 group 3 ## Assistant: - Branislav Andjelic ## Problem being solved: …

UPocek updated 1 year ago
3
hpcaitech/ColossalAI #3574

How to evaluate the effect of PPO training in coati chat

I did the third step of PPO training, it was time consuming and unstable. The reward observed during training is between -300 and -10 as follows. Is this situation normal? What does a good PPO trainin…

guijuzhejiang updated 11 months ago
2
Emmanuel-Naive/MATD3 #3

Some questions about model

Hello, as a newcomer to reinforcement learning, I have some questions I would like to ask. 1. Are the hyperparameters in the model suitable for all scenarios? 2. When I was training the model with y…

IIIgnac updated 1 year ago
1
dotnet/machinelearning-samples #962

Add Simple Feature Vector TensorFlow Example

Hello everyone, I think I've finally figured out how to send a simple float feature vector down to a TensorFlow model, do some predictions on it and retrieve it back. It doesn't seem to be be very …

Bonifatius94 updated 1 year ago
1
CoffeeKumazaki/arXiv #8636

Ensemble Kalman Filter (EnKF) for Reinforcement Learning (RL…

Ensemble Kalman Filter (EnKF) for Reinforcement Learning (RL). (arXiv:2107.01244v1 [eess.SY]) https://ift.tt/2SUDfTO This paper is concerned with the problem of representing and learning the optimal c…

CoffeeKumazaki updated 3 years ago
1

上一页 1...11 12 13 14 15 16 17...100 下一页

1000+ results for reinforcement-learning-algorithms

1000+ results
for reinforcement-learning-algorithms