q-learning Search Results

1000+ results
for q-learning

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

SamiSani68/CI2324 #7

LAB_10 Peer Review

This code implements a tic-tac-toe (Tic-Tac-Toe) game in which two agents play against each other. One of the agents follows a machine learning approach called Q-learning to improve its moves over tim…

SalvatorePolito98 updated 9 months ago
1
recodehive/machine-learning-repos #1533

💡[Feature]: Effective Waste Management using Reinforcement L…

### Is there an existing issue for this? - [X] I have searched the existing issues ### Feature Description The project aims to develop a reinforcement learning (RL) agent to optimize waste collecti…

Panchadip-128 updated 1 week ago
1
PennyLaneAI/pennylane #6323

[BUG] Inconsistent Qubit Indexing in QASM Conversion Process

### Expected behavior When the script is executed, the SX operation should be applied to the qubit at index 1, as specified in the circuit definition. The QASM string generated from the circuit shoul…

MattePalte updated 4 weeks ago
1
AmeetR/RL_IRL_Modeling #6

Create tensorflow model of Q Learning -- two layer NN instea…

AmeetR updated 5 years ago
1
geektutu/blog #15

TensorFlow 2.0 (七) - 强化学习 Q-Learning 玩转 OpenAI gym | 极客兔兔

https://geektutu.com/post/tensorflow2-gym-q-learning.html TensorFlow 2.0 入门系列文章，第七篇，Q-Learning 玩转 OpenAI gym game MountainCar-v0。

geektutu updated 3 years ago
7
AxiomaticUncertainty/Deep-Q-Learning-for-Tic-Tac-Toe #1

Question:

I see that you are using a 0 vector for the rewards, and only updating the value that corresponds to the action here: https://github.com/AxiomaticUncertainty/Deep-Q-Learning-for-Tic-Tac-Toe/blob/c5c0…

amaynez updated 3 years ago
1
WENKAILV/RSDQL #1

How can i run this code on a kubernetes cluster?

Hello, I recently read your article "Microservice Deployment in Edge Computing based on Deep Q Learning" and I read your open source code. I would like to know how you integrated your code into a Kub…

mannou updated 1 month ago
5
enricoande/reinforcement_learning_examples #1

Possible bugs?

@enricoande [1] https://github.com/enricoande/reinforcement_learning_examples/blob/95627db2a323535153e711a23f5519ecf7409f38/invertedpendulum/Sarsa/episodeFA.m#L35 It appears that here `phi` cor…

amneetb updated 2 years ago
1
apache/pulsar #23505

[Bug] Redelivering messages doesn't take dispatcherMaxReadSi…

### Search before asking - [X] I searched in the [issues](https://github.com/apache/pulsar/issues) and found nothing similar. ### Read release policy - [X] I understand that unsupported versions d…

lhotari updated 1 week ago
7
LucasAlegre/sumo-rl #212

Baseline approach for different maps

Hi Lucas, I am implementing different algorithms on different net provided in the library, but I want to simulate the network with a fixed timing and compare the rewards function output for differe…

Tarster updated 1 month ago
1

上一页 1...10 11 12 13 14 15 16...100 下一页

1000+ results for q-learning

1000+ results
for q-learning