-
Trying to debug larger width environments (7 currently).
Things to try:
1. Different metric (Average Q-value from 2015 paper https://arxiv.org/pdf/1312.5602.pdf).
```
5.1 Training and Sta…
-
# Abstract
This project implements the NeurIPS 2019 paper:
q-means: A quantum algorithm for unsupervised machine learning
https://papers.nips.cc/paper/8667-q-means-a-quantum-algorithm-for-unsupervi…
-
### **Description**
Add an interaction mechanism to allow agents to "play a game" with each other, such as sharing resources or attacking. This feature will enable agents to engage in complex inter…
-
**As an** agent
**I want to** be able to use Q-Learning to use as a strategy
**so that** I can play snake
## Acceptance Criteria
### AC1
Given I am starting to play a game of snake
When I do n…
-
## ❓ Questions on how to use PyTorch3D
NOTE: Please look at the existing list of Issues tagged with the label ['question`](https://github.com/facebookresearch/pytorch3d/issues?q=label%3Aquest…
-
Good thing I kept all my research work private, already deep q networks code stolen.
Feel free to contact me if needed in cloudsim scheduling and energy part, I have worked on reinforcement learnin…
-
### Proposal
Currently, there are only 2 datasets for [discrete](https://gymnasium.farama.org/api/spaces/fundamental/#gymnasium.spaces.Discrete)-action envs:
- [Fourrooms](https://minari.farama.…
-
Hello,
I recently read your article "Microservice Deployment in Edge Computing based on Deep Q Learning" and I read your open source code. I would like to know how you integrated your code into a Kub…
-
Q-learning algorithm for your graph traversal problem:
-
Research about Reinforcement Learning, currently-used algorithms (Q-Learning, Temporal Difference Learning, SARSA etc.) and become able to list pros and cons of those algorithms.