-
There is currently support for most of the common (and some less common) ML algorithms in Sharp Learning. However, there does appear to be a lack in the area of Reinforcement Leaning and some might ob…
-
It seems that all the algorithms require that you pass a transition probability table and reward vector, however most of the usefullness of algorithms such as QLearning relies on the fact that it does…
-
Can you show us the file "openai_ros/src/openai_ros/task_envs/turtlebot3/config/turtlebot3_world.yaml"? i have a problem with the network "ValueError: Input 0 of layer "sequential" is incompatible wit…
-
```
What steps will reproduce the problem?
1. Download beliefbox and compile subdirectories
2. ./bin/online_algorithms --n_runs 2 --environment RiverSwim --gamma 1
--n_steps 1000 --epsilon 0.0 --algo…
-
I wrote a very simple simulation to test the Reinforcement Learning Module. I only set up the current action as input, and the output is "left" or "right". Going right feeds the reward 1 back into the…
-
File "/home/user/turtlebot_ws/src/dqn_qlearning_sarsa_mobile_robot_navigation/my_turtlebot3_training/src/start_dqlearn_training.py", line 19, in
from openai_ros.openai_ros_common import StartOpe…
-
I was trying to run the AWR algorithm on the HalfCheetah environment as given in the README. So, first of all there is no `run.py` code in the folder of AWR. I copied `run_script.py` to the root of AW…
-
**What would you like to submit?** (put an 'x' inside the bracket that applies)
- [x] question
- [ ] bug report
- [ ] feature request
**Issue description**
Hy,
I am currently working o…
-
# WIP: English version using Mermaid
## policy
- [ ] policy-based learning 基于策略函数的学习方法
- [ ] value-based learning 基于值函数的学习方法
- [x] 动态规划学习方法 (Dynamic programming learning)
-…
-
Hi, this is a nice project for hybrid action space, and I see you mentioned PDQN/HPPO in `README.md`. Do you have some experiment results about these algorithms in this environment. If not, we want to…