-
Thank you for your outstanding work on embodied multimodal navigation!
While trying to replicate OVON related experiments, I realized that the `DATA_PATH` in the [script](https://github.com/Ram81/g…
-
`Agent`s are entities with a `sample_action` and `update` method, in potence.
We exclude from the list exploration strategies and curricula.
_Implement_ means either to produce new code from the pape…
-
Based on the available expert controller design examples (PI-based inner current/voltage control + droop control for power sharing) it will be very interesting to highlight the shortcomings and adavan…
-
Hi, Thanks for the amazing work!
We have successfully trained a state-based RL model on drawer and would like guidance on generating operation data based on point clouds and further training the visi…
-
Hi,
I'm very interested in your research and would like to run your code, but I've encountered a few issues.
1. While following the README, I attempted to perform Consistency-based Reinforcement L…
-
# Why
#### As a
user of `pyCMO`
#### I want
to be able to specify different reward models for my scenarios
#### So that
I can train RL agents
# Acceptance Criteria
#### Given
we currently only expo…
-
### Description
I am jus giving this repo a try , So as per the docs I have trained this probelm=translate_ende_wmt32k stated in tensor2tensor Basics walkthrough and Now I want to tune themodel using…
-
Hello,
I am running the PPO algorithm of Ray RL lib. When I run the code the screen is like this:
![Screenshot from 2024-07-18 04-49-46](https://github.com/user-attachments/assets/7e11e7a6-d5d8-4e…
-
Architecture Optimization - Search Strategy - Search Policy
# Reference
# Brief
Architecture Optimization | Description | Examples
-- | -- | --
Random & Gird Search |
EA - Evolutionar…
-
Hi everyone,
As a professional who has worked with a few RL frameworks in the past, I can confidently say that this is one of the cleanest, most user-friendly, and advanced RL library I've encount…