-
There we will track out progress for [Minari](https://github.com/Farama-Foundation/Minari) integration with CORL. Minari is a standard format for offline RL datasets, with popular reference datasets a…
-
As GC we can see, if the advertiser added the customer's discord tag, but unfortunatly if we are not in contact with the customer, we can't find him via the search system. We have to add him as friend…
-
Hi! Thank you for releasing your code. After reading the code, it seems like you used sequence length=1 for the experiments, and the reward is used in place of return_to_go in the decision transformer…
-
Hi,
Thank you for your dedicated work of PCC-Uspace.
When I followed the instruction in Deep_Learning_Readme.md, I found that values of both Reward and Ewma Reward were so high as the snapshot…
Enjia updated
4 years ago
-
**Is your feature request related to a problem? Please describe.**
It has been observed in research that the CQL (conservative q-learning) may be too conservative and often results in a significantly…
-
What do we want out of our experiments? In the setting of offline RL, we want our algorithm to
1. Achieve reasonable success on the task
2. Show that adding distribution risk improves over vanilla …
-
https://arxiv.org/abs/2402.15567
-
### Description
For offline rl it would be really useful to have the fqe direct and doubly robust methods be updated for continuous action spaces also, not just discrete action spaces.
### Use cas…
-
2022/2/25
Package name:
* qlib.neutrader?
* Sound, brand
* Sounds like limited to "trading" scenario
* qlib.rl?
* Shorter, easier to remember
* Not exactly an RL f…
-
I only like to see a certain few bars in the skills tab open but every time I close the client it resets which ones I have hidden and which ones I can see.
Also what is the "Offline Training" bar for…