offline-rl Search Results

1000+ results
for offline-rl

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

corl-team/CORL #2

Minari Integration with CORL

There we will track out progress for [Minari](https://github.com/Farama-Foundation/Minari) integration with CORL. Minari is a standard format for offline RL datasets, with popular reference datasets a…

Howuhh updated 7 months ago
3
Icecrown-Community/ICC-Member-Website-Public #135

can't reach the customers

As GC we can see, if the advertiser added the customer's discord tag, but unfortunatly if we are not in contact with the customer, we can't find him via the search system. We have to add him as friend…

Wanhedaa updated 2 years ago
2
Smart-Trafficlab/TransformerLight #3

sequence length and return to go

Hi! Thank you for releasing your code. After reading the code, it seems like you used sequence length=1 for the experiments, and the reward is used in place of return_to_go in the decision transformer…

sallyqiansun updated 8 months ago
1
PCCproject/PCC-Uspace #9

Problem about reward and loss in online training

Hi, Thank you for your dedicated work of PCC-Uspace. When I followed the instruction in Deep_Learning_Readme.md, I found that values of both Reward and Ewma Reward were so high as the snapshot…

Enjia updated 4 years ago
3
takuseno/d3rlpy #276

[REQUEST] Adding Cal-QL

**Is your feature request related to a problem? Please describe.** It has been observed in research that the CQL (conservative q-learning) may be too conservative and often results in a significantly…

zxp567 updated 1 year ago
2
hjsuh94/score_po #40

Thoughts on choosing experiments

What do we want out of our experiments? In the setting of offline RL, we want our algorithm to 1. Achieve reasonable success on the task 2. Show that adding distribution risk improves over vanilla …

hjsuh94 updated 1 year ago
5
ChufanSuki/read-paper-and-code #21

Foundation Policies with Hilbert Representations

https://arxiv.org/abs/2402.15567

ChufanSuki updated 4 months ago
3
ray-project/ray #38357

ray/RLlib/offline/estimators

### Description For offline rl it would be really useful to have the fqe direct and doubly robust methods be updated for continuous action spaces also, not just discrete action spaces. ### Use cas…

iaindocherty updated 1 year ago
1
microsoft/qlib #1011

[Proposal] Systematic RL support in qlib

2022/2/25 Package name: * qlib.neutrader? * Sound, brand * Sounds like limited to "trading" scenario * qlib.rl? * Shorter, easier to remember * Not exactly an RL f…

ultmaster updated 2 years ago
1
tvand7093/forgottenserver #44

Skills - Setting to save hidden bars

I only like to see a certain few bars in the skills tab open but every time I close the client it resets which ones I have hidden and which ones I can see. Also what is the "Offline Training" bar for…

paraknell updated 10 years ago
1

上一页 1...6 7 8 9 10 11 12...100 下一页

1000+ results for offline-rl

1000+ results
for offline-rl