rl-algorithms Search Results

1000+ results
for rl-algorithms

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

EdanToledo/Stoix #83

[FEATURE] Create a documentation website

As I've been taking lots of notes while reading papers related to Rainbow, I thought I'd set up the documentation website and flesh it out gradually. I'll link a pull request with a first version of t…

RPegoud updated 2 months ago
4
ray-project/ray #44478

[RLlib] TypeError converting batch (INFOS) to torch tensor w…

### What happened + What you expected to happen The method `convert_to_torch_tensor` fails and returns the following TypeError: ``` TypeError: can't convert np.ndarray of type numpy.str_. The onl…

ciroaceto updated 3 months ago
2
microsoft/malmo #780

Feature request: pausing the environment

In many on-policy RL algorithms, we would like to pause the environment while we synchronously wait for the update of the weights. Or, in other RL game environments, the game timer only steps every ti…

eambutu updated 5 years ago
2
thomashirtz/gym-hybrid #3

Algorithm results about PDQN/HPPO in gym-hybrid

Hi, this is a nice project for hybrid action space, and I see you mentioned PDQN/HPPO in `README.md`. Do you have some experiment results about these algorithms in this environment. If not, we want to…

PaParaZz1 updated 1 year ago
2
kekehurry/Evolvable-Case-based-Design #1

RL model trained on every condition image or on a dataset

I'm interested in the condition control part implemented by RL, but I got a confusion about the RL training. Does the training of RL on a set of training data or just one image online? From what I saw…

gdjmck updated 7 months ago
1
JuliaReinforcementLearning/ReinforcementLearning.jl #954

Improving Collaboration: Separate out the environment interf…

Hi everyone, It has been cool to see the recent flurry of contributions to this package, especially by @jeremiahpslewis. In a [recent discussion](https://github.com/JuliaReinforcementLearning/Reinf…

zsunberg updated 5 months ago
4
DaDucking/PPOAttention #1

Performance Comparison Results

**Thank you for your contribution !** I want to know in all of the self-attention RL algorithms, which one has the best performance? Thank you !

namjiwon1023 updated 1 year ago
1
thu-ml/tianshou #1034

Clearer separation between the trainer and the algorithm and…

- [ ] I have marked all applicable categories: + [ ] exception-raising bug + [ ] RL algorithm bug + [ ] documentation request (i.e. "X is missing from the documentation.") + [ ] ne…

maxhuettenrauch updated 7 months ago
1
Farama-Foundation/HighwayEnv #597

Issues with multi-agent settings

Dear author, I am implementing the Multi-agent settings using the Highway-v0. I am not able to achieve stable training and the vehicles can run off the roads without terminating the environment. I too…

DongChen06 updated 4 days ago
3
ray-project/ray #45433

RLlib - Multiagent new api - rllib-multi-agent-env-v0 alread…

### What happened + What you expected to happen I converted existing code working on 2.7 to 2.20 (new api) The error: File "/opt/project/trading/training/model/rl/multi_agent/ppo/equity/trainer…

zoetsekas updated 3 months ago
2

上一页 1...4 5 6 7 8 9 10...100 下一页

1000+ results for rl-algorithms

1000+ results
for rl-algorithms