rl-algorithms Search Results

1000+ results
for rl-algorithms

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Farama-Foundation/SuperSuit #96

Is it possible to convert an AEC Environment with an Action_…

So, I have a custom AEC Environment written in PettingZoo that has an action_mask (it's a board game with a large amount of "legal moves"; so it's necessary to mask out illegal moves during training),…

akshaygh0sh updated 1 year ago
4
riscvarchive/riscv-zacas #20

Memory ordering for failed AMOCAS

I'm not sure this needs to, or should be addressed, but I wanted to make sure people are aware of the issue: In C++ a compare_exchange that fails is treated as a load for memory model issues. "If t…

hboehm updated 1 year ago
35
seq-to-mind/semi-style-transfer #3

the style accuracy is much lower

Hi, I've got the testing result after your kindly instruction and thank you again. But the result is weird, here are the results : Corpus mode: Yelp Pair mode: semantic Epoch: 0 supervised loss…

FayeXXX updated 1 year ago
6
PKU-Alignment/safe-rlhf #128

[BUG] pytorch allocator cache flushes since last step [CUDA…

### Required prerequisites - [X] I have read the documentation . - [X] I have searched the [Issue Tracker](https://github.com/PKU-Alignment/safe-rlhf/issues) and [Discussions](https://github.com/PKU-…

bing0037 updated 1 year ago
4
takuseno/d3rlpy #268

[BUG] current overwriting of transitions in the buffer cause…

Hi, Thanks so much for maintaining this very easy-to-use library! I wanted to report what I think is a bug during overwriting of transitions in the (FIFO) Buffer once it is full. The `prev_transiti…

gunshi updated 1 year ago
7
pytorch/rl #1459

[BUG] Using DiscreteSAC in Multi-Agent environments

## Describe the bug I am currently trying to use the `DiscreteSACLoss` in a multi-agent environment. I am currently following this [tutorial](https://pytorch.org/rl/tutorials/multiagent_ppo.html). …

hyerra updated 1 year ago
5
proroklab/VectorizedMultiAgentSimulator #36

RL algorithms and their inputs

Hi all, I was wondering if the PPO-based MARL algorithms you use in the paper are taken from RLlib or whether they are already available in the library without the need of an RLlib interface. I …

menichel updated 1 year ago
1
dvalenciar/robotic_arm_environment #5

How many times do you need to train the robot arm to reach t…

It is very helpful to experience reinforcement learning simulation using your examples. I'm running through your examples, but it's hard to see that reinforcement learning is working. How many times…

hyungtai-kim updated 1 year ago
1
taylorhansen/pokemonshowdown-ai #361

Partial Python rewrite

With the growing complexity of the RL/ML side of this project, the training code is starting to hit the limits of the current TFJS/Node capabilities, having to re-implement some frameworks/algorithms …

taylorhansen updated 1 year ago
2
jrowberg/i2cdevlib #479

Confusing line in MPU6050_DMP6 example

I'm trying to understand the following code fragment in the MPU6050_DMP6 example ``` // check for overflow (this should never happen unless our code is too inefficient) ``` if ((mpuIntStatus…

paynterf updated 4 years ago
133

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for rl-algorithms

1000+ results
for rl-algorithms