learning-agent Search Results

1000+ results
for learning-agent

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

modelscope/ms-swift #1507

failed (exitcode: -11) local_rank: 5 (pid: 11514) of binary:…

**Describe the bug** 再进行多机lora微调时出错： failed (exitcode: -11) local_rank: 5 (pid: 11514) of binary: /home/jovyan/data-ws-enr/zconda/envs/swift_ft/bin/python Traceback (most recent call last): File…

shyzzz521 updated 2 months ago
8
carpedm20/deep-rl-tensorflow #15

InvalidArgumentError (see above for traceback): CPU BiasOp o…

Got an InvalidArgumentError after 26 minutes of training. I upgraded to the most recent TensorFlow as suggested and did `$ pip install -U 'gym[all]' tqdm scipy`. I ran this on a TitanX and Ubuntu 16.1…

ch3njust1n updated 7 years ago
2
Armandpl/furuta #64

sanity check by training with sac or tqc

I want to train the robot in very few steps and very quickly in terms of wall time but I haven't completed a training run on the robot yet. I should do that first to sanity check, make sure there is n…

Armandpl updated 6 months ago
8
p-christ/Deep-Reinforcement-Learning-Algorithms-with-PyTorch #78

ConnectionResetError: [Errno 104] Connection reset by peer

AGENT NAME: A3C 1.1: A3C TITLE CartPole layer info [20, 10, [2, 1]] layer info [20, 10, [2, 1]] {'learning_rate': 0.005, 'linear_hidden_units': [20, 10], 'final_layer_activation': ['SOFTMAX', …

JinQiangWang2021 updated 1 year ago
2
SforAiDl/genrl #314

Evaluating performance of contextual bandit agents in exampl…

I have been playing around with the DCBTrainer and found some potential inconsistencies. 1) **StatlogData** example found [here](https://genrl.readthedocs.io/en/latest/usage/tutorials/bandit/contex…

TMorville updated 4 years ago
2
geek-ai/MAgent #88

About modifying the number of the agents and the control of …

Hi, Taking the "battlefield" for example, how can I control the red team only? Then how can I change the number of the agents? Thanks!

Amanda2024 updated 3 years ago
1
gorgitko/MI-MVI_2016 #2

How to implement your own gym environment

Hey Jiri, I wonder if you could give some guidance on how to use keras-rl in order to create your own "gym" environment. For example, I see that your board_gym.py is based on core.py, but what for…

Kjell-K updated 7 years ago
3
DLR-RM/rl-baselines3-zoo #204

[Enhancement] Multiple model iterations per Optuna trial and…

I currently have the problem that, a lot of times, the results Optuna optimization produces are not really too optimal, due to the stochastic nature of RL training. For example, training 3 agents with…

seawee1 updated 2 years ago
5
duckietown/gym-duckietown #248

Repo examples are out of date and have several issues

I wanted to mess around with imitation learning on a simple lane following expert. Based on the README, I thought this would be easy to test out. But I had to edit several parts of the code, like to d…

matwilso updated 2 years ago
2
bahriddin/DeepMine #4

Robustness and generality of Alpha Zero

## Hypothesis Authors of recently published research Alpha Zero stated that this technique could be easily generalised to other problems without significant human effort and it approached better th…

bahriddin updated 6 years ago
1

上一页 1...89 90 91 92 93 94 95...100 下一页

1000+ results for learning-agent

1000+ results
for learning-agent