train-agents Search Results

1000+ results
for train-agents

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Unity-Technologies/ml-agents #6144

Unable to run HuggingFace's Tutorial Huggy due to TypeError:…

**Describe the bug** When trying to follow the HuggingFace's Tutorial Huggy ([Google Colab Link](https://colab.research.google.com/github/huggingface/deep-rl-class/blob/master/notebooks/bonus-unit1/b…

kuds updated 2 weeks ago
5
DLR-RM/stable-baselines3 #1500

check_env warning for FrameStacked observation in stable_bas…

### 🐛 Bug When using the "check_env "function of "stable_baselines3.common.env_checker" with an environment wrapped in a "FrameStack" wrapper from "gymnasium.wrappers", I get an error on the type of …

corentinlger updated 1 year ago
4
fiatrete/OpenDAN-Personal-AI-OS #41

Objected knowleadge base, a specialized implemention for em…

## Vectorized Knowledge Large language models are trained on general corpora and without fine-tuning on user-specific data, they struggle to utilize user-related context effectively. Users accumu…

photosssa updated 1 year ago
2
tudngn/IKTF-NN #1

Question on the project!

Hi tudngn! I've seen your project on GitHub and I'm very interested to understand what have you done, because I'm working in a shepherd environment with imitation learning technique with obstacles an…

totototo96 updated 3 years ago
4
hiyouga/LLaMA-Factory #4768

训练奖励模型，在处理数据时报错 StopIteration

### Reminder - [X] I have read the README and searched the existing issues. ### System Info CUDA_VISIBLE_DEVICES=0 llamafactory-cli train \ --stage rm \ --do_train True \ --model_nam…

SMR-S updated 3 days ago
6
hill-a/stable-baselines #835

Using Saved Model as Enemy Policy in Custom Environment (whi…

I am currently training in an environment that has multiple agents. In this case there are multiple snakes all on the same 11x11 grid moving around and eating food. There is one "player" snake and thr…

lukepolson updated 4 years ago
16
facebookresearch/hydra #1859

SLURM sweep with hydra

I try to sweep a set of hyperparams using the slurm submitit plugin. I run: `python run.py --multirun --config-name atari-slurm seed=1,2,3,4,5` And my config file looks something like this: …

slerman12 updated 2 years ago
22
Tencent/PocketFlow #258

Channel Pruning should reCreatePruner in channel pruning

在DDPG训练完也就是__prune_rl()后，应该再加一个self.create_pruner()吧，如果不加这个，感觉是在RL最后一次的compress上应用新的pruning，这应该不是正解吧！！！感觉还是重新create_pruner()比较好一点。各位大佬看看是不是这样子？ ![屏幕快照 2019-03-21 下午8 56 15](https://user-images.gith…

Nankaiming updated 5 years ago
10
javyduck/ChatScene #15

Code not working: ModuleNotFoundError: No module named 'agen…

~/ChatScene-main$ PYTHONPATH='./' python scripts/run_train.py --agent_cfg=adv_scenic.yaml --scenario_cfg=train_scenario_scenic.yaml --mode train_scenario --scenario_id 1 setGPU: Setting GPU to: 0 py…

Youjin1985 updated 1 month ago
1
hongzimao/decima-sim #34

A question about the result

Hi, I noticed that the duration of a task is decided by the code in node.py, which used np.random.randint to generate the cost-time. But if I replace it with np_random which has specified seed, the r…

hzx-ctrl updated 3 years ago
5

上一页 1...88 89 90 91 92 93 94...100 下一页

1000+ results for train-agents

1000+ results
for train-agents