actor-critic Search Results

1000+ results
for actor-critic

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

agi-brain/xuance #54

Support for 2D Observation Spaces in PPO with Torch

I'm working on DRL framework using the PPO agent with Torch and experienced a difference in how observation spaces are handled. The example in the [documentation](https://xuance.readthedocs.io/en/late…

Abdullah2020 updated 3 months ago
8
rail-berkeley/serl #66

A positional error occurred while running “ bash run_learner…

Hello everyone, I am deploying “SERL” in the real world and I fully follow the instructions on the webpage for hardware and software deployment. I first want to reproduce task 1: peg insertion Here …

iu777 updated 5 months ago
10
OpenRLHF/OpenRLHF #263

[Baseline] LLaMA2-7B RLHF training curves

``` deepspeed ./train_ppo.py \ --pretrain OpenLLMAI/Llama-2-7b-sft-model-ocra-500k \ --reward_pretrain OpenLLMAI/Llama-2-7b-rm-anthropic_hh-lmsys-oasst-webgpt \ --save_path ./ckpt/7b_l…

hijkzzz updated 4 months ago
2
OpenRLHF/OpenRLHF #221

Citation or comparison to trlX and NeMo-align.

Hi I notice you cite "70B+ Full Tuning with 16 A100" however this is also something that trlX (and that we worked very hard to add ;) ) supports via NeMo support. Similarly, this is something that …

LouisCastricato updated 4 months ago
3
Lakshadeep/pre-grasp-approaching #10

Two error about 'NameError' --------train/grasp_decision.py

# When I modify and run this code, new problems appear. It seems to be because the code is incomplete. Did I miss something? run ' python train/grasp_decision.py' ## error1 ``` 03/06/2024 10:39:0…

DevilCJS89 updated 5 months ago
3
dariusk/NaNoGenMo #39

"The Swallows of Summer" by Cat's Eye Technologies

Fifty thousand words, huh? I do fear that the plot will begin to suffer partway through no matter _how_ cleverly I code, but I'll give it a whirl.

catseye updated 10 years ago
42
Alescontrela/AMP_for_hardware #1

ValueError: Expected parameter loc (Tensor of shape (32880, …

**Describe the bug** 'ValueError: Expected parameter loc (Tensor of shape (32880, 12)) of distribution Normal(loc: torch.Size([32880, 12]), scale: torch.Size([32880, 12])) to satisfy the constraint R…

COST-97 updated 4 months ago
11
DLR-RM/stable-baselines3 #1941

[Question] How to set learning rate and scheduler for custom…

### ❓ Question Hi, everyone I would like to set a learning rate and scheduler for the feature extractor that differs from those of the actor-critic networks. Is there a way to do this? ### Chec…

edwardjjj updated 5 months ago
1
xiaobaishu0097/ECCV-VN #1

Does the training process need depth data？

The depth data is not included in the data set you shared, but this error message appeared during the training process. Training started from: 2020-09-03 10:23:59 Scene Data Exists! initialized o…

sx-zhang updated 10 months ago
6
thu-ml/tianshou #1157

Unable to replicate original PPO performance

- [x] I have marked all applicable categories: + [x] exception-raising bug + [x] RL algorithm bug + [ ] documentation request (i.e. "X is missing from the documentation.") + [ ] ne…

hexonfox updated 3 months ago
21

上一页 1...83 84 85 86 87 88 89...100 下一页

1000+ results for actor-critic

1000+ results
for actor-critic