deterministic-policy-gradients Search Results

138 results
for deterministic-policy-gradients

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Eclectic-Sheep/sheeprl #187

dreamerv3 trouble resuming? freeze?

Hello, sorry I don't actually know much about the mathematical formulas and whatnot behind rl and the algorithms, I've previously been just training stuff with SB3 PPO. Anyway, I installed sheeprl an…

Disastorm updated 9 months ago
26
ultralytics/ultralytics #1682

Add/Modify YOLOv8 backbone structure

I am trying to modify the YOLOv8 backbone by adding the attention module inside. However, I keep getting key error. I modified the .yaml file, modules.py and tasks.py but still not working. Anyone can…

chloewxrn updated 2 days ago
97
keras-team/keras-io #1457

Proposal to Add TD3 for Reinforcement Learning to Keras Exam…

Hi, I am interested in adding the implementation of [Twin Delayed Deep Deterministic Policy Gradients (TD3) ](https://arxiv.org/abs/1802.09477) to the Keras examples repository. TD3 addresses th…

hamidriasat updated 1 year ago
2
fly51fly/aicoco #3

爱可可老师24小时热门分享

微博内容精选

fly51fly updated 1 week ago
1906
DLR-RM/stable-baselines3 #1650

[Question] Miscellaneous questions

### ❓ Question Hi, I have several questions: # Entropy scheduler In SB3 it is possible to define the weight for the entropy loss function that it is used for example in A2C. I would like to de…

suargi updated 1 year ago
4
ultralytics/ultralytics #6223

Some problems about training

### Search before asking - [X] I have searched the YOLOv8 [issues](https://github.com/ultralytics/ultralytics/issues) and [discussions](https://github.com/ultralytics/ultralytics/discussions) and fou…

Iwill-github updated 10 months ago
4
InternLM/xtuner #161

How can I do full parameter fine-tuning the model with FP16

I modified `llama2_7b_full_wizardlm_e1_copy.py` with alpaca_dataset and added parameter `torch_dtype=torch.float16` in model loading, as following: ``` model = dict( type=SupervisedFinetune, …

yuqie updated 1 year ago
18
qiaoyuet/arxiv_daily #97

New submissions for Mon, 29 Apr 24

## Keyword: differential privacy ### State-of-the-Art Approaches to Enhancing Privacy Preservation of Machine Learning Datasets: A Survey - **Authors:** Chaoyu Zhang - **Subjects:** Cryptography an…

qiaoyuet updated 5 months ago
1
rickstaa/stable-learning-control #15

Validate LAC/SAC pytorch translation

## User story ## In order to be able to ship the LAC/SAC pytorch implementation to the team we need to validate whether it gives the same results as the LAC/SAC tensorflow version. ## Consideratio…

rickstaa updated 1 year ago
8
mjadiaz/toy-models #9

6-transfer-learning

The idea is to use transfer learning, at the most basic level as a first step, to using the trained agents in other environments: - [x] Implement a perfect ring or circle in the `models.py` file. …

mjadiaz updated 1 year ago
1

上一页 1...4 5 6 7 8 9 10...14 下一页

138 results for deterministic-policy-gradients

138 results
for deterministic-policy-gradients