deterministic-policy-gradients Search Results

138 results
for deterministic-policy-gradients

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

rodluger/starry #281

not implemented error from basic tutorial

**Describe the bug** during the following code block in the [basic tutorial](https://starry.readthedocs.io/en/latest/notebooks/Basics/), the error below happened. the entire text of the error is past…

mikelty updated 3 years ago
4
hpcaitech/ColossalAI #3403

[BUG]: Cannot use pipeline and gemini at the same time

### 🐛 Describe the bug I previously attempted to submit a similar issue on #3383, but some of my imprecise expressions may cause unnecessary misunderstandings, which could increase the cost of unders…

liuzeming-yuxi updated 1 year ago
1
hpcaitech/ColossalAI #3383

[BUG]: Deadlock when using gemini and pipeline at the same t…

### 🐛 Describe the bug Hi~ We tried to use pipeline parallel + gemini to train a model.But it seems that there was a deadlock during communation.The following is a simple reproduction based on the [o…

liuzeming-yuxi updated 1 year ago
1
Computational-Content-Analysis-2020/Readings-Responses-Spring #32

Deep Classification, Embedding & Text Generation - Orientati…

LeCun, Yann, Yoshua Bengio & Geoffrey Hinton. 2015. “[Deep Learning](https://www.nature.com/articles/nature14539).” Nature 521: 436-444. Karpathy, Andrej. 2015. “[The Unreasonable Effectiveness of …

HyunkuKwon updated 4 years ago
17
facebookresearch/BenchMARL #106

> Benchmarl automatically makes a video.

> Benchmarl automatically makes a video. > > In particular you might want to set these parameters > > https://github.com/facebookresearch/BenchMARL/blob/a9309159d6d46d099bd3d395ef1…

armansouri9 updated 3 months ago
15
lucidrains/TimeSformer-pytorch #8

Discussion on training issues I have encountered

Thank you for the implementation for the paper. This is the first time I'm dealing with transformer model, I tried to train over Kinetics700 dataset using this model. and I just want to share some of …

zmy1116 updated 2 years ago
33
ray-project/ray #14878

[rllib] SAC numerical instability

### What is the problem? SAC calculates the gaussian log probability based on clamped values, which can result in very large values if the tanh saturates and as a consequence result in explodin…

dHonerkamp updated 4 months ago
1
NVIDIA/Megatron-LM #937

[BUG]Get an AtrributeError when trying to finetune llama3-8B…

**Describe the bug** I try to finetune `llama3-8B` model with multi nodes but get an AtrributeError when finishing loading mcore format checkpoint and starting to build datasets, the error is below: …

nakroy updated 3 weeks ago
5
tensorflow/tensorflow #58310

TF built from source produces different XLA compiler results

Click to expand! ### Issue Type xla ### Source source ### Tensorflow Version tf.__git_version__ = v2.10.0-rc3-6-g359c3cdfc5f ### Custom Code No ### OS Platform and Distri…

pranavladkat updated 1 year ago
7
A-suozhang/GetArxivDaily #35

New submissions for Mon, 17 Apr 23

## Keyword: efficient ### End-to-end codesign of Hessian-aware quantized neural networks for FPGAs and ASICs - **Authors:** Javier Campos, Zhen Dong, Javier Duarte, Amir Gholami, Michael W. Mahoney,…

A-suozhang updated 1 year ago
1

上一页 1...1 2 3 4 5 6 7...14 下一页

138 results for deterministic-policy-gradients

138 results
for deterministic-policy-gradients