adam Search Results - Githubissues

1000+ results
for adam

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

zyushun/Adam-mini #25

Qwen2-0.5B cannot be Adam-mini-optimized in 4 shards (Deepsp…

Hi all, I found that using Adam-mini 1.0.1 cannot run in 4 shards, it would threw the exception related to Tensor reshaping: ``` File "/opt/conda/lib/python3.10/site-packages/adam_mini/adam_m…

xiningnlp updated 3 weeks ago
3
pytorch/pytorch #41006

Simplify Adam Optimizer

At the moment in the Adam optimizer the exponential moving averages are decoupled from the bias correction, as per the original paper. However, it is possible to combine these operations into a sing…

SeanRBurton updated 4 years ago
2
liucongg/ChatGLM-Finetuning #122

运行trainer时报错Error building extension 'fused_adam'

如题

J-G-Y updated 1 week ago
5
pytorch/pytorch #135881

DISABLED test_grad_scaler_with_preset_grad_scale_in_place_un…

Platforms: linux This test was disabled because it is failing in CI. See [recent examples](https://hud.pytorch.org/flakytest?name=test_grad_scaler_with_preset_grad_scale_in_place_unscale_True_Adam_cu…

pytorch-bot[bot] updated 1 week ago
3
XLabs-AI/x-flux #108

(out of memory)OOM for traing controlnet on flux-dev-fp8 on …

Here is the error: File "/home/workspace/x-flux-main/src/flux/modules/layers.py", line 499, in __call__ output = attn.linear2(torch.cat((attn_1, attn.mlp_act(mlp)), 2)) torch.OutOfMemoryError…

kongzijian updated 1 week ago
6
ManageIQ/manageiq #23086

EmbeddedWorkflow not able to be used for EmbeddedTerraform

The embedded workflow is set on the MiqRequestTask in `MiqProvisionRequestTemplate#service_options` called from `MiqProvisionRequestTemplate#create_tasks_for_service` ``` [----] D, [2024-07-12T13:16…

agrare updated 2 months ago
1
promazo/Content-Team #342

GSV-Adam Freed (Short Post)

jaahmuhl updated 7 months ago
3
pytorch/pytorch #135721

DISABLED test_grad_scaler_with_preset_grad_scale_in_place_un…

Platforms: linux This test was disabled because it is failing in CI. See [recent examples](https://hud.pytorch.org/flakytest?name=test_grad_scaler_with_preset_grad_scale_in_place_unscale_False_Adam_c…

pytorch-bot[bot] updated 1 week ago
2
pytorch/pytorch #115966

Adam Eager Benchmark Failures

### 🐛 Describe the bug When running with Adam eager, about 1/3 of our benchmark models fail accuracy. It's about uniform across suites. [list of failing models](https://github.com/pytorch/pytorch/…

mlazos updated 9 months ago
1
tensorflow/kfac #45

Wrong incompatible versions + worse than Adam performance.

According to the [docs](https://github.com/tensorflow/kfac/blob/master/kfac/python/keras/README.md), this optimizer is supposed to `converge much faster (>3.5x) and with fewer iterations (>14x) than S…

ghost updated 2 months ago
5

上一页 1...7 8 9 10 11 12 13...100 下一页

1000+ results for adam

1000+ results
for adam