sgd-optimizer Search Results

pytorch/xla #8071

Optimizer Memory in AdamW/Adam vs SGD

## ❓ Questions and Help It is to my understanding that Adam should use more memory than SGD because it keeps track of more parameters. However, when I look at my profiles between Adam and SGD optim…

dangthatsright updated 1 month ago

qiboteam/boostvqe #83

Optimizer option (readability)

In `train_vqe` in `main.py`, the optimizer options are given by argument `optimizer_options`. However, the description in the `help` documentation is unclear (without example code, a general user woul…

Sam-XiaoyueLi updated 1 day ago

pytorch/pytorch #137373

`dampening`, `maximize`, `foreach`, `differentiable` and `fu…

### 🐛 Describe the bug [The doc](https://pytorch.org/docs/stable/generated/torch.optim.SGD.html) of `optim.SGD()` doesn't say that the type of `dampening`, `maximize`, `foreach`, `differentiable` a…

hyperkai updated 3 weeks ago

layumi/Person_reID_baseline_pytorch #174

Why SGD optimizer?

Hi, I wanted to know is there any specific reason that you are using SGD with momentum optimizer instead of more recent variants like Adam and AdaGrad? How will the model perform if I use Adam? …

Rajat-Mehta updated 4 years ago

ultralytics/yolov5 #13399

why different optimizer train get different result

### Search before asking - [X] I have searched the YOLOv5 [issues](https://github.com/ultralytics/yolov5/issues) and [discussions](https://github.com/ultralytics/yolov5/discussions) and found no simi…

tank1530532 updated 1 day ago

ashleve/lightning-hydra-template #457

How to define a conditional search space ?

Hi! I want to search the best optimizer for the given "mnist_example" from SGD and Adam. However, for SGD, I also want to know which momentum value is the best (which Adam doesn't need), but for …

tianshuocong updated 3 days ago

heal-research/pyoperon #23

L2 regularization for constant optimization

Hi, is there a way to penalize the magnitude of the constants (via, e.g., L2 regularization)? I am trying to fit a `SymbolicRegressor` with some noisy data and sometimes I get very large values for…

Smantii updated 2 weeks ago

hiyouga/LLaMA-Factory #5513

有可能对train函数加上差分隐私的训练处理吗，如果我想对sft微调训练过程中使用opacus加入差分隐私处理，我该怎么…

### Reminder - [X] I have read the README and searched the existing issues. ### System Info 0.9.0 ### Reproduction opacus使用只要对训练函数使用privacy_engine.make_private函数包裹即可，请问对于sft我该去哪里修改？ model = Ne…

DSW2001 updated 1 month ago

Darya0170/LabKZ #1

Замечания по второй лабораторной работе

Код в 11 ячейке от начала (если считать только ячейки с кодом, начиная с 1) не запускается, выдается ошибка времени исполнения: Training with SGD optimizer ----------------------------------------…

Cubgl updated 6 days ago

tenstorrent/tt-metal #13643

[Bug Report] Binary operations which require ttnn::repeat ar…

**Describe the bug** Broadcast over batch dimension makes ops work much slower. We are using the in the optimizer step for each layer https://github.com/tenstorrent/TT-Tron/blob/main/sources/ttml/op…

dmakoviichuk-tt updated 4 days ago

1000+ results for sgd-optimizer

1000+ results
for sgd-optimizer