optim Search Results - Githubissues

1000+ results
for optim

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

allenai/OLMo #692

why CrossEntropyLoss is zero,i

### ❓ The question System/Peak GPU Memory (MB)=6,784 2024-08-06 09:59:26.181 intern-studio-160750:0 olmo.train:908 INFO [step=1/739328,epoch=0] optim/total_grad_norm=231.7 train/C…

aizhweiwei updated 2 weeks ago
2
mirage/mirage-crypto #241

[optim - C binding] Use a contiguous array/flat representati…

At the moment, in the C bindings, the points are represented as a struct with 3 fields. It creates 3 indirections when calling the C code. When performing many operations on the same point, the same m…

dannywillems updated 1 month ago
2
pytorch/torchtune #1278

[RFC] Optimizer CPU offload from torchao for single GPU low …

The recent addition of optimizer CPU offload in torchao can be useful for single GPU low memory config. https://github.com/pytorch/ao/tree/main/torchao/prototype/low_bit_optim#optimizer-cpu-offload…

gau-nernst updated 2 weeks ago
7
liweitj47/overnight-stock-movement-prediction #6

no module named optims ?

i tried to run the train.py but i encountered with this error " no module named optims. ". i thought it's a library so i tried to install it using pip , but it turned out not to be this case. so can …

sia-watsonlee updated 4 months ago
1
DassHydro/smash #301

Functions to manipulate the control vector and compute cost/…

For many applications, one needs to pass a function that evaluates the cost (or the log-posterior) from the control vector. For instance: 1. MCMC sampling: samples=MCMC(logpost_function, starting_poi…

benRenard updated 2 weeks ago
1
princeton-nlp/MeZO #37

question about MeZO-adam

Hi! I find MeZO-adam code in medium size folder, but it uses the Adam from pytorch.optim. Its not like the case in large_models that author re-write the inner_loop. Can you please explain it? Thank yo…

zhaoaustin updated 4 days ago
1
askalia-org/poney-racer #131

optim 2

optim 2

askalia updated 4 years ago
2
askalia-org/poney-racer #132

optim 3

optim 3

askalia updated 4 years ago
2
kohya-ss/sd-scripts #837

CAME optim

I'd like to check this optimizer if its not too hard to implement, it should be less mem usage than Adamw8bit https://github.com/yangluo7/CAME

betterftr updated 10 months ago
1
ai-safety-foundation/sparse_autoencoder #210

ImportError: cannot import name 'params_t' from 'torch.optim…

Ran this from the demo code: ``` import os # Check if we're in Colab try: import google.colab # noqa: F401 # type: ignore in_colab = True except ImportError: in_colab = False …

seansica updated 1 month ago
2

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for optim

1000+ results
for optim