adam-optimizer Search Results

1000+ results
for adam-optimizer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

zyushun/Adam-mini #28

RuntimeError: No backend type associated with device type cp…

Training stable diffusion XL unet using accelerate library with FSDP: fsdp_offload_params: true; fsdp_sharding_strategy: SHARD_GRAD_OP Environment: accelerate-0.34.2 torch-2.4.1 CUDA Version: 12…

minienglish1 updated 2 months ago
3
sandialabs/pyttb #351

Add CP_OPT

Goal: Align implementation with GCP_OPT while retaining the cp_opt top level interface as faithfully as possible. Components: - [x] cp_opt.m - [x] ktensor/fg.m - [x] tt_opt_lbfgsb.m - [ ] tt_op…

ntjohnson1 updated 6 days ago
3
huggingface/nanotron #233

Learning rate restart broken with Nanoset?

Retraining on checkpoint works perfectly with the tokenization on the fly, but breaks while using nanoset: training restart with a different lr, which is not the same as lr_schedule.pt We also have…

Pclanglais updated 2 days ago
13
rstudio/keras3 #1352

optimizer_adam needs an acces to the legacy optimizer tf.ker…

The error message I get Error in py_call_impl(callable, dots$args, dots$keywords) : ValueError: decay is deprecated in the new Keras optimizer, pleasecheck the docstring for valid arguments, or u…

erdeyl updated 2 years ago
2
pytorch/torchrec #2394

The optimizer state key names differ when using `data_parall…

We can reproduce this problem using the following command: `torchrun --master_addr=127.0.0.1 --master_port=1234 --nnodes=1 --nproc-per-node=1 --node_rank=0 test_optimizer_state.py --sharding_type $SHA…

tiankongdeguiji updated 2 months ago
2
AlexeyAB/darknet #4000

Adam optimizer and Batch norm parameters

Hi, i'm using tiny yolo v2 and im trying to use adam optimizer during training , so i added the following lines in cfg ![image](https://user-images.githubusercontent.com/33591581/65779761-abef6a80-e…

chtouroumahdi updated 4 years ago
1
pytorch/pytorch #119698

torch._inductor.triton_heuristics.cached_autotune is not thr…

### 🐛 Describe the bug In multiprocessing mode (i.e. FSDP/DDP), there occur JSONDecodeErrors within torch._inductor.triton_heuristics.cached_autotune, if the filesystem does not lock the file itself.…

kpoeppel updated 3 days ago
4
shenweichen/DeepCTR #486

ValueError: Could not interpret optimizer identifier: <keras…

**Describe the bug(问题描述)** get a valueError when running deepfm demo **To Reproduce(复现步骤)** ```python # fail and get ValueError: Could not interpret optimizer identifier: model.compile(optimiz…

ChildishChange updated 1 year ago
4
tensorflow/tensorflow #78610

Accuracy is lost after save_weights/load_weights

### Issue type Bug ### Have you reproduced the bug with TensorFlow Nightly? Yes ### Source binary ### TensorFlow version 2.16.1 ### Custom code Yes ### OS platform and distribution WSL Ubun…

Pandaaaa906 updated 3 days ago
4
TuringLang/AdvancedVI.jl #131

Multithreaded sampling

I have tried to implement multithreaded sampling by changing: ```julia function estimate_energy_with_samples(prob, samples) #return mean(Base.Fix1(LogDensityProblems.logdensity, prob), eachsa…

arnauqb updated 1 month ago
4

上一页 1...3 4 5 6 7 8 9...100 下一页

1000+ results for adam-optimizer

1000+ results
for adam-optimizer