novograd Search Results

155 results
for novograd

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

flashlight/wav2letter #569

Slow training with novograd

hi ,there i'm training an AM on 4 TeslaV100 gpus initially using sgd each epoch is done in ~ 12 min, since I'm having some problems with the lr, I wanted to move to novograd optimizer unfortunatly is…

cri5Castro updated 4 years ago
2
DingXiaoH/RepLKNet-pytorch #52

No module named 'timm.optim.novograd‘

File "D:\A_File\Project\RepLKNet-pytorch-main\optim_factory.py", line 19, in from timm.optim.novograd import NovoGrad ModuleNotFoundError: No module named 'timm.optim.novograd'

guyunsu128 updated 7 months ago
3
facebookresearch/bitsandbytes #6

Feature request: Please, add implementation for Novograd alg…

Great work! Can you, please, add implementation for Novograd algorithm? Support info: paper: https://arxiv.org/abs/1905.11286 Novograd implementations: https://github.com/NVIDIA/apex/blob/mas…

LSinev updated 2 years ago
1
tbachlechner/ReZero-examples #2

Optimizer choices?

In both of the jupyter notebooks and the paper, I noticed that instead of using Adam, the most commonly used optimizer for transformers, you used Adagrad for all of the experiments. Is there a reason …

Akamight updated 4 years ago
6
NVIDIA/OpenSeq2Seq #510

TF2.0

any plan on migrating code to tf 2.0?

chiweic updated 4 years ago
1
TensorSpeech/TensorFlowASR #14

TODO

- [x] Auto mixed precision and loss scaling - [x] Fix transducer embedding tflite conversion (tflite conversion raises a bug when using `tf.gather` in `tf.while_loop`) - [x] Fix transducer tflite co…

nglehuy updated 3 years ago
2
titu1994/keras_novograd #1

lr issue

self.lr = K.variable(lr, name='lr') AttributeError: can't set attribute keras version - 2.3.1 How to solve this issue?

jongli747 updated 4 years ago
1
SaneBow/tflite-kws #1

what is right parameters to run kws ds_tc_resnet

Hi , Thanks so much for the wrapper API. I was trying to make real-time KWS spotting using your wrapper and my trained .tflite model using the paper [google paper](https://arxiv.org/pdf/2005.067…

srewai updated 3 years ago
5
Project-MONAI/MONAILabel #874

Caching of volumes to GPU during training of deepedit (radio…

Currently, it is not possible to cache data to GPU during training of deepedit, to accelerate training as described in the [Fast Training Tutorial](https://github.com/Project-MONAI/tutorials/blob/main…

nvahmadi updated 1 year ago
2
ivankunyankin/quartznet-asr #3

Things that are different compared to the article

There are a number of differences compared to the source: * original article: https://arxiv.org/pdf/1910.10261.pdf Differences: 1. The default training script uses LibriTTS dataset instead of…

ivankunyankin updated 3 years ago
1

上一页 1...1 2 3 4 5 6 7...16 下一页

155 results for novograd

155 results
for novograd