adaptive-learning-rate Search Results

1000+ results
for adaptive-learning-rate

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

facebookresearch/adaptive_teacher #48

resnet backbone dont converge

I changed the backbone from vgg to resnet50. First, I have to use very low learning rate otherwise training diverges. Second, mAP is usually lower than vgg backbone adaptive teacher even if I train lo…

victor00070 updated 1 year ago
1
reproducibility-challenge/iclr_2019 #104

[RC] Hyper-Regularization: An Adaptive Choice for the Learni…

| Team Name | Affiliation | |---|---| | Sharks | epfl; epfl; epfl | - Paper: [Hyper-Regularization: An Adaptive Choice for the Learning Rate in Gradient Descent](https://openreview.net/pdf?id=S1e_ss…

reproducibility-org updated 5 years ago
1
LarryJane491/Lora-Training-in-Comfy #55

Lora training fails - "returned non-zero exit status 2", see…

Hi, I have used the captioning nodes and they worked fine, but when I try to run the lora node, I get the below issue. There seems to be an issue with getting it to recognise the checkpoint. From …

ArmouryGaming updated 3 weeks ago
2
rapidsai/cuml #2375

[FEA] t-SNE initialization, learning rate, and exaggeration

I was in contact with Victor Lafargue who suggested I ping @danielhanchen for all cuML t-SNE-related questions. So here it goes. It's three suggestions, including some questions. 1) Recent research…

dkobak updated 9 months ago
10
nengo/nengo-loihi #127

Learning large output fails

edit: why do they let you post blank issues. Here's a minimal example of what I was running into with the adaptive control notebook, which I got around by using a small learning rate and scaling th…

studywolf updated 6 years ago
1
luchris429/purejaxrl #28

GPU-Based Environment

I am trying to create a GPU-based environment where a model is being trained say resnet18 where the number of environment can be greater than 1. I am not familiar with Jax but I am planning to learn i…

DavidAkinpelu updated 5 months ago
3
tsoding/nn.h #3

Spiky Cost Trajectories

In certain training scenarios, I see extremely spiky cost trajectories through training. I bet this could be solved (at least partially) by implementing adagrad or some other adaptive learning rate sc…

SamuelSchlesinger updated 1 year ago
2
h2oai/h2o-3 #9953

FTRL

Implement quick & dirty single-node version of FTRL with L1 regularization and adaptive learning rate: https://www.eecs.tufts.edu/~dsculley/papers/ad-click-prediction.pdf First step: all features (…

exalate-issue-sync[bot] updated 1 year ago
1
kohya-ss/sd-scripts #1434

Training on 2x H100 on Ubuntu and speed is same as 1x H100 w…

When training batch size 4 on H100 the speed is 1.27 second / it When training batch size 4 on 2x H100 the speed is 2.05 second / it So basically we almost got no speed boost from multiple GPU t…

FurkanGozukara updated 4 months ago
25
h2oai/h2o-3 #11764

Documentation: Deep Learning parameter descriptions

Add descriptions to the Parameters Appendix for Deep Learning parameters: pretrained_autoencoder overwrite_with_best_model hidden epochs train_samples_per_iteration target_ratio_comm_to_comp …

exalate-issue-sync[bot] updated 1 year ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for adaptive-learning-rate

1000+ results
for adaptive-learning-rate