batch-gradient-descent Search Results

1000+ results
for batch-gradient-descent

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

vislearn/FrEIA #77

Requesting Advice on NF Methods

I am working on a project where I sample a set of n-dimensional points from a Gaussian distribution (of learnt parameters) as follows and then evaluate those points based on a loss function to update …

kayuksel updated 2 years ago
1
mratsim/Arraymancer #112

CUDA - Custom memory management to avoid expensive alloc/dea…

Memory allocations and release will probably become a bottleneck during the forward and backward propagation. During the forward pass it will hold inputs tensor in cache. During backward pass it wi…

mratsim updated 6 years ago
8
p-koo/tfomics #4

PGD attacker as a tf.keras.Model subclass

The attacker API here can be made more simple, imho, if the adversarial training were implemented in `tf.keras.Model.train_step`. That function is called by `fit()`on every batch of data. By using …

kaczmarj updated 3 years ago
1
jl749/LAMB_optimizer #1

Paper Reading

# Deep learning is EXPENSIVE e.g. train ResNet50 with ImageNet dataset for 80 epochs 80 * 1.3M images * 7.7B ops per img # Solution? - **Data Parallelism (large batch training)** ![image](http…

jl749 updated 1 year ago
8
RUCKBReasoning/RESDSQL #67

训练Cross-Encoder的时候为什么24G的显存还不够用？

是我哪里弄错了吗？还是说就是要这么大的显存？

Mucalinda2436 updated 9 months ago
1
FluxML/Flux.jl #876

Model optimization fails (NaNs) with Zygote.pullback but wor…

@MikeInnes I have a very simple model that does not train on Flux#master due to NaNs from exploding gradients. However the exact same code works and trains as expected with `Zygote.pullback -> Tracker…

jessebett updated 4 years ago
14
tensorflow/model-remediation #29

[GSOC]: Active Sampling for Min-Max Fairness

**Project Description:** Min-max fairness is a natural and desirable notion of subgroup fairness. The goal of this project is to develop open source implementations of recent [research](https://arxiv.…

pranjal-awasthi updated 2 years ago
11
llSourcell/tensorflow_demo #7

Fixing errors in board.py associated with deprecated tf attr…

import input_data from tensorflow.examples.tutorials.mnist import input_data mnist = input_data.read_data_sets("MNIST_data/", one_hot=True) import tenso…

hermano360 updated 7 years ago
1
wiseodd/natural-gradients #2

possible bug in kfac

In the pytorch implementation of kfac, G1_ is computed as: G1_ = 1/m * a1.grad.t() @ a1.grad However, the a1.grad is different from the a_1 in (1) of kfac's paper. Specifically, when you do back…

renyiryry updated 5 years ago
2
henryliangt/usyd #32

perceptron ML7

![image](https://user-images.githubusercontent.com/23263731/200564311-220f4f34-3d31-4eb5-a5d1-6af2683cab5a.png) ![image](https://user-images.githubusercontent.com/23263731/200296378-68401e7c-…

henryliangt updated 2 years ago
10

上一页 1...6 7 8 9 10 11 12...100 下一页

1000+ results for batch-gradient-descent

1000+ results
for batch-gradient-descent