stochastic-gradient-descent Search Results

1000+ results
for stochastic-gradient-descent

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

THUDM/P-tuning #47

Why discreteness of word embedding leads to the optimizer ea…

最近拜读了您的论文《GPT Understands, Too》，关于这段话有些不理解，希望您能帮忙指导解释下：”1) Discreteness: the original word embedding e of M has already become highly discrete after pre-training. If h is initialized with random distr…

xsc1234 updated 1 year ago
1
vlfeat/matconvnet #335

Problem with validation error

Hi I'm new to deep learning and CNN. I have intended to use matconvnet for my class project about facial age estimation. I have a training set of size 8000 face images and a validation set of size 100…

Linda72 updated 8 years ago
2
TMats/survey #107

Natural Langevin Dynamics for Neural Networks

https://arxiv.org/abs/1712.01076v1

TMats updated 6 years ago
1
UChicago-Thinking-Deep-Learning-Course/Frequently-Asked-Questions #5

Homework 3.2 loading data in main function of Minibatch SGD

Hi Bhargav and Likun, I am having trouble loading in my own data for Homework 3.2, Minibatch stochastic gradient descent (SGD). In the sample code you provided in the notebook, MNIST data is loaded…

william-wei-zhu updated 3 years ago
1
apache/pinot #8493

Add functions for statistical analysis in SQL

As discussed with @siddharthteotia, consider adding some common statistical analysis methods SQL language. Few examples: 1. Pearson's coefficient 2. Sampling (bernoulli/stratified) 5. Histogram…

jasperjiaguo updated 8 months ago
3
Dooders/Experiments #13

Experiment: Simulated Annealing Gradient Descent vs. Traditi…

Run an experiment to evaluate the performance of a simulated annealing gradient descent (SA-GD) approach compared to traditional gradient descent (GD). The purpose of this experiment is to understand …

csmangum updated 2 weeks ago
1
dartsim/dart #374

Standardizing the interfaces for optimizer::Solver classes

Right now the abstract optimizer::Solver class is extremely minimalistic, only offering the ability to solve a Problem that was passed in during construction (incidentally, I think it should be possib…

mxgrey updated 6 years ago
11
tangjianpku/LINE #35

A question about multi-thread in your code

Hi, I read your C++ code of LINE for Windos, a very good implementation. but I have a question that why you didn't consider the Read-Write Conflict when update the embeding vector in Update() functio…

xuxiaohan updated 5 years ago
2
gimseng/99-ML-Learning-Projects #83

[EXE] Mask Detector in Live Cameras

#### Learning Goals [Learning goals, bulleted/numbered list is preferred] [e.g. learn the concept and the use of train/validation/test dataset using scikit-learn ] Learn to preprocess images, use a…

mlunghi updated 6 months ago
2
quclub/Paper-reading #54

DOI: 10.1162/089976698300017746

* [Link](https://www.mitpressjournals.org/doi/10.1162/089976698300017746) * Title: Natural Gradient Works Efficiently in Learning * Keywords (optional): * Authors (optional): * Reason (opti…

refraction-ray updated 5 years ago
3

上一页 1...8 9 10 11 12 13 14...100 下一页

1000+ results for stochastic-gradient-descent

1000+ results
for stochastic-gradient-descent