batch-gradient-descent Search Results

tensorfork/tensorfork #35

BigGAN: D real+fake data augmentations (Zhao et al 2020b)

The new paper ["Image Augmentations for GAN Training"](https://arxiv.org/abs/2006.02595#google), Zhao et al 2020b, reports: > Data augmentations have been widely studied to improve the accuracy and…

gwern updated 3 years ago

yuenshome/yuenshome.github.io #1

MXNet高级凸优化

# 高级凸优化这部分是MXNet第六课的高级凸优化的学习记录，包含基本的凸函数性质说明，主要讲解了收敛率以及后面的高级优化算法，包括动量法、Adagrad、AdaDelta、RMSProp、Adam。重点介绍了一个由动量法引入的EMA（Exponential Moving Average，即指数平均数指标），在包括Adam等等优化算法都有它的身影。 ## 1. 凸函数性质好…

ysh329 updated 6 years ago

mizukihiraishi/Study-AI #3

深層学習　前編

### 深層学習 day1 - 00_プロローグ1_識別モデルと生成モデル > 機械学習モデルを入力・出力の目的で分類 > 識別モデル・・・データを目的のクラスに分類する、データ量の多いものからデータ量の少ないものに変換する > 生成モデル・・・特定のクラスのデータを生成する、データ量の少ないものからデータ量の多いものに変換する > p(Ck|x)・・・あるデータxが与えられた条件の下で…

mizukihiraishi updated 2 years ago

ccc013/Study-Notes #1

> PyTorch 是由 Facebook 开发，基于 **Torch** 开发，从并不常用的 Lua 语言转为 Python 语言开发的深度学习框架，Torch 是 TensorFlow 开源前非常出名的一个深度学习框架，而 PyTorch 在开源后由于其使用简单，动态计算图的特性得到非常多的关注，并且成为了 TensorFlow 的最大竞争对手。目前其 Github 也有 2w8+ 关注。 …

ccc013 updated 5 years ago

chenghong-lin-nu/blog #6

DLND-Week3

## Intro to Tensorflow - 在tensorflow中，数据是被封装在一个Object叫tensor。 ### Session - TensorFlow’s api是构建基于Computational Graph（计算图）上的。 - TensorFlow Session **is an environment for running a graph.** ```p…

chenghong-lin-nu updated 6 years ago

lifeiteng/vall-e #94

Pretrained Model

Hi, Are you planning on releasing a pretrained model anytime? Thanks

nivibilla updated 1 year ago

percyliang/sempre #217

Purpose of method "foreach" in class CallFormula

Hi, I'm trying to get some deeper insight in the code of sempre. For the moment, I'm looking at the simple Java logical forms and their execution by the class JavaExecutor- The class CallFormula f…

stbusch updated 3 years ago

marrlab/DomainLab #819

implement invariant risk minimization

paper: https://arxiv.org/pdf/1907.02893 paper section to read: 3.2 Implementation details When estimating the objective (IRMv1) using mini-batches for stochastic gradient descent, one can obtain …

smilesun updated 5 months ago

elixir-nx/axon #592

The loss increases until it become NaN on XOR example

The "Modeling XOR with a neural network" example don't work. The loss increases until it become `NaN` [modeling_xor_with_a_neural_network.livemd.zip](https://github.com/user-attachments/files/16…

jn-jairo updated 2 months ago

jinzhuoran/RWKU #2

High Computation Cost

Hi, Thanks for sharing the impressive code! The computation cost of this repo is higher than expected. As [LLaMA-Factory](https://github.com/hiyouga/LLaMA-Factory) suggested, a 7B model would only…

zhmzm updated 3 months ago

1000+ results
for batch-gradient-descent