-
I need scripts to train different models with different hyperparameters in order to choose the smallest model that can give me better metrics. The scripts should do the following tasks:
- Find a go…
pab1s updated
3 months ago
-
## Intro to Tensorflow
- 在tensorflow中,数据是被封装在一个Object叫tensor。
### Session
- TensorFlow’s api是构建基于Computational Graph(计算图)上的。
- TensorFlow Session **is an environment for running a graph.**
```p…
-
-
paper: https://arxiv.org/pdf/1907.02893
paper section to read:
3.2 Implementation details
When estimating the objective (IRMv1) using mini-batches for stochastic gradient
descent, one can obtain …
-
# 高级凸优化
这部分是MXNet第六课的高级凸优化的学习记录,包含基本的凸函数性质说明,主要讲解了收敛率以及后面的高级优化算法,包括动量法、Adagrad、AdaDelta、RMSProp、Adam。重点介绍了一个由动量法引入的EMA(Exponential Moving Average,即指数平均数指标),在包括Adam等等优化算法都有它的身影。
## 1. 凸函数性质
好…
-
> PyTorch 是由 Facebook 开发,基于 **Torch** 开发,从并不常用的 Lua 语言转为 Python 语言开发的深度学习框架,Torch 是 TensorFlow 开源前非常出名的一个深度学习框架,而 PyTorch 在开源后由于其使用简单,动态计算图的特性得到非常多的关注,并且成为了 TensorFlow 的 最大竞争对手。目前其 Github 也有 2w8+ 关注。
…
-
Hello,
I discovered your apex tools for integrating mixed precision and FP16 training in pytorch, which is a **great** idea to develop ! Our servers are mainly equipped with TITAN V cards hence I w…
-
FreeLB: Enhanced Adversarial Training for Language Understanding
Chen Zhu, Yu Cheng, Zhe Gan, Siqi Sun, Tom Goldstein, Jingjing Liu, ICLR 2020
- https://arxiv.org/abs/1909.11764
- [openreview (8-8…
-
-
I'm working on a [BayesianLinearRegression.jl](https://github.com/cscherrer/BayesianLinearRegression.jl) module, and I'd like for it to fit in with the Julia ecosystem in a natural way. @johnmyleswhit…