rnn-implementations Search Results

669 results
for rnn-implementations

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

alnfedorov/lowbitdnn-project #2

Primitives design

It's a well-known fact that many convolutions can be thought of as a direct matrix multiplication(Im2Col and more subtle ideas). cuDNN white-paper directly states that NVIDIA developers use precisely …

alnfedorov updated 4 years ago
9
pytorch/pytorch #61452

New module: Split log softmax with loss

## 🚀 Feature: SplitLogSoftmaxWithLoss An approximation of Log Softmax based on [splitsoftmax](https://gist.github.com/alisafaya/785e431539917cfbaab23281b77699d9). ## Motivation The cur…

alisafaya updated 3 years ago
2
microsoft/onnxruntime #6618

Request examples for GPU inference with C API

Dear friends, Could you please provide an example to show how place CPU data to GPU and inference with C API ? It would be appreciate if you like give some API more examples, such as Creat…

delldu updated 3 years ago
3
pytorch/pytorch #42545

CuDNN RNN bindings are basically all deprecated in cudnn 8

Many (all?!!) the rnn-related raw cudnn calls in https://github.com/pytorch/pytorch/blob/master/aten/src/ATen/cudnn/Descriptors.h https://github.com/pytorch/pytorch/blob/master/aten/src/ATen/native/…

mcarilli updated 4 years ago
2
hehefan/Recurrent-Attention-Model #2

Zero gradients in LocationNetwork

Thanks for your code! I think use tf.stop_gradient() for both "mean_loc" and "sample_loc" causes the gradients of location network to be None. Here is the gradients information: GlimpseNetwork/…

XoriieInpottn updated 5 years ago
1
pytorch/pytorch #20102

LSTM forget bias must be initialized properly

## 🚀 Feature LSTM forget bias must be initialized to 1 or 2 for better training. ## Motivation Please see: https://pdfs.semanticscholar.org/1154/0131eae85b2e11d53df7f1360eeb6476e7f4.pdf http:…

hengfun updated 1 year ago
6
worldmodels/worldmodels.github.io #3

Training time and procedure

Amazing work! I had a practical question about the time it took to train these models on the setup you described in the article. Would you be able to share more? In addition, would this repository be …

hundredblocks updated 6 years ago
2
pytorch/nestedtensor #344

Comment about nested tensor vs. ragged tensor

Guys, this is a very general comment and FYI... To some extent you guys seem to be viewing NestedTensor as a generic ragged-tensor data structure, similar to TensorFlow's RaggedTensor. I understan…

danpovey updated 3 years ago
1
ikostrikov/pytorch-a2c-ppo-acktr-gail #284

why PPO needs to store action_log_probs instead of using sto…

Hi, I am looking at the PPO implementation, and I am curious about this part (actually many other implementations are using this workflow as well, so I am also curious to see if I miss anything) …

Emerald01 updated 2 years ago
1
ematvey/hierarchical-attention-networks #11

ValueError in running worker.py

Sorry to bother you again. I used the tensorflow=1.2.1, python=3.6, run worker.py as your instructions, but it encountered an error. **ValueError:** Trying to share variable tcm/word/fw/multi_rnn_ce…

jianzhengming updated 6 years ago
12

上一页 1...2 3 4 5 6 7 8...67 下一页

669 results for rnn-implementations

669 results
for rnn-implementations