-
How did you initialize the cell state and the hidden state of the LSTM layers?
You gave an equation but didn't explain much. I wonder what the f_init function is. I read the code and guess it is a ta…
-
Has anyone exported the soundstream model to ONNX? I tried:
`torch.onnx.export(soundstream, audio, "soundstream.onnx")`
but it fails with
```
/home/divya/.local/lib/python3.8/site-packages/…
-
I tried to fine-tune the BERT model as an embedding model, which maps sentences to a space where the `cosine` similarity between two sentence embedding vectors can be interpreted as the sentence simil…
-
Because of the autoregressive nature of Transformers, I know that they are fairly slow when generating new sequences from scratch, but I was wondering if you had any tips or tricks on how to do faster…
-
https://hanxiao.github.io/2018/04/21/Teach-Machine-to-Comprehend-Text-and-Answer-Question-with-Tensorflow/
-
**Found a bottleneck: the attention layer**
I have found a potential bottleneck for why bug #22 occurred. It seems like the axial attention layer is some kind of bottleneck. I ran the network for 100…
-
## 論文リンク
http://www.anlp.jp/proceedings/annual_meeting/2019/pdf_dir/P5-25.pdf
## 著者
○松好祐紀, 滝口哲也, 有木康雄, 瀧和男, 片桐恵子 (神戸大)
-
**Paper**: http://openaccess.thecvf.com/content_ICCV_2019/papers/Choi_Looking_to_Relations_for_Future_Trajectory_Forecast_ICCV_2019_paper.pdf
**Summary**: Predict future trajectories of all objects…
-
When I try running my **nmt_code.sh** which contains:
python -m nmt.nmt \
--vocab_prefix=$path_file/nmt_data_new/ \
--train_prefix=$path_file…
-
Could it be that in the Mackey Glass experiment, the network is asked to approximate a 15-step delay, instead of simulating a complex system?
The definition of the X and Y data is as follows:
```
…