-
When I did this job recently, the most trouble I had was with the maths. So:
- https://julialang.org/blog/2013/04/distributed-numerical-optimization/
- https://julialang.org/blog/2017/09/gsoc-2017…
-
neural_machine_translation/rnn_search预测有时会报错(非必现),报错如下:
Traceback (most recent call last):
File "infer.py", line 137, in
infer()
File "infer.py", line 113, in infer
return_numpy=Fals…
-
This came up in #628, and I think it appeared for the first time for EMNLP 2019 because we pushed some simplifying changes to START that turned off LaTeX on their end.
Is it because `normalize_ant…
-
* arxiv(cs.cl) 2016
* PDF link: https://arxiv.org/pdf/1610.10099v1.pdf
* MT & Language Modeling, this paper introduced 'ByteNet', a character level dilated conv NN based encoder-decoder model, whic…
-
Following on the other issue I created #108 , I'm trying to teach an LSTM network to write a simple children's book. I'm getting odd behavior but really don't know what I'm doing to begin with. I'd lo…
-
## Description
RAdam
- RAdam is a new variant of Adam, by introducing a term to rectify the varianceof the adaptive learning rate
- Experimental results on language modeling and neural machine tra…
-
Hello,
Thank you very much for you tutorials.
I think I have an issue in my use case.
I use the seq2seq with attention model and I want to get embeddings of a sequence. I don't make translation …
-
For transformer-based neural machine translation (NMT), take English-Chinese for example, we pass English for encoder and use decoder input(Chinese) attend to encoder output, then final output.
Wh…
-
Please tell the detail of how to use your test sets to evaluate the MT model, thanks.
-
For transformer-based neural machine translation (NMT), take English-Chinese for example, we pass English for encoder and use decoder input(Chinese) attend to encoder output, then final output.
Wha…