-
Traceback (most recent call last):
File "", line 1, in
runfile('D:/nlp-tutorial/neural-machine-translation/nmt/train.py', wdir='D:/nlp-tutorial/neural-machine-translation/nmt')
File "C…
-
## 简介
论文分析了机器翻译在**推理**时的校准`Calibration`问题:即模型输出分布与结果不吻合,存在差距。 文章使用的评价指标是`ECE(期望校准误差)`,发现训练的`ECE`远远小于推理`ECE`,说明缩小训练和推理之间的差距需要做很多工作。
论文分析了一些`NMT`在语言学方面的现象:
`Frequency, Position, Fertility, Syntact…
-
https://arxiv.org/abs/1409.0473
https://www.youtube.com/watch?v=upskBSbA9cA&index=56&list=PLlMkM4tgfjnJhhd4wn5aj8fVTYJwIpWkS 참고
* 사실 이해가 잘 가지 않아 밑에 보면 여러군데 찾아봄.
-
## 简介
把Capsule Network套到了建模past-feture上。感觉写的比较套路了,了解了一下dynamic route咋工作的,再看论文感觉基本就是用的标准的做法套了一下呢。。写的有点不是很清楚。用pre-trained baseline继续训的。
## 论文信息
* Author: NJU
* [Paper](https://arxiv.org/pdf/1904.0…
-
# [Paper Review] Bahdanau Attention (2014) | woodong's log
Neural machine translation by jointly learning to align and translate (2014) 논문 리뷰 Bahdanau, D., Cho, K., & Bengio, Y. (2014). Neural machin…
-
## ❓ Questions and Help
#### What is your question?
For some english words, the model add `.` to the end of translation ; for example: `ok`.
See the code which produces the following output:
`…
-
Hi, I saw you add dropout layer after word embedding, which was not mentioned in rnnsearch paper "Neural Machine Translation by Jointly Learning to Align and Translate". Does this trick improve some p…
-
https://arxiv.org/pdf/2104.08677.pdf
-
# BPE as input tokens of the transformer model
The transformer model proposed by "_Attention is all you need_" encodes the 4.5M sentence input data into a small vocabulary generated by learning sha…
-
没有找到model.py文件,下载的big_model及base_model是替代models.py文件吗?如何使用big_model及base_model文件