-
## 어떤 내용의 논문인가요? 👋
- BERT Style의 Masked Language Model (MLM)은 텍스트 인코딩 뿐만 아니라 생성에 있어서도 유용하게 활용될 수 있음
- CMLM이 활용하는 Parallel decoding을 통해 성능과 속도의 Trade-off를 직접 잡아줄 수도 있음
## Abstract (요약) 🕵🏻♂️
…
-
Wrong pdf for the paper titled "Taskmaster-1: Toward a Realistic and Diverse Dialog Dataset" from Byrne et al is linked to the ACL Anthology site.
The current paper that is linked shows a different…
-
In the origin paper you states that the filtered TREC QA dataset has 1, 229, 65 and 68 questions. However, in the repo you use `*-filtered.jsonl` from [here](https://github.com/mcrisc/lexdecomp/tree/…
-
@weilinie ,
Thank you for the implementation. I wonder if you always observe the same behavior of NLL_gen loss curve with different initialization? I am not observing the same behavior on another c…
-
Hi there,
I have this idea for a peer-to-peer network construction technique and a lookup function that would enhance content-addressing by mapping data to high-dimensional vectors instead of purel…
-
It would be nice if we got pubchairs to decide on a short name for each volume, which could then be used in papers etc. If there is agreement on this, we could ask Softconf to add that field to the ST…
-
[PrivacyQA_EMNLP-master.zip](https://github.com/AbhilashaRavichander/PrivacyQA_EMNLP/files/4109839/PrivacyQA_EMNLP-master.zip)
The files don't have any content after cloning or downloading. Attachi…
-
Please include our paper [Mixture Content Selection for Diverse Sequence Generation (EMNLP 2019)](https://www.semanticscholar.org/paper/Mixture-Content-Selection-for-Diverse-Sequence-Cho-Seo/8270fcc6c…
j-min updated
4 years ago
-
@williamSYSU I have a small question. I am trying to understand how you handle Out of vocabulary tokens, since we can a max cap on the vocabulary size if we deal with another dataset. I can see that …
-
**Question**
I am trying to make an ALBERT based model for question generation task. Which downstream head should I use?
An example would be greatly appreciated.
**Additional context**
```
impo…