kweonwooj / papers

summary of ML papers I've read
319 stars 34 forks source link

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding #114

Open kweonwooj opened 6 years ago

kweonwooj commented 6 years ago

Abstract

Details

Pre-Training (Transfer Learning)

ELMo vs OpenAI GPT vs BERT

BERT

Input Representation

Pre-training BERT

Fine-Tuning in BERT

Dataset (tasks)

Fine-Tuning

Performance

Ablation Study

What contributed the most?

Effect on Model Size

Effect of Training Steps

Feature-based vs Fine-Tuning

Personal Thoughts

Link : https://arxiv.org/pdf/1810.04805.pdf Authors : Devlin et al 2018