issues
search
richardbaihe
/
paperreading
NLP papers
MIT License
2
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Arxiv 2021 | Lifelong Pretraining: Continually Adapting Language Models to Emerging Corpora
#71
richardbaihe
closed
2 years ago
0
Arxiv 2021 | The Power of Scale for Parameter-Efficient Prompt Tuning
#70
richardbaihe
opened
3 years ago
0
ACL2021 | Few-Shot Question Answering by Pretraining Span Selection
#69
richardbaihe
opened
3 years ago
0
Arxiv 2021 | SimCSE: Simple Contrastive Learning of Sentence Embeddings
#68
richardbaihe
closed
3 years ago
0
ICLR 2021 | Random feature attention
#67
richardbaihe
closed
3 years ago
0
ICLR 2021 | WHEN DO CURRICULA WORK?
#66
richardbaihe
closed
3 years ago
0
NIPS 2020 | Uncertainty-aware Self-training for Text Classification with Few Labels
#65
richardbaihe
closed
3 years ago
0
ICLR 2020 | WHY GRADIENT CLIPPING ACCELERATES TRAINING:ATHEORETICAL JUSTIFICATION FOR ADAPTIVITY
#64
richardbaihe
closed
3 years ago
1
ACL 2018 | Sharp Nearby, Fuzzy Far Away: How Neural Language Models Use Context
#63
richardbaihe
closed
3 years ago
1
ICLR 2021 | MASTERING ATARI WITH DISCRETE WORLD MODELS
#62
richardbaihe
closed
3 years ago
0
AAAI 2021 | Self-Attention Attribution: Interpreting Information Interactions Inside Transformer
#61
richardbaihe
closed
3 years ago
0
Arixv 2021 | Pitfalls of Static Language Modelling
#60
richardbaihe
closed
3 years ago
0
EMNLP 2020 | Probing Pretrained Language Models for Lexical Semantics
#59
richardbaihe
closed
3 years ago
0
NIPS 2020 | The Depth-to-Width Interplay in Self-Attention
#58
richardbaihe
closed
3 years ago
0
EMNLP 2020 | Blank Language Models
#57
richardbaihe
closed
3 years ago
0
Arxiv 2021 | Shortformer: Better Language Modeling using Shorter Inputs
#56
richardbaihe
closed
3 years ago
0
EMNLP 2020 | SLM: Learning a Discourse Language Representation with Sentence Unshuffling
#54
richardbaihe
closed
3 years ago
0
KDD 2003 | Towards Parameter-Free Data Mining
#53
richardbaihe
closed
3 years ago
0
Arxiv 2020 | GEDI: GENERATIVE DISCRIMINATOR GUIDED SEQUENCE GENERATION
#50
richardbaihe
closed
3 years ago
0
openreview 2021 | SUPERVISED CONTRASTIVE LEARNING FOR PRE-TRAINED LANGUAGE MODEL FINE-TUNING
#49
richardbaihe
closed
3 years ago
0
EMNLP 2020 | An Unsupervised Sentence Embedding Method by Mutual Information Maximization
#48
richardbaihe
closed
3 years ago
0
Arxiv 2020 | It’s Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners
#47
richardbaihe
closed
3 years ago
0
ICLR 2021 | UNIVERSAL SENTENCE REPRESENTATIONS LEARN- ING WITH CONDITIONAL MASKED LANGUAGE MODEL
#46
richardbaihe
closed
3 years ago
0
EMNLP2020 | Cross-Thought for Sentence Encoder Pre-training
#45
richardbaihe
closed
3 years ago
0
Arxiv 2020 | DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations
#44
richardbaihe
closed
3 years ago
0
Arxiv 2020| Language models are few-shot learners
#43
richardbaihe
closed
4 years ago
0
ICLR 2020 | A MUTUAL INFORMATION MAXIMIZATION PERSPEC- TIVE OF LANGUAGE REPRESENTATION LEARNING
#42
richardbaihe
closed
3 years ago
0
Arxiv 2020 | Progressive Generation of Long Text
#41
richardbaihe
closed
3 years ago
0
Arxiv 2020 | Pre-training via Paraphrasing
#39
richardbaihe
closed
4 years ago
1
ACL 2020 | Pretraining with Contrastive Sentence Objectives Improves Discourse Performance of Language Models
#38
richardbaihe
closed
4 years ago
1
Arxiv 2020 | DeBERTa: Decoding-enhanced BERT with Disentangled Attention
#37
richardbaihe
closed
4 years ago
0
Linformer: Self-Attention with Linear Complexity
#36
richardbaihe
closed
4 years ago
0
Arxiv 2020| GMAT: Global Memory Augmentation for Transformers
#35
richardbaihe
closed
4 years ago
0
ACL 2019 | Hierarchical Transformers for Multi-Document Summarization
#34
richardbaihe
closed
4 years ago
0
Arxiv 2019|SparseTransformer
#33
richardbaihe
closed
4 years ago
0
Arxiv 2019 | BP-Transformer: Modeling Long-Range Context via Binary Partitioning
#32
richardbaihe
closed
4 years ago
0
Arxiv2020|Longformer: The Long-Document Transformer
#31
richardbaihe
closed
4 years ago
1
ACL2020|Adaptive Attention Span in Transformers
#30
richardbaihe
closed
4 years ago
0
Arixv 2020|Efficient Content-Based Sparse Attention with Routing Transformers
#29
richardbaihe
closed
4 years ago
0
Encoder-Agnostic Adaptation for Conditional Language Generation
#28
richardbaihe
closed
3 years ago
0
DYNAMIC EVALUATION OF TRANSFORMER LANGUAGE MODELS
#27
richardbaihe
closed
4 years ago
1
A Simple Framework for Contrastive Learning of Visual Representations
#26
richardbaihe
closed
3 years ago
0
Consistency of a Recurrent Language Model With Respect to Incomplete Decoding
#25
Impavidity
closed
3 years ago
0
ICML2019 | The Evolved Transformer
#24
richardbaihe
closed
4 years ago
1
Arxiv 2019 | Towards a Human-like Open-Domain Chatbot
#23
richardbaihe
closed
4 years ago
1
ICML 2020 | PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive Summarization
#22
richardbaihe
closed
4 years ago
1
ProphetNet: Predicting Future N-gram for Sequence-to-Sequence Pre-training
#21
richardbaihe
closed
3 years ago
0
ICLR2020 | Reformer: The Efficient Transformer
#20
richardbaihe
closed
4 years ago
1
NAACL 2019|Text Generation with Exemplar-based Adaptive Decoding
#19
richardbaihe
closed
4 years ago
1
ICLR2020 | ELECTRA: PRE-TRAINING TEXT ENCODERS AS DISCRIMINATORS RATHER THAN GENERATORS
#18
richardbaihe
closed
4 years ago
0
Next