issues
search
eagle705
/
presentation
presentation pdf collection
6
stars
1
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Scaling Laws for Fine-Grained Mixture of Experts
#38
eagle705
opened
1 month ago
0
Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity
#37
eagle705
opened
2 months ago
0
Multimodal C4: An Open, Billion-scale Corpus of Images Interleaved with Text
#36
eagle705
opened
5 months ago
0
YaRN: Efficient Context Window Extension of Large Language Models
#35
eagle705
opened
7 months ago
0
개발자의 글쓰기 : Technical Writing #4
#34
eagle705
opened
10 months ago
0
NEFTune: Noisy Embeddings Improve Instruction Finetuning
#33
eagle705
opened
10 months ago
0
(LLaMA2 Long) Effective Long-Context Scaling of Foundation Models
#32
eagle705
opened
10 months ago
0
(Humpback) Self-Alignment with Instruction Backtranslation
#31
eagle705
opened
1 year ago
0
LongNet: Scaling Transformers to 1,000,000,000 Tokens
#30
eagle705
opened
1 year ago
0
PaLM 2 Technical Report
#29
eagle705
opened
1 year ago
0
Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling
#28
eagle705
opened
1 year ago
0
LLaMA: Open and Efficient Foundation Language Models
#27
eagle705
opened
1 year ago
0
(IA3) Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning
#26
eagle705
opened
1 year ago
0
Alpaca: A Strong Instruction-Following Model
#25
eagle705
opened
1 year ago
0
Toolformer: Language Models Can Teach Themselves to Use Tools
#24
eagle705
opened
1 year ago
0
SELF-INSTRUCT: Aligning Language Model with Self Generated Instructions
#23
eagle705
opened
1 year ago
0
(FLAN-T5) Scaling Instruction-Finetuned Language Models
#22
eagle705
opened
1 year ago
0
(FLAN) Finetuned Language Models Are Zero-Shot Learners
#21
eagle705
opened
1 year ago
0
(T0) Multitask Prompted Training Enables Zero-Shot Task Generalization
#20
eagle705
opened
1 year ago
0
How does GPT Obtain its Ability? Tracing Emergent Abilities of Language Models to their Sources
#19
eagle705
opened
1 year ago
0
(InstructGPT) Training language models to follow instructions with human feedback
#18
eagle705
opened
1 year ago
0
Training Compute-Optimal Large Language Models
#17
eagle705
opened
1 year ago
0
Robust Conversational Agents against Imperceptible Toxicity Triggers
#16
eagle705
opened
1 year ago
0
LMentry: A Language Model Benchmark of Elementary Language Tasks
#15
eagle705
opened
1 year ago
0
SOCIAL CHEMISTRY 101 - Learning to Reason about Social and Moral Norms
#14
eagle705
opened
1 year ago
0
A Contrastive Framework for Neural Text Generation
#13
eagle705
opened
1 year ago
0
Efficient Training of Language Models to Fill in the Middle (FIM)
#12
eagle705
opened
2 years ago
0
(ALiBi) TRAIN SHORT, TEST LONG: ATTENTION WITH LINEAR BIASES ENABLES INPUT LENGTH EXTRAPOLATION
#11
eagle705
opened
2 years ago
0
볼만한 논문 리스트
#10
eagle705
opened
2 years ago
1
COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining
#9
eagle705
opened
2 years ago
0
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
#8
eagle705
opened
2 years ago
0
RoBERTa: A Robustly Optimized BERT Pretraining Approach
#7
eagle705
opened
2 years ago
0
A Neural Network Solves and Generates Mathematics Problems by Program Synthesis: Calculus, Differential Equations, Linear Algebra, and More
#6
eagle705
opened
2 years ago
0
Knowledge Enhanced Contextual Word Representations
#5
eagle705
opened
2 years ago
0
CLINE: Contrastive Learning with Semantic Negative Examples for Natural Language Understanding
#4
eagle705
opened
2 years ago
0
GPT Understands, Too
#3
eagle705
opened
2 years ago
0
WARP: Word-level Adversarial ReProgramming
#2
eagle705
opened
2 years ago
0
Knowledge Graph Based Synthetic Corpus Generation for Knowledge-Enhanced Language Model Pre-training
#1
eagle705
opened
2 years ago
0