issues
search
tma15
/
paper-reading-list
3
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2
#213
tma15
opened
11 months ago
0
Fine-tuning Language Models for Factuality
#212
tma15
opened
11 months ago
0
PROMPT ENGINEERING A PROMPT ENGINEER
#211
tma15
opened
11 months ago
0
Retrieving Skills from Job Descriptions: A Language Model Based Extreme Multi-label Classification Framework
#210
tma15
opened
12 months ago
0
Lost in the Middle: How Language Models Use Long Contexts
#209
tma15
opened
12 months ago
0
Infusing Context and Knowledge Awareness in Multi-turn Dialog Understanding
#208
tma15
opened
12 months ago
0
A Context-Aware Hierarchical BERT Fusion Network for Multi-turn Dialog Act Detection
#207
tma15
opened
1 year ago
0
LLMLingua: Compressing Prompts for Accelerated Inference of Large Language Models
#206
tma15
opened
1 year ago
0
Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language Models
#205
tma15
opened
1 year ago
0
Prompting with Pseudo-Code Instructions
#204
tma15
opened
1 year ago
0
Textbooks Are All You Need II: phi-1.5 technical report
#203
tma15
opened
1 year ago
0
EFFICIENT STREAMING LANGUAGE MODELS WITH ATTENTION SINKS
#202
tma15
opened
1 year ago
0
[KDD23] CADENCE: Offline Category Constrained and Diverse Query Generation for E-commerce Autosuggest
#201
tma15
opened
1 year ago
0
Out-of-Domain Intent Detection Considering Multi-turn Dialogue Contexts
#200
tma15
opened
1 year ago
0
Accelerating Large Language Model Decoding with Speculative Sampling
#199
tma15
opened
1 year ago
0
Did You Read the Instructions? Rethinking the Effectiveness of Task Definitions in Instruction Learning
#198
tma15
opened
1 year ago
0
Text Embeddings by Weakly-Supervised Contrastive Pre-training
#197
tma15
opened
1 year ago
0
Preference Ranking Optimization for Human Alignment
#196
tma15
opened
1 year ago
0
Retrieval-augmented Multi-label Text Classification
#195
tma15
opened
1 year ago
0
Surface-Based Retrieval Reduces Perplexity of Retrieval-Augmented Language Models
#194
tma15
opened
1 year ago
0
Long-range Language Modeling with Self-retrieval
#193
tma15
opened
1 year ago
0
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
#192
tma15
opened
1 year ago
0
Orca: Progressive Learning from Complex Explanation Traces of GPT-4
#191
tma15
opened
1 year ago
0
Textbooks Are All You Need
#190
tma15
opened
1 year ago
0
Large Language Models in the Workplace: A Case Study on Prompt Engineering for Job Type Classification
#189
tma15
opened
1 year ago
0
CHATDB: AUGMENTING LLMS WITH DATABASES AS THEIR SYMBOLIC MEMORY
#188
tma15
opened
1 year ago
0
SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models
#187
tma15
opened
1 year ago
0
Optimal Partial Transport based Sentence Selection for Long-form Document Matching
#186
tma15
opened
1 year ago
0
The Impact of Positional Encoding on Length Generalization in Transformers
#185
tma15
opened
1 year ago
0
Dropout Reduces Underfitting
#184
tma15
opened
1 year ago
0
How Does Generative Retrieval Scale to Millions of Passages?
#182
tma15
opened
1 year ago
0
ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings
#181
tma15
opened
1 year ago
0
Text Classification via Large Language Models
#180
tma15
opened
1 year ago
0
Bridging the Gap: A Survey on Integrating (Human) Feedback for Natural Language Generation
#179
tma15
opened
1 year ago
0
ResiDual: Transformer with Dual Residual Connections
#178
tma15
opened
1 year ago
0
Search-in-the-Chain: Towards Accurate, Credible and Traceable Large Language Models for Knowledge-intensive Tasks
#177
tma15
opened
1 year ago
0
Unlimiformer: Long-Range Transformers with Unlimited Length Input
#176
tma15
opened
1 year ago
0
SPACE-3: Unified Dialog Model Pre-training for Task-Oriented Dialog Understanding and Generation
#175
tma15
opened
1 year ago
0
ART: Automatic multi-step reasoning and tool-use for large language models
#174
tma15
opened
1 year ago
0
Text and Code Embeddings by Contrastive Pre-Training
#173
tma15
opened
1 year ago
0
GPT4Tools: Teaching LLM to Use Tools via Self-instruction
#172
tma15
opened
1 year ago
0
Augmented Language Models: a Survey
#171
tma15
opened
1 year ago
1
Scaling Transformer to 1M tokens and beyond with RMT
#170
tma15
opened
1 year ago
0
Why Do Better Loss Functions Lead to Less Transferable Features?
#169
tma15
opened
1 year ago
0
Sabiá: Portuguese Large Language Models
#168
tma15
opened
1 year ago
0
How to train your own Large Language Models
#167
tma15
opened
1 year ago
0
Mitigating Neural Network Overconfidence with Logit Normalization
#166
tma15
opened
1 year ago
0
NormSoftmax: Normalize the Input of Softmax to Accelerate and Stabilize Training
#165
tma15
opened
1 year ago
0
What’s in the RedPajama-Data-1T LLM training set
#164
tma15
opened
1 year ago
0
LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention
#163
tma15
opened
1 year ago
0
Next