tma15 paper-reading-list issues

tma15 / paper-reading-list

3 stars 0 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2

#213 tma15 opened 11 months ago
0
Fine-tuning Language Models for Factuality

#212 tma15 opened 11 months ago
0
PROMPT ENGINEERING A PROMPT ENGINEER

#211 tma15 opened 11 months ago
0
Retrieving Skills from Job Descriptions: A Language Model Based Extreme Multi-label Classification Framework

#210 tma15 opened 12 months ago
0
Lost in the Middle: How Language Models Use Long Contexts

#209 tma15 opened 12 months ago
0
Infusing Context and Knowledge Awareness in Multi-turn Dialog Understanding

#208 tma15 opened 12 months ago
0
A Context-Aware Hierarchical BERT Fusion Network for Multi-turn Dialog Act Detection

#207 tma15 opened 1 year ago
0
LLMLingua: Compressing Prompts for Accelerated Inference of Large Language Models

#206 tma15 opened 1 year ago
0
Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language Models

#205 tma15 opened 1 year ago
0
Prompting with Pseudo-Code Instructions

#204 tma15 opened 1 year ago
0
Textbooks Are All You Need II: phi-1.5 technical report

#203 tma15 opened 1 year ago
0
EFFICIENT STREAMING LANGUAGE MODELS WITH ATTENTION SINKS

#202 tma15 opened 1 year ago
0
[KDD23] CADENCE: Offline Category Constrained and Diverse Query Generation for E-commerce Autosuggest

#201 tma15 opened 1 year ago
0
Out-of-Domain Intent Detection Considering Multi-turn Dialogue Contexts

#200 tma15 opened 1 year ago
0
Accelerating Large Language Model Decoding with Speculative Sampling

#199 tma15 opened 1 year ago
0
Did You Read the Instructions? Rethinking the Effectiveness of Task Definitions in Instruction Learning

#198 tma15 opened 1 year ago
0
Text Embeddings by Weakly-Supervised Contrastive Pre-training

#197 tma15 opened 1 year ago
0
Preference Ranking Optimization for Human Alignment

#196 tma15 opened 1 year ago
0
Retrieval-augmented Multi-label Text Classification

#195 tma15 opened 1 year ago
0
Surface-Based Retrieval Reduces Perplexity of Retrieval-Augmented Language Models

#194 tma15 opened 1 year ago
0
Long-range Language Modeling with Self-retrieval

#193 tma15 opened 1 year ago
0
Direct Preference Optimization: Your Language Model is Secretly a Reward Model

#192 tma15 opened 1 year ago
0
Orca: Progressive Learning from Complex Explanation Traces of GPT-4

#191 tma15 opened 1 year ago
0
Textbooks Are All You Need

#190 tma15 opened 1 year ago
0
Large Language Models in the Workplace: A Case Study on Prompt Engineering for Job Type Classification

#189 tma15 opened 1 year ago
0
CHATDB: AUGMENTING LLMS WITH DATABASES AS THEIR SYMBOLIC MEMORY

#188 tma15 opened 1 year ago
0
SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models

#187 tma15 opened 1 year ago
0
Optimal Partial Transport based Sentence Selection for Long-form Document Matching

#186 tma15 opened 1 year ago
0
The Impact of Positional Encoding on Length Generalization in Transformers

#185 tma15 opened 1 year ago
0
Dropout Reduces Underfitting

#184 tma15 opened 1 year ago
0
How Does Generative Retrieval Scale to Millions of Passages?

#182 tma15 opened 1 year ago
0
ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings

#181 tma15 opened 1 year ago
0
Text Classification via Large Language Models

#180 tma15 opened 1 year ago
0
Bridging the Gap: A Survey on Integrating (Human) Feedback for Natural Language Generation

#179 tma15 opened 1 year ago
0
ResiDual: Transformer with Dual Residual Connections

#178 tma15 opened 1 year ago
0
Search-in-the-Chain: Towards Accurate, Credible and Traceable Large Language Models for Knowledge-intensive Tasks

#177 tma15 opened 1 year ago
0
Unlimiformer: Long-Range Transformers with Unlimited Length Input

#176 tma15 opened 1 year ago
0
SPACE-3: Unified Dialog Model Pre-training for Task-Oriented Dialog Understanding and Generation

#175 tma15 opened 1 year ago
0
ART: Automatic multi-step reasoning and tool-use for large language models

#174 tma15 opened 1 year ago
0
Text and Code Embeddings by Contrastive Pre-Training

#173 tma15 opened 1 year ago
0
GPT4Tools: Teaching LLM to Use Tools via Self-instruction

#172 tma15 opened 1 year ago
0
Augmented Language Models: a Survey

#171 tma15 opened 1 year ago
1
Scaling Transformer to 1M tokens and beyond with RMT

#170 tma15 opened 1 year ago
0
Why Do Better Loss Functions Lead to Less Transferable Features?

#169 tma15 opened 1 year ago
0
Sabiá: Portuguese Large Language Models

#168 tma15 opened 1 year ago
0
How to train your own Large Language Models

#167 tma15 opened 1 year ago
0
Mitigating Neural Network Overconfidence with Logit Normalization

#166 tma15 opened 1 year ago
0
NormSoftmax: Normalize the Input of Softmax to Accelerate and Stabilize Training

#165 tma15 opened 1 year ago
0
What’s in the RedPajama-Data-1T LLM training set

#164 tma15 opened 1 year ago
0
LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention

#163 tma15 opened 1 year ago
0