Dahoas QDSyntheticData issues

Dahoas / QDSyntheticData

13 stars 16 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Create 2410.04715_Rule-based_Data_Selection.md

#366 lauraaisling opened 1 week ago
0
Entropic Distribution Matching in Supervised Fine-tuning of LLMs: Less Overfitting and Better Diversity

#365 Dahoas opened 2 weeks ago
0
Diversity-Rewarded CFG Distillation

#364 Dahoas opened 2 weeks ago
0
Rule-based Data Selection for Large Language Models

#363 Dahoas opened 2 weeks ago
0
$\textbf{Only-IF}$:Revealing the Decisive Effect of Instruction Diversity on Generalization

#362 Dahoas opened 2 weeks ago
0
Data Advisor: Dynamic Data Curation for Safety Alignment of Large Language Models

#361 Dahoas opened 2 weeks ago
1
On LLMs-Driven Synthetic Data Generation, Curation, and Evaluation: A Survey

#360 Dahoas opened 3 weeks ago
0
Improving Pretraining Data Using Perplexity Correlations

#359 kanishkg opened 1 month ago
0
RL with KL penalties is better viewed as Bayesian inference

#358 Dahoas opened 1 month ago
0
QDC mermaid diagrams

#357 baberabb opened 2 months ago
0
Create 2404.14219_phi3

#356 lauraaisling opened 2 months ago
0
phi3 paper (for completeness of phi series)

#355 lauraaisling opened 2 months ago
0
phi1

#354 lauraaisling opened 2 months ago
0
Create phi_2.md

#353 lauraaisling opened 2 months ago
0
2408.03314 paper summary added

#352 ShayekhBinIslam opened 2 months ago
0
Create 2309.05463_phi1.5.md

#351 lauraaisling opened 2 months ago
0
Create 2306.11644_phi1.md

#350 lauraaisling closed 2 months ago
0
Llama 3.1 paper

#349 Dahoas opened 2 months ago
0
Orca: Progressive learning from complex explanation traces of gpt- 4

#348 Dahoas opened 2 months ago
0
Read some easy to hard papers to better understand complexity

#347 Dahoas opened 2 months ago
0
Make list of key papers

#346 Dahoas opened 2 months ago
0
Pre-training case study

#345 Dahoas opened 2 months ago
1
case study for code

#344 Dahoas opened 2 months ago
0
Case study for reasoning/math

#343 Dahoas opened 2 months ago
0
Add domain specific section for RLHF/Instruction tuning

#342 Dahoas opened 2 months ago
0
Make table of contents diagram

#341 Dahoas opened 2 months ago
0
Read phi and add to paper

#340 Dahoas opened 2 months ago
1
Adding diagrams for quality, diversity, complexity taxonomy

#339 Dahoas opened 2 months ago
0
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

#338 Dahoas opened 2 months ago
1
Self-Taught Evaluators

#337 Dahoas opened 2 months ago
0
Create Meta-Rewarding_2407.19594.md

#336 lauraaisling closed 2 months ago
0
Meta-Rewarding Language Models

#335 lauraaisling opened 2 months ago
0
Closes #301

#334 srishti-git1110 closed 2 months ago
0
docs: fix/add broken links

#333 nlile opened 2 months ago
0
Polish final draft

#332 Dahoas opened 2 months ago
1
Solicit feedback

#331 Dahoas opened 2 months ago
1
Genetic Instruct: Scaling up Synthetic Generation of Coding Instructions for Large Language Models

#330 mmhamdy opened 2 months ago
1
Add summary for RL on Incorrect Synthetic Data Scales the Efficiency of LLM Math Reasoning by Eight-Fold

#329 mmhamdy closed 2 months ago
0
ead paper summary

#328 veratr86 closed 2 months ago
0
Rethinking and Refining the Distinct Metric

#327 veratr86 opened 3 months ago
1
Create 2305.10601_Tree_of_thought.md

#326 lauraaisling closed 2 months ago
0
Add summary for Simple synthetic data reduces sycophancy in large language models

#325 mmhamdy closed 2 months ago
0
Simple synthetic data reduces sycophancy in large language models

#324 mmhamdy closed 2 months ago
0
adding evo prompt summary

#323 veratr86 closed 2 months ago
0
tinyBenchmarks: evaluating LLMs with fewer examples

#322 kanishkg opened 3 months ago
0
What makes a good data for alignment / Automatic instruction evolving

#321 Dahoas closed 3 months ago
0
Automatic Instruction Evolving for Large Language Models

#320 Dahoas opened 3 months ago
1
AgentInstruct: Toward Generative Teaching with Agentic Flows

#319 veratr86 closed 3 months ago
1
Arena Learning: Build Data Flywheel for LLMs Post-training via Simulated Chatbot Arena

#318 mmhamdy opened 3 months ago
0
Issue 116

#317 lauraaisling closed 3 months ago
0