issues
search
Dahoas
/
QDSyntheticData
13
stars
16
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Create 2410.04715_Rule-based_Data_Selection.md
#366
lauraaisling
opened
1 week ago
0
Entropic Distribution Matching in Supervised Fine-tuning of LLMs: Less Overfitting and Better Diversity
#365
Dahoas
opened
2 weeks ago
0
Diversity-Rewarded CFG Distillation
#364
Dahoas
opened
2 weeks ago
0
Rule-based Data Selection for Large Language Models
#363
Dahoas
opened
2 weeks ago
0
$\textbf{Only-IF}$:Revealing the Decisive Effect of Instruction Diversity on Generalization
#362
Dahoas
opened
2 weeks ago
0
Data Advisor: Dynamic Data Curation for Safety Alignment of Large Language Models
#361
Dahoas
opened
2 weeks ago
1
On LLMs-Driven Synthetic Data Generation, Curation, and Evaluation: A Survey
#360
Dahoas
opened
3 weeks ago
0
Improving Pretraining Data Using Perplexity Correlations
#359
kanishkg
opened
1 month ago
0
RL with KL penalties is better viewed as Bayesian inference
#358
Dahoas
opened
1 month ago
0
QDC mermaid diagrams
#357
baberabb
opened
2 months ago
0
Create 2404.14219_phi3
#356
lauraaisling
opened
2 months ago
0
phi3 paper (for completeness of phi series)
#355
lauraaisling
opened
2 months ago
0
phi1
#354
lauraaisling
opened
2 months ago
0
Create phi_2.md
#353
lauraaisling
opened
2 months ago
0
2408.03314 paper summary added
#352
ShayekhBinIslam
opened
2 months ago
0
Create 2309.05463_phi1.5.md
#351
lauraaisling
opened
2 months ago
0
Create 2306.11644_phi1.md
#350
lauraaisling
closed
2 months ago
0
Llama 3.1 paper
#349
Dahoas
opened
2 months ago
0
Orca: Progressive learning from complex explanation traces of gpt- 4
#348
Dahoas
opened
2 months ago
0
Read some easy to hard papers to better understand complexity
#347
Dahoas
opened
2 months ago
0
Make list of key papers
#346
Dahoas
opened
2 months ago
0
Pre-training case study
#345
Dahoas
opened
2 months ago
1
case study for code
#344
Dahoas
opened
2 months ago
0
Case study for reasoning/math
#343
Dahoas
opened
2 months ago
0
Add domain specific section for RLHF/Instruction tuning
#342
Dahoas
opened
2 months ago
0
Make table of contents diagram
#341
Dahoas
opened
2 months ago
0
Read phi and add to paper
#340
Dahoas
opened
2 months ago
1
Adding diagrams for quality, diversity, complexity taxonomy
#339
Dahoas
opened
2 months ago
0
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
#338
Dahoas
opened
2 months ago
1
Self-Taught Evaluators
#337
Dahoas
opened
2 months ago
0
Create Meta-Rewarding_2407.19594.md
#336
lauraaisling
closed
2 months ago
0
Meta-Rewarding Language Models
#335
lauraaisling
opened
2 months ago
0
Closes #301
#334
srishti-git1110
closed
2 months ago
0
docs: fix/add broken links
#333
nlile
opened
2 months ago
0
Polish final draft
#332
Dahoas
opened
2 months ago
1
Solicit feedback
#331
Dahoas
opened
2 months ago
1
Genetic Instruct: Scaling up Synthetic Generation of Coding Instructions for Large Language Models
#330
mmhamdy
opened
2 months ago
1
Add summary for RL on Incorrect Synthetic Data Scales the Efficiency of LLM Math Reasoning by Eight-Fold
#329
mmhamdy
closed
2 months ago
0
ead paper summary
#328
veratr86
closed
2 months ago
0
Rethinking and Refining the Distinct Metric
#327
veratr86
opened
3 months ago
1
Create 2305.10601_Tree_of_thought.md
#326
lauraaisling
closed
2 months ago
0
Add summary for Simple synthetic data reduces sycophancy in large language models
#325
mmhamdy
closed
2 months ago
0
Simple synthetic data reduces sycophancy in large language models
#324
mmhamdy
closed
2 months ago
0
adding evo prompt summary
#323
veratr86
closed
2 months ago
0
tinyBenchmarks: evaluating LLMs with fewer examples
#322
kanishkg
opened
3 months ago
0
What makes a good data for alignment / Automatic instruction evolving
#321
Dahoas
closed
3 months ago
0
Automatic Instruction Evolving for Large Language Models
#320
Dahoas
opened
3 months ago
1
AgentInstruct: Toward Generative Teaching with Agentic Flows
#319
veratr86
closed
3 months ago
1
Arena Learning: Build Data Flywheel for LLMs Post-training via Simulated Chatbot Arena
#318
mmhamdy
opened
3 months ago
0
Issue 116
#317
lauraaisling
closed
3 months ago
0
Next