issues
search
AkihikoWatanabe
/
paper_notes
たまに追加される論文メモ
https://AkihikoWatanabe.github.io/paper_notes
17
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Molmo, AI2, 2024.09
#1426
AkihikoWatanabe
opened
1 day ago
2
No Language Left Behind: Scaling Human-Centered Machine Translation, NLLB Team+, N/A, arXiv'22
#1425
AkihikoWatanabe
opened
2 days ago
1
UL2: Unifying Language Learning Paradigms, Yi Tay+, N/A, arXiv'22
#1424
AkihikoWatanabe
opened
2 days ago
0
When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method, Biao Zhang+, N/A, arXiv'24
#1423
AkihikoWatanabe
opened
2 days ago
1
Llama 3.2: Revolutionizing edge AI and vision with open, customizable models, Meta, 2024.09
#1422
AkihikoWatanabe
opened
3 days ago
2
beeFormer: Bridging the Gap Between Semantic and Interaction Similarity in Recommender Systems, Vojtěch Vančura+, N/A, RecSys'24
#1421
AkihikoWatanabe
opened
3 days ago
1
Enhancing Performance and Scalability of Large-Scale Recommendation Systems with Jagged Flash Attention, Rengan Xu+, N/A, arXiv'24
#1420
AkihikoWatanabe
opened
3 days ago
0
LLMの効率化・高速化を支えるアルゴリズム, Tatsuya Urabe, 2024.09
#1419
AkihikoWatanabe
opened
3 days ago
0
LLM-jp-3 1.8B・3.7B・13B の公開, LLM.jp, 2024.09
#1418
AkihikoWatanabe
opened
3 days ago
3
LLM-jp Corpus v3, LLM.jp, 2024.09
#1417
AkihikoWatanabe
opened
3 days ago
1
Bump webrick from 1.8.1 to 1.8.2
#1416
dependabot[bot]
closed
3 days ago
0
NLP Experimental Design, Graham Neubig, 2024
#1415
AkihikoWatanabe
opened
3 days ago
0
Improving Language Understanding by Generative Pre-Training, OpenAI, 2018
#1414
AkihikoWatanabe
opened
3 days ago
2
Finetuned Language Models Are Zero-Shot Learners, Jason Wei+, N/A, ICLR'22
#1413
AkihikoWatanabe
opened
3 days ago
1
Direct Preference Optimization: Your Language Model is Secretly a Reward Model, Rafael Rafailov+, N/A, arXiv'23
#1412
AkihikoWatanabe
opened
3 days ago
1
Recommendation with Generative Models, Yashar Deldjoo+, N/A, arXiv'24
#1411
AkihikoWatanabe
opened
4 days ago
1
Report on the 1st Workshop on Large Language Model for Evaluation in Information Retrieval (LLM4Eval 2024) at SIGIR 2024, Hossein A. Rahmani+, N/A, arXiv'24
#1410
AkihikoWatanabe
opened
4 days ago
1
Don't Use LLMs to Make Relevance Judgments, Ian Soboroff, N/A, arXiv'24
#1409
AkihikoWatanabe
opened
4 days ago
1
Backtracking Improves Generation Safety, Yiming Zhang+, N/A, arXiv'24
#1408
AkihikoWatanabe
opened
4 days ago
1
LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models, Yixiao Li+, N/A, arXiv'23
#1407
AkihikoWatanabe
opened
4 days ago
0
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning, Zayne Sprague+, N/A, arXiv'24
#1406
AkihikoWatanabe
opened
4 days ago
1
Bump nokogiri from 1.15.4 to 1.16.5
#1405
dependabot[bot]
closed
5 days ago
0
Bump rexml from 3.2.6 to 3.3.6
#1404
dependabot[bot]
closed
5 days ago
0
Calibrated Recommendation, Herald Steck, Netflix, RecSys'18
#1403
AkihikoWatanabe
opened
1 week ago
1
Bump google-protobuf from 3.24.4 to 3.25.5
#1402
dependabot[bot]
closed
1 week ago
0
Instruction Tuning with GPT-4, Baolin Peng+, N/A, arXiv'23
#1401
AkihikoWatanabe
opened
1 week ago
1
PLUG: Leveraging Pivot Language in Cross-Lingual Instruction Tuning, Zhihan Zhang+, N/A, ACL'24
#1400
AkihikoWatanabe
opened
1 week ago
1
Sohu, etched, 2024.06
#1399
AkihikoWatanabe
opened
1 week ago
1
When Can LLMs Actually Correct Their Own Mistakes? A Critical Survey of Self-Correction of LLMs, Ryo Kamoi+, N/A, arXiv'24
#1398
AkihikoWatanabe
opened
1 week ago
4
STaR: Bootstrapping Reasoning With Reasoning, Eric Zelikman+, N/A, arXiv'22
#1397
AkihikoWatanabe
opened
1 week ago
1
クリックを最大化しない推薦システム, Ryoma Sato, 2024.01
#1396
AkihikoWatanabe
opened
1 week ago
2
mise-en-place
#1395
AkihikoWatanabe
opened
1 week ago
1
Leveraging User-Generated Reviews for Recommender Systems with Dynamic Headers, Shanu Vashishtha+, N/A, arXiv'24
#1394
AkihikoWatanabe
opened
2 weeks ago
1
Source2Synth: Synthetic Data Generation and Curation Grounded in Real Data Sources, Alisia Lupidi+, N/A, arXiv'24
#1393
AkihikoWatanabe
opened
2 weeks ago
2
Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning, Zhiheng Xi+, N/A, arXiv'24
#1392
AkihikoWatanabe
opened
2 weeks ago
0
ReFT: Reasoning with Reinforced Fine-Tuning, Trung Quoc Luong+, N/A, arXiv'24
#1391
AkihikoWatanabe
opened
2 weeks ago
0
OpenAI o1, 2024.09
#1390
AkihikoWatanabe
opened
2 weeks ago
7
Pluggyとは
#1389
AkihikoWatanabe
opened
2 weeks ago
1
Generative Verifiers: Reward Modeling as Next-Token Prediction, Lunjun Zhang+, N/A, arXiv'24
#1388
AkihikoWatanabe
opened
2 weeks ago
1
PaperQA2
#1387
AkihikoWatanabe
opened
2 weeks ago
1
From Decoding to Meta-Generation: Inference-time Algorithms for Large Language Models, Sean Welleck+, N/A, arXiv'24
#1386
AkihikoWatanabe
opened
2 weeks ago
2
Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP Researchers, Chenglei Si+, N/A, arXiv'24
#1385
AkihikoWatanabe
opened
2 weeks ago
1
A few prompt engineering tips that Ilya Sutskever picked up at OpenAI, Ilya Sutskever, 2024.09
#1384
AkihikoWatanabe
opened
2 weeks ago
0
Late Chunking: Balancing Precision and Cost in Long Context Retrieval, Pierse+, 2024.09
#1383
AkihikoWatanabe
opened
2 weeks ago
1
Large Language Models Cannot Self-Correct Reasoning Yet, Jie Huang+, N/A, arXiv'23
#1382
AkihikoWatanabe
opened
3 weeks ago
0
A Survey on Human Preference Learning for Large Language Models, Ruili Jiang+, N/A, arXiv'24
#1381
AkihikoWatanabe
opened
3 weeks ago
0
Reflection-Tuning: Data Recycling Improves LLM Instruction-Tuning, Ming Li+, N/A, arXiv'23
#1380
AkihikoWatanabe
opened
3 weeks ago
1
ml-engineering
#1379
AkihikoWatanabe
opened
3 weeks ago
1
Automatically Correcting Large Language Models: Surveying the landscape of diverse self-correction strategies, Liangming Pan+, N/A, arXiv'23
#1378
AkihikoWatanabe
opened
3 weeks ago
1
Self-Reflection in LLM Agents: Effects on Problem-Solving Performance, Matthew Renze+, N/A, arXiv'24
#1377
AkihikoWatanabe
opened
3 weeks ago
0
Next