issues
search
AkihikoWatanabe
/
paper_notes
たまに追加される論文メモ
https://AkihikoWatanabe.github.io/paper_notes
20
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Introducing quantized Llama models with increased speed and a reduced memory footprint, Meta, 2024.10
#1471
AkihikoWatanabe
opened
3 weeks ago
0
Ilya Sutskever’s Top 30 Reading List
#1470
AkihikoWatanabe
opened
3 weeks ago
0
Aya Expanse, Cohere, 2024.10
#1469
AkihikoWatanabe
opened
3 weeks ago
1
Generative Reward Models, Dakota Mahan+, N/A, arXiv'24
#1468
AkihikoWatanabe
opened
3 weeks ago
0
What Matters in Transformers? Not All Attention is Needed, Shwai He+, N/A, arXiv'24
#1467
AkihikoWatanabe
opened
3 weeks ago
2
Differential Transformer, Tianzhu Ye+, N/A, arXiv'24
#1466
AkihikoWatanabe
opened
3 weeks ago
11
nGPT: Normalized Transformer with Representation Learning on the Hypersphere, Ilya Loshchilov+, N/A, arXiv'24
#1465
AkihikoWatanabe
opened
3 weeks ago
1
Self-Taught Evaluators, Tianlu Wang+, N/A, arXiv'24
#1464
AkihikoWatanabe
opened
3 weeks ago
1
Retrieval Augmented Generation (RAG) and Beyond: A Comprehensive Survey on How to Make your LLMs use External Data More Wisely, Siyun Zhao+, N/A, arXiv'24
#1463
AkihikoWatanabe
opened
3 weeks ago
1
Prompt-Engineering-Guide, DAIR.AI
#1462
AkihikoWatanabe
opened
3 weeks ago
1
Fact, Fetch, and Reason: A Unified Evaluation of Retrieval-Augmented Generation, Satyapriya Krishna+, N/A, arXiv'24
#1461
AkihikoWatanabe
opened
3 weeks ago
1
LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations, Hadas Orgad+, N/A, arXiv'24
#1460
AkihikoWatanabe
opened
3 weeks ago
1
Addition is All You Need for Energy-efficient Language Models, Hongyin Luo+, N/A, arXiv'24
#1459
AkihikoWatanabe
opened
3 weeks ago
0
ToolGen: Unified Tool Retrieval and Calling via Generation, Renxi Wang+, N/A, arXiv'24
#1458
AkihikoWatanabe
opened
3 weeks ago
5
MLE-Bench, OpenAI, 2024.10
#1457
AkihikoWatanabe
opened
3 weeks ago
1
Thinking LLMs: General Instruction Following with Thought Generation, Tianhao Wu+, N/A, arXiv'24
#1456
AkihikoWatanabe
opened
4 weeks ago
1
Llama-3.1-Nemotron-70B-Instruct, Nvidia, 2024.10
#1454
AkihikoWatanabe
opened
1 month ago
2
One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation, Fabian Paischer+, N/A, arXiv'24
#1453
AkihikoWatanabe
opened
1 month ago
1
GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models, Iman Mirzadeh+, N/A, arXiv'24
#1452
AkihikoWatanabe
opened
1 month ago
1
Overcoming catastrophic forgetting in neural networks, James Kirkpatrick+, N/A, arXiv'16
#1451
AkihikoWatanabe
opened
1 month ago
1
Unsloth
#1450
AkihikoWatanabe
opened
1 month ago
1
COSMO: A large-scale e-commerce common sense knowledge generation and serving system at Amazon , Yu+, SIGMOD/PODS '24
#1449
AkihikoWatanabe
opened
1 month ago
3
SINC: Self-Supervised In-Context Learning for Vision-Language Tasks, Yi-Syuan Chen+, N/A, ICCV'23
#1448
AkihikoWatanabe
opened
1 month ago
0
Streamlit, 2020.12
#1447
AkihikoWatanabe
opened
1 month ago
1
What Does BERT Learn about the Structure of Language?, Jawahar+, ACL'19
#1446
AkihikoWatanabe
opened
1 month ago
2
今日から始める大規模言語モデルのプロダクト活用, y_matsuwitter, 2024.10
#1445
AkihikoWatanabe
opened
1 month ago
0
MovieGen, Meta, 2024.10
#1444
AkihikoWatanabe
opened
1 month ago
0
Gemma-2-Baku, 2024.10
#1443
AkihikoWatanabe
opened
1 month ago
0
textlesslib, FAIR, 2022.02
#1442
AkihikoWatanabe
opened
1 month ago
1
Gemma-2-JPN, 2024.10
#1441
AkihikoWatanabe
opened
1 month ago
1
AutoGen, Microsoft, 2024.10
#1440
AkihikoWatanabe
opened
1 month ago
1
Intrinsic Dimensionality Explains the Effectiveness of Language Model Fine-Tuning, Armen Aghajanyan+, N/A, ACL'21
#1439
AkihikoWatanabe
opened
1 month ago
2
生成AIを活用したシステム開発 の現状と展望 - 生成AI時代を見据えたシステム開発に向けて-, 株式会社日本総合研究所 先端技術ラボ, 2024.09
#1438
AkihikoWatanabe
opened
1 month ago
2
ECCV2024-Papers-with-Code, 2024.09
#1437
AkihikoWatanabe
opened
1 month ago
3
非プロダクトマネージャーのためのプロダクトマネジメント入門, 神原淳史, 2024.09
#1436
AkihikoWatanabe
opened
1 month ago
3
COM Kitchens: An Unedited Overhead-view Video Dataset as a Vision-Language Benchmark, Koki Maeda+, N/A, ECCV'24
#1435
AkihikoWatanabe
opened
1 month ago
1
What matters when building vision-language models?, Hugo Laurençon+, N/A, arXiv'24
#1434
AkihikoWatanabe
opened
1 month ago
1
API設計まとめ, KNR109, 2024.02
#1433
AkihikoWatanabe
opened
1 month ago
0
Long-CLIP: Unlocking the Long-Text Capability of CLIP, Beichen Zhang+, N/A, ECCV'24
#1432
AkihikoWatanabe
opened
1 month ago
0
Evaluating the Effectiveness of LLM-Evaluators (aka LLM-as-Judge), 2024.09
#1431
AkihikoWatanabe
opened
1 month ago
1
RAGの実装戦略まとめ, Jin Watanabe, 2024.03
#1430
AkihikoWatanabe
opened
1 month ago
0
Logic-of-Thought: Injecting Logic into Contexts for Full Reasoning in Large Language Models, Tongxuan Liu+, N/A, arXiv'24
#1429
AkihikoWatanabe
opened
1 month ago
1
NotebookLM, Google
#1428
AkihikoWatanabe
opened
1 month ago
1
Smaller, Weaker, Yet Better: Training LLM Reasoners via Compute-Optimal Sampling, Hritik Bansal+, N/A, arXiv'24
#1427
AkihikoWatanabe
opened
1 month ago
1
Molmo, AI2, 2024.09
#1426
AkihikoWatanabe
opened
1 month ago
2
No Language Left Behind: Scaling Human-Centered Machine Translation, NLLB Team+, N/A, arXiv'22
#1425
AkihikoWatanabe
opened
1 month ago
1
UL2: Unifying Language Learning Paradigms, Yi Tay+, N/A, arXiv'22
#1424
AkihikoWatanabe
opened
1 month ago
0
When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method, Biao Zhang+, N/A, ICLR'24
#1423
AkihikoWatanabe
opened
1 month ago
1
Llama 3.2: Revolutionizing edge AI and vision with open, customizable models, Meta, 2024.09
#1422
AkihikoWatanabe
opened
1 month ago
2
beeFormer: Bridging the Gap Between Semantic and Interaction Similarity in Recommender Systems, Vojtěch Vančura+, N/A, RecSys'24
#1421
AkihikoWatanabe
opened
1 month ago
1
Previous
Next