AkihikoWatanabe paper_notes issues

AkihikoWatanabe / paper_notes

たまに追加される論文メモ

https://AkihikoWatanabe.github.io/paper_notes

20 stars 0 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Introducing quantized Llama models with increased speed and a reduced memory footprint, Meta, 2024.10

#1471 AkihikoWatanabe opened 3 weeks ago
0
Ilya Sutskever’s Top 30 Reading List

#1470 AkihikoWatanabe opened 3 weeks ago
0
Aya Expanse, Cohere, 2024.10

#1469 AkihikoWatanabe opened 3 weeks ago
1
Generative Reward Models, Dakota Mahan+, N/A, arXiv'24

#1468 AkihikoWatanabe opened 3 weeks ago
0
What Matters in Transformers? Not All Attention is Needed, Shwai He+, N/A, arXiv'24

#1467 AkihikoWatanabe opened 3 weeks ago
2
Differential Transformer, Tianzhu Ye+, N/A, arXiv'24

#1466 AkihikoWatanabe opened 3 weeks ago
11
nGPT: Normalized Transformer with Representation Learning on the Hypersphere, Ilya Loshchilov+, N/A, arXiv'24

#1465 AkihikoWatanabe opened 3 weeks ago
1
Self-Taught Evaluators, Tianlu Wang+, N/A, arXiv'24

#1464 AkihikoWatanabe opened 3 weeks ago
1
Retrieval Augmented Generation (RAG) and Beyond: A Comprehensive Survey on How to Make your LLMs use External Data More Wisely, Siyun Zhao+, N/A, arXiv'24

#1463 AkihikoWatanabe opened 3 weeks ago
1
Prompt-Engineering-Guide, DAIR.AI

#1462 AkihikoWatanabe opened 3 weeks ago
1
Fact, Fetch, and Reason: A Unified Evaluation of Retrieval-Augmented Generation, Satyapriya Krishna+, N/A, arXiv'24

#1461 AkihikoWatanabe opened 3 weeks ago
1
LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations, Hadas Orgad+, N/A, arXiv'24

#1460 AkihikoWatanabe opened 3 weeks ago
1
Addition is All You Need for Energy-efficient Language Models, Hongyin Luo+, N/A, arXiv'24

#1459 AkihikoWatanabe opened 3 weeks ago
0
ToolGen: Unified Tool Retrieval and Calling via Generation, Renxi Wang+, N/A, arXiv'24

#1458 AkihikoWatanabe opened 3 weeks ago
5
MLE-Bench, OpenAI, 2024.10

#1457 AkihikoWatanabe opened 3 weeks ago
1
Thinking LLMs: General Instruction Following with Thought Generation, Tianhao Wu+, N/A, arXiv'24

#1456 AkihikoWatanabe opened 4 weeks ago
1
Llama-3.1-Nemotron-70B-Instruct, Nvidia, 2024.10

#1454 AkihikoWatanabe opened 1 month ago
2
One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation, Fabian Paischer+, N/A, arXiv'24

#1453 AkihikoWatanabe opened 1 month ago
1
GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models, Iman Mirzadeh+, N/A, arXiv'24

#1452 AkihikoWatanabe opened 1 month ago
1
Overcoming catastrophic forgetting in neural networks, James Kirkpatrick+, N/A, arXiv'16

#1451 AkihikoWatanabe opened 1 month ago
1
Unsloth

#1450 AkihikoWatanabe opened 1 month ago
1
COSMO: A large-scale e-commerce common sense knowledge generation and serving system at Amazon , Yu+, SIGMOD/PODS '24

#1449 AkihikoWatanabe opened 1 month ago
3
SINC: Self-Supervised In-Context Learning for Vision-Language Tasks, Yi-Syuan Chen+, N/A, ICCV'23

#1448 AkihikoWatanabe opened 1 month ago
0
Streamlit, 2020.12

#1447 AkihikoWatanabe opened 1 month ago
1
What Does BERT Learn about the Structure of Language?, Jawahar+, ACL'19

#1446 AkihikoWatanabe opened 1 month ago
2
今日から始める大規模言語モデルのプロダクト活用, y_matsuwitter, 2024.10

#1445 AkihikoWatanabe opened 1 month ago
0
MovieGen, Meta, 2024.10

#1444 AkihikoWatanabe opened 1 month ago
0
Gemma-2-Baku, 2024.10

#1443 AkihikoWatanabe opened 1 month ago
0
textlesslib, FAIR, 2022.02

#1442 AkihikoWatanabe opened 1 month ago
1
Gemma-2-JPN, 2024.10

#1441 AkihikoWatanabe opened 1 month ago
1
AutoGen, Microsoft, 2024.10

#1440 AkihikoWatanabe opened 1 month ago
1
Intrinsic Dimensionality Explains the Effectiveness of Language Model Fine-Tuning, Armen Aghajanyan+, N/A, ACL'21

#1439 AkihikoWatanabe opened 1 month ago
2
生成AIを活用したシステム開発の現状と展望 - 生成AI時代を見据えたシステム開発に向けて-, 株式会社日本総合研究所先端技術ラボ, 2024.09

#1438 AkihikoWatanabe opened 1 month ago
2
ECCV2024-Papers-with-Code, 2024.09

#1437 AkihikoWatanabe opened 1 month ago
3
非プロダクトマネージャーのためのプロダクトマネジメント入門, 神原淳史, 2024.09

#1436 AkihikoWatanabe opened 1 month ago
3
COM Kitchens: An Unedited Overhead-view Video Dataset as a Vision-Language Benchmark, Koki Maeda+, N/A, ECCV'24

#1435 AkihikoWatanabe opened 1 month ago
1
What matters when building vision-language models?, Hugo Laurençon+, N/A, arXiv'24

#1434 AkihikoWatanabe opened 1 month ago
1
API設計まとめ, KNR109, 2024.02

#1433 AkihikoWatanabe opened 1 month ago
0
Long-CLIP: Unlocking the Long-Text Capability of CLIP, Beichen Zhang+, N/A, ECCV'24

#1432 AkihikoWatanabe opened 1 month ago
0
Evaluating the Effectiveness of LLM-Evaluators (aka LLM-as-Judge), 2024.09

#1431 AkihikoWatanabe opened 1 month ago
1
RAGの実装戦略まとめ, Jin Watanabe, 2024.03

#1430 AkihikoWatanabe opened 1 month ago
0
Logic-of-Thought: Injecting Logic into Contexts for Full Reasoning in Large Language Models, Tongxuan Liu+, N/A, arXiv'24

#1429 AkihikoWatanabe opened 1 month ago
1
NotebookLM, Google

#1428 AkihikoWatanabe opened 1 month ago
1
Smaller, Weaker, Yet Better: Training LLM Reasoners via Compute-Optimal Sampling, Hritik Bansal+, N/A, arXiv'24

#1427 AkihikoWatanabe opened 1 month ago
1
Molmo, AI2, 2024.09

#1426 AkihikoWatanabe opened 1 month ago
2
No Language Left Behind: Scaling Human-Centered Machine Translation, NLLB Team+, N/A, arXiv'22

#1425 AkihikoWatanabe opened 1 month ago
1
UL2: Unifying Language Learning Paradigms, Yi Tay+, N/A, arXiv'22

#1424 AkihikoWatanabe opened 1 month ago
0
When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method, Biao Zhang+, N/A, ICLR'24

#1423 AkihikoWatanabe opened 1 month ago
1
Llama 3.2: Revolutionizing edge AI and vision with open, customizable models, Meta, 2024.09

#1422 AkihikoWatanabe opened 1 month ago
2
beeFormer: Bridging the Gap Between Semantic and Interaction Similarity in Recommender Systems, Vojtěch Vančura+, N/A, RecSys'24

#1421 AkihikoWatanabe opened 1 month ago
1

Previous Next