issues
search
AkihikoWatanabe
/
paper_notes
たまに追加される論文メモ
https://AkihikoWatanabe.github.io/paper_notes
20
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
microsoft/orca-agentinstruct-1M-v1, Microsoft, 2024.11
#1521
AkihikoWatanabe
opened
11 hours ago
0
Adaptive Decoding via Latent Preference Optimization, Shehzaad Dhuliawala+, arXiv'24
#1520
AkihikoWatanabe
opened
1 day ago
0
ローカルLLMのリリース年表, npaka, 随時更新, 2024.11
#1519
AkihikoWatanabe
opened
1 day ago
1
TensorRT-LLMによる推論高速化, Hiroshi Matsuda, NVIDIA AI Summit 2024
#1518
AkihikoWatanabe
opened
2 days ago
2
A Large-Scale Study of Relevance Assessments with Large Language Models: An Initial Look, Shivani Upadhyay+, arXiv'24
#1517
AkihikoWatanabe
opened
2 days ago
3
Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding, Haolin Chen+, arXiv'24
#1516
AkihikoWatanabe
opened
3 days ago
1
Bump rexml from 3.3.6 to 3.3.9
#1515
dependabot[bot]
closed
3 days ago
0
LLM Prompt Tuning Playbook, 2024.11
#1514
AkihikoWatanabe
opened
3 days ago
1
Artificial Intelligence, Scientific Discovery, and Product Innovation, Aidan Toner-Rodgers, MIT, 2024.11
#1513
AkihikoWatanabe
opened
3 days ago
0
Scaling Laws for Precision, Tanishq Kumar+, arXiv'24
#1512
AkihikoWatanabe
opened
3 days ago
1
Why Do You Grok? A Theoretical Analysis of Grokking Modular Addition, Mohamad Amin Mohamadi+, arXiv'24
#1511
AkihikoWatanabe
opened
3 days ago
0
ALLoRA: Adaptive Learning Rate Mitigates LoRA Fatal Flaws, Hai Huang+, arXiv'24
#1510
AkihikoWatanabe
opened
3 days ago
0
A Theoretical Understanding of Chain-of-Thought: Coherent Reasoning and Error-Aware Demonstration, Yingqian Cui+, arXiv'24
#1509
AkihikoWatanabe
opened
3 days ago
1
Copilot Arena, CMU and UC Berkeley, 2024.11
#1508
AkihikoWatanabe
opened
3 days ago
3
LBPE: Long-token-first Tokenization to Improve Large Language Models, Haoran Lian+, arXiv'24
#1507
AkihikoWatanabe
opened
4 days ago
1
LLMs as Research Tools: A Large Scale Survey of Researchers' Usage and Perceptions, Zhehui Liao+, arXiv'24
#1506
AkihikoWatanabe
opened
4 days ago
0
Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models, Weixin Liang+, arXiv'24
#1505
AkihikoWatanabe
opened
4 days ago
1
DELIFT: Data Efficient Language model Instruction Fine Tuning, Ishika Agarwal+, arXiv'24
#1504
AkihikoWatanabe
opened
4 days ago
0
GUI Agents with Foundation Models: A Comprehensive Survey, Shuai Wang+, arXiv'24
#1503
AkihikoWatanabe
opened
4 days ago
2
Online-LoRA: Task-free Online Continual Learning via Low Rank Adaptation, Xiwen Wei+, arXiv'24
#1502
AkihikoWatanabe
opened
4 days ago
1
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters, Charlie Snell+, arXiv'24
#1501
AkihikoWatanabe
opened
4 days ago
2
The Surprising Effectiveness of Test-Time Training for Abstract Reasoning, 2024.11
#1500
AkihikoWatanabe
opened
5 days ago
0
Beyond Browsing: API-Based Web Agents, Yueqi Song+, arXiv'24
#1499
AkihikoWatanabe
opened
5 days ago
2
Precise Zero-Shot Dense Retrieval without Relevance Labels, Luyu Gao+, arXiv'22
#1498
AkihikoWatanabe
opened
5 days ago
0
HyQE: Ranking Contexts with Hypothetical Query Embeddings, Weichao Zhou+, arXiv'24
#1497
AkihikoWatanabe
opened
5 days ago
2
Personalization of Large Language Models: A Survey, Zhehao Zhang+, arXiv'24
#1496
AkihikoWatanabe
opened
6 days ago
0
Number Cookbook: Number Understanding of Language Models and How to Improve It, Haotong Yang+, arXiv'24
#1495
AkihikoWatanabe
opened
1 week ago
2
sarashina2-8x70B, SBIntuitions, 2024.11
#1494
AkihikoWatanabe
opened
1 week ago
3
The Fastest Access to Enterprise-Grade Cloud GPUs, Lambda
#1493
AkihikoWatanabe
opened
1 week ago
4
LoRA vs Full Fine-tuning: An Illusion of Equivalence, Reece Shuttleworth+, arXiv'24
#1492
AkihikoWatanabe
opened
1 week ago
3
MM-Embed: Universal Multimodal Retrieval with Multimodal LLMs, Sheng-Chieh Lin+, arXiv'24
#1491
AkihikoWatanabe
opened
1 week ago
1
A Comprehensive Survey of Small Language Models in the Era of Large Language Models: Techniques, Enhancements, Applications, Collaboration with LLMs, and Trustworthiness, Fali Wang+, arXiv'24
#1490
AkihikoWatanabe
opened
1 week ago
2
Self-Consistency Preference Optimization, Archiki Prasad+, arXiv'24
#1489
AkihikoWatanabe
opened
1 week ago
5
RAGの改善方法に関する情報のまとめ(再掲), GENZITSU, 2023.10
#1488
AkihikoWatanabe
opened
1 week ago
0
ZeRO: DeepSpeedの紹介, レトリバ, 2021.07
#1487
AkihikoWatanabe
opened
1 week ago
8
Data Extraction Attacks in Retrieval-Augmented Generation via Backdoors, Yuefeng Peng+, arXiv'24
#1486
AkihikoWatanabe
opened
1 week ago
4
ほぼリアルタイム!?爆速で動作する日本語特化の文字起こしAI!『kotoba-whisper-v2.0』, 遼介 大堀, 2024.11
#1485
AkihikoWatanabe
opened
1 week ago
2
Beyond Accuracy: Evaluating the Reasoning Behavior of Large Language Models -- A Survey, Philipp Mondorf+, arXiv'24
#1484
AkihikoWatanabe
opened
1 week ago
2
Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent, Xingwu Sun+, arXiv'24
#1483
AkihikoWatanabe
opened
1 week ago
1
ADOPT: Modified Adam Can Converge with Any $β_2$ with the Optimal Rate, Shohei Taniguchi+, NeurIPS'24
#1482
AkihikoWatanabe
opened
1 week ago
2
Beyond Utility: Evaluating LLM as Recommender, Chumeng Jiang+, arXiv'24
#1481
AkihikoWatanabe
opened
1 week ago
1
Stuffed Mamba: State Collapse and State Capacity of RNN-Based Long-Context Modeling, Yingfa Chen+, arXiv'24
#1480
AkihikoWatanabe
opened
1 week ago
0
Lingua, Meta
#1479
AkihikoWatanabe
opened
1 week ago
1
システム開発プロジェクト応用第一 第5,6回 Gitによるバージョン管理, 内田公太, 2020.01
#1478
AkihikoWatanabe
opened
1 week ago
2
On The Planning Abilities of OpenAI's o1 Models: Feasibility, Optimality, and Generalizability, Kevin Wang+, N/A, arXiv'24, 2024.11
#1477
AkihikoWatanabe
opened
2 weeks ago
1
Looking Inward: Language Models Can Learn About Themselves by Introspection, Felix J Binder+, N/A, arXiv'24, 2024.11
#1476
AkihikoWatanabe
opened
2 weeks ago
2
Beyond Full Fine-tuning: Harnessing the Power of LoRA for Multi-Task Instruction Tuning, Xin+, LREC-COLING'24
#1475
AkihikoWatanabe
opened
2 weeks ago
3
Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks, Yizhong Wang+, N/A, EMNLP'22
#1474
AkihikoWatanabe
opened
2 weeks ago
1
NEFTune: Noisy Embeddings Improve Instruction Finetuning, Neel Jain+, N/A, ICLR'24
#1473
AkihikoWatanabe
opened
2 weeks ago
1
KTO: Model Alignment as Prospect Theoretic Optimization, Kawin Ethayarajh+, N/A, arXiv'24
#1472
AkihikoWatanabe
opened
2 weeks ago
1
Next