AkihikoWatanabe paper_notes issues

AkihikoWatanabe / paper_notes

たまに追加される論文メモ

https://AkihikoWatanabe.github.io/paper_notes

20 stars 0 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

microsoft/orca-agentinstruct-1M-v1, Microsoft, 2024.11

#1521 AkihikoWatanabe opened 11 hours ago
0
Adaptive Decoding via Latent Preference Optimization, Shehzaad Dhuliawala+, arXiv'24

#1520 AkihikoWatanabe opened 1 day ago
0
ローカルLLMのリリース年表, npaka, 随時更新, 2024.11

#1519 AkihikoWatanabe opened 1 day ago
1
TensorRT-LLMによる推論高速化, Hiroshi Matsuda, NVIDIA AI Summit 2024

#1518 AkihikoWatanabe opened 2 days ago
2
A Large-Scale Study of Relevance Assessments with Large Language Models: An Initial Look, Shivani Upadhyay+, arXiv'24

#1517 AkihikoWatanabe opened 2 days ago
3
Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding, Haolin Chen+, arXiv'24

#1516 AkihikoWatanabe opened 3 days ago
1
Bump rexml from 3.3.6 to 3.3.9

#1515 dependabot[bot] closed 3 days ago
0
LLM Prompt Tuning Playbook, 2024.11

#1514 AkihikoWatanabe opened 3 days ago
1
Artificial Intelligence, Scientific Discovery, and Product Innovation, Aidan Toner-Rodgers, MIT, 2024.11

#1513 AkihikoWatanabe opened 3 days ago
0
Scaling Laws for Precision, Tanishq Kumar+, arXiv'24

#1512 AkihikoWatanabe opened 3 days ago
1
Why Do You Grok? A Theoretical Analysis of Grokking Modular Addition, Mohamad Amin Mohamadi+, arXiv'24

#1511 AkihikoWatanabe opened 3 days ago
0
ALLoRA: Adaptive Learning Rate Mitigates LoRA Fatal Flaws, Hai Huang+, arXiv'24

#1510 AkihikoWatanabe opened 3 days ago
0
A Theoretical Understanding of Chain-of-Thought: Coherent Reasoning and Error-Aware Demonstration, Yingqian Cui+, arXiv'24

#1509 AkihikoWatanabe opened 3 days ago
1
Copilot Arena, CMU and UC Berkeley, 2024.11

#1508 AkihikoWatanabe opened 3 days ago
3
LBPE: Long-token-first Tokenization to Improve Large Language Models, Haoran Lian+, arXiv'24

#1507 AkihikoWatanabe opened 4 days ago
1
LLMs as Research Tools: A Large Scale Survey of Researchers' Usage and Perceptions, Zhehui Liao+, arXiv'24

#1506 AkihikoWatanabe opened 4 days ago
0
Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models, Weixin Liang+, arXiv'24

#1505 AkihikoWatanabe opened 4 days ago
1
DELIFT: Data Efficient Language model Instruction Fine Tuning, Ishika Agarwal+, arXiv'24

#1504 AkihikoWatanabe opened 4 days ago
0
GUI Agents with Foundation Models: A Comprehensive Survey, Shuai Wang+, arXiv'24

#1503 AkihikoWatanabe opened 4 days ago
2
Online-LoRA: Task-free Online Continual Learning via Low Rank Adaptation, Xiwen Wei+, arXiv'24

#1502 AkihikoWatanabe opened 4 days ago
1
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters, Charlie Snell+, arXiv'24

#1501 AkihikoWatanabe opened 4 days ago
2
The Surprising Effectiveness of Test-Time Training for Abstract Reasoning, 2024.11

#1500 AkihikoWatanabe opened 5 days ago
0
Beyond Browsing: API-Based Web Agents, Yueqi Song+, arXiv'24

#1499 AkihikoWatanabe opened 5 days ago
2
Precise Zero-Shot Dense Retrieval without Relevance Labels, Luyu Gao+, arXiv'22

#1498 AkihikoWatanabe opened 5 days ago
0
HyQE: Ranking Contexts with Hypothetical Query Embeddings, Weichao Zhou+, arXiv'24

#1497 AkihikoWatanabe opened 5 days ago
2
Personalization of Large Language Models: A Survey, Zhehao Zhang+, arXiv'24

#1496 AkihikoWatanabe opened 6 days ago
0
Number Cookbook: Number Understanding of Language Models and How to Improve It, Haotong Yang+, arXiv'24

#1495 AkihikoWatanabe opened 1 week ago
2
sarashina2-8x70B, SBIntuitions, 2024.11

#1494 AkihikoWatanabe opened 1 week ago
3
The Fastest Access to Enterprise-Grade Cloud GPUs, Lambda

#1493 AkihikoWatanabe opened 1 week ago
4
LoRA vs Full Fine-tuning: An Illusion of Equivalence, Reece Shuttleworth+, arXiv'24

#1492 AkihikoWatanabe opened 1 week ago
3
MM-Embed: Universal Multimodal Retrieval with Multimodal LLMs, Sheng-Chieh Lin+, arXiv'24

#1491 AkihikoWatanabe opened 1 week ago
1
A Comprehensive Survey of Small Language Models in the Era of Large Language Models: Techniques, Enhancements, Applications, Collaboration with LLMs, and Trustworthiness, Fali Wang+, arXiv'24

#1490 AkihikoWatanabe opened 1 week ago
2
Self-Consistency Preference Optimization, Archiki Prasad+, arXiv'24

#1489 AkihikoWatanabe opened 1 week ago
5
RAGの改善方法に関する情報のまとめ（再掲）, GENZITSU, 2023.10

#1488 AkihikoWatanabe opened 1 week ago
0
ZeRO: DeepSpeedの紹介, レトリバ, 2021.07

#1487 AkihikoWatanabe opened 1 week ago
8
Data Extraction Attacks in Retrieval-Augmented Generation via Backdoors, Yuefeng Peng+, arXiv'24

#1486 AkihikoWatanabe opened 1 week ago
4
ほぼリアルタイム！？爆速で動作する日本語特化の文字起こしAI！『kotoba-whisper-v2.0』, 遼介大堀, 2024.11

#1485 AkihikoWatanabe opened 1 week ago
2
Beyond Accuracy: Evaluating the Reasoning Behavior of Large Language Models -- A Survey, Philipp Mondorf+, arXiv'24

#1484 AkihikoWatanabe opened 1 week ago
2
Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent, Xingwu Sun+, arXiv'24

#1483 AkihikoWatanabe opened 1 week ago
1
ADOPT: Modified Adam Can Converge with Any $β_2$ with the Optimal Rate, Shohei Taniguchi+, NeurIPS'24

#1482 AkihikoWatanabe opened 1 week ago
2
Beyond Utility: Evaluating LLM as Recommender, Chumeng Jiang+, arXiv'24

#1481 AkihikoWatanabe opened 1 week ago
1
Stuffed Mamba: State Collapse and State Capacity of RNN-Based Long-Context Modeling, Yingfa Chen+, arXiv'24

#1480 AkihikoWatanabe opened 1 week ago
0
Lingua, Meta

#1479 AkihikoWatanabe opened 1 week ago
1
システム開発プロジェクト応用第一第5,6回 Gitによるバージョン管理, 内田公太, 2020.01

#1478 AkihikoWatanabe opened 1 week ago
2
On The Planning Abilities of OpenAI's o1 Models: Feasibility, Optimality, and Generalizability, Kevin Wang+, N/A, arXiv'24, 2024.11

#1477 AkihikoWatanabe opened 2 weeks ago
1
Looking Inward: Language Models Can Learn About Themselves by Introspection, Felix J Binder+, N/A, arXiv'24, 2024.11

#1476 AkihikoWatanabe opened 2 weeks ago
2
Beyond Full Fine-tuning: Harnessing the Power of LoRA for Multi-Task Instruction Tuning, Xin+, LREC-COLING'24

#1475 AkihikoWatanabe opened 2 weeks ago
3
Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks, Yizhong Wang+, N/A, EMNLP'22

#1474 AkihikoWatanabe opened 2 weeks ago
1
NEFTune: Noisy Embeddings Improve Instruction Finetuning, Neel Jain+, N/A, ICLR'24

#1473 AkihikoWatanabe opened 2 weeks ago
1
KTO: Model Alignment as Prospect Theoretic Optimization, Kawin Ethayarajh+, N/A, arXiv'24

#1472 AkihikoWatanabe opened 2 weeks ago
1