Open shure-dev opened 7 months ago
Text2Motion: From Natural Language Instructions to Feasible Plans https://arxiv.org/abs/2303.12153
GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D Generation https://arxiv.org/abs/2401.04092
Learning to Compress Prompts with Gist Tokens https://arxiv.org/abs/2304.08467
Large Language Models as Tool Makers https://arxiv.org/abs/2305.17126
Generative Agents: Interactive Simulacra of Human Behavior
Instruction-tuning Aligns LLMs to the Human Brain
LLM-BRAIn: AI-driven Fast Generation of Robot Behaviour Tree based on Large Language Model
Divergences between Language Models and Human Brains
LLM-Powered Hierarchical Language Agent for Real-time Human-AI Coordination
dspy
lang graph
RoCo: Dialectic Multi-Robot Collaboration with Large Language Models
dspy
RAG (retrieval augmented generation)
Embedchain: The Open Source RAG Framework
DSPy: Compiling Declarative Language Model Calls into Self-Improving Pipelines
How the segment anything model are applied and used in many cases
Focusing on perception
CogAgent: A Visual Language Model for GUI Agents
Self-Discover: Large Language Models Self-Compose Reasoning Structures
Large Language Model (LLM) as a System of Multiple Expert Agents: An Approach to solve the Abstraction and Reasoning Corpus (ARC) Challenge
Language Action Model
An Interactive Agent Foundation Model https://arxiv.org/abs/2402.05929
Predictive Minds: LLMs As Atypical Active Inference Agents
Thinking for Doing
InfiAgent: A Multi-Tool Agent for AI Operating Systems
MindAgent: Emergent Gaming Interaction
This issue is for the notification of papers which will be added to this repo in the future