efficient-llm Search Results

aleximmer/Laplace #203

Efficient, universal, standalone Jacobian backend

- Functorch = memory blowup due to `vmap` - Asdl/asdfghjkl = can't backprop through the Jacobians => can't be used for continuous BO - BackPACK = requires inflexible extension We need a Jacobian …

wiseodd updated 4 hours ago

FederatedAI/FATE #5588

tutorial problem in feedkseed, offsite tuning, parameter eff…

您好，我在尝试tutorials中都能遇到以下问题，请问有什么解决办法吗，谢谢指导. 版本是Fatellm2.1 [INFO] [2024-05-02 03:36:43,059] [202405020336083952670] [52:140389839259392] - [base_saver.execute_update] [line:223]: UPDATE "t_task" SET …

zhaoaustin updated 2 months ago

OpenDevin/OpenDevin #2593

Implement Cost Reduction Options for LLM Usage in OpenDevin

**What problem or use case are you trying to solve?** We are trying to reduce the costs associated with using Large Language Models (LLMs) in the OpenDevin project. This involves optimizing the usa…

PierrunoYT updated 3 days ago

agiresearch/AIOS #127

[Roadmap] AIOS Roadmap Q4 2024

## Better integration of LLM kernel and OS kernel - [ ] Translate current implementation to more efficient implementation (more efficient and still cross-platform) - [ ] Multi-thread/Multi-proce…

evison updated 1 day ago

agiresearch/AIOS #117

[Roadmap] AIOS Roadmap Q3 2024

This document outlines the long-term features in the AIOS roadmap for Q3 2024. Feel free to discuss any of the following topics, and add any other topics you'd like to talk about in this issue. ## …

Tata0703 updated 6 days ago

AkihikoWatanabe/paper_notes #1235

User-LLM: Efficient LLM Contextualization with User Embeddin…

# URL - https://arxiv.org/abs/2402.13598 # Affiliations - Lin Ning, N/A - Luyang Liu, N/A - Jiaxing Wu, N/A - Neo Wu, N/A - Devora Berlowitz, N/A - Sushant Prakash, N/A - Bradley Green, …

AkihikoWatanabe updated 2 months ago

oneapi-src/oneDNN #1788

GEMM API for efficient LLM inference with W8A16

I want to perform inference on quantized LLAMA (W8A16) on ARM-v9 (with SVE) using oneDNN. The LLAMA weights are per-group quantized. Based on my understanding, I need to prepack the weights to redu…

oleotiger updated 3 months ago

NVlabs/VILA #53

working with VLLM

I'm wondering if I can get an easier pipeline by loading the awq weights with vllm: ``` from vllm import LLM, SamplingParams prompts = [ "Hello, my name is", "The president of the Uni…

kousun12 updated 6 days ago

camel-ai/camel #639

[Feature Request] Study on efficiency improvement solutions

### Required prerequisites - [X] I have searched the [Issue Tracker](https://github.com/camel-ai/camel/issues) and [Discussions](https://github.com/camel-ai/camel/discussions) that this hasn't alre…

Wendong-Fan updated 4 days ago

InternLM/InternLM #757

[Feature] Is there any plan to merge the modeling_internlm2.…

### Describe the feature Hi InternLM Team, Thanks for your great work and the powerful InternLM2.5 models. I'm currently conducting research on efficient long-context LLMs inference, [MInference](…

iofu728 updated 5 days ago

1000+ results for efficient-llm

1000+ results
for efficient-llm