efficient-llm Search Results

1000+ results
for efficient-llm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

WasmEdge/WasmEdge #3523

OSPP Workspace: Support llm.c as a WasmEdge plugin

### Summary # Motivation The WasmEdge Runtime is poised to offer robust inference support for AI models and Large Language Models (LLMs) such as llama3 and phi-3-mini. Recognizing the critical …

junaire updated 1 day ago
16
dusty-nv/NanoLLM #39

Steady RAM Usage Increase During Video Inference using video…

Hello, I’ve been running some tests using the nano_llm.vision.video module with live camera streaming on AGX Orin 64gb model. with the following parameters, --model Efficient-Large-Model/VI…

chain-of-immortals updated 1 week ago
7
manisnesan/til #44

NLU augmentation for Query generation and Doc expansion

[Improved text ranking with few shot prompting](https://blog.vespa.ai/improving-text-ranking-with-few-shot-prompting/) - This blog post discusses using large language models (LLMs) to generate labe…

manisnesan updated 6 months ago
7
confident-ai/deepeval #500

Parallelization of Evaluations

Usually these sort of evaluations are made on large datasets of Q&A interactions. Deepeval's interface however is implemented in a way that calls to the LLM Evaluators Agents are done sequentially and…

AndresPrez updated 3 weeks ago
9
langchain4j/langchain4j #1466

[FEATURE] Support for Using Multiple Different QueryTransfor…

**Is your feature request related to a problem? Please describe.** I am often frustrated by the limitation of being able to use only a single QueryTransformer at a time. This constraint makes it ch…

gongyongjie updated 1 month ago
2
mit-han-lab/streaming-llm #15

[Feature Request] Release StreamEval dataset and evaluation…

Hi, this is opencompass community volunteer, OpenCompass is an open-source, efficient, and comprehensive evaluation suite and platform designed for large models. Looking forward to adding StreamEva…

vansin updated 6 months ago
2
iamtalwinder/gif-maker #2

use GPU if True else CPU

If GPU is available in the machine of the user. Instead of using CPU for processing the gif(s) files, using GPU would prove a much more efficient and effective solution in terms of time complexity. …

NitkarshChourasia updated 2 months ago
1
meta-introspector/meta-meme #180

Enhanced LLM Inference System

### Summary of the Enhanced LLM Inference System **Objective**: To create a robust, transparent, and efficient system for large language model (LLM) inference using CUDA, ensuring reproducibility, qu…

jmikedupont2 updated 1 month ago
1
run-llama/llama_index #14344

[Question]: Extracting Implicit Information from Vector Data…

### Question Validation - [x] I have searched both the documentation and discord for an answer. ### Question **Understanding the Problem Statement** **Problem Statement:** When querying a vecto…

BennisonDevadoss updated 2 months ago
1
leon-ai/leon #529

Would GPT4All integration provide a performance improvement?

In the demos I’ve seen of Leon AI, it appeared rather slow. I have no idea if this was a limitation of the hardware or there were inefficiencies that might be improved upon. [GPT4All](https://github.c…

loren-osborn updated 2 months ago
2

上一页 1...7 8 9 10 11 12 13...100 下一页

1000+ results for efficient-llm

1000+ results
for efficient-llm