efficient-llm Search Results

1000+ results
for efficient-llm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

triton-inference-server/pytriton #65

Example of TensorRT-LLM Whisper backend for PyTriton

**Describe the solution you'd like** With the recent [TensorRT-LLM support for Whipser](https://github.com/NVIDIA/TensorRT-LLM/tree/main/examples/whisper), and now that PyTriton supports TensorRT-LLM…

aleksandr-smechov updated 1 month ago
5
pytorch/pytorch #128706

does FSDP support AMSP (a new DP shard strategy)

### 🚀 The feature, motivation and pitch there's a new DP shard strategy which is more flexible and general, see more detail at https://arxiv.org/abs/2311.00257 AMSP: Reducing Communication Overhead o…

guoyejun updated 1 month ago
2
meta-introspector/meta-meme #180

Enhanced LLM Inference System

### Summary of the Enhanced LLM Inference System **Objective**: To create a robust, transparent, and efficient system for large language model (LLM) inference using CUDA, ensuring reproducibility, qu…

jmikedupont2 updated 6 days ago
1
NVIDIA/TensorRT-LLM #1419

[FeatureRequest] Gather sparse logprobs

Hello team, We typically use `gather_all_token_logits` to collect the logit tensors for post-processing. Especially for large vocabulary sizes (128 000) this can require a lot of GPU memory. For ex…

Marks101 updated 1 month ago
7
langchain4j/langchain4j #1466

[FEATURE] Support for Using Multiple Different QueryTransfor…

**Is your feature request related to a problem? Please describe.** I am often frustrated by the limitation of being able to use only a single QueryTransformer at a time. This constraint makes it ch…

gongyongjie updated 1 week ago
2
junhwi/next-gen-ai #24

24/05/12

Consistency Large Language Models: A Family of Efficient Parallel Decoders https://hao-ai-lab.github.io/blogs/cllm/ Does Fine-Tuning LLMs on New Knowledge Encourage Hallucinations? https://arxiv.or…

junhwi updated 2 months ago
3
parthasarathydNU/protein-data-nlq #9

Review NLQ 2 SQL Architectures

parthasarathydNU updated 1 week ago
1
signintech/gopdf #272

html and css layout support

I just wrote a layout engine library with the help of LLM, which is much more efficient for simple HTML than wkhtml: https://github.com/html2any/layout. You can try it Support flex layout,css,page sp…

html2any updated 2 weeks ago
1
enricoros/big-AGI #503

[Roadmap] Common Prompt Templates

**Why** To streamline user interactions with the language learning model (LLM) in the chat application, users will be able to quickly select from a variety of predefined prompt templates. This featur…

joriskalz updated 2 months ago
3
kubeedge/ianvs #98

Smart Coding benchmark suite: built on KubeEdge-lanvs

**What would you like to be added/modified:** 1. Build a collaborative code intelligent agent alignment dataset for LLMs: - The dataset should include behavioral trajectories, feedback, and i…

YangBrooksHan updated 2 months ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for efficient-llm

1000+ results
for efficient-llm