llm-compression Search Results

508 results
for llm-compression

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

microsoft/autogen #2583

[Feature Request]: Add TransformMessages support to GroupCha…

### Is your feature request related to a problem? Please describe. GroupChat uses a nested conversation between two agents. Currently it does not utilise the recent TransformMessages capability nor…

marklysze updated 2 months ago
4
langchain-ai/langchain #24433

ainvoke is not asynchronous

### Checked other resources - [X] I added a very descriptive title to this issue. - [X] I searched the LangChain documentation with the integrated search. - [X] I used the GitHub search to find a…

Tolkoton updated 2 days ago
8
vllm-project/llm-compressor #35

Mixtral 8*22B Quantization Failed with 2 issues

**Describe the bug** A clear and concise description of what the bug is. Hey Team, trying to quantize mistral 8*22b with W8A8 recipe and failed with two issues with different versions: 1) `…

qingquansong updated 17 hours ago
18
microsoft/LLMLingua #155

[Question]: Reproduce LLMLingua-2 results with Mistral-7B

### Describe the issue First of all, thank you for your great contributions. I have a similar question to the [issue 146](https://github.com/microsoft/LLMLingua/issues/146), I cannot reproduce the…

xvyaward updated 6 days ago
3
microsoft/vcpkg #33432

[New Port Request] <TinyChatEngine>

### Library name TinyChatEngine ### Library description TinyChatEngine: On-Device LLM Inference Library ### Source repository URL https://github.com/mit-han-lab/TinyChatEngine ### Project homepa…

ilongshan updated 2 weeks ago
10
froggy1014/langchainjs-kr #2

번역해야할 문서 리스트

## List - tutorials - [ ] #4 - @seochan99 - [ ] #5 - @seochan99 - [ ] #6 - @seochan99 - [ ] #17 - @bananana0118 - [ ] graph.mdx - [ ] index.mdx - [ ] llm_chain.mdx - [ ]…

froggy1014 updated 1 week ago
3
theodo-group/LLPhant #180

[Feature] Re-ranking and prompt compression

What I want to achieve basically is re-ranking and prompt compression, before adding the retrieved docs to the context. I read that it could drastically improve RAG performance. I think right now t…

synio-wesley updated 1 day ago
4
THUDM/AgentBench #130

Excellent Job! Well, no offense, it seems LLM-Bench rather t…

Sorry to raise the problem but give no systematic analysis It may be about to take me more time on more complete investigation over the "compression" ability of LLM as many may be support "compressio…

Konisberg updated 4 months ago
1
AkihikoWatanabe/paper_notes #1270

Dynamic Memory Compression: Retrofitting LLMs for Accelerate…

# URL - https://arxiv.org/abs/2403.09636 # Affiliations - Piotr Nawrot, N/A - Adrian Łańcucki, N/A - Marcin Chochowski, N/A - David Tarjan, N/A - Edoardo M. Ponti, N/A # Abstract - Transfo…

AkihikoWatanabe updated 3 months ago
3
microsoft/LLMLingua #89

Enhancing quality - Recovery settings

As mentioned in the paper, key concepts might get omitted either corrupted by the compression, in a way that the GPT can't process the compressed prompt. You mention also there is an approach to op…

synergiator updated 5 months ago
1

上一页 1...1 2 3 4 5 6 7...51 下一页

508 results for llm-compression

508 results
for llm-compression