llm-compression Search Results

614 results
for llm-compression

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Robitx/gp.nvim #122

my adventures with GpWhisper: log to files the different com…

Now that I have my GPU used by localai I wanted to try whisper locally via `:GpWhisper` after installing sox and I got a not very helpful: ``` Gp: Whisper query exited: 2, 0 ``` I had installed …

teto updated 2 months ago
18
microsoft/MInference #20

[Question]: Question about KV-cache storage

### Describe the issue Thank you for the amazing work! 1. Does the model store the whole kv-cache of prefilling and generation on device? If so, how can the device hold the memory of 1M kv value…

DerrickYLJ updated 3 months ago
5
philschmid/llm-sagemaker-sample #4

Having a greater chunk length than 2048 in packing leads to …

Hi @philschmid, When I try to increase the chunk length to be greater than 2048, the training fails and runs into an OOM error on g5.4xlarge. Totally makes sense why it's happening, my question i…

abhimasand updated 11 months ago
16
meta-introspector/meta-meme #141

Hierarchy

Designing a Multi-Layered Hierarchy of Control You I'm working on a idea for a multi-layered hierarchy of control Copilot That sounds like an interesting project! A multi-layered hierarchy of co…

jmikedupont2 updated 6 months ago
5
neomutt/neomutt #4387

Segfault when replying to a forwarded `message/rfc822` part

Possibly related to #4177 but it also seems sufficiently different… ## Expected Behaviour When I enter attachment view of a message that's forwarding another message, I can hit `` on the `…

madduck updated 3 weeks ago
10
langfuse/langfuse #2155

bug: run not found and Generations are not stacked under Spa…

### Describe the bug When using Langchain ContextualCompressionRetriever, "run not found" was raised. ``` Traceback (most recent call last): File "/lib/python3.11/site-packages/langfuse/cal…

nathan-vo810 updated 3 months ago
2
langchain-ai/langchainjs #6912

Cannot pass document by retriever and throws "text.replace i…

### Checked other resources - [X] I added a very descriptive title to this issue. - [X] I searched the LangChain.js documentation with the integrated search. - [X] I used the GitHub search to find a …

lynicis updated 6 days ago
3
vllm-project/llm-compressor #43

IndexError: tuple index out of range

**Describe the bug** A clear and concise description of what the bug is. When trying to quantize the StarCoder2 models, I run into a index error due to estimates of the quantization. Specifically,…

Lin-K76 updated 2 months ago
2
foxcpu/Programming-Language-Trends #2

javascript weekly news

javascript weekly news

foxcpu updated 3 months ago
23
TencentARC/Open-MAGVIT2 #5

[question] any plans to train higher compression ratio?

Great thanks to the authors of this project! Bytedance's [TiTok](https://arxiv.org/pdf/2406.07550) use 1d codebook achieves impressive 256x256 to 32 token super high compression ratio, this is ver…

eisneim updated 4 months ago
3

上一页 1...20 21 22 23 24 25 26...62 下一页

614 results for llm-compression

614 results
for llm-compression