llm-compression Search Results

614 results
for llm-compression

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

vllm-project/llm-compressor #83

lm_eval compatibility with generated model

**Describe the bug** After a model is generated running `big_model_fp8.py`, lm_eval dont not work unless the .py files from the original base model is transferred to the generated model folder. Happe…

horheynm updated 5 days ago
1
test-time-training/ttt-lm-pytorch #5

Unexpected Output from TTT Model

When running the provided example code for the TTT (Learning to Learn at Test Time) model, the output generated by the model is not coherent or meaningful. The expected output for the prompt "Greeting…

pprp updated 3 months ago
1
langchain-ai/langchainjs #6860

Uncaught SyntaxError: Invalid left-hand side in assignment (…

### Checked other resources - [X] I added a very descriptive title to this issue. - [X] I searched the LangChain.js documentation with the integrated search. - [X] I used the GitHub search to find a …

pratham-jaiswal updated 2 weeks ago
3
MozerWang/Loong #6

How to reproduce the results in the leaderboard?

During the evaluation process, I noticed that the models have different lengths, such as qwen-128, and the length is measured using the tokenizer of gpt-3.5-turbo. For qwen 128k, should it be set as 1…

cizhenshi updated 3 months ago
2
All-Hands-AI/OpenHands #3041

[Bug]: Unable to SSH into session when using "evaluation" sc…

### Is there an existing issue for the same bug? - [X] I have checked the troubleshooting document at https://docs.all-hands.dev/modules/usage/troubleshooting - [X] I have checked the existing iss…

priyanshu-kumar-256 updated 4 weeks ago
22
Genymobile/scrcpy #1691

[question] Any plans to support iOS

- [X] I have checked that a similar [feature request](https://github.com/Genymobile/scrcpy/issues?q=is%3Aopen+is%3Aissue+label%3A%22feature+request%22) does not already exist. I know this kind of …

kirk86 updated 3 months ago
33
xorbitsai/inference #1967

Xinference 接入glm4-chat模型报错 KeyError: [address=127.0.0.1:6314…

### System Info / 系統信息在win11系统下，python版本3.10.9，cuda版本12.1， transformers 4.41.0 xinference 0.13.1 torch 2.3.1+cu121 torchaudio …

sw2s updated 1 month ago
4
vllm-project/llm-compressor #109

Struggling to quantize Llama-3.1-70b - OOM/linalg.Cholesky …

**Describe the bug** Hello the vLLM team, thank you for your outstanding work. I think llm-compressor is really filling a need : a one simple unified quant franework for vLLM. So the bug I am enc…

lulmer updated 1 month ago
3
theodo-group/LLPhant #180

[Feature] Re-ranking and prompt compression

What I want to achieve basically is re-ranking and prompt compression, before adding the retrieved docs to the context. I read that it could drastically improve RAG performance. I think right now t…

synio-wesley updated 2 months ago
7
langchain-ai/langchainjs #6985

Cannot pass document by retriever and throws "text.replace i…

### Checked other resources - [X] I added a very descriptive title to this issue. - [X] I searched the LangChain.js documentation with the integrated search. - [X] I used the GitHub search to find a …

lynicis updated 6 days ago
1

上一页 1...21 22 23 24 25 26 27...62 下一页

614 results for llm-compression

614 results
for llm-compression