llm-compression Search Results

508 results
for llm-compression

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

bojone/bytepiece #15

tokenizer压缩率与模型最终效果的关系

在评估tokenizer的部分给出的是tokenizer自身的评估指标，比如压缩率但是，高压缩率的tokenizer并不意味模型的效果也更好，是否能给出最终模型层面的效果？例如：sentencepiece实验中的BLUE https://github.com/google/sentencepiece/blob/master/doc/experiments.md#english…

nghuyong updated 5 months ago
2
xorbitsai/inference #742

BUG： error raised by launch model chatglm-pytorch-6b 8 bit

### Describe the bug Ubuntu2004 py3.11 xinference latest ### To Reproduce To help us to reproduce this bug, please provide information below: 2023-12-08 11:41:10,825 - modelscope - INFO - PyTo…

liyanz1377 updated 4 days ago
2
manisnesan/til #76

DSPy and ColBERT with Omar Khattab

## DSPy and ColBERT with Omar Khattab! - Weaviate Podcast - 85 [0:00](https://www.youtube.com/watch?v=CDung1LnLbY&t=0s) Weaviate at NeurIPS 2023! [0:38](https://www.youtube.com/watch?v=CDung1LnLbY…

manisnesan updated 2 months ago
15
meta-llama/llama-toolchain #6

RFC-0001 - Llama Stack

As part of the Llama 3.1 release, Meta is releasing an RFC for ‘Llama Stack’, a comprehensive set of interfaces / API for ML developers building on top of Llama foundation models. We are looking for f…

raghotham updated 1 hour ago
17
meta-introspector/meta-meme #169

polynomials

lets go into self reflective, auto semiotic, stream of conciousness, free style, note taking, neologism constructing mode. consider the construction of the polynomial, each prime base carefully chosen…

jmikedupont2 updated 1 week ago
11
ollama/ollama #1016

Are AMD GPUs supported on Intel Macs?

I'm currently trying out the ollama app on my iMac (i7/Vega64) and I can't seem to get it to use my GPU. I have tried running it with num_gpu 1 but that generated the warnings below. ` 2023/11/…

J0hnny007 updated 1 week ago
75
horseee/LLM-Pruner #20

Adding quantization

If I use the multiple strategies such as GPTQ + LLM-Pruner + LoRA, maybe the compressing ratio of LLM will be greatly improved with an acceptable performance?

Duncan1115 updated 7 months ago
9
microsoft/LLMLingua #86

How to reproduce Multidocument QA results under 9th？

My reproduction of the results on location 9 of the NQ dataset in the longllmlingua paper using the prompt compressor resulted in a large discrepancy from the original results. My hyperparameters are …

Twilightaaa updated 5 months ago
5
AIoT-MLSys-Lab/SVD-LLM #9

Compressed Model Produces Random and Repetitive Output - Req…

Description When running the code, we successfully obtain a compressed model. However, when prompted with an input, the model generates random and repetitive outputs, often repeating the same letters…

codeit1792 updated 3 weeks ago
1
bazingagin/npc_gzip #48

Discussion on future work

Hi there! First of all, I appreciate the team for putting in the work for this research paper. I would like to preface this by saying that my comments here should just be considered a point of disc…

jamestiotio updated 11 months ago
2

上一页 1...4 5 6 7 8 9 10...51 下一页

508 results for llm-compression

508 results
for llm-compression