model-quantization Search Results

1000+ results
for model-quantization

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

vllm-project/llm-compressor #853

Perplexity (ppl) Calculation of Local Sparse Model: NaN issu…

👋 Hello Neural Magic community developers, I encountered an issue while calculating the perplexity for a locally converted Llama3-8B sparse model using the llm-compress library. I'm refer the spars…

HengJayWang updated 3 hours ago
2
ggerganov/llama.cpp #9893

Bug: Inconsistency while parsing the model using `llama-cli`…

### What happened? Hi, recently, I'm trying to learn the gguf-py lib and use the gruff-py and write a script to make a gguf file, after I made the file, I tried to load it using llama-cli, but it sai…

Lyutoon updated 1 week ago
4
NVIDIA/k8s-nim-operator #177

add runtimeclass nvidia as a default option for nimcache

Hi , can help to add runtimeclass on the nimcache and all others crd ? got this error Traceback (most recent call last): File "/usr/local/bin/download-to-cache", line 5, in from vllm_nv…

jxdn updated 1 week ago
5
pytorch/torchtune #1581

Generate Command phi3 Error

I have used command `tune run generate --config custom_quantization.yaml prompt='Explain some topic'`to generate inference from finetuned phi3 model through torchtune Config custom_quantization.y…

sgupta1007 updated 3 weeks ago
13
NVIDIA/TensorRT-Model-Optimizer #63

AssertError in model_calib when using histogram as the calib…

Hi, When running `mtq.quantize` with `"calibrator": "historgam"` in my config, i got the following assert error ``` File "modelopt/torch/quantization/model_calib.py", line 220, in modelopt.torch.…

YixuanSeanZhou updated 1 month ago
5
huggingface/trl #2217

[GKD] 0 loss

### System Info ``` pip install git+https://github.com/huggingface/transformers.git pip install tokenizers==0.20.0 pip install accelerate==0.34.2 pip install git+https://github.com/huggingface/tr…

nivibilla updated 1 week ago
3
Speech-Interaction-Technology-Aalto-U/itsp #16

Modelling / Vector quantization / dataset_name not defined

In the recent update to Modelling/Vector_Quantization.ipynb code block [6], the variable "dataset_name" is not defined.

tombackstrom updated 2 months ago
3
huggingface/diffusers #9450

FluxPipeline - Multi-GPU Issue - When you define transformer…

### Describe the bug When I load the text_encoder like this: ``` model_id = "black-forest-labs/FLUX.1-schnell" text_encoder = T5EncoderModel.from_pretrained( model_id, subfolder="t…

CrackerHax updated 2 weeks ago
11
NVIDIA/TensorRT-LLM #2158

KeyError: 'llava_llama'

Hi TensorRT-LLM team, Your work is incredible. By following the READme file for [multi-modeling](https://github.com/NVIDIA/TensorRT-LLM/blob/main/examples/multimodal/README.md), we were sucess to run…

tiend1 updated 3 weeks ago
3
pytorch/torchchat #1298

RuntimeError: CUDA error: named symbol not found

### 🐛 Describe the bug python torchchat.py generate stories110M --quant torchchat/quant_config/cuda.json --prompt "It was a dark and stormy night, and" Using device=cuda Tesla T4 Loading model...…

mikekgfb updated 4 days ago
2

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for model-quantization

1000+ results
for model-quantization