quantizing Search Results

1000+ results
for quantizing

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

googlecolab/colabtools #4652

Downloading big files does not work.

In colab, I made some test quantizing some A.I. models. Once quantized, the files are usually between 4 and 7 gb. The only thing that seems to work is to move them momentarily on google drive and th…

0wwafa updated 4 months ago
1
NVIDIA-AI-IOT/torch2trt #473

Problem about quantizing model with external module like DCN…

Hi, I am trying to apply torch2trt on [FairMot model](https://github.com/ifzhang/FairMOT). It has an external library DCNv2. 1)With option fp16_mode=True, DCNv2 cannot be converted correctly and me…

KiedaTamashi updated 3 years ago
3
yhhhli/BRECQ #19

yolov5 Quantitative problem

Thank you very much for your work， I refer to your code modification yolov5，When w4a8 quantizing There are nearly 3points of loss，Have you experimented yolov5

w4087165 updated 11 months ago
2
casper-hansen/AutoAWQ #598

Model quantize error

hello. I am getting an error when running the sample below. The request file does not exist in the original source, I copied and used the preprocessor_config.json file in the same model family. …

sailfish009 updated 1 week ago
3
ThisisBillhe/tiny-stable-diffusion #4

quantize my own model to 2bits

Thank you for your efforts. I'm curious to know if there are any codes or scripts for quantizing my own 2-bit stable diffusion models, rather than relying on the pre-existing model available on Goog…

mason5957 updated 9 months ago
1
AutoGPTQ/AutoGPTQ #568

CUDA Out of Memory when quantizing mistralai/Mistral-7B-Inst…

## I fine-tune my model on Mistral-7B-Instruct-v0.2 using QLora, then merge it back to the base mode (I need to use vLLM). But I always have Cuda out of memory even I use an instance that has 48GB CPU…

hieuminh65 updated 8 months ago
1
vllm-project/vllm #7200

[Bug]: loading fp16 model as fp8 quantized caused OOM

### Your current environment (venv-vllm-54) (base) root@I1ba088648b009018e4:/hy-tmp# nvidia-smi Tue Aug 6 10:29:16 2024 +--------------------------------------------------------------------…

AlphaINF updated 2 days ago
3
cj-mills/christianjmills #53

posts/pytorch-train-keypoint-rcnn-tutorial/index

# Christian Mills - Training Keypoint R-CNN Models with PyTorch Learn how to train Keypoint R-CNN models on custom datasets with PyTorch. [https://christianjmills.com/posts/pytorch-train-keypoint-rc…

utterances-bot updated 1 month ago
3
pytorch/ao #796

NotImplementedError: aten.linear.default not implemented whe…

Hey I'm using the MX datatypes. It seems like the aten.linear.default function has not been implemented which causes the linear layers in the attenion layers not work with the MX datatypes. Can you…

Ali-Flt updated 2 months ago
7
mlabonne/llm-course #85

File not found error while using GGUF in AutoQuant

Hey, i want to quantize my Qwen2 model but it seems then the files are not found even though it clones and installs llama.cpp correctly. When quantizing the mode i get this: ```txt python3: can't …

Goekdeniz-Guelmez updated 4 months ago
1

上一页 1...7 8 9 10 11 12 13...100 下一页

1000+ results for quantizing

1000+ results
for quantizing