model-quantization Search Results

1000+ results
for model-quantization

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

tensorflow/model-optimization #869

Quantizing model at runtime during training results in non-l…

Prior to filing: check that this should be a bug instead of a feature request. Everything supported, including the compatible versions of TensorFlow, is listed in the overview page of each technique. …

FSet89 updated 2 years ago
1
THUDM/ChatGLM-6B #1307

No compiled kernel found.

### Is there an existing issue for this? - [X] I have searched the existing issues ### Current Behavior D:\Anaconda\envs\langchain\python.exe E:/langchain-ChatGLM-master/cli_demo.py D:\Anaconda\en…

Mosquito0352 updated 8 months ago
1
vllm-project/vllm #8873

[Bug]: Server - `aqlm` fails with `--cpu-offload-gb`

### Your current environment The output of `python collect_env.py` ```text PyTorch version: 2.4.0+cu121 Is debug build: False CUDA used to build PyTorch: 12.1 ROCM used to build PyTorch: N/A…

JMPSequeira updated 3 weeks ago
2
aigc-apps/CogVideoX-Fun #45

How to use torch.compile() on transformer

`# Text Encoder text_encoder = T5EncoderModel.from_pretrained(model_path, subfolder="text_encoder", torch_dtype=weight_dtype).to(device) quantize_(text_encoder, quantization()) # Tran…

kursatdinc updated 1 month ago
1
vllm-project/vllm #5540

[Feature]: LoRA support for Mixtral GPTQ and AWQ

### 🚀 The feature, motivation and pitch Please consider adding support for GPTQ and AWQ quantized Mixtral models. I guess that after #4012 it's technically possible. ### Alternatives _No r…

StrikerRUS updated 2 months ago
6
Lightning-AI/litgpt #935

Unexpected behaviour in inference with merged QLoRA weights

Hi, a few weeks ago @morettif and I finetuned the `Llama70B` with QLoRA on a H100: - `r=32` - `alpha=64` - `quantize=bnb.nf4-dq` - `precision=bf16-true` - `weight_decay=0` - `batch_size=32` -…

michele-milesi updated 2 weeks ago
4
microsoft/onnxruntime-inference-examples #301

My Quantized model not running faster than Unquantized model…

Hello @edgchen1 @wejoncy I tried to quantize the mars-model used in deepsort tracking. Using the example in `image_classification/cpu ` I am able to quantize my mars model. Size of the model has reduc…

srikar242 updated 1 year ago
1
THUDM/LongWriter #33

Hardware requirements for Training/Finetuning?

I have a 3070Ti and was wondering if running this training pipeline on consumer grade hardware is possible. If not, then what is the recommend hardware requirement and cost of training?

CHesketh76 updated 1 month ago
1
mlc-ai/mlc-llm #2966

InternalError when running llava model

## ❓ InternalError when running llava model Im new to mlc-llm and I'm not sure if this is a bug or me doing something incorrectly. I have so far not managed to run any model successfully. I have tr…

plufz updated 1 month ago
6
unslothai/unsloth #927

NameError: name 'os' is not defined

Hi, I am trying to run the `Llama-3.1 8b + Unsloth 2x faster finetuning.ipynb` you provided in the README. However, when I use google colab to run the second cell I got this error: ``` bash ------…

Tizzzzy updated 3 months ago
1

上一页 1...85 86 87 88 89 90 91...100 下一页

1000+ results for model-quantization

1000+ results
for model-quantization