4bit Search Results - Githubissues

AIDC-AI/Ovis #29

running on 4bit model

hey, while running on 4bit quantized model from https://huggingface.co/ThetaCursed/Ovis1.6-Gemma2-9B-bnb-4bit i am getting the following error ``` { "name": "RuntimeError", "message": "self an…

haiderasad updated 2 weeks ago

unslothai/unsloth #1267

save_pretrained_merged ruins my model

Good day After load saved lora model, i save it to merged. And after load it from merged, i have generation like '+++++ 1000000000000000000000000000000000000000000000000…

Romiroz updated 5 hours ago

huggingface/optimum-quanto #346

How to support activation 4bit quantization?

As mentioned in title.

Ther-nullptr updated 2 weeks ago

fpgaminer/joycaption #3

How to run with BNB 4bit or 8bit quantization?

I tryed to modify your example code to run this model on lowvram card by BNB 4bit or 8bit quantization config. While use bnb 4bit config like below: ```python qnt_config = BitsAndBytesConfig(load…

fireicewolf updated 4 days ago

mainframecomputer/fullmoon-ios #14

Model request Qwen 2.5 7B 4bit

Highest capability models that can run on latest iPhone would be useful. The best I've found to fit in 8gb RAM is Qwen 2.5 7B 4bit?

mobile-appz updated 1 month ago

ml-explore/mlx-examples #1111

ValueError: [dequantize] The matrix should be given as a uin…

After quantizing mlx-community/miqumaid-v3-70b with this command `mlx_lm.convert --hf-path miqumaid-v3-70b --mlx-path miqumaid-v3-70b-4bit -q --qbits 4`. The model miqumaid-v3-70b-4bit cannot be infe…

chaihahaha updated 1 hour ago

hiyouga/LLaMA-Factory #5798

4bit-QLora + Qwen2 72b + 16k cutoff_len

How many gpus are needed to finetune? I have tried 16 PPUs (96GB each) but got CUDA OUT OF MEMROY

lmc8133 updated 3 weeks ago

argmaxinc/DiffusionKit #44

Error running sd3.5 4bit quant using venv

ive tred to run argmaxinc/mlx-stable-diffusion-3.5-large-4bit-quantized with no conda but using venv instead I am using pinned Python 3.10 and Pytorch 2.4.0 versions to avoid compatibility issues. …

iamwavecut updated 2 weeks ago

unslothai/unsloth #1261

CAN'T LOAD: AttributeError: 'LlamaForCausalLM' object has no…

Code: ``` from unsloth import FastLanguageModel import torch max_seq_length = 16384 # Choose any! We auto support RoPE Scaling internally! dtype = None # None for auto detection. Float16 for Te…

yukiarimo updated 4 days ago

unslothai/unsloth #1283

Error loading model: Unsloth: unsloth/Meta-Llama-3.1-8B-bnb-…

This is happening when I try to load any '-bnb-4bit' but not for instance 'unsloth/Meta-Llama-3.1-8B'. No error shown in terminal. `model, tokenizer = FastLanguageModel.from_pretrained( …

sree-tejis updated 6 days ago

1000+ results for 4bit

1000+ results
for 4bit