weight-only Search Results

1000+ results
for weight-only

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

unslothai/unsloth #1170

A bug in save.py

W = W.t().to(dtype) else: W = layer.weight return W, bias Only the layer.weight is saved, the bias is not set.

serendipity800 updated 1 week ago
2
teamneoneko/Avatar-Toolkit #61

Remove Zero Weight bones: list option and/or only operate on…

Pulled over from the Cats issue tracker, suggestion is by @hanetzer https://github.com/teamneoneko/Cats-Blender-Plugin-Unofficial-/issues/172 As title. Basic gist, if you use this option as is, you…

Yusarina updated 2 weeks ago
1
pytorch/ao #1188

[FLOAT8] Add Hardware Compatibility Check for FP8 Quantizati…

### Add Hardware Compatibility Check for FP8 Quantization #### Issue Summary In our current implementation, we provide three APIs for model computation in FP8 format. However, for dynamic activati…

drisspg updated 2 days ago
1
csyxwei/MasterWeaver #2

Why are only the weights of `id_encoder` updated in the seco…

This is a very good piece of work. Could you please explain why, after adding Ledit in the second stage of training, the weights of `adapter_modules` are fixed and only the weights of `id_encoder` are…

gaoyixuan111 updated 2 weeks ago
5
shivahanifi/SCDD-image-segmentation-keras #6

Problem loading the model from checkpoint

Training the model with `train_script.py`, I save for all the epochs only the weights using the following code: ``` # ModelCheckpoint callback to save model weights checkpoint_callback = ModelChe…

shivahanifi updated 1 week ago
2
unslothai/unsloth #1205

Unable to use "unsloth/gemma-2b-bnb-4bit" model via vLLM

Hi @danielhanchen , I am unable to use "unsloth/gemma-2b-bnb-4bit" via vLLM. I am getting below error while loading the model on Nvidia_T4 or NVIDIA_V100 GPU . `engine_args = EngineArgs(model="u…

InderjeetVishnoi updated 1 day ago
2
huggingface/diffusers #8874

StableDiffusionPipeline.from_single_file() with CKPT fails: …

### Describe the bug Trying to load local CKPT file using the "from_single_file()" method fails. Works fine with .safetensors file from same repo (Runway ML SD). ### Reproduction ```import to…

whydna updated 2 days ago
14
vllm-project/vllm #3307

[feature on nm-vllm] Sparse Inference with weight only int8 …

Can sparsity and quantization be used simultaneously to further improve inference speed? Do you have any plans in this regard? Looking forward to your reply @robertgshaw2-neuralmagic

shiqingzhangCSU updated 3 days ago
1
intel/neural-compressor #2031

Numba package requried for int-4 quantization

I am trying to run quantization for int4 examples from `examples/3.x_api/pytorch/nlp/huggingface_models/language-modeling/quantization/weight_only` but there is a package missing in the requirement.tx…

aneelaka-int updated 1 week ago
3
SafeAILab/EAGLE #135

VLLM does not support EAGLE Spec Decode when deploying EAGLE…

I can successfully deploy llama3-8b-instruct with EAGLE. But there is a problem when deploying qwen2-7b-instruct with EAGLE. I have converted the EAGLE-Qwen2-7B-Instruct model according to[vllm/mod…

crownz248 updated 3 weeks ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for weight-only

1000+ results
for weight-only