weight-only Search Results

NVIDIA/TensorRT-LLM #1810

Is it "INT8 or FP8" with "--use_weight_only --weight_only_pr…

### System Info GPU - A10 ### Who can help? @Tracin ### Information - [X] The official example scripts - [ ] My own modified scripts ### Tasks - [X] An officially supported task in the `…

aiiAtelier updated 1 month ago

NVIDIA/TensorRT-LLM #1922

Support int type zero-points in weight-only GEMM

Currently some quantized huggingface models save zero-points in int4 datatype directly, like [Qwen/Qwen2-7B-Instruct-GPTQ-Int4](https://huggingface.co/Qwen/Qwen2-7B-Instruct-GPTQ-Int4) and [Qwen/Qwen2…

xiaonans updated 3 weeks ago

flowtyone/ComfyUI-Flowty-LDSR #8

Weights only load failed.

Weights only load failed. Re-running `torch.load` with `weights_only` set to `False` will likely succeed, but it can result in arbitrary code execution.Do it only if you get the file from a trusted so…

richkel updated 4 months ago

psp1g/papers #44

feature request: give less weight to chatters who only type …

Active chatters are added to the pool, and more frequent chats increase a chatter's weight to be selected. It should favor people who are not typing only in emotes

LittleBigBug updated 1 month ago

intel/neural-compressor #1980

how to evaluate AWQ ?

https://github.com/intel/neural-compressor/blob/master/docs/source/quantization_weight_only.md#examples how to set eval_func? https://github.com/intel/neural-compressor/blob/master/examples/3…

chunniunai220ml updated 1 week ago

Weifeng-Chen/prompt2prompt #5

why only controlling the weights for the unconditional predi…

hi, thanks for your excellent repo and i have much fun with trying it i have a quesion about the attention control on the unconditional prediction only in AttentionControl (p2p_utils.py) according …

garychan22 updated 2 weeks ago

comfyanonymous/ComfyUI_bitsandbytes_NF4 #26

Please add option to adjust GPU Weight since my gpu only has…

Please add option to adjust GPU Weight since my gpu only has 6GB Vram my RTX 3060 laptop can run with normal fp8 within 100-150 sec, but it talk super long with nf4 (my gpu run 99% all the time an…

ThepExcel updated 3 weeks ago

teamneoneko/Cats-Blender-Plugin-Unofficial- #172

Remove Zero Weight bones: list option and/or only operate on…

As title. Basic gist, if you use this option as is, you may nuke your parent bones for things like skirts, ears, or breasts. It would be nice to have a tickbox to prevent deletion of bones with chi…

hanetzer updated 2 days ago

NVIDIA/TensorRT-LLM #1235

"No valid weight only groupwise GEMM tactic" error during in…

### System Info - GPU Type: V100 ![WhatsApp Image 2024-03-05 at 9 50 36 AM](https://github.com/NVIDIA/TensorRT-LLM/assets/24196798/e9546886-695b-482b-96d4-1d4024935d7f) ### Who can help? @Tracin…

palVikram updated 2 weeks ago

zhu-xlab/SSL4EO-S12 #28

Documentation: Code Entry Point and Data Preprocessing Detai…

Thank you for providing the pre-trained weights B13_rn18_moco_0099_ckpt.pth. Could you please specify the exact code entry point used to generate these weights? The codebase includes multiple data…

douglasmacdonald updated 3 days ago

1000+ results for weight-only

1000+ results
for weight-only