quant Search Results - Githubissues

1000+ results
for quant

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

datawhalechina/DOPMC #149

whale-quant

### 你是否已经阅读并同意《Datawhale开源项目指南》？ - [X] 我已阅读并同意[《Datawhale开源项目指南》](https://github.com/datawhalechina/DOPMC/blob/main/GUIDE.md) ### 你是否已经阅读并同意《Datawhale开源项目行为准则》？ - [X] 我已阅读并同意[《Datawhale开源项目行为…

2951121599 updated 11 months ago
6
vllm-project/vllm #4744

[Usage]: Vllm AutoAWQ with 4-GPU doesnt utilize GPU

### Your current environment ... ### How would you like to use vllm I have downloaded a model. Now on my 4 GPU instance I attempt to quantize it using AutoAWQ. Whenever I run the script below I ge…

danielstankw updated 3 weeks ago
2
modelscope/ms-swift #2306

量化qwen2-audio-7b 报错

详细报错信息 Traceback (most recent call last): File "/mnt/d/ai/swift/swift/cli/export.py", line 5, in export_main() File "/mnt/d/ai/swift/swift/utils/run_utils.py", line 32, in x_main res…

Liufeiran123 updated 3 weeks ago
6
ExploreASL/ExploreASL #1797

Add AgeSex2Hct option to Quantification

### Description We realized that the function xASL_quant_AgeSex2Hct is not anywhere implemented in ExploreASL: ![Screenshot 2024-09-27 at 15 21 19](https://github.com/user-attachments/assets/c345a…

BeatrizPadrela updated 1 month ago
3
EricLBuehler/mistral.rs #893

Docker Build Failure: mistralrs-quant Fails with "No such fi…

## Minimum Reproducible Example ## Steps to Reproduce: 1. Run the Docker build command: ```bash docker build -f Dockerfile.cuda-all . -t mistral ``` 2. Observe the error during th…

ShivamSphn updated 1 day ago
8
usefulsensors/qc_npu_benchmark #1

There's useless DQ node in matmul_model_quant_io.onnx

There's useless DQ node in matmul_model_quant_io.onnx ![useless_dq_node](https://github.com/user-attachments/assets/2ef0506f-c8c0-4f8c-a600-db621643e51f) Also have some questions: 1. The model …

HectorSVC updated 1 month ago
3
NVIDIA/TensorRT-LLM #1770

Fail to build w4a8_awq on Llama 13b

### System Info ubuntu 20.04 tensorrt 10.0.1 tensorrt-cu12 10.0.1 tensorrt-cu12-bindings 10.0.1 tensorrt-cu12-libs 10.0.1 tensorrt-llm …

Hongbosherlock updated 4 days ago
12
pytorch/ao #752

[Feature Request] Fused fp8 matmul kernel (quant + dequant +…

Hey, team, AO provides awesome FP8 support with torch compile to get speed and memory improvement, however since torch compile is not always easily applicable for some models such as [MoE HF implement…

qingquansong updated 1 month ago
3
KarryRen/Karry-Studies-Math #1

We need to learn MORE math about Quant !

KarryRen updated 2 months ago
1
pytorch/pytorch #137574

[feature request] Provide FlexAttention as a new available/s…

### 🚀 The feature, motivation and pitch Originally discussed here with @drisspg : - https://github.com/pytorch/pytorch/pull/137526#issuecomment-2401115408 This would be good for exercising Flex…

vadimkantorov updated 1 week ago
2

上一页 1...4 5 6 7 8 9 10...100 下一页

1000+ results for quant

1000+ results
for quant