quantizing Search Results

1000+ results
for quantizing

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ostris/ai-toolkit #70

OverflowError: cannot fit 'int' into an index-sized integer

E:\ai-toolkit>python run.py first2 Running 1 job Error running job: Could not find config file first2 ======================================== Result: - 0 completed jobs - 1 failure =======…

Aderek514 updated 2 months ago
6
neuralmagic/AutoFP8 #38

fp8 vs bf16 performance problem

After quantizing an AutoModelForSequenceClassification model using autofp8, I observed a slight drop in performance. The left chart shows the inference time for bf16 linear layers, while the right cha…

AllenDou updated 2 months ago
5
pytorch/ao #533

[RFC] Add Auto-Round support

Hi, here is the INC team from Intel. Thank you for developing this amazing project. ### Motivation Our team has developed Auto-Round, a new weight-only quantization algorithm. It has achieved …

yiliu30 updated 2 months ago
4
Stability-AI/StableLM #17

GPU support Table & VRAM usage

It would be great to get the instructions to run the 3B model locally on a gaming GPU (e.g. 3090/4090 with 24GB VRAM). ### Confirmed GPUs From this thread | GPU Model | VRAM (GB) | Tuned-3b | T…

enricoros updated 1 year ago
34
ultralytics/ultralytics #3693

tflite yolov8 model not performing well after exporting it …

### Search before asking - [X] I have searched the YOLOv8 [issues](https://github.com/ultralytics/ultralytics/issues) and [discussions](https://github.com/ultralytics/ultralytics/discussions) and fou…

srikar242 updated 2 weeks ago
14
xlang-ai/instructor-embedding #88

quantization and gpu acceleration of the quantized model.

Wasn't sure if this was technically a new issue but just in case I'm reposting here: I figured out how to dynamically quantize the instructor-xl model, but at the point that it's supposed to create…

BBC-Esq updated 2 months ago
1
pytorch/torchchat #753

[LAUNCH BLOCKER] Executorch segmentation fault https://githu…

Executorch issue: https://github.com/pytorch/executorch/issues/3588 https://github.com/pytorch/torchchat/actions/runs/9047866134/job/24860312456?pr=751 ``` + python3 torchchat.py export storie…

mikekgfb updated 3 months ago
3
ModelCloud/GPTQModel #383

[BUG] OOM when quantize the llama-3.1-405B-instruct model

**Describe the bug** Quantizing mlp.down_proj in layer 0 of 125: 0%| | 0/126 [00:44

nctu6 updated 1 week ago
1
openvinotoolkit/openvino_notebooks #1445

Serving the OpenVINO Model In OpenShift

I was going through the **[Clip Zero-Shot Image Classification](https://github.com/openvinotoolkit/openvino_notebooks/tree/main/notebooks/228-clip-zero-shot-image-classification)** section and I repli…

ChamanSahil updated 2 months ago
5
casper-hansen/AutoAWQ #219

ValueError: OC is not multiple of cta_N = 64

Hello. I have a question about `oc_batch_size`. https://github.com/casper-hansen/AutoAWQ/blob/63d2aaec7b3849eadd6fee8df767cf92c30ee65c/awq/quantize/quantizer.py#L262 As you can see above, `oc_batch_…

emphasis10 updated 2 months ago
7

上一页 1...85 86 87 88 89 90 91...100 下一页

1000+ results for quantizing

1000+ results
for quantizing