model-quantization Search Results

1000+ results
for model-quantization

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

snap-research/MyVLM #6

GPU question.

Thank you for submitting a good paper. I have three questions regarding gpu Is there a way to make the code work using multiple GPUs? What GPU was it trained on? Did you use a model that …

woonsango updated 3 days ago
1
b4rtaz/distributed-llama #127

Segmentation fault

![image](https://github.com/user-attachments/assets/d8ea00ec-7106-4e30-bec3-02273b322218) Hello, thank you very much for your work. …

YueZhan721 updated 2 weeks ago
6
pytorch/executorch #290

Upcoming changes to export API in ExecuTorch (published on 9…

## Where are we? Exporting pytorch model for ExecuTorch runtime goes through multiple AoT (Ahead of Time) stages. At high level there are 3 stages. 1. `exir.capture`: This captures model’s graph …

kimishpatel updated 1 week ago
30
huggingface/optimum-habana #1237

Quantization failed

### System Info ```shell The examples provided do not work correctly, I think there has been updates in the intel neural compressor toolkit, which is now 3.0. and the habana quantization toolkit, and…

endomorphosis updated 2 months ago
6
math-silva/YOLO-Parking-Spot #2

[Optimization] Model Quantization!

The project is so cool. Using TensorRT or OpenVINO to optimize the model to a lower precision could increase the performance of the edge inference. BTW is the project accepting any pull requests?

mahimairaja updated 1 year ago
1
pytorch/torchtune #1488

Issue: AttributeError when running tune run generate --confi…

**** When running the command `tune run generate ./custom_quantization_generation_config.yaml`, I encountered the following error: `AttributeError: module 'torchtune.utils' has no attribute 'gen…

MaxwelsDonc updated 1 month ago
5
kssteven418/I-BERT #6

Quantization on trained model

## ❓ Questions and Help Hello, Great paper! kudos! After reading I was wondering if it is possible to use these quantization methods on trained model using one of huggingface transformers or shal…

shon-otmazgin updated 3 years ago
10
OpenBMB/vllm #14

[Installation]: 安装报告找不到numpy，实际nump已经安装好了

### Your current environment PyTorch version: 2.4.1+cu121 Is debug build: False CUDA used to build PyTorch: 12.1 ROCM used to build PyTorch: N/A OS: Ubuntu 22.04.3 LTS (x86_64) GCC version: (U…

qq745639151 updated 1 day ago
2
IrDIE/YOLO8_quantization #2

problem with inference

I have my quantized YoloV8m and I'm trying make some inference but I'm facing some errors... When I run those lines of code: " ov_model = YOLO("YOLO8_quantization/quantization_OpenVino/quantized_re…

Ghezzo98 updated 3 months ago
2
smallcloudai/refact #91

Model memory usage / quantization

According to [this Refact blog post](https://refact.ai/blog/2023/self-hosted-15b-code-model/): > Check out the [docs on self-hosting](https://github.com/smallcloudai/refact-self-hosting) to get you…

coder543 updated 1 year ago
2

上一页 1...11 12 13 14 15 16 17...100 下一页

1000+ results for model-quantization

1000+ results
for model-quantization