quantizing Search Results

1000+ results
for quantizing

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Beep6581/RawTherapee #5426

Trackpad gestures

I just installed Rawtherapee on a Windows 10 laptop, which comes with a decent trackpad and a handful of useful gestures. I'm most interested in the zoom and swipe to scroll between photos. Looking…

Davide-sd updated 4 years ago
14
mit-emze/cimloop #3

More information about workloads

@giorgiodaneri and myself are working on a university project about mapping various transformer architectures on different CiM implementations. It would be nice to have more details about the workloa…

jacopopalumbo01 updated 3 months ago
2
osqzss/gps-sdr-sim #225

Problems getting devices to lock using Adalm-Pluto

Hi, I'm having some trouble to get any tested device to lock on simulated location, using Adalm-Pluto: Linux pluto 4.14.0-41915-gc2041af #279 SMP PREEMPT Mon Jan 14 13:13:47 CET 2019 armv7l GNU/…

toppharley updated 2 years ago
54
neuralmagic/sparseml #2328

Sparse/Quantization Aware Training for YOLOv10

**Is your feature request related to a problem? Please describe.** Need to reduce model size of YOLOv10 while maintaining performance. **Describe the solution you'd like** Sparse and Quantizatio…

yoloyash updated 4 months ago
4
ROCm/AMDMIGraphX #3049

Support model quantizing to int8 from Fp16

Model can be in fp16 instead of fp32 when quantizing to int8/uint8. For example, brevitas quantizes such fp16 models to int8. In such cases, ONNX models have "cast"/"convert" nodes before and …

umangyadav updated 6 months ago
1
pytorch/pytorch #46749

Quantization - we need a better solution for tracking quanti…

## 🐛 Bug: Quantization - we need a better solution for tracking quantization backend settings in a model Currently, there are various points of confusion: 1. a target backend (qnnpack / fbgemm) is…

vkuzo updated 11 months ago
33
huggingface/transformers #31725

gguf dequantize failed

### System Info transformers==4.42.3 torch==2.3.0 ### Who can help? _No response_ ### Information - [X] The official example scripts - [ ] My own modified scripts ### Tasks - [ ] An officially…

PenutChen updated 3 months ago
11
zeux/meshoptimizer #701

gltfpack settings.pos_bits default value

I noticed that the default value for the number of position bits when quantizing positions with gltfpack is 14 bits, while the component type is uint16, which means that 2 bits of data is cropped unle…

mathieu-lemuzic updated 5 months ago
1
meta-llama/llama-stack-apps #30

Error: Failed to initialize the TMA descriptor 801

Good day everyone, I am trying to run llama agentic system on RTX4090 with FP8 Quantization for the inference model and meta-llama/Llama-Guard-3-8B-INT8 for the Guard. WIth sufficiently small max_seq_…

anret updated 2 months ago
2
ultralytics/yolov5 #4386

Post training Dynamic quantization

## ❔Question Did someone tried model **post training dynamic quantization**? When I quantize, model size is increasing twice and inferencing time is same with FP32 model. Based on pytorch tutor…

Abdulazizbek updated 2 weeks ago
3

上一页 1...83 84 85 86 87 88 89...100 下一页

1000+ results for quantizing

1000+ results
for quantizing