second-quantized-operator Search Results

262 results
for second-quantized-operator

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

microsoft/onnxruntime #14707

[Performance]why is the inference latency of onnx QDQ quanti…

### Describe the issue I haved a pre-trained CNN model of tensorflow saved model and I convert it to **.onnx form** as well as a **static quantized .onnx form**, and their inference latency at the…

vonJJ updated 4 months ago
5
pytorch/ao #391

[RFC] torchao Contributor Guide

Status: Draft Updated: 09/18/2024 # Objective In this doc we’ll talk about how different optimization techniques are structured in torchao and how to contribute to torchao. # torchao Stack Ove…

jerryzh168 updated 2 weeks ago
16
pytorch/executorch #4668

Thread/no of core setting for execution_runner

### 📚 The doc issue How to run the example execution_runner .exe after building it using cmake from the tutorial https://pytorch.org/executorch/stable/getting-started-setup.html, with multiple threa…

ali-rehman-ML updated 2 months ago
4
fevangelista/wicked #6

Support for mixed fermion/boson and odd number of `sqops`

I'd like to use this as a drop-in for `wick` and just wanted to check a couple of things. Your paper states that > Similarly, it would be desirable to expand WICK&D to mixed fermionic/bosonic fiel…

obackhouse updated 1 year ago
5
pytorch/pytorch #55464

Support quantized functional linear/conv operators with 32b …

## 🚀 Feature Currently (as of 1.8.1) torch.nn.quantized.functional.conv1d/2d/3d/linear require the output to always be requantized to 8b. Conv operators ask for an output scale and zero point, while …

volcacius updated 2 years ago
4
ThisisBillhe/torch_quantizer #1

quantization for performance vs for memory

I push SD performance to the maximum. Currently I can generate 200 images per second on my 4090 when using 1 step sd-turbo, the onediff compiler, the stable-fast compiler, and my own optimizations. …

aifartist updated 8 months ago
3
ValeevGroup/SeQuant #107

General tensor indices

`Tensor` objects (or more generally: anything fulfilling the `AbstractTensor` concept) are expected to divide their indices into bra and ket indices. There seems to be a connection (at least notation-…

Krzmbrzl updated 1 year ago
4
Samsung/ONE #11047

[onert] Support hybrid quantization

### Supported - FullyConnected ### Not yet - Conv - DepthwiseConv - BatchMatMul - LSTM - RNN

hseok-oh updated 1 year ago
7
Samsung/ONE #12165

[circle-quantizer] Support int8 quantization

### What Let's support int8 quantization in circle-quantizer. ### Why Onert-micro support int8 quantized kernels and contains faster CMSIS-NN kernel, which works with int8 quantization, not …

BalyshevArtem updated 10 months ago
11
quic/aimet #2443

How to set the quantization parameters for bias?

If I deploy on the Qualcomm HTP and want the bias quantization to be 32 bits, but I can't find a parameter in Aimet's configuration file to set the bias quantization bit width. There are only settings…

xiexiaozheng updated 7 months ago
11

上一页 1...1 2 3 4 5 6 7...27 下一页

262 results for second-quantized-operator

262 results
for second-quantized-operator