-
Hi, when I compile syntaxnet model branch "documents-from-tensor", error occurs as follow:
ERROR: /home/darren.wy/.cache/bazel/_bazel_darren.wy/ac581bc1223ed80290130d37625e326f/external/org_tensorflow…
-
I noticed that only negative minimum values are preserved as zero points with the code.
https://github.com/mit-han-lab/llm-awq/blob/f0b4b68004f76d562658143cddea5aad8c1b8266/awq/quantize/quantizer.py#…
-
Hi
I am looking for a INT8 version of GEMM in OpenCL. If I am correct, CLBlast does not yet support it. Pls correct me if I am wrong and comment on the usage (perhaps a sample app etc.,).
Suppo…
-
## Quantization Method for conv, deconv and fc Layers.
Here I want to implement the quanzization on operation in conv, deconv and fc layers. Much quantization method are included in this paper: Ristr…
-
Hey,
I'm looking to perform `int8 * int8 -> fp32`. where at the output stage I dequantise the `int32_t` result into `float` (and then potentially add a bias. I was following the example from https:…
-
-
I saw the doc/quantization_example.cc.
there is a problem about the result_scale and result_zero_point?
how to make sure about it ?
you count the real result ,them calculate it ?
but if we…
-
Here is my understanding of the existing state of things and what I think we should be doing to make our lower-bit kernels more performant at both small and larger batch sizes. I'm making this an RFC …
-
Nice project
very intresting projects.
I tested it on nexus5 (snapdragon 800)
Are you planning to develop the project further?
for example
- new Nets: googlenet inception, squeezenet or etc
- new …
-
(update on 8/30)
## v1.0.0alpha on 12 September, 2016
- [ ] Add model serialization API @nyanp
- [x] Add protobuf parser for tensorflow @Wangyida
- [x] Fix compiler warnings and style issues @nyanp
#…
nyanp updated
5 years ago