-
**Describe the bug**
Running a forward pass on a `DeepSpeedTransformerInference` layer, with a sequence length of ~1000 tokens, results in an illegal memory access CUDA error.
**To Reproduce**
He…
-
**Describe the bug**
Fused GEMM example gives the wrong result for some values of `problemSize1.K`.
**Steps/Code to reproduce bug**
Set the following problem sizes in `examples/13_two_tensor_op_f…
-
Hello,
I have trained an `OPQ128_512,IVF262144_HNSW32,PQ128x6`, and it took some time! Would it be possible to avoid training again, but to just transfer the IVF index to another index, but with a …
-
In the path of `arch/pretrained_model`, i use under shell to convert model to tflite.
```shell
tflite_convert ^
--output_file MobileFaceNet_9925_9680.tflite ^
--graph_def_file MobileFaceNet_99…
-
Hi,
I am trying to do,
`
n2d2 resnet-18-v1-onnx.ini -seed 1 -w /dev/null -export CPP -nbbits -32`
but after compiling the network Accuracy is 0%
but if I test the onnx directly I get 70%
…
-
# Summary
I have updated the overview figure because AQ and 4-bit PQ have been implemented. Let me know if there're papers that should appear in this figure :)
![teaser2_main_202220225](ht…
-
Example: https://mila.quebec/en/publications/
It would be nice to reuse the same code as in the Mila website. Not sure if that's 'easily' possible via RTD
-
### Discussed in https://github.com/fraunhoferhhi/vvenc/discussions/147
Originally posted by **Harshitha35** March 17, 2022
Hi. I have 2 questions. Quick response would be really helpful!
1…
-
Hello!
Thanks for providing the scripts for running baselines. The following one liner:
```
python -u track1_baseline_faiss/baseline_faiss.py --dataset bigann-100M \
--indexkey OPQ64_128,I…
-
### Sanity checks
- [X] My issue relates to a specific CLI completion spec (e.g. `git checkout` is missing options in `git` completion spec). If your issue is more general, please create your issue h…