Open etrommer opened 2 years ago
Some comparison with TFApprox with TFApprox was requested.
This should be kept separate from productive code, similar to #6
Test case is not fully-defined yet. Most likely scenario: Comparison of Conv2D inference speed.
Build Environment used for TorchApprox: CUDA 10.1 Tensorflow 2.3.0 CuDNN 7.6.5
CUDA versions >= 11.0 do not compile due to incompatible API
Rerun after merging #8
conv_layer_bench.zip
Rerun using all High-Througput Models
conv_layer_benchmark.zip
Some comparison with TFApprox with TFApprox was requested.
This should be kept separate from productive code, similar to #6
Test case is not fully-defined yet. Most likely scenario: Comparison of Conv2D inference speed.