Closed bjin closed 5 years ago
Will it be something like RAVU? Sorry, just interesting...
@fhlfibh It's just an attempt for general speedup of all CNN super resolution algorithms. I was playing with some CNN model back in September, but the model size is kind of large so I started this first. Now I don't have much time working on it, so I hope someone could finish it, or make use of the existing code (if any).
I tried to implement
3x3
Winograd convolution algorithm months ago, based on the scripts from the wincnn repo. In theory it could make3x3
convolution2d layer (in CNN models) 2.25 times faster. It will benefit mostly ResNet based models, but could also help other CNN models.The initial result is not promising, it's slower than the naive implementation. The code can be found in the conv2d-slow branch. There are two approaches that I tried
2x2
group andmat4
-vec4
multiplication, and just slightly slower than naive approach4x4
group andmat4
-mat4
multiplication, and about 2 times slower.A proper implementation requires finding the overhead first. Some low level primitivess might also be required, probably from some vendor specific extensions.