Closed ChristianFeldmann closed 2 years ago
I think everything is added. We will have to see about compilation on other systems but at least on windows using msbuild it works for me so far. I ran some numbers and holy moly this thing is fast:
So this seems to work. However, I don't think it would directly work with 16 bit transforms too. But maybe we don't ever want to do that anyways and just always do a 8 bit DCT because the additional bits don't give us more precision.
I got the asm files to compile using:
Now we have to make this work with cmake.