Closed Sauerstoffdioxid closed 4 years ago
On how CPU did you run it? It looks like I added support for AVX512 after 2020-03-06, but this is only tested on an emulator under linux (Intel SDE).
Try the "--info 16" option, If you see the message "SIMD type: 4" - then this is running with AVX512 vectorization. You can specify the "--cpu 3" option (curently undocumented) to force the use of AVX2.
:information_source: This is the conclusion of the quantization tables; the process of dequantization itself does not even begin. Indication of the process itself:
...
0011 000f 001b 001f 0026 0030 003e 0054
component[0] : size W0xH0
component[1] : size W1xH1
component[2] : size W2xH2
quantsmooth = XXXX.XXXms
It says SIMD type: 3
when run with --info 16. So I guess it uses AVX2? In any case, I'm fairly certain my CPU doesn't support that either (older Intel Core). However, trying with --cpu 2
it does work.
Thank you for the quick response 👍
This is interesting, it looks like my CPU features detection code is not working correctly for your CPU (or you have some early or development model). "--cpu 2" forces to use SSE2. Changes from 2020-03-06 in the detection code are that FMA detection is disabled. This is because I thought AVX2 processors always come with FMA support. Also due to a VirtualBox bug, it clears the FMA flag for the VM, even if the host processor has this flag.
It would be helpful if you sent me a screenshot of what the CPU-Z tool tells about your CPU.
And about VirtualBox... VirtualBox developers have ignored this problem for years.
https://www.virtualbox.org/ticket/15471 https://forums.virtualbox.org/viewtopic.php?f=6&t=84213
I'm testing windows version in a VirtualBox VM, and need to deal with this.
Thanks, the FMA bit is not needed here because there is no AVX2 support, I don't understand why my code works incorrectly for this case.
Found a bug, so I will make a new release with a fix and document that --cpu
switch.
Thank you for the quick fix! The new version fixed the issue.
Just stumbled across this today, and it looks like a really neat program. However, I can't get the current Windows builds to produce any output file. The latest working version seems to be 2020-03-06 afaict. I'm on Windows 10 64bit, version 10.0.19041.264, if that matters. Here's the full console out put if that helps