AcademySoftwareFoundation / OpenImageIO

Reading, writing, and processing images in a wide variety of file formats, using a format-agnostic API, aimed at VFX applications.
https://openimageio.readthedocs.org
Apache License 2.0
1.97k stars 597 forks source link

ctest failed about openimageio@2.3.16.0 on ubuntu_aarch64 #3458

Closed Tom-python0121 closed 1 year ago

Tom-python0121 commented 2 years ago

hello, I meet a problem:ctest failed about openimageio@2.3.16.0 on ubuntu_aarch64

Steps to reproduce the issue

root@ubuntu:build#ls
CMakeCache.txt  CPackConfig.cmake        CTestTestfile.cmake    Makefile        OpenImageIOConfig.cmake         OpenImageIOTargets.cmake  bin                  cpack    install_manifest.txt  src
CMakeFiles      CPackSourceConfig.cmake  DartConfiguration.tcl  OpenImageIO.pc  OpenImageIOConfigVersion.cmake  Testing                   cmake_install.cmake  include  lib                   testsuite

root@ubuntu:build# ctest -j126

The following tests FAILED:
          2 - oiiotool (Failed)
          4 - oiiotool-copy (Failed)
          6 - maketx (Failed)
          7 - oiiotool-maketx (Failed)
         30 - python-imagebufalgo (Failed)
         31 - python-texturesys (Failed)
         32 - python-texturesys.batch (Failed)
         36 - gif-broken (Failed)
         46 - openexr-damaged (Failed)
         49 - psd (Failed)
         50 - psd-colormodes (Failed)
         51 - raw-broken (Failed)
         56 - tiff-depths (Failed)
         58 - webp-broken (Failed)
         59 - zfile (Failed)
         73 - unit_simd (Failed)
Errors while running CTest

Can you tell me how to solve it?

lgritz commented 2 years ago

The tests with "broken" in the name are because you didn't build OIIO with support for those formats. For example, the "webp-broken" failure is because OIIO's build scripts didn't find any webp libraries at build time. The related tests have "-broken" appended so we can tell this is what happened. We usually run ctest with -E broken to skip those tests if we only want to test the formats we are trying to support on a partcular build.

Not sure about the others, though. It's hard to tell just from this summary list what happened.

Can you ctest -E broken --force-new-ctest-process --output-on-failure and show us the full output?

Tom-python0121 commented 2 years ago

@lgritz it shows:

Steps to reproduce the issue

root@ubuntu:build# ctest -E broken --force-new-ctest-process --output-on-failure
 1/84 Test #58: webp-broken ......................***Failed    2.78 sec
oiiotool ERROR: read : OpenImageIO could not find a format reader for "../oiio-images/webp/1.webp". Is it a file format that OpenImageIO doesn't know about?
 2/84 Test #59: zfile ............................***Failed    2.78 sec
command = ../../bin/oiiotool  -colorconfig ../common/OpenColorIO/nuke-default/config.ocio -pattern fill:topleft=0.1:topright=0.5:bottomleft=1.0:bottomright=0.3 64x64 1 -chnames Z -d float -o out.zfile >> out.txt  ;
../../bin/oiiotool --info -v -a --hash -stats out.zfile >> out.txt ;

Diff out.txt vs ref/out.txt was:
-------
--- out.txt Mon Jul  4 17:47:51 2022
+++ ref/out.txt Wed Jun  1 23:37:19 2022
@@ -1,6 +1,6 @@
 Reading out.zfile
 out.zfile            :   64 x   64, 1 channel, float zfile
-    SHA-1: EF2AEAE96A84C10881A388BDA2CD35CAD29F14FC
+    SHA-1: 5CFBFC862EC731262ABDDF70D684BAE2E3FC54FC
     channel list: z
     worldtocamera: 1, 0, 0, 0, 0, 1, 0, 0, 0, 0, 1, 0, 0, 0, 0, 1
     worldtoscreen: 1, 0, 0, 0, 0, 1, 0, 0, 0, 0, 1, 0, 0, 0, 0, 1

NO MATCH for out.txt
FAIL out.txt
-----out.txt----->
Reading out.zfile
out.zfile            :   64 x   64, 1 channel, float zfile
    SHA-1: EF2AEAE96A84C10881A388BDA2CD35CAD29F14FC
    channel list: z
19/84 Test #36: gif-broken .......................***Failed    2.86 sec
oiiotool ERROR: read : OpenImageIO could not find a format reader for "../oiio-images/gif_animation.gif". Is it a file format that OpenImageIO doesn't know about?
38/84 Test #49: psd ..............................***Failed    4.15 sec
oiiotool ERROR: -info : SHA-1: could not read image
Full command line was:
> oiiotool --info -v -a --hash ../oiio-images/psd_123.psd
oiiotool ERROR: -info : SHA-1: could not read image
Full command line was:
> oiiotool --info -v -a --hash ../oiio-images/psd_123.psd
oiiotool ERROR: -info : SHA-1: could not read image
Full command line was:
> oiiotool --info -v -a --hash ../oiio-images/psd_123.psd
oiiotool ERROR: -info : SHA-1: could not read image
Full command line was:
> oiiotool --info -v -a --hash ../oiio-images/psd_123.psd
oiiotool ERROR: -info : SHA-1: could not read image
Full command line was:
> oiiotool --info -v -a --hash ../oiio-images/psd_123_nomaxcompat.psd
oiiotool ERROR: -info : SHA-1: could not read image
Full command line was:
> oiiotool --info -v -a --hash ../oiio-images/psd_123_nomaxcompat.psd
oiiotool ERROR: -info : SHA-1: could not read image
Full command line was:
> oiiotool --info -v -a --hash ../oiio-images/psd_123_nomaxcompat.psd
oiiotool ERROR: -info : SHA-1: could not read image
Full command line was:
> oiiotool --info -v -a --hash ../oiio-images/psd_123_nomaxcompat.psd
oiiotool ERROR: -info : SHA-1: could not read image
Full command line was:
> oiiotool --info -v -a --hash ../oiio-images/psd_bitmap.psd
oiiotool ERROR: -info : SHA-1: could not read image
Full command line was:
> oiiotool --info -v -a --hash ../oiio-images/psd_indexed_trans.psd
oiiotool ERROR: -info : SHA-1: could not read image
Full command line was:
> oiiotool --info -v -a --hash ../oiio-images/psd_rgb_8.psd
oiiotool ERROR: -info : SHA-1: could not read image
Full command line was:
> oiiotool --info -v -a --hash ../oiio-images/psd_rgba_8.psd
oiiotool ERROR: -info : SHA-1: could not read image
Full command line was:
> oiiotool --info -v -a --hash ../oiio-images/psd_rgba_8.psd
oiiotool ERROR: -info : SHA-1: could not read image
Full command line was:
> oiiotool --info -v -a --hash src/different-mask-size.psd
oiiotool ERROR: -info : SHA-1: could not read image
Full command line was:
> oiiotool --info -v -a --hash src/different-mask-size.psd
command = ../../bin/oiiotool --info -v -a --hash  ../oiio-images/psd_123.psd >> out.txt ;
../../bin/oiiotool --info -v -a --hash  ../oiio-images/psd_123_nomaxcompat.psd >> out.txt ;
../../bin/oiiotool --info -v -a --hash  ../oiio-images/psd_bitmap.psd >> out.txt ;
../../bin/oiiotool --info -v -a --hash  ../oiio-images/psd_indexed_trans.psd >> out.txt ;
../../bin/oiiotool --info -v -a --hash  ../oiio-images/psd_rgb_8.psd >> out.txt ;
../../bin/oiiotool --info -v -a --hash  ../oiio-images/psd_rgb_16.psd >> out.txt ;
../../bin/oiiotool --info -v -a --hash  ../oiio-images/psd_rgb_32.psd >> out.txt ;
../../bin/oiiotool --info -v -a --hash  ../oiio-images/psd_rgba_8.psd >> out.txt ;
../../bin/oiiotool --info -v -a --hash  src/different-mask-size.psd >> out.txt ;
../../bin/oiiotool --info -v -a --hash  src/layer-mask.psd >> out.txt ;

#### Error: this command failed:  ../../bin/oiiotool --info -v -a --hash  ../oiio-images/psd_123.psd >> out.txt
FAIL
#### Error: this command failed:  ../../bin/oiiotool --info -v -a --hash  ../oiio-images/psd_123_nomaxcompat.psd >> out.txt
38/84 Test #49: psd ..............................***Failed    4.15 sec
oiiotool ERROR: -info : SHA-1: could not read image
Full command line was:
> oiiotool --info -v -a --hash ../oiio-images/psd_123.psd
oiiotool ERROR: -info : SHA-1: could not read image
Full command line was:
> oiiotool --info -v -a --hash ../oiio-images/psd_123.psd
oiiotool ERROR: -info : SHA-1: could not read image
Full command line was:
> oiiotool --info -v -a --hash ../oiio-images/psd_123.psd
oiiotool ERROR: -info : SHA-1: could not read image
Full command line was:
> oiiotool --info -v -a --hash ../oiio-images/psd_123.psd
oiiotool ERROR: -info : SHA-1: could not read image
Full command line was:
> oiiotool --info -v -a --hash ../oiio-images/psd_123_nomaxcompat.psd
oiiotool ERROR: -info : SHA-1: could not read image
Full command line was:
> oiiotool --info -v -a --hash ../oiio-images/psd_123_nomaxcompat.psd
oiiotool ERROR: -info : SHA-1: could not read image
Full command line was:
> oiiotool --info -v -a --hash ../oiio-images/psd_123_nomaxcompat.psd
oiiotool ERROR: -info : SHA-1: could not read image
Full command line was:
> oiiotool --info -v -a --hash ../oiio-images/psd_123_nomaxcompat.psd
oiiotool ERROR: -info : SHA-1: could not read image
Full command line was:
> oiiotool --info -v -a --hash ../oiio-images/psd_bitmap.psd
oiiotool ERROR: -info : SHA-1: could not read image
Full command line was:
> oiiotool --info -v -a --hash ../oiio-images/psd_indexed_trans.psd
oiiotool ERROR: -info : SHA-1: could not read image
Full command line was:
> oiiotool --info -v -a --hash ../oiio-images/psd_rgb_8.psd
oiiotool ERROR: -info : SHA-1: could not read image
Full command line was:
> oiiotool --info -v -a --hash ../oiio-images/psd_rgba_8.psd
oiiotool ERROR: -info : SHA-1: could not read image
Full command line was:
> oiiotool --info -v -a --hash ../oiio-images/psd_rgba_8.psd
oiiotool ERROR: -info : SHA-1: could not read image
Full command line was:
> oiiotool --info -v -a --hash src/different-mask-size.psd
oiiotool ERROR: -info : SHA-1: could not read image
Full command line was:
> oiiotool --info -v -a --hash src/different-mask-size.psd
command = ../../bin/oiiotool --info -v -a --hash  ../oiio-images/psd_123.psd >> out.txt ;
../../bin/oiiotool --info -v -a --hash  ../oiio-images/psd_123_nomaxcompat.psd >> out.txt ;
../../bin/oiiotool --info -v -a --hash  ../oiio-images/psd_bitmap.psd >> out.txt ;
../../bin/oiiotool --info -v -a --hash  ../oiio-images/psd_indexed_trans.psd >> out.txt ;
../../bin/oiiotool --info -v -a --hash  ../oiio-images/psd_rgb_8.psd >> out.txt ;
../../bin/oiiotool --info -v -a --hash  ../oiio-images/psd_rgb_16.psd >> out.txt ;
../../bin/oiiotool --info -v -a --hash  ../oiio-images/psd_rgb_32.psd >> out.txt ;
../../bin/oiiotool --info -v -a --hash  ../oiio-images/psd_rgba_8.psd >> out.txt ;
../../bin/oiiotool --info -v -a --hash  src/different-mask-size.psd >> out.txt ;
../../bin/oiiotool --info -v -a --hash  src/layer-mask.psd >> out.txt ;

#### Error: this command failed:  ../../bin/oiiotool --info -v -a --hash  ../oiio-images/psd_123.psd >> out.txt
FAIL
#### Error: this command failed:  ../../bin/oiiotool --info -v -a --hash  ../oiio-images/psd_123_nomaxcompat.psd >> out.txt
FAIL
#### Error: this command failed:  ../../bin/oiiotool --info -v -a --hash  ../oiio-images/psd_bitmap.psd >> out.txt
FAIL
#### Error: this command failed:  ../../bin/oiiotool --info -v -a --hash  ../oiio-images/psd_indexed_trans.psd >> out.txt
FAIL
#### Error: this command failed:  ../../bin/oiiotool --info -v -a --hash  ../oiio-images/psd_rgb_8.psd >> out.txt
FAIL
#### Error: this command failed:  ../../bin/oiiotool --info -v -a --hash  ../oiio-images/psd_rgba_8.psd >> out.txt
FAIL
47/84 Test #32: python-texturesys.batch ..........***Failed    5.00 sec
oiiotool ERROR: -text : OpenImageIO was not compiled with FreeType for font rendering
48/84 Test #31: python-texturesys ................***Failed    5.03 sec
oiiotool ERROR: -text : OpenImageIO was not compiled with FreeType for font rendering
54/84 Test #51: raw-broken .......................***Failed    7.61 sec
oiiotool ERROR: read : OpenImageIO could not find a format reader for "../oiio-images/raw/RAW_FUJI_F700.RAF". Is it a file format that OpenImageIO doesn't know about?
65/84 Test #50: psd-colormodes ...................***Failed   11.46 sec
iconvert ERROR copying "src/pattern2-8-rgb.psd" to "pattern2-8-rgb.psd.tif" :
71/84 Test #46: openexr-damaged ..................***Failed   13.66 sec
Diff out.txt vs ref/out-exr2.2-alt.txt was:
-------
--- out.txt Mon Jul  4 17:48:04 2022
+++ ref/out-exr2.2-alt.txt  Wed Jun  1 23:37:19 2022
@@ -1,6 +1,6 @@
 tmpsrc/asan_heap-oob_4cb169_255_cc7ac9cde4b8634b31cb41c8fe89b92d_exr.exr :   64 x   64, 2 channel, deep half openexr
-oiiotool ERROR: read : Failed OpenEXR read: Error reading sample count data from image file "tmpsrc/asan_heap-oob_4cb169_255_cc7ac9cde4b8634b31cb41c8fe89b92d_exr.exr". Deep scanline sampleCount data corrupt at chunk 0 (negative sample count detected)
-Failed OpenEXR read: Error reading sample count data from image file "tmpsrc/asan_heap-oob_4cb169_255_cc7ac9cde4b8634b31cb41c8fe89b92d_exr.exr". Deep scanline sampleCount data corrupt at chunk 0 (negative sample count detected)
+oiiotool ERROR: read : Failed OpenEXR read: Error reading sample count data from image file "tmpsrc/asan_heap-oob_4cb169_255_cc7ac9cde4b8634b31cb41c8fe89b92d_exr.exr". Data decoding (rle) failed.
+Failed OpenEXR read: Error reading sample count data from image file "tmpsrc/asan_heap-oob_4cb169_255_cc7ac9cde4b8634b31cb41c8fe89b92d_exr.exr". Data decoding (rle) failed.
 Full command line was:
 > oiiotool -colorconfig ../common/OpenColorIO/nuke-default/config.ocio -n -info tmpsrc/asan_heap-oob_4cb169_255_cc7ac9cde4b8634b31cb41c8fe89b92d_exr.exr -o out.exr
73/84 Test #56: tiff-depths ......................***Failed   14.93 sec
NO MATCH for out.txt
FAIL out.txt
-----out.txt----->
Reading ../oiio-images/libtiffpic/depth/flower-minisblack-02.tif
../oiio-images/libtiffpic/depth/flower-minisblack-02.tif :   73 x   43, 1 channel, uint2 tiff
    SHA-1: F6BD9D10FB0DD8E9AC62DEBBB743A78FC48D3C9B
    channel list: Y
    compression: "none"
    DocumentName: "flower-minisblack-02.tif"
    Orientation: 1 (normal)
    PixelAspectRatio: 1
    planarconfig: "contig"
    ResolutionUnit: "in"
    Software: "GraphicsMagick 1.2 unreleased Q32 http://www.GraphicsMagick.org/"
    XResolution: 72
    YResolution: 72
    oiio:BitsPerSample: 2
    tiff:Compression: 1
    tiff:PhotometricInterpretation: 1
    tiff:PlanarConfiguration: 1
    tiff:RowsPerStrip: 431
Comparing "../oiio-images/libtiffpic/depth/flower-minisblack-02.tif" and "flower-minisblack-02.tif"
79/84 Test #30: python-imagebufalgo ..............***Failed   19.30 sec
Comparing "text.tif" and "../../../spack-src/testsuite/oiiotool-text/ref/text.tif"
idiff ERROR: Could not read text.tif:
    Invalid image file "text.tif": Could not open file: text.tif: No such file or directory
Comparing "text.tif" and "../../../spack-src/testsuite/oiiotool-text/ref/text-alt.tif"
idiff ERROR: Could not read text.tif:
    Invalid image file "text.tif": Could not open file: text.tif: No such file or directory
Comparing "text.tif" and "../../../spack-src/testsuite/oiiotool-text/ref/text-freetype2.6.5.tif"
idiff ERROR: Could not read text.tif:
    Invalid image file "text.tif": Could not open file: text.tif: No such file or directory
Comparing "text.tif" and "../../../spack-src/testsuite/oiiotool-text/ref/text-freetype2.7.tif"
idiff ERROR: Could not read text.tif:
    Invalid image file "text.tif": Could not open file: text.tif: No such file or directory
idiff ERROR: Could not read text.tif:
    Invalid image file "text.tif": Could not open file: text.tif: No such file or directory
Comparing "textcentered.tif" and "ref/textcentered.tif"
320 x 240, 3 channel
  Mean error = 0.0175388
  RMS error = 0.126291
  Peak SNR = 17.9726
  Max error  = 1 @ (236, 105, R)  values are 0, 0, 0 vs 1, 1, 1
  1745 pixels (2.27%) over 0.008
  1756 pixels (2.29%) over 0.004
FAILURE
84/84 Test #73: unit_simd ........................***Failed   73.44 sec
84/84 Test #73: unit_simd ........................***Failed   73.44 sec
OIIO SIMD support is: 
Hardware SIMD support is: 

  null benchmark 4:   352.5 Mvals/sec, (352.5 Mcalls/sec)
  null benchmark 8:   352.6 Mvals/sec, (352.6 Mcalls/sec)

vfloat4

load/store  vfloat4
  partial load 1 : 101 0 0 0
  partial store 1 : 1 0 0 0
  partial load 2 : 101 102 0 0
  partial store 2 : 1 2 0 0
  partial load 3 : 101 102 103 0
  partial store 3 : 1 2 3 0
  partial load 4 : 101 102 103 104
  partial store 4 : 1 2 3 4
  load scalar:   756.4 Mvals/sec, (189.1 Mcalls/sec)
  load vec:   705.1 Mvals/sec, (176.3 Mcalls/sec)
  store vec:   705.1 Mvals/sec, (176.3 Mcalls/sec)
  load 4 comps:   660.4 Mvals/sec, (165.1 Mcalls/sec)
  load 3 comps:   528.7 Mvals/sec, (176.2 Mcalls/sec)
  load 2 comps:   350.9 Mvals/sec, (175.4 Mcalls/sec)
  load 1 comps:   189.1 Mvals/sec, (189.1 Mcalls/sec)
  store 4 comps:   482.2 Mvals/sec, (120.6 Mcalls/sec)
  store 3 comps:   345.8 Mvals/sec, (115.3 Mcalls/sec)
  store 2 comps:   238.2 Mvals/sec, (119.1 Mcalls/sec)
  store 1 comps:   120.7 Mvals/sec, (120.7 Mcalls/sec)
load/store with conversion vfloat4
  load from unsigned short[]:   666.6 Mvals/sec, (166.6 Mcalls/sec)
  load from short[]:   680.3 Mvals/sec, (170.1 Mcalls/sec)
  load from unsigned char[]:   679.2 Mvals/sec, (169.8 Mcalls/sec)
  load from char[]:   658.3 Mvals/sec, (164.6 Mcalls/sec)
  load from half[]:   679.7 Mvals/sec, (169.9 Mcalls/sec)
  store to half[]:   422.1 Mvals/sec, (105.5 Mcalls/sec)
masked loadstore  vfloat4
  masked load with int mask:   560.6 Mvals/sec, (140.2 Mcalls/sec)
  masked load with bool mask:   586.1 Mvals/sec, (146.5 Mcalls/sec)
  masked store with int mask:   163.9 Mvals/sec, (163.9 Mcalls/sec)
  masked store with bool mask:   149.9 Mvals/sec, (149.9 Mcalls/sec)
scatter & gather  vfloat4
  gather:   645.3 Mvals/sec, (161.3 Mcalls/sec)
  gather_mask:   472.4 Mvals/sec, (118.1 Mcalls/sec)
  scatter:   625.0 Mvals/sec, (156.3 Mcalls/sec)
  scatter_mask:   454.3 Mvals/sec, (113.6 Mcalls/sec)
component_access  vfloat4
  operator[i]:   333.6 Mvals/sec, (333.6 Mcalls/sec)
  operator[2]:   364.7 Mvals/sec, (364.7 Mcalls/sec)
  operator[0]:   340.8 Mvals/sec, (340.8 Mcalls/sec)
  extract<2> :   325.1 Mvals/sec, (325.1 Mcalls/sec)
  extract<0> :   335.5 Mvals/sec, (335.5 Mcalls/sec)
  insert<2> :   207.6 Mvals/sec, (207.6 Mcalls/sec)
arithmetic  vfloat4
  operator+:   466.3 Mvals/sec, (116.6 Mcalls/sec)
  operator-:   388.7 Mvals/sec, (97.2 Mcalls/sec)
  operator- (neg):   361.3 Mvals/sec, (90.3 Mcalls/sec)
  operator*:   411.0 Mvals/sec, (102.8 Mcalls/sec)
  operator* (scalar):   454.3 Mvals/sec, (113.6 Mcalls/sec)
  operator/:   355.1 Mvals/sec, (88.8 Mcalls/sec)
  abs:   403.1 Mvals/sec, (100.8 Mcalls/sec)
  reduce_add:  1221.7 Mvals/sec, (305.4 Mcalls/sec)
  reference: add scalar:   175.8 Mvals/sec, (175.8 Mcalls/sec)
  reference: mul scalar:   188.7 Mvals/sec, (188.7 Mcalls/sec)
  reference: div scalar:   153.0 Mvals/sec, (153.0 Mcalls/sec)
comparisons  vfloat4
  operator< :   412.3 Mvals/sec, (103.1 Mcalls/sec)
  operator> :   382.3 Mvals/sec, (95.6 Mcalls/sec)
  operator<=:   404.4 Mvals/sec, (101.1 Mcalls/sec)
  operator>=:   486.0 Mvals/sec, (121.5 Mcalls/sec)
  operator==:   431.0 Mvals/sec, (107.8 Mcalls/sec)
  operator!=:   433.6 Mvals/sec, (108.4 Mcalls/sec)
shuffle  vfloat4
  shuffle<...> :  1320.6 Mvals/sec, (330.1 Mcalls/sec)
  shuffle<0> :   543.6 Mvals/sec, (135.9 Mcalls/sec)
  shuffle<1> :  1363.8 Mvals/sec, (340.9 Mcalls/sec)
  shuffle<2> :  1363.8 Mvals/sec, (340.9 Mcalls/sec)
  shuffle<3> :  1342.3 Mvals/sec, (335.6 Mcalls/sec)
swizzle  vfloat4
blend  vfloat4
  blend:   885.0 Mvals/sec, (221.2 Mcalls/sec)
  blend0:   978.7 Mvals/sec, (244.7 Mcalls/sec)
  blend0not:   894.3 Mvals/sec, (223.6 Mcalls/sec)
transpose  vfloat4
 before transpose:
    0 1 2 3
    4 5 6 7
    8 9 10 11
    12 13 14 15
 after transpose:
    0 4 8 12
    1 5 9 13
    2 6 10 14
    3 7 11 15
vectorops  vfloat4
  vdot:   578.8 Mvals/sec, (144.7 Mcalls/sec)
  dot:   158.7 Mvals/sec, (158.7 Mcalls/sec)
  vdot3:   586.8 Mvals/sec, (146.7 Mcalls/sec)
  dot3:   158.7 Mvals/sec, (158.7 Mcalls/sec)
fused  vfloat4
  madd old *+:   413.2 Mvals/sec, (103.3 Mcalls/sec)
  madd fused:   436.7 Mvals/sec, (109.2 Mcalls/sec)
  msub old *-:   402.8 Mvals/sec, (100.7 Mcalls/sec)
  msub fused:   370.6 Mvals/sec, (92.7 Mcalls/sec)
  nmadd old (-*)+:   443.2 Mvals/sec, (110.8 Mcalls/sec)
  nmadd fused:   403.1 Mvals/sec, (100.8 Mcalls/sec)
  nmsub old -(*+):   314.4 Mvals/sec, (78.6 Mcalls/sec)
  nmsub fused:   320.7 Mvals/sec, (80.2 Mcalls/sec)
mathfuncs vfloat4
  simd abs:   815.5 Mvals/sec, (203.9 Mcalls/sec)
  simd sign:   399.2 Mvals/sec, (99.8 Mcalls/sec)
  simd ceil:   501.6 Mvals/sec, (125.4 Mcalls/sec)
  simd floor:   404.4 Mvals/sec, (101.1 Mcalls/sec)
  simd round:   405.6 Mvals/sec, (101.4 Mcalls/sec)
  simd operator/:   358.2 Mvals/sec, (89.5 Mcalls/sec)
  simd safe_div:   451.9 Mvals/sec, (113.0 Mcalls/sec)
  simd rcp_fast:   358.4 Mvals/sec, (89.6 Mcalls/sec)
  float ifloor:   352.6 Mvals/sec, (352.6 Mcalls/sec)
  simd ifloor:   680.0 Mvals/sec, (170.0 Mcalls/sec)
  float floorfrac:   143.7 Mvals/sec, (143.7 Mcalls/sec)
  simd floorfrac:   245.4 Mvals/sec, (61.3 Mcalls/sec)
  float expf:    23.0 Mvals/sec, (23.0 Mcalls/sec)
  float fast_exp:   102.1 Mvals/sec, (102.1 Mcalls/sec)
  simd exp:    23.9 Mvals/sec, (6.0 Mcalls/sec)
  simd fast_exp:   102.7 Mvals/sec, (25.7 Mcalls/sec)
  float logf:    56.3 Mvals/sec, (56.3 Mcalls/sec)
  fast_log:   106.4 Mvals/sec, (106.4 Mcalls/sec)
  simd log:    55.9 Mvals/sec, (14.0 Mcalls/sec)
  simd fast_log:    30.3 Mvals/sec, (7.6 Mcalls/sec)
  float powf:    16.7 Mvals/sec, (16.7 Mcalls/sec)
  simd fast_pow_pos:    19.4 Mvals/sec, (4.9 Mcalls/sec)
  float sqrt:   133.3 Mvals/sec, (133.3 Mcalls/sec)
  simd sqrt:   281.0 Mvals/sec, (70.2 Mcalls/sec)
  float rsqrt:   179.3 Mvals/sec, (179.3 Mcalls/sec)
  simd rsqrt:   281.0 Mvals/sec, (70.2 Mcalls/sec)
  simd rsqrt_fast:   281.0 Mvals/sec, (70.2 Mcalls/sec)

vfloat3

load/store  vfloat3
  partial load 1 : 101 0 0
  partial store 1 : 1 0 0
  partial load 2 : 101 102 0
  partial store 2 : 1 2 0
  partial load 3 : 101 102 103
  partial store 3 : 1 2 3
  load scalar:   515.7 Mvals/sec, (171.9 Mcalls/sec)
  load vec:   483.7 Mvals/sec, (161.2 Mcalls/sec)
  store vec:   515.6 Mvals/sec, (171.9 Mcalls/sec)
  load 3 comps:   515.7 Mvals/sec, (171.9 Mcalls/sec)
  load 2 comps:   343.8 Mvals/sec, (171.9 Mcalls/sec)
  load 1 comps:   171.9 Mvals/sec, (171.9 Mcalls/sec)
  store 3 comps:   346.7 Mvals/sec, (115.6 Mcalls/sec)
  store 2 comps:   241.8 Mvals/sec, (120.9 Mcalls/sec)
  store 1 comps:   120.9 Mvals/sec, (120.9 Mcalls/sec)
load/store with conversion vfloat3
  load from unsigned short[]:   931.4 Mvals/sec, (310.5 Mcalls/sec)
  load from short[]:   931.4 Mvals/sec, (310.5 Mcalls/sec)
  load from unsigned char[]:   917.7 Mvals/sec, (305.9 Mcalls/sec)
  load from char[]:   931.4 Mvals/sec, (310.5 Mcalls/sec)
  load from half[]:   931.4 Mvals/sec, (310.5 Mcalls/sec)
  store to half[]:   412.6 Mvals/sec, (137.5 Mcalls/sec)
component_access  vfloat3
  operator[i]:   340.9 Mvals/sec, (340.9 Mcalls/sec)
  operator[2]:   365.0 Mvals/sec, (365.0 Mcalls/sec)
  operator[0]:   341.1 Mvals/sec, (341.1 Mcalls/sec)
  extract<2> :   365.0 Mvals/sec, (365.0 Mcalls/sec)
  extract<0> :   335.5 Mvals/sec, (335.5 Mcalls/sec)
  insert<2> :   208.0 Mvals/sec, (208.0 Mcalls/sec)
arithmetic  vfloat3
/home/stage/root/spack-stage-openimageio-2.3.16.0-i2pthrcpiigiwslfqrtrjbei46dpp3wi/spack-src/src/libutil/simd_test.cpp:983:
FAILED: a/b == div
    values were '1 2 2.33333' and '1 2 2.33333'
  operator+:   288.6 Mvals/sec, (96.2 Mcalls/sec)
  operator-:   270.6 Mvals/sec, (90.2 Mcalls/sec)
  operator- (neg):   298.2 Mvals/sec, (99.4 Mcalls/sec)
  operator*:   287.3 Mvals/sec, (95.8 Mcalls/sec)
  operator* (scalar):   278.8 Mvals/sec, (92.9 Mcalls/sec)
  operator/:   252.3 Mvals/sec, (84.1 Mcalls/sec)
  abs:   515.7 Mvals/sec, (171.9 Mcalls/sec)
  reduce_add:   960.0 Mvals/sec, (320.0 Mcalls/sec)
  add Imath::V3f:   479.7 Mvals/sec, (159.9 Mcalls/sec)
  add Imath::V3f with simd:   161.9 Mvals/sec, (54.0 Mcalls/sec)
  sub Imath::V3f:   510.9 Mvals/sec, (170.3 Mcalls/sec)
  mul Imath::V3f:   479.7 Mvals/sec, (159.9 Mcalls/sec)
  div Imath::V3f:   339.0 Mvals/sec, (113.0 Mcalls/sec)
  reference: add scalar:   176.3 Mvals/sec, (176.3 Mcalls/sec)
  reference: mul scalar:   189.1 Mvals/sec, (189.1 Mcalls/sec)
  reference: div scalar:   176.3 Mvals/sec, (176.3 Mcalls/sec)
vectorops  vfloat3
  vdot:   269.8 Mvals/sec, (89.9 Mcalls/sec)
  dot:   352.6 Mvals/sec, (352.6 Mcalls/sec)
  dot vfloat3:   192.6 Mvals/sec, (192.6 Mcalls/sec)
  dot Imath::V3f:   177.8 Mvals/sec, (177.8 Mcalls/sec)
  dot Imath::V3f with simd:   190.8 Mvals/sec, (190.8 Mcalls/sec)
  normalize Imath:    78.6 Mvals/sec, (78.6 Mcalls/sec)
  normalize Imath with simd:    35.6 Mvals/sec, (35.6 Mcalls/sec)
  normalize Imath with simd fast:    35.5 Mvals/sec, (35.5 Mcalls/sec)
  normalize simd:   157.8 Mvals/sec, (52.6 Mcalls/sec)
  normalize simd fast:   191.0 Mvals/sec, (63.7 Mcalls/sec)
fused  vfloat3
  madd old *+:   185.9 Mvals/sec, (62.0 Mcalls/sec)
  madd fused:   415.7 Mvals/sec, (103.9 Mcalls/sec)
  msub old *-:   184.6 Mvals/sec, (61.5 Mcalls/sec)
  msub fused:   456.1 Mvals/sec, (114.0 Mcalls/sec)
  nmadd old (-*)+:   191.3 Mvals/sec, (63.8 Mcalls/sec)
  nmadd fused:   441.9 Mvals/sec, (110.5 Mcalls/sec)
  nmsub old -(*+):   168.8 Mvals/sec, (56.3 Mcalls/sec)
  nmsub fused:   319.2 Mvals/sec, (79.8 Mcalls/sec)

vfloat8

load/store  vfloat8
  partial load 1 : 101 0 0 0 0 0 0 0
  partial store 1 : 1 0 0 0 0 0 0 0
  partial load 2 : 101 102 0 0 0 0 0 0
  partial store 2 : 1 2 0 0 0 0 0 0
  partial load 3 : 101 102 103 0 0 0 0 0
  partial store 3 : 1 2 3 0 0 0 0 0
  partial load 4 : 101 102 103 104 0 0 0 0
  partial store 4 : 1 2 3 4 0 0 0 0
  partial load 5 : 101 102 103 104 105 0 0 0
  partial store 5 : 1 2 3 4 5 0 0 0
  partial load 6 : 101 102 103 104 105 106 0 0
  partial store 6 : 1 2 3 4 5 6 0 0
  partial load 7 : 101 102 103 104 105 106 107 0
  partial store 7 : 1 2 3 4 5 6 7 0
  partial load 8 : 101 102 103 104 105 106 107 108
  partial store 8 : 1 2 3 4 5 6 7 8
  load scalar:  1161.1 Mvals/sec, (145.1 Mcalls/sec)
  load vec:   635.4 Mvals/sec, (79.4 Mcalls/sec)
  store vec:   631.7 Mvals/sec, (79.0 Mcalls/sec)
  load 8 comps:   795.4 Mvals/sec, (99.4 Mcalls/sec)
  load 7 comps:   692.8 Mvals/sec, (99.0 Mcalls/sec)
  load 6 comps:   805.9 Mvals/sec, (134.3 Mcalls/sec)
  load 5 comps:   812.5 Mvals/sec, (162.5 Mcalls/sec)
  load 4 comps:   650.0 Mvals/sec, (162.5 Mcalls/sec)
  load 3 comps:   520.0 Mvals/sec, (173.3 Mcalls/sec)
  load 2 comps:   346.7 Mvals/sec, (173.3 Mcalls/sec)
  load 1 comps:   165.7 Mvals/sec, (165.7 Mcalls/sec)
  store 8 comps:   614.8 Mvals/sec, (76.8 Mcalls/sec)
  store 7 comps:   556.2 Mvals/sec, (79.5 Mcalls/sec)
  store 6 comps:   688.9 Mvals/sec, (114.8 Mcalls/sec)
  store 5 comps:   601.1 Mvals/sec, (120.2 Mcalls/sec)
  store 4 comps:   480.9 Mvals/sec, (120.2 Mcalls/sec)
  store 3 comps:   360.4 Mvals/sec, (120.1 Mcalls/sec)
  store 2 comps:   229.4 Mvals/sec, (114.7 Mcalls/sec)
  store 1 comps:   120.2 Mvals/sec, (120.2 Mcalls/sec)
load/store with conversion vfloat8
  load from unsigned short[]:   935.7 Mvals/sec, (117.0 Mcalls/sec)
  load from short[]:   940.7 Mvals/sec, (117.6 Mcalls/sec)
  load from unsigned char[]:   939.5 Mvals/sec, (117.4 Mcalls/sec)
  load from char[]:   940.6 Mvals/sec, (117.6 Mcalls/sec)
  load from half[]:   940.2 Mvals/sec, (117.5 Mcalls/sec)
  store to half[]:   416.8 Mvals/sec, (52.1 Mcalls/sec)
masked loadstore  vfloat8
  masked load with int mask:   827.3 Mvals/sec, (103.4 Mcalls/sec)
  masked load with bool mask:   918.9 Mvals/sec, (114.9 Mcalls/sec)
  masked store with int mask:   112.7 Mvals/sec, (112.7 Mcalls/sec)
  masked store with bool mask:   112.2 Mvals/sec, (112.2 Mcalls/sec)
scatter & gather  vfloat8
  gather:   898.0 Mvals/sec, (112.2 Mcalls/sec)
  gather_mask:   569.3 Mvals/sec, (71.2 Mcalls/sec)
  scatter:   827.7 Mvals/sec, (103.5 Mcalls/sec)
  scatter_mask:   605.0 Mvals/sec, (75.6 Mcalls/sec)
component_access  vfloat8
  operator[i]:   340.9 Mvals/sec, (340.9 Mcalls/sec)
  operator[2]:   335.6 Mvals/sec, (335.6 Mcalls/sec)
  operator[0]:   365.0 Mvals/sec, (365.0 Mcalls/sec)
  extract<2> :   340.9 Mvals/sec, (340.9 Mcalls/sec)
  extract<0> :   365.0 Mvals/sec, (365.0 Mcalls/sec)
  insert<2> :   191.7 Mvals/sec, (191.7 Mcalls/sec)
arithmetic  vfloat8
  operator+:   561.6 Mvals/sec, (70.2 Mcalls/sec)
  operator-:   574.2 Mvals/sec, (71.8 Mcalls/sec)
  operator- (neg):   649.4 Mvals/sec, (81.2 Mcalls/sec)
  operator*:   561.4 Mvals/sec, (70.2 Mcalls/sec)
  operator* (scalar):   573.1 Mvals/sec, (71.6 Mcalls/sec)
  operator/:   305.4 Mvals/sec, (38.2 Mcalls/sec)
  abs:   633.5 Mvals/sec, (79.2 Mcalls/sec)
  reduce_add:   589.2 Mvals/sec, (73.7 Mcalls/sec)
  reference: add scalar:   176.3 Mvals/sec, (176.3 Mcalls/sec)
  reference: mul scalar:   189.1 Mvals/sec, (189.1 Mcalls/sec)
  reference: div scalar:   176.3 Mvals/sec, (176.3 Mcalls/sec)
comparisons  vfloat8
  operator< :   563.4 Mvals/sec, (70.4 Mcalls/sec)
  operator> :   631.9 Mvals/sec, (79.0 Mcalls/sec)
  operator<=:   591.1 Mvals/sec, (73.9 Mcalls/sec)
  operator>=:   563.5 Mvals/sec, (70.4 Mcalls/sec)
  operator==:   563.5 Mvals/sec, (70.4 Mcalls/sec)
  operator!=:   561.6 Mvals/sec, (70.2 Mcalls/sec)
shuffle  vfloat8
  shuffle<...> :  2224.1 Mvals/sec, (278.0 Mcalls/sec)
  shuffle<0> :  2599.9 Mvals/sec, (325.0 Mcalls/sec)
  shuffle<1> :  2599.9 Mvals/sec, (325.0 Mcalls/sec)
  shuffle<2> :  2599.9 Mvals/sec, (325.0 Mcalls/sec)
  shuffle<3> :  2599.9 Mvals/sec, (325.0 Mcalls/sec)
  shuffle<4> :  2599.9 Mvals/sec, (325.0 Mcalls/sec)
  shuffle<5> :  2599.9 Mvals/sec, (325.0 Mcalls/sec)
  shuffle<6> :  2599.9 Mvals/sec, (325.0 Mcalls/sec)
  shuffle<7> :  2599.9 Mvals/sec, (325.0 Mcalls/sec)
blend  vfloat8
  blend:   932.3 Mvals/sec, (116.5 Mcalls/sec)
  blend0:   603.2 Mvals/sec, (75.4 Mcalls/sec)
  blend0not:   614.3 Mvals/sec, (76.8 Mcalls/sec)
fused  vfloat8
  madd old *+:   226.9 Mvals/sec, (28.4 Mcalls/sec)
  madd fused:   206.1 Mvals/sec, (25.8 Mcalls/sec)
  msub old *-:   226.2 Mvals/sec, (28.3 Mcalls/sec)
  msub fused:   205.7 Mvals/sec, (25.7 Mcalls/sec)
  nmadd old (-*)+:   216.9 Mvals/sec, (27.1 Mcalls/sec)
  nmadd fused:   204.7 Mvals/sec, (25.6 Mcalls/sec)
  nmsub old -(*+):   137.9 Mvals/sec, (17.2 Mcalls/sec)
  nmsub fused:   149.5 Mvals/sec, (18.7 Mcalls/sec)
mathfuncs vfloat8
  simd abs:   780.2 Mvals/sec, (97.5 Mcalls/sec)
  simd sign:   190.6 Mvals/sec, (23.8 Mcalls/sec)
  simd ceil:   779.9 Mvals/sec, (97.5 Mcalls/sec)
  simd floor:   778.8 Mvals/sec, (97.4 Mcalls/sec)
  simd round:   779.0 Mvals/sec, (97.4 Mcalls/sec)
  simd operator/:   358.0 Mvals/sec, (44.8 Mcalls/sec)
  simd safe_div:   449.5 Mvals/sec, (56.2 Mcalls/sec)
  simd rcp_fast:   358.0 Mvals/sec, (44.7 Mcalls/sec)
  float ifloor:   346.7 Mvals/sec, (346.7 Mcalls/sec)
  simd ifloor:   906.6 Mvals/sec, (113.3 Mcalls/sec)
  float floorfrac:   170.5 Mvals/sec, (170.5 Mcalls/sec)
  simd floorfrac:   294.3 Mvals/sec, (36.8 Mcalls/sec)
  float expf:    23.0 Mvals/sec, (23.0 Mcalls/sec)
  float fast_exp:   102.3 Mvals/sec, (102.3 Mcalls/sec)
  simd exp:    24.9 Mvals/sec, (3.1 Mcalls/sec)
  simd fast_exp:    85.4 Mvals/sec, (10.7 Mcalls/sec)
  float logf:    56.2 Mvals/sec, (56.2 Mcalls/sec)
  fast_log:   104.8 Mvals/sec, (104.8 Mcalls/sec)
  simd log:    56.2 Mvals/sec, (7.0 Mcalls/sec)
  simd fast_log:    17.0 Mvals/sec, (2.1 Mcalls/sec)
  float powf:    16.8 Mvals/sec, (16.8 Mcalls/sec)
  simd fast_pow_pos:    15.1 Mvals/sec, (1.9 Mcalls/sec)
  float sqrt:   133.3 Mvals/sec, (133.3 Mcalls/sec)
  simd sqrt:   280.8 Mvals/sec, (35.1 Mcalls/sec)
  float rsqrt:   179.3 Mvals/sec, (179.3 Mcalls/sec)
  simd rsqrt:   280.8 Mvals/sec, (35.1 Mcalls/sec)
  simd rsqrt_fast:   280.8 Mvals/sec, (35.1 Mcalls/sec)

vfloat16

load/store  vfloat16
  partial load 1 : 101 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  partial store 1 : 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  partial load 2 : 101 102 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  partial store 2 : 1 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  partial load 3 : 101 102 103 0 0 0 0 0 0 0 0 0 0 0 0 0
  partial store 3 : 1 2 3 0 0 0 0 0 0 0 0 0 0 0 0 0
  partial load 4 : 101 102 103 104 0 0 0 0 0 0 0 0 0 0 0 0
  partial store 4 : 1 2 3 4 0 0 0 0 0 0 0 0 0 0 0 0
  partial load 5 : 101 102 103 104 105 0 0 0 0 0 0 0 0 0 0 0
  partial store 5 : 1 2 3 4 5 0 0 0 0 0 0 0 0 0 0 0
  partial load 6 : 101 102 103 104 105 106 0 0 0 0 0 0 0 0 0 0
  partial store 6 : 1 2 3 4 5 6 0 0 0 0 0 0 0 0 0 0
  partial load 7 : 101 102 103 104 105 106 107 0 0 0 0 0 0 0 0 0
  partial store 7 : 1 2 3 4 5 6 7 0 0 0 0 0 0 0 0 0
  partial load 8 : 101 102 103 104 105 106 107 108 0 0 0 0 0 0 0 0
  partial store 8 : 1 2 3 4 5 6 7 8 0 0 0 0 0 0 0 0
  partial load 9 : 101 102 103 104 105 106 107 108 109 0 0 0 0 0 0 0
  partial store 9 : 1 2 3 4 5 6 7 8 9 0 0 0 0 0 0 0
  partial load 10 : 101 102 103 104 105 106 107 108 109 110 0 0 0 0 0 0
  partial store 10 : 1 2 3 4 5 6 7 8 9 10 0 0 0 0 0 0
  partial load 11 : 101 102 103 104 105 106 107 108 109 110 111 0 0 0 0 0
  partial store 11 : 1 2 3 4 5 6 7 8 9 10 11 0 0 0 0 0
  partial load 12 : 101 102 103 104 105 106 107 108 109 110 111 112 0 0 0 0
  partial store 12 : 1 2 3 4 5 6 7 8 9 10 11 12 0 0 0 0
  partial load 13 : 101 102 103 104 105 106 107 108 109 110 111 112 113 0 0 0
  partial store 13 : 1 2 3 4 5 6 7 8 9 10 11 12 13 0 0 0
  partial load 14 : 101 102 103 104 105 106 107 108 109 110 111 112 113 114 0 0
  partial store 14 : 1 2 3 4 5 6 7 8 9 10 11 12 13 14 0 0
  partial load 15 : 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 0
  partial store 15 : 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 0
  partial load 16 : 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116
  partial store 16 : 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16
  load scalar:  1589.0 Mvals/sec, (99.3 Mcalls/sec)
  load vec:  1433.6 Mvals/sec, (89.6 Mcalls/sec)
  store vec:   924.1 Mvals/sec, (57.8 Mcalls/sec)
  load 16 comps:  1171.0 Mvals/sec, (73.2 Mcalls/sec)
  load 13 comps:  1171.6 Mvals/sec, (90.1 Mcalls/sec)
  load 9 comps:   675.4 Mvals/sec, (75.0 Mcalls/sec)
  load 8 comps:   615.4 Mvals/sec, (76.9 Mcalls/sec)
  load 7 comps:   673.7 Mvals/sec, (96.2 Mcalls/sec)
  load 6 comps:   662.3 Mvals/sec, (110.4 Mcalls/sec)
  load 5 comps:   589.2 Mvals/sec, (117.8 Mcalls/sec)
  load 4 comps:   561.7 Mvals/sec, (140.4 Mcalls/sec)
  load 3 comps:   338.6 Mvals/sec, (112.9 Mcalls/sec)
  load 2 comps:   233.4 Mvals/sec, (116.7 Mcalls/sec)
  load 1 comps:   112.7 Mvals/sec, (112.7 Mcalls/sec)
  store 16 comps:  1187.8 Mvals/sec, (74.2 Mcalls/sec)
  store 13 comps:   811.2 Mvals/sec, (62.4 Mcalls/sec)
  store 9 comps:   668.2 Mvals/sec, (74.2 Mcalls/sec)
  store 8 comps:   515.2 Mvals/sec, (64.4 Mcalls/sec)
  store 7 comps:   509.4 Mvals/sec, (72.8 Mcalls/sec)
  store 6 comps:   689.4 Mvals/sec, (114.9 Mcalls/sec)
  store 5 comps:   601.1 Mvals/sec, (120.2 Mcalls/sec)
  store 4 comps:   480.5 Mvals/sec, (120.1 Mcalls/sec)
  store 3 comps:   360.7 Mvals/sec, (120.2 Mcalls/sec)
  store 2 comps:   229.8 Mvals/sec, (114.9 Mcalls/sec)
  store 1 comps:   120.2 Mvals/sec, (120.2 Mcalls/sec)
load/store with conversion vfloat16
  load from unsigned short[]:  1090.6 Mvals/sec, (68.2 Mcalls/sec)
  load from short[]:  1078.7 Mvals/sec, (67.4 Mcalls/sec)
  load from unsigned char[]:  1105.0 Mvals/sec, (69.1 Mcalls/sec)
  load from char[]:  1101.7 Mvals/sec, (68.9 Mcalls/sec)
  load from half[]:  1005.0 Mvals/sec, (62.8 Mcalls/sec)
  store to half[]:   453.0 Mvals/sec, (28.3 Mcalls/sec)
masked loadstore  vfloat16
  masked load with int mask:   551.9 Mvals/sec, (34.5 Mcalls/sec)
  masked load with bool mask:   556.9 Mvals/sec, (34.8 Mcalls/sec)
  masked store with int mask:    35.6 Mvals/sec, (35.6 Mcalls/sec)
  masked store with bool mask:    35.6 Mvals/sec, (35.6 Mcalls/sec)
scatter & gather  vfloat16
  gather:   858.0 Mvals/sec, (53.6 Mcalls/sec)
  gather_mask:   427.4 Mvals/sec, (26.7 Mcalls/sec)
  scatter:   367.3 Mvals/sec, (23.0 Mcalls/sec)
  scatter_mask:   399.9 Mvals/sec, (25.0 Mcalls/sec)
component_access  vfloat16
  operator[i]:   335.5 Mvals/sec, (335.5 Mcalls/sec)
  operator[2]:   341.1 Mvals/sec, (341.1 Mcalls/sec)
  operator[0]:   365.0 Mvals/sec, (365.0 Mcalls/sec)
  extract<2> :   335.6 Mvals/sec, (335.6 Mcalls/sec)
  extract<0> :   365.0 Mvals/sec, (365.0 Mcalls/sec)
  insert<2> :   157.6 Mvals/sec, (157.6 Mcalls/sec)
arithmetic  vfloat16
  operator+:   383.6 Mvals/sec, (24.0 Mcalls/sec)
  operator-:   388.5 Mvals/sec, (24.3 Mcalls/sec)
  operator- (neg):   725.6 Mvals/sec, (45.4 Mcalls/sec)
  operator*:   389.9 Mvals/sec, (24.4 Mcalls/sec)
  operator* (scalar):   507.7 Mvals/sec, (31.7 Mcalls/sec)
  operator/:   201.4 Mvals/sec, (12.6 Mcalls/sec)
  abs:   816.7 Mvals/sec, (51.0 Mcalls/sec)
  reduce_add:   247.7 Mvals/sec, (15.5 Mcalls/sec)
  reference: add scalar:   176.3 Mvals/sec, (176.3 Mcalls/sec)
  reference: mul scalar:   189.1 Mvals/sec, (189.1 Mcalls/sec)
  reference: div scalar:   176.3 Mvals/sec, (176.3 Mcalls/sec)
comparisons  vfloat16
  operator< :   505.8 Mvals/sec, (31.6 Mcalls/sec)
  operator> :   507.8 Mvals/sec, (31.7 Mcalls/sec)
  operator<=:   507.5 Mvals/sec, (31.7 Mcalls/sec)
  operator>=:   499.9 Mvals/sec, (31.2 Mcalls/sec)
  operator==:   508.4 Mvals/sec, (31.8 Mcalls/sec)
  operator!=:   492.6 Mvals/sec, (30.8 Mcalls/sec)
shuffle  vfloat16
  shuffle4<> :  1286.1 Mvals/sec, (80.4 Mcalls/sec)
  shuffle<> :  1286.1 Mvals/sec, (80.4 Mcalls/sec)
blend  vfloat16
  blend:   472.5 Mvals/sec, (29.5 Mcalls/sec)
  blend0:   422.8 Mvals/sec, (26.4 Mcalls/sec)
  blend0not:   406.3 Mvals/sec, (25.4 Mcalls/sec)
fused  vfloat16
  madd old *+:   220.4 Mvals/sec, (13.8 Mcalls/sec)
  madd fused:   191.8 Mvals/sec, (12.0 Mcalls/sec)
  msub old *-:   223.0 Mvals/sec, (13.9 Mcalls/sec)
  msub fused:   180.4 Mvals/sec, (11.3 Mcalls/sec)
  nmadd old (-*)+:   221.2 Mvals/sec, (13.8 Mcalls/sec)
  nmadd fused:   178.2 Mvals/sec, (11.1 Mcalls/sec)
  nmsub old -(*+):   175.7 Mvals/sec, (11.0 Mcalls/sec)
  nmsub fused:   164.1 Mvals/sec, (10.3 Mcalls/sec)
mathfuncs vfloat16
  simd abs:   649.2 Mvals/sec, (40.6 Mcalls/sec)
  simd sign:   182.7 Mvals/sec, (11.4 Mcalls/sec)
  simd ceil:   656.2 Mvals/sec, (41.0 Mcalls/sec)
  simd floor:   656.2 Mvals/sec, (41.0 Mcalls/sec)
  simd round:   647.1 Mvals/sec, (40.4 Mcalls/sec)
  simd operator/:   238.2 Mvals/sec, (14.9 Mcalls/sec)
  simd safe_div:   452.1 Mvals/sec, (28.3 Mcalls/sec)
  simd rcp_fast:   228.2 Mvals/sec, (14.3 Mcalls/sec)
  float ifloor:   352.6 Mvals/sec, (352.6 Mcalls/sec)
  simd ifloor:   401.2 Mvals/sec, (25.1 Mcalls/sec)
  float floorfrac:   171.9 Mvals/sec, (171.9 Mcalls/sec)
  simd floorfrac:   205.1 Mvals/sec, (12.8 Mcalls/sec)
  float expf:    23.0 Mvals/sec, (23.0 Mcalls/sec)
  float fast_exp:   102.1 Mvals/sec, (102.1 Mcalls/sec)
  simd exp:    25.0 Mvals/sec, (1.6 Mcalls/sec)
  simd fast_exp:    83.3 Mvals/sec, (5.2 Mcalls/sec)
  float logf:    56.3 Mvals/sec, (56.3 Mcalls/sec)
  fast_log:   107.0 Mvals/sec, (107.0 Mcalls/sec)
  simd log:    56.3 Mvals/sec, (3.5 Mcalls/sec)
  simd fast_log:    15.1 Mvals/sec, (0.9 Mcalls/sec)
  float powf:    16.8 Mvals/sec, (16.8 Mcalls/sec)
  simd fast_pow_pos:    13.1 Mvals/sec, (0.8 Mcalls/sec)
  float sqrt:   133.3 Mvals/sec, (133.3 Mcalls/sec)
  simd sqrt:   239.0 Mvals/sec, (14.9 Mcalls/sec)
  float rsqrt:   179.3 Mvals/sec, (179.3 Mcalls/sec)
  simd rsqrt:   276.7 Mvals/sec, (17.3 Mcalls/sec)
  simd rsqrt_fast:   276.7 Mvals/sec, (17.3 Mcalls/sec)

vint4

load/store  vint4
  partial load 1 : 101 0 0 0
  partial store 1 : 1 0 0 0
  partial load 2 : 101 102 0 0
  partial store 2 : 1 2 0 0
  partial load 3 : 101 102 103 0
  partial store 3 : 1 2 3 0
  partial load 4 : 101 102 103 104
  partial store 4 : 1 2 3 4
  load scalar:   705.1 Mvals/sec, (176.3 Mcalls/sec)
  load vec:   705.1 Mvals/sec, (176.3 Mcalls/sec)
  store vec:   705.1 Mvals/sec, (176.3 Mcalls/sec)
  load 4 comps:   705.1 Mvals/sec, (176.3 Mcalls/sec)
  load 3 comps:   495.2 Mvals/sec, (165.1 Mcalls/sec)
  load 2 comps:   352.5 Mvals/sec, (176.3 Mcalls/sec)
  load 1 comps:   176.3 Mvals/sec, (176.3 Mcalls/sec)
  store 4 comps:   483.7 Mvals/sec, (120.9 Mcalls/sec)
  store 3 comps:   346.6 Mvals/sec, (115.5 Mcalls/sec)
  store 2 comps:   241.9 Mvals/sec, (120.9 Mcalls/sec)
  store 1 comps:   120.9 Mvals/sec, (120.9 Mcalls/sec)
load/store with conversion vint4
  load from int[]:   827.5 Mvals/sec, (206.9 Mcalls/sec)
  load from unsigned short[]:   770.4 Mvals/sec, (192.6 Mcalls/sec)
  load from short[]:   770.4 Mvals/sec, (192.6 Mcalls/sec)
  load from unsigned char[]:   696.4 Mvals/sec, (174.1 Mcalls/sec)
  load from char[]:   715.6 Mvals/sec, (178.9 Mcalls/sec)
  store to unsigned short[]:  1280.0 Mvals/sec, (320.0 Mcalls/sec)
  store to unsigned char[]:   709.6 Mvals/sec, (177.4 Mcalls/sec)
masked loadstore  vint4
  masked load with int mask:   637.9 Mvals/sec, (159.5 Mcalls/sec)
  masked load with bool mask:   592.8 Mvals/sec, (148.2 Mcalls/sec)
  masked store with int mask:   165.0 Mvals/sec, (165.0 Mcalls/sec)
  masked store with bool mask:   150.5 Mvals/sec, (150.5 Mcalls/sec)
scatter & gather  vint4
  gather:   623.8 Mvals/sec, (156.0 Mcalls/sec)
  gather_mask:   515.3 Mvals/sec, (128.8 Mcalls/sec)
  scatter:   653.6 Mvals/sec, (163.4 Mcalls/sec)
  scatter_mask:   514.7 Mvals/sec, (128.7 Mcalls/sec)
component_access  vint4
  operator[i]:   341.1 Mvals/sec, (341.1 Mcalls/sec)
  operator[2]:   335.6 Mvals/sec, (335.6 Mcalls/sec)
  operator[0]:   365.0 Mvals/sec, (365.0 Mcalls/sec)
  extract<2> :   341.1 Mvals/sec, (341.1 Mcalls/sec)
  extract<0> :   365.0 Mvals/sec, (365.0 Mcalls/sec)
  insert<2> :   207.9 Mvals/sec, (207.9 Mcalls/sec)
arithmetic  vint4
  operator+:   422.5 Mvals/sec, (105.6 Mcalls/sec)
  operator-:   433.3 Mvals/sec, (108.3 Mcalls/sec)
  operator- (neg):   472.6 Mvals/sec, (118.2 Mcalls/sec)
  operator*:   377.5 Mvals/sec, (94.4 Mcalls/sec)
  operator* (scalar):   449.7 Mvals/sec, (112.4 Mcalls/sec)
  operator/:   432.2 Mvals/sec, (108.0 Mcalls/sec)
  abs:   423.9 Mvals/sec, (106.0 Mcalls/sec)
  reduce_add:  1280.0 Mvals/sec, (320.0 Mcalls/sec)
  reference: add scalar:   177.8 Mvals/sec, (177.8 Mcalls/sec)
  reference: mul scalar:   190.8 Mvals/sec, (190.8 Mcalls/sec)
  reference: div scalar:   177.8 Mvals/sec, (177.8 Mcalls/sec)
bitwise  vint4
  operator&:   413.3 Mvals/sec, (103.3 Mcalls/sec)
  operator|:   418.6 Mvals/sec, (104.7 Mcalls/sec)
  operator^:   432.6 Mvals/sec, (108.1 Mcalls/sec)
  operator!:   472.8 Mvals/sec, (118.2 Mcalls/sec)
  andnot:   405.1 Mvals/sec, (101.3 Mcalls/sec)
  reduce_and:   315.2 Mvals/sec, (315.2 Mcalls/sec)
  reduce_or :   320.0 Mvals/sec, (320.0 Mcalls/sec)
comparisons  vint4
  operator< :   432.5 Mvals/sec, (108.1 Mcalls/sec)
  operator> :   409.5 Mvals/sec, (102.4 Mcalls/sec)
  operator<=:   275.9 Mvals/sec, (69.0 Mcalls/sec)
  operator>=:   275.9 Mvals/sec, (69.0 Mcalls/sec)
  operator==:   432.5 Mvals/sec, (108.1 Mcalls/sec)
  operator!=:   308.1 Mvals/sec, (77.0 Mcalls/sec)
shuffle  vint4
  shuffle<...> :  1320.6 Mvals/sec, (330.1 Mcalls/sec)
  shuffle<0> :  1341.8 Mvals/sec, (335.5 Mcalls/sec)
  shuffle<1> :  1364.3 Mvals/sec, (341.1 Mcalls/sec)
  shuffle<2> :  1363.8 Mvals/sec, (340.9 Mcalls/sec)
  shuffle<3> :  1363.8 Mvals/sec, (340.9 Mcalls/sec)
blend  vint4
  blend:   476.2 Mvals/sec, (119.1 Mcalls/sec)
  blend0:   511.4 Mvals/sec, (127.9 Mcalls/sec)
  blend0not:   499.1 Mvals/sec, (124.8 Mcalls/sec)
test converting vint4 to uint16 
  load from uint16:  1280.0 Mvals/sec, (320.0 Mcalls/sec)
  convert to uint16:  1260.6 Mvals/sec, (315.2 Mcalls/sec)
test converting vint4 to uint8 
  load from uint8:   708.8 Mvals/sec, (177.2 Mcalls/sec)
  convert to uint16:   684.5 Mvals/sec, (171.1 Mcalls/sec)
shift  vint4
  [-80000000 -80000000 -80000000 -80000000] >>  1 == [-40000000 -40000000 -40000000 -40000000]
  [-80000000 -80000000 -80000000 -80000000] srl 1 == [40000000 40000000 40000000 40000000]
  [-80000000 -80000000 -80000000 -80000000] >>  4 == [-8000000 -8000000 -8000000 -8000000]
  [-80000000 -80000000 -80000000 -80000000] srl 4 == [8000000 8000000 8000000 8000000]
  [-1 -1 -1 -1] >>  1 == [-1 -1 -1 -1]
  [-1 -1 -1 -1] srl 1 == [7fffffff 7fffffff 7fffffff 7fffffff]
  [-1 -1 -1 -1] >>  4 == [-1 -1 -1 -1]
  [-1 -1 -1 -1] srl 4 == [fffffff fffffff fffffff fffffff]
  [ffff ffff ffff ffff] >>  1 == [7fff 7fff 7fff 7fff]
  [ffff ffff ffff ffff] srl 1 == [7fff 7fff 7fff 7fff]
  [ffff ffff ffff ffff] >>  4 == [fff fff fff fff]
  [ffff ffff ffff ffff] srl 4 == [fff fff fff fff]
  [3 3 3 3] >>  1 == [1 1 1 1]
  [3 3 3 3] srl 1 == [1 1 1 1]
  [3 3 3 3] >>  4 == [0 0 0 0]
  [3 3 3 3] srl 4 == [0 0 0 0]
  operator<<:   419.3 Mvals/sec, (104.8 Mcalls/sec)
  operator>>:   391.6 Mvals/sec, (97.9 Mcalls/sec)
  srl       :   464.7 Mvals/sec, (116.2 Mcalls/sec)
  rotl      :   239.6 Mvals/sec, (59.9 Mcalls/sec)
transpose  vint4
 before transpose:
    0 1 2 3
    4 5 6 7
    8 9 10 11
    12 13 14 15
 after transpose:
    0 4 8 12
    1 5 9 13
    2 6 10 14
    3 7 11 15

vint8

load/store  vint8
  partial load 1 : 101 0 0 0 0 0 0 0
  partial store 1 : 1 0 0 0 0 0 0 0
  partial load 2 : 101 102 0 0 0 0 0 0
  partial store 2 : 1 2 0 0 0 0 0 0
  partial load 3 : 101 102 103 0 0 0 0 0
  partial store 3 : 1 2 3 0 0 0 0 0
  partial load 4 : 101 102 103 104 0 0 0 0
  partial store 4 : 1 2 3 4 0 0 0 0
  partial load 5 : 101 102 103 104 105 0 0 0
  partial store 5 : 1 2 3 4 5 0 0 0
  partial load 6 : 101 102 103 104 105 106 0 0
  partial store 6 : 1 2 3 4 5 6 0 0
  partial load 7 : 101 102 103 104 105 106 107 0
  partial store 7 : 1 2 3 4 5 6 7 0
  partial load 8 : 101 102 103 104 105 106 107 108
  partial store 8 : 1 2 3 4 5 6 7 8
  load scalar:  1341.6 Mvals/sec, (167.7 Mcalls/sec)
  load vec:   596.7 Mvals/sec, (74.6 Mcalls/sec)
  store vec:   635.9 Mvals/sec, (79.5 Mcalls/sec)
  load 8 comps:   797.7 Mvals/sec, (99.7 Mcalls/sec)
  load 7 comps:   693.3 Mvals/sec, (99.0 Mcalls/sec)
  load 6 comps:   804.6 Mvals/sec, (134.1 Mcalls/sec)
  load 5 comps:   812.5 Mvals/sec, (162.5 Mcalls/sec)
  load 4 comps:   650.0 Mvals/sec, (162.5 Mcalls/sec)
  load 3 comps:   520.0 Mvals/sec, (173.3 Mcalls/sec)
  load 2 comps:   346.7 Mvals/sec, (173.3 Mcalls/sec)
  load 1 comps:   156.7 Mvals/sec, (156.7 Mcalls/sec)
  store 8 comps:   633.9 Mvals/sec, (79.2 Mcalls/sec)
  store 7 comps:   576.5 Mvals/sec, (82.4 Mcalls/sec)
  store 6 comps:   725.5 Mvals/sec, (120.9 Mcalls/sec)
  store 5 comps:   577.5 Mvals/sec, (115.5 Mcalls/sec)
  store 4 comps:   462.2 Mvals/sec, (115.6 Mcalls/sec)
  store 3 comps:   346.7 Mvals/sec, (115.6 Mcalls/sec)
  store 2 comps:   241.9 Mvals/sec, (120.9 Mcalls/sec)
  store 1 comps:   120.9 Mvals/sec, (120.9 Mcalls/sec)
load/store with conversion vint8
  load from int[]:   964.7 Mvals/sec, (120.6 Mcalls/sec)
  load from unsigned short[]:  1027.2 Mvals/sec, (128.4 Mcalls/sec)
  load from short[]:   962.7 Mvals/sec, (120.3 Mcalls/sec)
  load from unsigned char[]:  1027.2 Mvals/sec, (128.4 Mcalls/sec)
  load from char[]:   941.5 Mvals/sec, (117.7 Mcalls/sec)
  store to unsigned short[]:  1000.6 Mvals/sec, (125.1 Mcalls/sec)
  store to unsigned char[]:  1005.2 Mvals/sec, (125.6 Mcalls/sec)
masked loadstore  vint8
  masked load with int mask:   848.0 Mvals/sec, (106.0 Mcalls/sec)
  masked load with bool mask:   886.2 Mvals/sec, (110.8 Mcalls/sec)
  masked store with int mask:   113.0 Mvals/sec, (113.0 Mcalls/sec)
  masked store with bool mask:   112.8 Mvals/sec, (112.8 Mcalls/sec)
scatter & gather  vint8
  gather:   896.0 Mvals/sec, (112.0 Mcalls/sec)
  gather_mask:   634.8 Mvals/sec, (79.3 Mcalls/sec)
  scatter:   827.2 Mvals/sec, (103.4 Mcalls/sec)
  scatter_mask:   611.3 Mvals/sec, (76.4 Mcalls/sec)
component_access  vint8
  operator[i]:   341.1 Mvals/sec, (341.1 Mcalls/sec)
  operator[2]:   365.0 Mvals/sec, (365.0 Mcalls/sec)
  operator[0]:   335.6 Mvals/sec, (335.6 Mcalls/sec)
  extract<2> :   365.0 Mvals/sec, (365.0 Mcalls/sec)
  extract<0> :   341.1 Mvals/sec, (341.1 Mcalls/sec)
  insert<2> :   191.7 Mvals/sec, (191.7 Mcalls/sec)
arithmetic  vint8
  operator+:   635.2 Mvals/sec, (79.4 Mcalls/sec)
  operator-:   635.1 Mvals/sec, (79.4 Mcalls/sec)
  operator- (neg):   792.3 Mvals/sec, (99.0 Mcalls/sec)
  operator*:   635.3 Mvals/sec, (79.4 Mcalls/sec)
  operator* (scalar):   605.3 Mvals/sec, (75.7 Mcalls/sec)
  operator/:   472.6 Mvals/sec, (59.1 Mcalls/sec)
  abs:   784.8 Mvals/sec, (98.1 Mcalls/sec)
  reduce_add:  1300.0 Mvals/sec, (162.5 Mcalls/sec)
  reference: add scalar:   177.8 Mvals/sec, (177.8 Mcalls/sec)
  reference: mul scalar:   190.8 Mvals/sec, (190.8 Mcalls/sec)
  reference: div scalar:   177.8 Mvals/sec, (177.8 Mcalls/sec)
bitwise  vint8
  operator&:   636.4 Mvals/sec, (79.5 Mcalls/sec)
  operator|:   635.3 Mvals/sec, (79.4 Mcalls/sec)
  operator^:   635.1 Mvals/sec, (79.4 Mcalls/sec)
  operator!:   792.3 Mvals/sec, (99.0 Mcalls/sec)
  andnot:   637.2 Mvals/sec, (79.6 Mcalls/sec)
  reduce_and:   133.2 Mvals/sec, (133.2 Mcalls/sec)
  reduce_or :   133.2 Mvals/sec, (133.2 Mcalls/sec)
comparisons  vint8
  operator< :   636.3 Mvals/sec, (79.5 Mcalls/sec)
  operator> :   564.0 Mvals/sec, (70.5 Mcalls/sec)
  operator<=:   359.4 Mvals/sec, (44.9 Mcalls/sec)
  operator>=:   359.3 Mvals/sec, (44.9 Mcalls/sec)
  operator==:   634.9 Mvals/sec, (79.4 Mcalls/sec)
  operator!=:   362.9 Mvals/sec, (45.4 Mcalls/sec)
shuffle  vint8
  shuffle<...> :  2483.7 Mvals/sec, (310.5 Mcalls/sec)
  shuffle<0> :  2641.1 Mvals/sec, (330.1 Mcalls/sec)
  shuffle<1> :  2641.1 Mvals/sec, (330.1 Mcalls/sec)
  shuffle<2> :  2642.0 Mvals/sec, (330.3 Mcalls/sec)
  shuffle<3> :  2641.1 Mvals/sec, (330.1 Mcalls/sec)
  shuffle<4> :  2641.1 Mvals/sec, (330.1 Mcalls/sec)
  shuffle<5> :  2641.1 Mvals/sec, (330.1 Mcalls/sec)
  shuffle<6> :  2599.9 Mvals/sec, (325.0 Mcalls/sec)
  shuffle<7> :  2599.9 Mvals/sec, (325.0 Mcalls/sec)
blend  vint8
  blend:   934.7 Mvals/sec, (116.8 Mcalls/sec)
  blend0:   600.6 Mvals/sec, (75.1 Mcalls/sec)
  blend0not:   582.1 Mvals/sec, (72.8 Mcalls/sec)
test converting vint8 to uint16 
  load from uint16:  1003.3 Mvals/sec, (125.4 Mcalls/sec)
  convert to uint16:  1003.6 Mvals/sec, (125.5 Mcalls/sec)
test converting vint8 to uint8 
  load from uint8:  1005.8 Mvals/sec, (125.7 Mcalls/sec)
  convert to uint16:  1005.2 Mvals/sec, (125.6 Mcalls/sec)
shift  vint8
  [-80000000 -80000000 -80000000 -80000000 -80000000 -80000000 -80000000 -80000000] >>  1 == [-40000000 -40000000 -40000000 -40000000 -40000000 -40000000 -40000000 -40000000]
  [-80000000 -80000000 -80000000 -80000000 -80000000 -80000000 -80000000 -80000000] srl 1 == [40000000 40000000 40000000 40000000 40000000 40000000 40000000 40000000]
  [-80000000 -80000000 -80000000 -80000000 -80000000 -80000000 -80000000 -80000000] >>  4 == [-8000000 -8000000 -8000000 -8000000 -8000000 -8000000 -8000000 -8000000]
  [-80000000 -80000000 -80000000 -80000000 -80000000 -80000000 -80000000 -80000000] srl 4 == [8000000 8000000 8000000 8000000 8000000 8000000 8000000 8000000]
  [-1 -1 -1 -1 -1 -1 -1 -1] >>  1 == [-1 -1 -1 -1 -1 -1 -1 -1]
  [-1 -1 -1 -1 -1 -1 -1 -1] srl 1 == [7fffffff 7fffffff 7fffffff 7fffffff 7fffffff 7fffffff 7fffffff 7fffffff]
  [-1 -1 -1 -1 -1 -1 -1 -1] >>  4 == [-1 -1 -1 -1 -1 -1 -1 -1]
  [-1 -1 -1 -1 -1 -1 -1 -1] srl 4 == [fffffff fffffff fffffff fffffff fffffff fffffff fffffff fffffff]
  [ffff ffff ffff ffff ffff ffff ffff ffff] >>  1 == [7fff 7fff 7fff 7fff 7fff 7fff 7fff 7fff]
  [ffff ffff ffff ffff ffff ffff ffff ffff] srl 1 == [7fff 7fff 7fff 7fff 7fff 7fff 7fff 7fff]
  [ffff ffff ffff ffff ffff ffff ffff ffff] >>  4 == [fff fff fff fff fff fff fff fff]
  [ffff ffff ffff ffff ffff ffff ffff ffff] srl 4 == [fff fff fff fff fff fff fff fff]
  [3 3 3 3 3 3 3 3] >>  1 == [1 1 1 1 1 1 1 1]
  [3 3 3 3 3 3 3 3] srl 1 == [1 1 1 1 1 1 1 1]
  [3 3 3 3 3 3 3 3] >>  4 == [0 0 0 0 0 0 0 0]
  [3 3 3 3 3 3 3 3] srl 4 == [0 0 0 0 0 0 0 0]
  operator<<:   635.2 Mvals/sec, (79.4 Mcalls/sec)
  operator>>:   634.7 Mvals/sec, (79.3 Mcalls/sec)
  srl       :   786.0 Mvals/sec, (98.3 Mcalls/sec)
  rotl      :   309.4 Mvals/sec, (38.7 Mcalls/sec)

vint16

load/store  vint16
  partial load 1 : 101 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  partial store 1 : 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  partial load 2 : 101 102 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  partial store 2 : 1 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  partial load 3 : 101 102 103 0 0 0 0 0 0 0 0 0 0 0 0 0
  partial store 3 : 1 2 3 0 0 0 0 0 0 0 0 0 0 0 0 0
  partial load 4 : 101 102 103 104 0 0 0 0 0 0 0 0 0 0 0 0
  partial store 4 : 1 2 3 4 0 0 0 0 0 0 0 0 0 0 0 0
  partial load 5 : 101 102 103 104 105 0 0 0 0 0 0 0 0 0 0 0
  partial store 5 : 1 2 3 4 5 0 0 0 0 0 0 0 0 0 0 0
  partial load 6 : 101 102 103 104 105 106 0 0 0 0 0 0 0 0 0 0
  partial store 6 : 1 2 3 4 5 6 0 0 0 0 0 0 0 0 0 0
  partial load 7 : 101 102 103 104 105 106 107 0 0 0 0 0 0 0 0 0
  partial store 7 : 1 2 3 4 5 6 7 0 0 0 0 0 0 0 0 0
  partial load 8 : 101 102 103 104 105 106 107 108 0 0 0 0 0 0 0 0
  partial store 8 : 1 2 3 4 5 6 7 8 0 0 0 0 0 0 0 0
  partial load 9 : 101 102 103 104 105 106 107 108 109 0 0 0 0 0 0 0
  partial store 9 : 1 2 3 4 5 6 7 8 9 0 0 0 0 0 0 0
  partial load 10 : 101 102 103 104 105 106 107 108 109 110 0 0 0 0 0 0
  partial store 10 : 1 2 3 4 5 6 7 8 9 10 0 0 0 0 0 0
  partial load 11 : 101 102 103 104 105 106 107 108 109 110 111 0 0 0 0 0
  partial store 11 : 1 2 3 4 5 6 7 8 9 10 11 0 0 0 0 0
  partial load 12 : 101 102 103 104 105 106 107 108 109 110 111 112 0 0 0 0
  partial store 12 : 1 2 3 4 5 6 7 8 9 10 11 12 0 0 0 0
  partial load 13 : 101 102 103 104 105 106 107 108 109 110 111 112 113 0 0 0
  partial store 13 : 1 2 3 4 5 6 7 8 9 10 11 12 13 0 0 0
  partial load 14 : 101 102 103 104 105 106 107 108 109 110 111 112 113 114 0 0
  partial store 14 : 1 2 3 4 5 6 7 8 9 10 11 12 13 14 0 0
  partial load 15 : 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 0
  partial store 15 : 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 0
  partial load 16 : 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116
  partial store 16 : 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16
  load scalar:  2066.1 Mvals/sec, (129.1 Mcalls/sec)
  load vec:  1167.5 Mvals/sec, (73.0 Mcalls/sec)
  store vec:  1063.5 Mvals/sec, (66.5 Mcalls/sec)
  load 16 comps:  1171.7 Mvals/sec, (73.2 Mcalls/sec)
  load 13 comps:  1171.6 Mvals/sec, (90.1 Mcalls/sec)
  load 9 comps:   675.7 Mvals/sec, (75.1 Mcalls/sec)
  load 8 comps:   616.0 Mvals/sec, (77.0 Mcalls/sec)
  load 7 comps:   678.4 Mvals/sec, (96.9 Mcalls/sec)
  load 6 comps:   662.3 Mvals/sec, (110.4 Mcalls/sec)
  load 5 comps:   564.5 Mvals/sec, (112.9 Mcalls/sec)
  load 4 comps:   616.0 Mvals/sec, (154.0 Mcalls/sec)
  load 3 comps:   335.4 Mvals/sec, (111.8 Mcalls/sec)
  load 2 comps:   259.6 Mvals/sec, (129.8 Mcalls/sec)
  load 1 comps:   112.9 Mvals/sec, (112.9 Mcalls/sec)
  store 16 comps:  1227.7 Mvals/sec, (76.7 Mcalls/sec)
  store 13 comps:   816.4 Mvals/sec, (62.8 Mcalls/sec)
  store 9 comps:   693.1 Mvals/sec, (77.0 Mcalls/sec)
  store 8 comps:   639.6 Mvals/sec, (80.0 Mcalls/sec)
  store 7 comps:   577.6 Mvals/sec, (82.5 Mcalls/sec)
  store 6 comps:   725.5 Mvals/sec, (120.9 Mcalls/sec)
  store 5 comps:   577.8 Mvals/sec, (115.6 Mcalls/sec)
  store 4 comps:   462.0 Mvals/sec, (115.5 Mcalls/sec)
  store 3 comps:   346.7 Mvals/sec, (115.6 Mcalls/sec)
  store 2 comps:   241.9 Mvals/sec, (120.9 Mcalls/sec)
  store 1 comps:   120.9 Mvals/sec, (120.9 Mcalls/sec)
load/store with conversion vint16
  load from int[]:  1167.0 Mvals/sec, (72.9 Mcalls/sec)
  load from unsigned short[]:  1125.3 Mvals/sec, (70.3 Mcalls/sec)
  load from short[]:  1150.0 Mvals/sec, (71.9 Mcalls/sec)
  load from unsigned char[]:  1158.4 Mvals/sec, (72.4 Mcalls/sec)
  load from char[]:  1164.2 Mvals/sec, (72.8 Mcalls/sec)
  store to unsigned short[]:  1189.3 Mvals/sec, (74.3 Mcalls/sec)
  store to unsigned char[]:  1244.7 Mvals/sec, (77.8 Mcalls/sec)
masked loadstore  vint16
  masked load with int mask:   557.8 Mvals/sec, (34.9 Mcalls/sec)
  masked load with bool mask:   569.8 Mvals/sec, (35.6 Mcalls/sec)
  masked store with int mask:    35.5 Mvals/sec, (35.5 Mcalls/sec)
  masked store with bool mask:    35.6 Mvals/sec, (35.6 Mcalls/sec)
scatter & gather  vint16
  gather:   825.5 Mvals/sec, (51.6 Mcalls/sec)
  gather_mask:   403.4 Mvals/sec, (25.2 Mcalls/sec)
  scatter:   407.2 Mvals/sec, (25.4 Mcalls/sec)
  scatter_mask:   400.3 Mvals/sec, (25.0 Mcalls/sec)
component_access  vint16
  operator[i]:   340.9 Mvals/sec, (340.9 Mcalls/sec)
  operator[2]:   365.0 Mvals/sec, (365.0 Mcalls/sec)
  operator[0]:   340.9 Mvals/sec, (340.9 Mcalls/sec)
  extract<2> :   365.0 Mvals/sec, (365.0 Mcalls/sec)
  extract<0> :   335.5 Mvals/sec, (335.5 Mcalls/sec)
  insert<2> :   166.4 Mvals/sec, (166.4 Mcalls/sec)
arithmetic  vint16
  operator+:   693.9 Mvals/sec, (43.4 Mcalls/sec)
  operator-:   658.7 Mvals/sec, (41.2 Mcalls/sec)
  operator- (neg):   785.2 Mvals/sec, (49.1 Mcalls/sec)
  operator*:   576.8 Mvals/sec, (36.1 Mcalls/sec)
  operator* (scalar):   527.1 Mvals/sec, (32.9 Mcalls/sec)
  operator/:   457.0 Mvals/sec, (28.6 Mcalls/sec)
  abs:   716.0 Mvals/sec, (44.8 Mcalls/sec)
  reduce_add:   950.6 Mvals/sec, (59.4 Mcalls/sec)
  reference: add scalar:   177.8 Mvals/sec, (177.8 Mcalls/sec)
  reference: mul scalar:   190.8 Mvals/sec, (190.8 Mcalls/sec)
  reference: div scalar:   177.8 Mvals/sec, (177.8 Mcalls/sec)
bitwise  vint16
  operator&:   694.6 Mvals/sec, (43.4 Mcalls/sec)
  operator|:   661.4 Mvals/sec, (41.3 Mcalls/sec)
  operator^:   623.0 Mvals/sec, (38.9 Mcalls/sec)
  operator!:   692.2 Mvals/sec, (43.3 Mcalls/sec)
  andnot:   688.2 Mvals/sec, (43.0 Mcalls/sec)
  reduce_and:    53.2 Mvals/sec, (53.2 Mcalls/sec)
  reduce_or :    53.2 Mvals/sec, (53.2 Mcalls/sec)
comparisons  vint16
  operator< :   582.8 Mvals/sec, (36.4 Mcalls/sec)
  operator> :   582.9 Mvals/sec, (36.4 Mcalls/sec)
  operator<=:   322.1 Mvals/sec, (20.1 Mcalls/sec)
  operator>=:   319.5 Mvals/sec, (20.0 Mcalls/sec)
  operator==:   569.7 Mvals/sec, (35.6 Mcalls/sec)
  operator!=:   449.8 Mvals/sec, (28.1 Mcalls/sec)
shuffle  vint16
  shuffle4<> :  1041.4 Mvals/sec, (65.1 Mcalls/sec)
  shuffle<> :  1041.0 Mvals/sec, (65.1 Mcalls/sec)
blend  vint16
  blend:   509.0 Mvals/sec, (31.8 Mcalls/sec)
  blend0:   398.3 Mvals/sec, (24.9 Mcalls/sec)
  blend0not:   386.8 Mvals/sec, (24.2 Mcalls/sec)
test converting vint16 to uint16 
  load from uint16:  1248.3 Mvals/sec, (78.0 Mcalls/sec)
  convert to uint16:  1182.9 Mvals/sec, (73.9 Mcalls/sec)
test converting vint16 to uint16 
  load from uint16:  1246.9 Mvals/sec, (77.9 Mcalls/sec)
  convert to uint16:  1184.0 Mvals/sec, (74.0 Mcalls/sec)
shift  vint16
  [-80000000 -80000000 -80000000 -80000000 -80000000 -80000000 -80000000 -80000000 -80000000 -80000000 -80000000 -80000000 -80000000 -80000000 -80000000 -80000000] >>  1 == [-40000000 -40000000 -40000000 -40000000 -40000000 -40000000 -40000000 -40000000 -40000000 -40000000 -40000000 -40000000 -40000000 -40000000 -40000000 -40000000]
  [-80000000 -80000000 -80000000 -80000000 -80000000 -80000000 -80000000 -80000000 -80000000 -80000000 -80000000 -80000000 -80000000 -80000000 -80000000 -80000000] srl 1 == [40000000 40000000 40000000 40000000 40000000 40000000 40000000 40000000 40000000 40000000 40000000 40000000 40000000 40000000 40000000 40000000]
  [-80000000 -80000000 -80000000 -80000000 -80000000 -80000000 -80000000 -80000000 -80000000 -80000000 -80000000 -80000000 -80000000 -80000000 -80000000 -80000000] >>  4 == [-8000000 -8000000 -8000000 -8000000 -8000000 -8000000 -8000000 -8000000 -8000000 -8000000 -8000000 -8000000 -8000000 -8000000 -8000000 -8000000]
  [-80000000 -80000000 -80000000 -80000000 -80000000 -80000000 -80000000 -80000000 -80000000 -80000000 -80000000 -80000000 -80000000 -80000000 -80000000 -80000000] srl 4 == [8000000 8000000 8000000 8000000 8000000 8000000 8000000 8000000 8000000 8000000 8000000 8000000 8000000 8000000 8000000 8000000]
  [-1 -1 -1 -1 -1 -1 -1 -1 -1 -1 -1 -1 -1 -1 -1 -1] >>  1 == [-1 -1 -1 -1 -1 -1 -1 -1 -1 -1 -1 -1 -1 -1 -1 -1]
  [-1 -1 -1 -1 -1 -1 -1 -1 -1 -1 -1 -1 -1 -1 -1 -1] srl 1 == [7fffffff 7fffffff 7fffffff 7fffffff 7fffffff 7fffffff 7fffffff 7fffffff 7fffffff 7fffffff 7fffffff 7fffffff 7fffffff 7fffffff 7fffffff 7fffffff]
  [-1 -1 -1 -1 -1 -1 -1 -1 -1 -1 -1 -1 -1 -1 -1 -1] >>  4 == [-1 -1 -1 -1 -1 -1 -1 -1 -1 -1 -1 -1 -1 -1 -1 -1]
  [-1 -1 -1 -1 -1 -1 -1 -1 -1 -1 -1 -1 -1 -1 -1 -1] srl 4 == [fffffff fffffff fffffff fffffff fffffff fffffff fffffff fffffff fffffff fffffff fffffff fffffff fffffff fffffff fffffff fffffff]
  [ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff] >>  1 == [7fff 7fff 7fff 7fff 7fff 7fff 7fff 7fff 7fff 7fff 7fff 7fff 7fff 7fff 7fff 7fff]
  [ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff] srl 1 == [7fff 7fff 7fff 7fff 7fff 7fff 7fff 7fff 7fff 7fff 7fff 7fff 7fff 7fff 7fff 7fff]
  [ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff] >>  4 == [fff fff fff fff fff fff fff fff fff fff fff fff fff fff fff fff]
  [ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff] srl 4 == [fff fff fff fff fff fff fff fff fff fff fff fff fff fff fff fff]
  [3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3] >>  1 == [1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1]
  [3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3] srl 1 == [1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1]
  [3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3] >>  4 == [0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0]
  [3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3] srl 4 == [0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0]
  operator<<:   695.5 Mvals/sec, (43.5 Mcalls/sec)
  operator>>:   669.8 Mvals/sec, (41.9 Mcalls/sec)
  srl       :   713.7 Mvals/sec, (44.6 Mcalls/sec)
  rotl      :   313.9 Mvals/sec, (19.6 Mcalls/sec)

vbool4

shuffle  vbool4
  shuffle<...> :  1205.9 Mvals/sec, (301.5 Mcalls/sec)
  shuffle<0> :  1300.0 Mvals/sec, (325.0 Mcalls/sec)
  shuffle<1> :  1321.0 Mvals/sec, (330.3 Mcalls/sec)
  shuffle<2> :  1321.0 Mvals/sec, (330.3 Mcalls/sec)
  shuffle<3> :  1320.6 Mvals/sec, (330.1 Mcalls/sec)
component_access  vbool4
bitwise  vbool4
  operator&:   432.6 Mvals/sec, (108.1 Mcalls/sec)
  operator|:   483.7 Mvals/sec, (120.9 Mcalls/sec)
  operator^:   402.5 Mvals/sec, (100.6 Mcalls/sec)
  operator!:   423.7 Mvals/sec, (105.9 Mcalls/sec)
  reduce_and:   168.7 Mvals/sec, (168.7 Mcalls/sec)
  reduce_or :   155.7 Mvals/sec, (155.7 Mcalls/sec)

vbool8

shuffle  vbool8
  shuffle<...> :  1934.7 Mvals/sec, (241.8 Mcalls/sec)
  shuffle<0> :  2521.3 Mvals/sec, (315.2 Mcalls/sec)
  shuffle<1> :  2560.0 Mvals/sec, (320.0 Mcalls/sec)
  shuffle<2> :  2560.0 Mvals/sec, (320.0 Mcalls/sec)
  shuffle<3> :  2560.0 Mvals/sec, (320.0 Mcalls/sec)
  shuffle<4> :  2521.3 Mvals/sec, (315.2 Mcalls/sec)
  shuffle<5> :  2560.0 Mvals/sec, (320.0 Mcalls/sec)
  shuffle<6> :  2560.0 Mvals/sec, (320.0 Mcalls/sec)
  shuffle<7> :  2521.3 Mvals/sec, (315.2 Mcalls/sec)
component_access  vbool8
bitwise  vbool8
  operator&:   635.2 Mvals/sec, (79.4 Mcalls/sec)
  operator|:   786.6 Mvals/sec, (98.3 Mcalls/sec)
  operator^:   637.5 Mvals/sec, (79.7 Mcalls/sec)
  operator!:   636.8 Mvals/sec, (79.6 Mcalls/sec)
  reduce_and:   124.8 Mvals/sec, (124.8 Mcalls/sec)
  reduce_or :   118.5 Mvals/sec, (118.5 Mcalls/sec)

vbool16

component_access  vbool16
bitwise  vbool16
  operator&:  2919.7 Mvals/sec, (182.5 Mcalls/sec)
  operator|:  2919.2 Mvals/sec, (182.4 Mcalls/sec)
  operator^:  3139.7 Mvals/sec, (196.2 Mcalls/sec)
  operator!:  4322.0 Mvals/sec, (270.1 Mcalls/sec)
  reduce_and:   340.9 Mvals/sec, (340.9 Mcalls/sec)
  reduce_or :   330.3 Mvals/sec, (330.3 Mcalls/sec)

Odds and ends

constants 
  vfloat4 = float(const):  1459.9 Mvals/sec, (365.0 Mcalls/sec)
  vfloat4 = Zero():  1341.8 Mvals/sec, (335.5 Mcalls/sec)
  vfloat4 = One():  1341.8 Mvals/sec, (335.5 Mcalls/sec)
  vfloat4 = Iota():  1459.9 Mvals/sec, (365.0 Mcalls/sec)
  vfloat8 = float(const):  2560.0 Mvals/sec, (320.0 Mcalls/sec)
  vfloat8 = Zero():  2599.9 Mvals/sec, (325.0 Mcalls/sec)
  vfloat8 = One():  2560.0 Mvals/sec, (320.0 Mcalls/sec)
  vfloat8 = Iota():  2728.5 Mvals/sec, (341.1 Mcalls/sec)
  vfloat16 = float(const):  4557.1 Mvals/sec, (284.8 Mcalls/sec)
  vfloat16 = Zero():  4621.6 Mvals/sec, (288.9 Mcalls/sec)
  vfloat16 = One():  4620.3 Mvals/sec, (288.8 Mcalls/sec)
  vfloat16 = Iota():  4823.6 Mvals/sec, (301.5 Mcalls/sec)
special 
metaprogramming 
Testing matrix ops: 
  P = (1 0 0)
  Mtrans = (  1.000000e+00   0.000000e+00   0.000000e+00   0.000000e+00
   0.000000e+00   1.000000e+00   0.000000e+00   0.000000e+00
   0.000000e+00   0.000000e+00   1.000000e+00   0.000000e+00
   1.000000e+01   1.100000e+01   1.200000e+01   1.000000e+00)

  Mrot   = ( -4.371139e-08  -0.000000e+00  -1.000000e+00  -0.000000e+00
   0.000000e+00   1.000000e+00   0.000000e+00   0.000000e+00
   1.000000e+00   0.000000e+00  -4.371139e-08   0.000000e+00
   0.000000e+00   0.000000e+00   0.000000e+00   1.000000e+00)

  P translated = 11 11 12
  P rotated = -4.37114e-08 0 -1
  P rotated by the transpose = -4.37114e-08 0 -1
  Mrot transposed = ( -4.371139e-08   0.000000e+00   1.000000e+00   0.000000e+00
  -0.000000e+00   1.000000e+00   0.000000e+00   0.000000e+00
  -1.000000e+00   0.000000e+00  -4.371139e-08   0.000000e+00
  -0.000000e+00   0.000000e+00   0.000000e+00   1.000000e+00)

  V4 * M44 Imath:   107.4 Mvals/sec, (107.4 Mcalls/sec)
  M44 * V4 simd:    31.5 Mvals/sec, (31.5 Mcalls/sec)
  V4 * M44 simd:   111.3 Mvals/sec, (111.3 Mcalls/sec)
  transformp Imath:    76.6 Mvals/sec, (76.6 Mcalls/sec)
  transformp Imath with simd:    60.2 Mvals/sec, (60.2 Mcalls/sec)
  transformp simd:    62.2 Mvals/sec, (62.2 Mcalls/sec)
  transpose m44:    68.2 Mvals/sec, (68.2 Mcalls/sec)
  transpose m44 with simd:    68.2 Mvals/sec, (68.2 Mcalls/sec)
  m44 inverse Imath:    21.8 Mvals/sec, (21.8 Mcalls/sec)
  m44 inverse_simd:    18.4 Mvals/sec, (18.4 Mcalls/sec)
  m44 inverse_simd native simd:    18.4 Mvals/sec, (18.4 Mcalls/sec)

Total time: 1m 13.4s
ERRORS!
lgritz commented 2 years ago

Probably a few different things are going on.

Like I said, any of the tests that have "-broken" in the name are simply cases where you haven't built the format support you need for those tests. That's fine. Running ctest with -E broken will skip those tests.

The unit-simd test looks like just a mathematical precision error, some LSB change on the ARM chip compared to Intel maybe?

The openexr-damaged, sorry, that's expected to fail. It's trying to test broken exr files, but it turns out that the precise error messages it prints seems to change quite a bit from one openexr version to another, so I haven't been able to make a stable "reference output" to compare against. Again, that's a test we tend to skip in our own CI.

All the others -- hmmm, I'm not sure. Let's pick just one and investigate more carefully. Just for fun, let's pick psd:

(cd build/testsuite/psd ; ../runtest.py >& out.log)
tar czvf psdresults.tgz build/testsuite/psd

And either attach that file (if not too big) or email it to lg AT openimageio.org.

Tom-python0121 commented 2 years ago

@lgritz I use the openimageio@2.3.17.0 to test and Found the same mistake but The x86_64 test passed in the same test method.The following figure shows the errors reported in the psd-colormodes test in the aarch64 test。

Steps to reproduce the issue

55/69 Test #36: psd-colormodes ...................***Failed   12.66 sec
iconvert ERROR copying "src/pattern2-8-rgb.psd" to "pattern2-8-rgb.psd.tif" :

iconvert ERROR copying "src/pattern2-8-cmyk.psd" to "pattern2-8-cmyk.psd.tif" :

iconvert ERROR copying "src/pattern2-8-multichannel.psd" to "pattern2-8-multichannel.psd.tif" :

iconvert ERROR copying "src/pattern2-8-grayscale.psd" to "pattern2-8-grayscale.psd.tif" :

iconvert ERROR copying "src/pattern2-8-indexed.psd" to "pattern2-8-indexed.psd.tif" :

Comparing "pattern2-8-rgb.psd.tif" and "ref/out.txt"
idiff ERROR: Could not read ref/out.txt:
        Invalid image file "ref/out.txt": OpenImageIO could not find a format reader for "ref/out.txt". Is it a file format that OpenImageIO doesn't know about?
Comparing "pattern2-8-rgb.psd.tif" and "ref/pattern2-alpha.tif"
200 x 150, 3 channel
  Mean error = 0.640395
  RMS error = 0.765747
  Peak SNR = 2.3183
  Max error  = 1 @ (0, 0, R)  values are 0, 0, 0 vs 1, 1, 1, 1
  30000 pixels (100%) over 0.016
  30000 pixels (100%) over 0.008
FAILURE
Comparing "pattern2-8-rgb.psd.tif" and "ref/pattern2-cmyk.tif"
200 x 150, 3 channel
  Mean error = 0.540351
  RMS error = 0.684572
  Peak SNR = 3.29161
  Max error  = 1 @ (0, 0, R)  values are 0, 0, 0 vs 1, 1, 1
  30000 pixels (100%) over 0.016
  30000 pixels (100%) over 0.008
FAILURE
Comparing "pattern2-8-rgb.psd.tif" and "ref/pattern2-gray.tif"
200 x 150, 3 channel
  Mean error = 0.172971
  RMS error = 0.376413
  Peak SNR = 8.48672
  Max error  = 1 @ (0, 0, R)  values are 0, 0, 0 vs 1
  27696 pixels (92.3%) over 0.016
  27696 pixels (92.3%) over 0.008
FAILURE
Comparing "pattern2-8-rgb.psd.tif" and "ref/pattern2.tif"
200 x 150, 3 channel
  Mean error = 0.520527
  RMS error = 0.669694
  Peak SNR = 3.48247
  Max error  = 1 @ (0, 0, R)  values are 0, 0, 0 vs 1, 1, 1
  27696 pixels (92.3%) over 0.016
  27696 pixels (92.3%) over 0.008
FAILURE
idiff ERROR: Could not read ref/pattern2-8-rgb.psd.tif:
        Invalid image file "ref/pattern2-8-rgb.psd.tif": Could not open file: ref/pattern2-8-rgb.psd.tif: No such file or directory
Comparing "pattern2-16-rgb.psd.tif" and "ref/out.txt"
idiff ERROR: Could not read ref/out.txt:
        Invalid image file "ref/out.txt": OpenImageIO could not find a format reader for "ref/out.txt". Is it a file format that OpenImageIO doesn't know about?
Comparing "pattern2-16-rgb.psd.tif" and "ref/pattern2-alpha.tif"
200 x 150, 3 channel
  Mean error = 0.250003
  RMS error = 0.5
  Peak SNR = 6.0206
  Max error  = 1 @ (0, 0, A)  values are 1, 1, 1 vs 1, 1, 1, 1
  30000 pixels (100%) over 0.016
  30000 pixels (100%) over 0.008
FAILURE
Comparing "pattern2-16-rgb.psd.tif" and "ref/pattern2-cmyk.tif"
200 x 150, 3 channel
  Mean error = 0.042967
  RMS error = 0.0829656
  Peak SNR = 21.622
  Max error  = 0.33727 @ (53, 51, B)  values are 0.501945, 0, 0.501945 vs 0.380392, 0, 0.839216
  18432 pixels (61.4%) over 0.016
  18432 pixels (61.4%) over 0.008
FAILURE
Comparing "pattern2-16-rgb.psd.tif" and "ref/pattern2-gray.tif"
200 x 150, 3 channel
  Mean error = 0.387581
  RMS error = 0.557595
  Peak SNR = 5.07362
  Max error  = 1 @ (0, 0, G)  values are 1, 1, 1 vs 1
  27696 pixels (92.3%) over 0.016
  27696 pixels (92.3%) over 0.008
FAILURE
Comparing "pattern2-16-rgb.psd.tif" and "ref/pattern2.tif"
PASS
Comparing "pattern2-8-cmyk.psd.tif" and "ref/out.txt"
idiff ERROR: Could not read ref/out.txt:
        Invalid image file "ref/out.txt": OpenImageIO could not find a format reader for "ref/out.txt". Is it a file format that OpenImageIO doesn't know about?
Comparing "pattern2-8-cmyk.psd.tif" and "ref/pattern2-alpha.tif"
200 x 150, 3 channel
  Mean error = nan
  RMS error = nan
  Peak SNR = nan
  Max error  = inf @ (7, 115, R)  values are nan, 1.49931e+38, 9.41159e+36 vs 1, 1, 1, 1
  30000 pixels (100%) over 0.016
  30000 pixels (100%) over 0.008
FAILURE
Comparing "pattern2-8-cmyk.psd.tif" and "ref/pattern2-cmyk.tif"
200 x 150, 3 channel
  Mean error = nan
  RMS error = nan
  Peak SNR = nan
  Max error  = inf @ (7, 115, R)  values are nan, 1.49931e+38, 9.41159e+36 vs 1, 1, 1
  29999 pixels (100%) over 0.016
  30000 pixels (100%) over 0.008
FAILURE
Comparing "pattern2-8-cmyk.psd.tif" and "ref/pattern2-gray.tif"
200 x 150, 3 channel
  Mean error = nan
  RMS error = nan
  Peak SNR = nan
  Max error  = inf @ (7, 115, R)  values are nan, 1.49931e+38, 9.41159e+36 vs 1
  29453 pixels (98.2%) over 0.016
  29460 pixels (98.2%) over 0.008
FAILURE
Comparing "pattern2-8-cmyk.psd.tif" and "ref/pattern2.tif"
200 x 150, 3 channel
  Mean error = nan
  RMS error = nan
  Peak SNR = nan
  Max error  = inf @ (7, 115, R)  values are nan, 1.49931e+38, 9.41159e+36 vs 1, 1, 1
  29459 pixels (98.2%) over 0.016
  29466 pixels (98.2%) over 0.008
FAILURE
idiff ERROR: Could not read ref/pattern2-8-cmyk.psd.tif:
        Invalid image file "ref/pattern2-8-cmyk.psd.tif": Could not open file: ref/pattern2-8-cmyk.psd.tif: No such file or directory
Comparing "pattern2-16-cmyk.psd.tif" and "ref/out.txt"
idiff ERROR: Could not read ref/out.txt:
        Invalid image file "ref/out.txt": OpenImageIO could not find a format reader for "ref/out.txt". Is it a file format that OpenImageIO doesn't know about?
Comparing "pattern2-16-cmyk.psd.tif" and "ref/pattern2-alpha.tif"
200 x 150, 3 channel
  Mean error = 0.282226
  RMS error = 0.505133
  Peak SNR = 5.93189
  Max error  = 1 @ (0, 0, A)  values are 1, 1, 1 vs 1, 1, 1, 1
  30000 pixels (100%) over 0.016
  30000 pixels (100%) over 0.008
FAILURE
Comparing "pattern2-16-cmyk.psd.tif" and "ref/pattern2-cmyk.tif"
PASS
Comparing "pattern2-8-multichannel.psd.tif" and "ref/out.txt"
idiff ERROR: Could not read ref/out.txt:
        Invalid image file "ref/out.txt": OpenImageIO could not find a format reader for "ref/out.txt". Is it a file format that OpenImageIO doesn't know about?
Comparing "pattern2-8-multichannel.psd.tif" and "ref/pattern2-alpha.tif"
200 x 150, 3 channel
  Mean error = 0.640395
  RMS error = 0.765747
  Peak SNR = 2.3183
  Max error  = 1 @ (0, 0, R)  values are 0, 0, 0 vs 1, 1, 1, 1
  30000 pixels (100%) over 0.016
  30000 pixels (100%) over 0.008
FAILURE
Comparing "pattern2-8-multichannel.psd.tif" and "ref/pattern2-cmyk.tif"
200 x 150, 3 channel
  Mean error = 0.540351
  RMS error = 0.684572
  Peak SNR = 3.29161
  Max error  = 1 @ (0, 0, R)  values are 0, 0, 0 vs 1, 1, 1
  30000 pixels (100%) over 0.016
  30000 pixels (100%) over 0.008
FAILURE
Comparing "pattern2-8-multichannel.psd.tif" and "ref/pattern2-gray.tif"
200 x 150, 3 channel
  Mean error = 0.172971
  RMS error = 0.376413
  Peak SNR = 8.48672
  Max error  = 1 @ (0, 0, R)  values are 0, 0, 0 vs 1
  27696 pixels (92.3%) over 0.016
  27696 pixels (92.3%) over 0.008
FAILURE
Comparing "pattern2-8-multichannel.psd.tif" and "ref/pattern2.tif"
200 x 150, 3 channel
  Mean error = 0.520527
  RMS error = 0.669694
  Peak SNR = 3.48247
  Max error  = 1 @ (0, 0, R)  values are 0, 0, 0 vs 1, 1, 1
  27696 pixels (92.3%) over 0.016
  27696 pixels (92.3%) over 0.008
FAILURE
idiff ERROR: Could not read ref/pattern2-8-multichannel.psd.tif:
        Invalid image file "ref/pattern2-8-multichannel.psd.tif": Could not open file: ref/pattern2-8-multichannel.psd.tif: No such file or directory
Comparing "pattern2-16-multichannel.psd.tif" and "ref/out.txt"
idiff ERROR: Could not read ref/out.txt:
        Invalid image file "ref/out.txt": OpenImageIO could not find a format reader for "ref/out.txt". Is it a file format that OpenImageIO doesn't know about?
Comparing "pattern2-16-multichannel.psd.tif" and "ref/pattern2-alpha.tif"
200 x 150, 3 channel
  Mean error = 0.250003
  RMS error = 0.5
  Peak SNR = 6.0206
  Max error  = 1 @ (0, 0, A)  values are 1, 1, 1 vs 1, 1, 1, 1
  30000 pixels (100%) over 0.016
  30000 pixels (100%) over 0.008
FAILURE
Comparing "pattern2-16-multichannel.psd.tif" and "ref/pattern2-cmyk.tif"
200 x 150, 3 channel
  Mean error = 0.042967
  RMS error = 0.0829656
  Peak SNR = 21.622
  Max error  = 0.33727 @ (53, 51, B)  values are 0.501945, 0, 0.501945 vs 0.380392, 0, 0.839216
  18432 pixels (61.4%) over 0.016
  18432 pixels (61.4%) over 0.008
FAILURE
Comparing "pattern2-16-multichannel.psd.tif" and "ref/pattern2-gray.tif"
200 x 150, 3 channel
  Mean error = 0.387581
  RMS error = 0.557595
  Peak SNR = 5.07362
  Max error  = 1 @ (0, 0, G)  values are 1, 1, 1 vs 1
  27696 pixels (92.3%) over 0.016
  27696 pixels (92.3%) over 0.008
FAILURE
Comparing "pattern2-16-multichannel.psd.tif" and "ref/pattern2.tif"
PASS
Comparing "pattern2-8-grayscale.psd.tif" and "ref/out.txt"
idiff ERROR: Could not read ref/out.txt:
        Invalid image file "ref/out.txt": OpenImageIO could not find a format reader for "ref/out.txt". Is it a file format that OpenImageIO doesn't know about?
Comparing "pattern2-8-grayscale.psd.tif" and "ref/pattern2-alpha.tif"
200 x 150, 1 channel
  Mean error = 0.640395
  RMS error = 0.765747
  Peak SNR = 2.3183
  Max error  = 1 @ (0, 0, Y)  values are 0 vs 1, 1, 1, 1
  30000 pixels (100%) over 0.016
  30000 pixels (100%) over 0.008
FAILURE
Comparing "pattern2-8-grayscale.psd.tif" and "ref/pattern2-cmyk.tif"
200 x 150, 1 channel
  Mean error = 0.540351
  RMS error = 0.684572
  Peak SNR = 3.29161
  Max error  = 1 @ (0, 0, Y)  values are 0 vs 1, 1, 1
  30000 pixels (100%) over 0.016
  30000 pixels (100%) over 0.008
FAILURE
Comparing "pattern2-8-grayscale.psd.tif" and "ref/pattern2-gray.tif"
200 x 150, 1 channel
  Mean error = 0.518912
  RMS error = 0.651966
  Peak SNR = 3.7155
  Max error  = 1 @ (0, 0, Y)  values are 0 vs 1
  27696 pixels (92.3%) over 0.016
  27696 pixels (92.3%) over 0.008
FAILURE
Comparing "pattern2-8-grayscale.psd.tif" and "ref/pattern2.tif"
200 x 150, 1 channel
  Mean error = 0.520527
  RMS error = 0.669694
  Peak SNR = 3.48247
  Max error  = 1 @ (0, 0, Y)  values are 0 vs 1, 1, 1
  27696 pixels (92.3%) over 0.016
  27696 pixels (92.3%) over 0.008
FAILURE
idiff ERROR: Could not read ref/pattern2-8-grayscale.psd.tif:
        Invalid image file "ref/pattern2-8-grayscale.psd.tif": Could not open file: ref/pattern2-8-grayscale.psd.tif: No such file or directory
Comparing "pattern2-16-grayscale.psd.tif" and "ref/out.txt"
idiff ERROR: Could not read ref/out.txt:
        Invalid image file "ref/out.txt": OpenImageIO could not find a format reader for "ref/out.txt". Is it a file format that OpenImageIO doesn't know about?
Comparing "pattern2-16-grayscale.psd.tif" and "ref/pattern2-alpha.tif"
200 x 150, 1 channel
  Mean error = 0.540691
  RMS error = 0.695116
  Peak SNR = 3.15886
  Max error  = 1 @ (0, 0, G)  values are 0.999969 vs 1, 1, 1, 1
  30000 pixels (100%) over 0.016
  30000 pixels (100%) over 0.008
FAILURE
Comparing "pattern2-16-grayscale.psd.tif" and "ref/pattern2-cmyk.tif"
200 x 150, 1 channel
  Mean error = 0.395826
  RMS error = 0.57531
  Peak SNR = 4.80197
  Max error  = 1 @ (0, 0, G)  values are 0.999969 vs 1, 1, 1
  30000 pixels (100%) over 0.016
  30000 pixels (100%) over 0.008
FAILURE
Comparing "pattern2-16-grayscale.psd.tif" and "ref/pattern2-gray.tif"
PASS
Comparing "pattern2-8-indexed.psd.tif" and "ref/out.txt"
idiff ERROR: Could not read ref/out.txt:
        Invalid image file "ref/out.txt": OpenImageIO could not find a format reader for "ref/out.txt". Is it a file format that OpenImageIO doesn't know about?
Comparing "pattern2-8-indexed.psd.tif" and "ref/pattern2-alpha.tif"
200 x 150, 4 channel
  Mean error = 0.640395
  RMS error = 0.765747
  Peak SNR = 2.3183
  Max error  = 1 @ (0, 0, R)  values are 0, 0, 0, 0 vs 1, 1, 1, 1
  30000 pixels (100%) over 0.016
  30000 pixels (100%) over 0.008
FAILURE
Comparing "pattern2-8-indexed.psd.tif" and "ref/pattern2-cmyk.tif"
200 x 150, 4 channel
  Mean error = 0.405263
  RMS error = 0.592857
  Peak SNR = 4.541
  Max error  = 1 @ (0, 0, R)  values are 0, 0, 0, 0 vs 1, 1, 1
  30000 pixels (100%) over 0.016
  30000 pixels (100%) over 0.008
FAILURE
Comparing "pattern2-8-indexed.psd.tif" and "ref/pattern2-gray.tif"
200 x 150, 4 channel
  Mean error = 0.129728
  RMS error = 0.325983
  Peak SNR = 9.7361
  Max error  = 1 @ (0, 0, R)  values are 0, 0, 0, 0 vs 1
  27696 pixels (92.3%) over 0.016
  27696 pixels (92.3%) over 0.008
FAILURE
Comparing "pattern2-8-indexed.psd.tif" and "ref/pattern2.tif"
200 x 150, 4 channel
  Mean error = 0.390395
  RMS error = 0.579972
  Peak SNR = 4.73186
  Max error  = 1 @ (0, 0, R)  values are 0, 0, 0, 0 vs 1, 1, 1
  27696 pixels (92.3%) over 0.016
  27696 pixels (92.3%) over 0.008
FAILURE
idiff ERROR: Could not read ref/pattern2-8-indexed.psd.tif:
        Invalid image file "ref/pattern2-8-indexed.psd.tif": Could not open file: ref/pattern2-8-indexed.psd.tif: No such file or directory
-rw-r--r--. 1 root root 1642 Jul  5 16:42 out.txt
-rw-rw-r--. 1 root root 1334 Jul  2 08:57 ref/out.txt
newsymlink /home/stage/root/spack-stage-openimageio-2.3.17.0-ik6ogbre7fflaqql5vorke6icq6lpiy6/spack-src/testsuite/psd-colormodes/ref ./ref
newsymlink /home/stage/root/spack-stage-openimageio-2.3.17.0-ik6ogbre7fflaqql5vorke6icq6lpiy6/spack-src/testsuite/psd-colormodes/src ./src
newsymlink /home/stage/root/spack-stage-openimageio-2.3.17.0-ik6ogbre7fflaqql5vorke6icq6lpiy6/spack-src/testsuite/psd-colormodes ./data
command = ../../bin/iconvert  src/pattern2-8-rgb.psd  pattern2-8-rgb.psd.tif >> out.txt ;
../../bin/idiff  -a src/pattern2-8-rgb.psd -fail 0.008 -failpercent 0.02 -hardfail 0.012 -warn 0.016  pattern2-8-rgb.psd.tif >> out.txt ;
../../bin/iconvert  src/pattern2-16-rgb.psd  pattern2-16-rgb.psd.tif >> out.txt ;
../../bin/idiff  -a src/pattern2-16-rgb.psd -fail 0.008 -failpercent 0.02 -hardfail 0.012 -warn 0.016  pattern2-16-rgb.psd.tif >> out.txt ;
../../bin/iconvert  src/pattern2-8-cmyk.psd  pattern2-8-cmyk.psd.tif >> out.txt ;
../../bin/idiff  -a src/pattern2-8-cmyk.psd -fail 0.008 -failpercent 0.02 -hardfail 0.012 -warn 0.016  pattern2-8-cmyk.psd.tif >> out.txt ;
../../bin/iconvert  src/pattern2-16-cmyk.psd  pattern2-16-cmyk.psd.tif >> out.txt ;
../../bin/idiff  -a src/pattern2-16-cmyk.psd -fail 0.008 -failpercent 0.02 -hardfail 0.012 -warn 0.016  pattern2-16-cmyk.psd.tif >> out.txt ;
../../bin/iconvert  src/pattern2-8-multichannel.psd  pattern2-8-multichannel.psd.tif >> out.txt ;
../../bin/idiff  -a src/pattern2-8-multichannel.psd -fail 0.008 -failpercent 0.02 -hardfail 0.012 -warn 0.016  pattern2-8-multichannel.psd.tif >> out.txt ;
../../bin/iconvert  src/pattern2-16-multichannel.psd  pattern2-16-multichannel.psd.tif >> out.txt ;
../../bin/idiff  -a src/pattern2-16-multichannel.psd -fail 0.008 -failpercent 0.02 -hardfail 0.012 -warn 0.016  pattern2-16-multichannel.psd.tif >> out.txt ;
../../bin/iconvert  src/pattern2-8-grayscale.psd  pattern2-8-grayscale.psd.tif >> out.txt ;
../../bin/idiff  -a src/pattern2-8-grayscale.psd -fail 0.008 -failpercent 0.02 -hardfail 0.012 -warn 0.016  pattern2-8-grayscale.psd.tif >> out.txt ;
../../bin/iconvert  src/pattern2-16-grayscale.psd  pattern2-16-grayscale.psd.tif >> out.txt ;
../../bin/idiff  -a src/pattern2-16-grayscale.psd -fail 0.008 -failpercent 0.02 -hardfail 0.012 -warn 0.016  pattern2-16-grayscale.psd.tif >> out.txt ;
../../bin/iconvert  src/pattern2-8-indexed.psd  pattern2-8-indexed.psd.tif >> out.txt ;
../../bin/idiff  -a src/pattern2-8-indexed.psd -fail 0.008 -failpercent 0.02 -hardfail 0.012 -warn 0.016  pattern2-8-indexed.psd.tif >> out.txt ;

#### Error: this command failed:  ../../bin/iconvert  src/pattern2-8-rgb.psd  pattern2-8-rgb.psd.tif >> out.txt
FAIL
#### Error: this command failed:  ../../bin/iconvert  src/pattern2-8-cmyk.psd  pattern2-8-cmyk.psd.tif >> out.txt
FAIL
#### Error: this command failed:  ../../bin/idiff  -a src/pattern2-8-cmyk.psd -fail 0.008 -failpercent 0.02 -hardfail 0.012 -warn 0.016  pattern2-8-cmyk.psd.tif >> out.txt
FAIL
#### Error: this command failed:  ../../bin/iconvert  src/pattern2-8-multichannel.psd  pattern2-8-multichannel.psd.tif >> out.txt
FAIL
#### Error: this command failed:  ../../bin/iconvert  src/pattern2-8-grayscale.psd  pattern2-8-grayscale.psd.tif >> out.txt
FAIL
#### Error: this command failed:  ../../bin/iconvert  src/pattern2-8-indexed.psd  pattern2-8-indexed.psd.tif >> out.txt
FAIL
NO MATCH for pattern2-8-rgb.psd.tif
FAIL pattern2-8-rgb.psd.tif
PASS: pattern2-16-rgb.psd.tif matches ref/pattern2.tif
NO MATCH for pattern2-8-cmyk.psd.tif
FAIL pattern2-8-cmyk.psd.tif
PASS: pattern2-16-cmyk.psd.tif matches ref/pattern2-cmyk.tif
NO MATCH for pattern2-8-multichannel.psd.tif
FAIL pattern2-8-multichannel.psd.tif
PASS: pattern2-16-multichannel.psd.tif matches ref/pattern2.tif
NO MATCH for pattern2-8-grayscale.psd.tif
FAIL pattern2-8-grayscale.psd.tif
PASS: pattern2-16-grayscale.psd.tif matches ref/pattern2-gray.tif
NO MATCH for pattern2-8-indexed.psd.tif
FAIL pattern2-8-indexed.psd.tif
Diff out.txt vs ref/out.txt was:
-------
--- out.txt     Tue Jul  5 16:42:42 2022
+++ ref/out.txt Sat Jul  2 08:57:35 2022
@@ -3,14 +3,7 @@
 Comparing "src/pattern2-16-rgb.psd" and "pattern2-16-rgb.psd.tif"
 PASS
 Comparing "src/pattern2-8-cmyk.psd" and "pattern2-8-cmyk.psd.tif"
-200 x 150, 3 channel
-  Mean error = nan
-  RMS error = nan
-  Peak SNR = nan
-  Max error  = 1.49916e+38 @ (142, 134, G)  values are 5.53718e-39, 6.27186e-39, 3.7523e-38 vs 1.49256e+38, 1.49916e+38, 1.49916e+38
-  21349 pixels (71.2%) over 0.016
-  21414 pixels (71.4%) over 0.008
-FAILURE
+PASS
 Comparing "src/pattern2-16-cmyk.psd" and "pattern2-16-cmyk.psd.tif"
 PASS
 Comparing "src/pattern2-8-multichannel.psd" and "pattern2-8-multichannel.psd.tif"
@@ -23,16 +16,21 @@
 PASS
 Comparing "src/pattern2-8-indexed.psd" and "pattern2-8-indexed.psd.tif"
 PASS
-Comparing "pattern2-8-rgb.psd.tif" and "ref/pattern2-8-rgb.psd.tif"
+Comparing "pattern2-8-rgb.psd.tif" and "ref/pattern2.tif"
+PASS
 Comparing "pattern2-16-rgb.psd.tif" and "ref/pattern2.tif"
 PASS
-Comparing "pattern2-8-cmyk.psd.tif" and "ref/pattern2-8-cmyk.psd.tif"
+Comparing "pattern2-8-cmyk.psd.tif" and "ref/pattern2-cmyk.tif"
+PASS
 Comparing "pattern2-16-cmyk.psd.tif" and "ref/pattern2-cmyk.tif"
 PASS
-Comparing "pattern2-8-multichannel.psd.tif" and "ref/pattern2-8-multichannel.psd.tif"
+Comparing "pattern2-8-multichannel.psd.tif" and "ref/pattern2.tif"
+PASS
 Comparing "pattern2-16-multichannel.psd.tif" and "ref/pattern2.tif"
 PASS
-Comparing "pattern2-8-grayscale.psd.tif" and "ref/pattern2-8-grayscale.psd.tif"
+Comparing "pattern2-8-grayscale.psd.tif" and "ref/pattern2-gray.tif"
+PASS
 Comparing "pattern2-16-grayscale.psd.tif" and "ref/pattern2-gray.tif"
 PASS
-Comparing "pattern2-8-indexed.psd.tif" and "ref/pattern2-8-indexed.psd.tif"
+Comparing "pattern2-8-indexed.psd.tif" and "ref/pattern2-alpha.tif"
+PASS

Diff out.txt vs ref/out.txt was:
-------
--- out.txt     Tue Jul  5 16:42:42 2022
+++ ref/out.txt Sat Jul  2 08:57:35 2022
@@ -3,14 +3,7 @@
 Comparing "src/pattern2-16-rgb.psd" and "pattern2-16-rgb.psd.tif"
 PASS
 Comparing "src/pattern2-8-cmyk.psd" and "pattern2-8-cmyk.psd.tif"
-200 x 150, 3 channel
-  Mean error = nan
-  RMS error = nan
-  Peak SNR = nan
-  Max error  = 1.49916e+38 @ (142, 134, G)  values are 5.53718e-39, 6.27186e-39, 3.7523e-38 vs 1.49256e+38, 1.49916e+38, 1.49916e+38
-  21349 pixels (71.2%) over 0.016
-  21414 pixels (71.4%) over 0.008
-FAILURE
+PASS
 Comparing "src/pattern2-16-cmyk.psd" and "pattern2-16-cmyk.psd.tif"
 PASS
 Comparing "src/pattern2-8-multichannel.psd" and "pattern2-8-multichannel.psd.tif"
@@ -23,16 +16,21 @@
 PASS
 Comparing "src/pattern2-8-indexed.psd" and "pattern2-8-indexed.psd.tif"
 PASS
-Comparing "pattern2-8-rgb.psd.tif" and "ref/pattern2-8-rgb.psd.tif"
+Comparing "pattern2-8-rgb.psd.tif" and "ref/pattern2.tif"
+PASS
 Comparing "pattern2-16-rgb.psd.tif" and "ref/pattern2.tif"
 PASS
-Comparing "pattern2-8-cmyk.psd.tif" and "ref/pattern2-8-cmyk.psd.tif"
+Comparing "pattern2-8-cmyk.psd.tif" and "ref/pattern2-cmyk.tif"
+PASS
 Comparing "pattern2-16-cmyk.psd.tif" and "ref/pattern2-cmyk.tif"
 PASS
-Comparing "pattern2-8-multichannel.psd.tif" and "ref/pattern2-8-multichannel.psd.tif"
+Comparing "pattern2-8-multichannel.psd.tif" and "ref/pattern2.tif"
+PASS
 Comparing "pattern2-16-multichannel.psd.tif" and "ref/pattern2.tif"
 PASS
-Comparing "pattern2-8-grayscale.psd.tif" and "ref/pattern2-8-grayscale.psd.tif"
+Comparing "pattern2-8-grayscale.psd.tif" and "ref/pattern2-gray.tif"
+PASS
 Comparing "pattern2-16-grayscale.psd.tif" and "ref/pattern2-gray.tif"
 PASS
-Comparing "pattern2-8-indexed.psd.tif" and "ref/pattern2-8-indexed.psd.tif"
+Comparing "pattern2-8-indexed.psd.tif" and "ref/pattern2-alpha.tif"
+PASS

Unexpected error: <class 'UnicodeDecodeError'>
Unexpected error: <class 'UnicodeDecodeError'>
Unexpected error: <class 'UnicodeDecodeError'>
Unexpected error: <class 'UnicodeDecodeError'>
NO MATCH for out.txt
FAIL out.txt
-----out.txt----->
Comparing "src/pattern2-8-rgb.psd" and "pattern2-8-rgb.psd.tif"
PASS
Comparing "src/pattern2-16-rgb.psd" and "pattern2-16-rgb.psd.tif"
PASS
Comparing "src/pattern2-8-cmyk.psd" and "pattern2-8-cmyk.psd.tif"
200 x 150, 3 channel
  Mean error = nan
  RMS error = nan
  Peak SNR = nan
  Max error  = 1.49916e+38 @ (142, 134, G)  values are 5.53718e-39, 6.27186e-39, 3.7523e-38 vs 1.49256e+38, 1.49916e+38, 1.49916e+38
  21349 pixels (71.2%) over 0.016
  21414 pixels (71.4%) over 0.008
FAILURE
Comparing "src/pattern2-16-cmyk.psd" and "pattern2-16-cmyk.psd.tif"
PASS
Comparing "src/pattern2-8-multichannel.psd" and "pattern2-8-multichannel.psd.tif"
PASS
Comparing "src/pattern2-16-multichannel.psd" and "pattern2-16-multichannel.psd.tif"
PASS
Comparing "src/pattern2-8-grayscale.psd" and "pattern2-8-grayscale.psd.tif"
PASS
Comparing "src/pattern2-16-grayscale.psd" and "pattern2-16-grayscale.psd.tif"
PASS
Comparing "src/pattern2-8-indexed.psd" and "pattern2-8-indexed.psd.tif"
PASS
Comparing "pattern2-8-rgb.psd.tif" and "ref/pattern2-8-rgb.psd.tif"
Comparing "pattern2-16-rgb.psd.tif" and "ref/pattern2.tif"
PASS
Comparing "pattern2-8-cmyk.psd.tif" and "ref/pattern2-8-cmyk.psd.tif"
Comparing "pattern2-16-cmyk.psd.tif" and "ref/pattern2-cmyk.tif"
PASS
Comparing "pattern2-8-multichannel.psd.tif" and "ref/pattern2-8-multichannel.psd.tif"
Comparing "pattern2-16-multichannel.psd.tif" and "ref/pattern2.tif"
PASS
Comparing "pattern2-8-grayscale.psd.tif" and "ref/pattern2-8-grayscale.psd.tif"
Comparing "pattern2-16-grayscale.psd.tif" and "ref/pattern2-gray.tif"
PASS
Comparing "pattern2-8-indexed.psd.tif" and "ref/pattern2-8-indexed.psd.tif"
<----------
-----ref/out.txt----->
Comparing "src/pattern2-8-rgb.psd" and "pattern2-8-rgb.psd.tif"
PASS
Comparing "src/pattern2-16-rgb.psd" and "pattern2-16-rgb.psd.tif"
PASS
Comparing "src/pattern2-8-cmyk.psd" and "pattern2-8-cmyk.psd.tif"
PASS
Comparing "src/pattern2-16-cmyk.psd" and "pattern2-16-cmyk.psd.tif"
PASS
Comparing "src/pattern2-8-multichannel.psd" and "pattern2-8-multichannel.psd.tif"
PASS
Comparing "src/pattern2-16-multichannel.psd" and "pattern2-16-multichannel.psd.tif"
PASS
Comparing "src/pattern2-8-grayscale.psd" and "pattern2-8-grayscale.psd.tif"
PASS
Comparing "src/pattern2-16-grayscale.psd" and "pattern2-16-grayscale.psd.tif"
PASS
Comparing "src/pattern2-8-indexed.psd" and "pattern2-8-indexed.psd.tif"
PASS
Comparing "pattern2-8-rgb.psd.tif" and "ref/pattern2.tif"
PASS
Comparing "pattern2-16-rgb.psd.tif" and "ref/pattern2.tif"
PASS
Comparing "pattern2-8-cmyk.psd.tif" and "ref/pattern2-cmyk.tif"
PASS
Comparing "pattern2-16-cmyk.psd.tif" and "ref/pattern2-cmyk.tif"
PASS
Comparing "pattern2-8-multichannel.psd.tif" and "ref/pattern2.tif"
PASS
Comparing "pattern2-16-multichannel.psd.tif" and "ref/pattern2.tif"
PASS
Comparing "pattern2-8-grayscale.psd.tif" and "ref/pattern2-gray.tif"
PASS
Comparing "pattern2-16-grayscale.psd.tif" and "ref/pattern2-gray.tif"
PASS
Comparing "pattern2-8-indexed.psd.tif" and "ref/pattern2-alpha.tif"
PASS
<----------
Diff was:
-------
--- out.txt     Tue Jul  5 16:42:42 2022
+++ ref/out.txt Sat Jul  2 08:57:35 2022
@@ -3,14 +3,7 @@
 Comparing "src/pattern2-16-rgb.psd" and "pattern2-16-rgb.psd.tif"
 PASS
 Comparing "src/pattern2-8-cmyk.psd" and "pattern2-8-cmyk.psd.tif"
-200 x 150, 3 channel
-  Mean error = nan
-  RMS error = nan
-  Peak SNR = nan
-  Max error  = 1.49916e+38 @ (142, 134, G)  values are 5.53718e-39, 6.27186e-39, 3.7523e-38 vs 1.49256e+38, 1.49916e+38, 1.49916e+38
-  21349 pixels (71.2%) over 0.016
-  21414 pixels (71.4%) over 0.008
-FAILURE
+PASS
 Comparing "src/pattern2-16-cmyk.psd" and "pattern2-16-cmyk.psd.tif"
 PASS
 Comparing "src/pattern2-8-multichannel.psd" and "pattern2-8-multichannel.psd.tif"
@@ -23,16 +16,21 @@
 PASS
 Comparing "src/pattern2-8-indexed.psd" and "pattern2-8-indexed.psd.tif"
 PASS
-Comparing "pattern2-8-rgb.psd.tif" and "ref/pattern2-8-rgb.psd.tif"
+Comparing "pattern2-8-rgb.psd.tif" and "ref/pattern2.tif"
+PASS
 Comparing "pattern2-16-rgb.psd.tif" and "ref/pattern2.tif"
 PASS
-Comparing "pattern2-8-cmyk.psd.tif" and "ref/pattern2-8-cmyk.psd.tif"
+Comparing "pattern2-8-cmyk.psd.tif" and "ref/pattern2-cmyk.tif"
+PASS
 Comparing "pattern2-16-cmyk.psd.tif" and "ref/pattern2-cmyk.tif"
 PASS
-Comparing "pattern2-8-multichannel.psd.tif" and "ref/pattern2-8-multichannel.psd.tif"
+Comparing "pattern2-8-multichannel.psd.tif" and "ref/pattern2.tif"
+PASS
 Comparing "pattern2-16-multichannel.psd.tif" and "ref/pattern2.tif"
 PASS
-Comparing "pattern2-8-grayscale.psd.tif" and "ref/pattern2-8-grayscale.psd.tif"
+Comparing "pattern2-8-grayscale.psd.tif" and "ref/pattern2-gray.tif"
+PASS
 Comparing "pattern2-16-grayscale.psd.tif" and "ref/pattern2-gray.tif"
 PASS
-Comparing "pattern2-8-indexed.psd.tif" and "ref/pattern2-8-indexed.psd.tif"
+Comparing "pattern2-8-indexed.psd.tif" and "ref/pattern2-alpha.tif"
+PASS

Currently, these test cases report errors in aarch64, but the test is successful in x86_64: 90% tests passed, 7 tests failed out of 69

Total Test time (real) = 94.69 sec

The following tests FAILED: 2 - maketx (Failed) 32 - openexr-damaged (Failed) 35 - psd (Failed) 36 - psd-colormodes (Failed) 41 - tiff-depths (Failed) 44 - zfile (Failed) 58 - unit_simd (Failed) Errors while running CTest

lgritz commented 2 years ago

Can you please do a couple more things (after the failure above):

ls -al -R build/testsuite/psd           # and post the output

tar czvf psdresults.tgz build/testsuite/psd       # and email me the tgz file
Tom-python0121 commented 2 years ago

it shows:

Steps to reproduce the issue

[root@localhost spack-stage-openimageio-2.3.17.0-ik6ogbre7fflaqql5vorke6icq6lpiy6]# ls -al -R build/testsuite/psd
build/testsuite/psd:
total 60
drwxr-xr-x.   2 root root    94 Jul  5 16:42 .
drwxr-xr-x. 135 root root  8192 Jul  5 16:32 ..
lrwxrwxrwx.   1 root root   106 Jul  5 16:42 data -> /home/stage/root/spack-stage-openimageio-2.3.17.0-ik6ogbre7fflaqql5vorke6icq6lpiy6/spack-src/testsuite/psd
-rw-r--r--.   1 root root     0 Jul  5 16:42 out.err.txt
-rw-r--r--.   1 root root 39250 Jul  5 16:42 out.txt
-rw-r--r--.   1 root root  6526 Jul  5 16:42 out.txt.diff
lrwxrwxrwx.   1 root root   110 Jul  5 16:42 ref -> /home/stage/root/spack-stage-openimageio-2.3.17.0-ik6ogbre7fflaqql5vorke6icq6lpiy6/spack-src/testsuite/psd/ref
lrwxrwxrwx.   1 root root   110 Jul  5 16:42 src -> /home/stage/root/spack-stage-openimageio-2.3.17.0-ik6ogbre7fflaqql5vorke6icq6lpiy6/spack-src/testsuite/psd/src
[root@localhost spack-stage-openimageio-2.3.17.0-ik6ogbre7fflaqql5vorke6icq6lpiy6]# tar czvf psdresults.tgz build/testsuite/psd
build/testsuite/psd/
build/testsuite/psd/ref
build/testsuite/psd/src
build/testsuite/psd/data
build/testsuite/psd/out.txt
build/testsuite/psd/out.err.txt
build/testsuite/psd/out.txt.diff

sorry, because I am using a Huawei internal network and cannot email you the tgz file

lgritz commented 2 years ago

For PSD (and maybe the others), I'm betting that there's some subtle bug related to the machine being "big endian" (someplace we are failing to swap bytes, or perhaps are swapping bytes when that is only required on Intel but not ARM). It is very hard to guess without having a big-endian machine on which I can try to debug it.

Does anybody out there have easy access to an aarch64 linux, or a Mac ARM machine who can try this test and see directly what's going on?

lgritz commented 2 years ago

I tried this and just merged it: https://github.com/OpenImageIO/oiio/pull/3469 So if you pull the current master and try with it, maybe we'll get a better error message about exactly what point within the PSD read had the problems.

Tom-python0121 commented 2 years ago

@lgritz I have tested the latest 2.4.5.0 version and found that the following test cases report errors after ctest is used:

Steps to reproduce the issue

[root@bigdata build]ctest -E '(oiiotool-subimage|oiiotool-text|broken|texture-.*|unit_timer)' -j126  --output-on-failure
Total time: 1m 31.7s
ERRORS!
91% tests passed, 4 tests failed out of 47

Total Test time (real) =  92.45 sec

The following tests FAILED:
          1 - cmake-consumer (Failed)
          5 - oiiotool (Failed)
          8 - oiiotool-copy (Failed)
         17 - maketx (Failed)
         18 - oiiotool-maketx (Failed)
         37 - unit_simd (Failed)
Errors while running CTest