masterofeye / libyuv

Automatically exported from code.google.com/p/libyuv
0 stars 0 forks source link

YUY2ToARGB performance #495

Closed GoogleCodeExporter closed 8 years ago

GoogleCodeExporter commented 8 years ago
Originally was multistep SSSE3/AVX2.
On AVX2:
YUY2ToARGB_Opt (431 ms)
    68.37%  libyuv_unittest  libyuv_unittest      [.] I422ToARGBRow_AVX2                                                     
    20.79%  libyuv_unittest  libyuv_unittest      [.] YUY2ToUV422Row_AVX2                                                    
     5.51%  libyuv_unittest  libyuv_unittest      [.] YUY2ToYRow_AVX2   

Then changed to 1 step SSSE3
YUY2ToARGB_Opt (396 ms)
    97.04%  libyuv_unittest  libyuv_unittest    [.] YUY2ToARGBRow_SSSE3  

Then ported to AVX2
YUY2ToARGB_Opt (274 ms)
    95.73%  libyuv_unittest  libyuv_unittest      [.] YUY2ToARGBRow_AVX2

Original issue reported on code.google.com by fbarch...@google.com on 24 Sep 2015 at 6:32

GoogleCodeExporter commented 8 years ago

Original comment by fbarch...@google.com on 25 Sep 2015 at 11:20