NVIDIA / Fuser

A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")
Other
271 stars 53 forks source link

warp-specialization #3400

Open zasdfgbnm opened 1 week ago

zasdfgbnm commented 1 week ago
 Time (%)  Total Time (ns)  Instances  Avg (ns)  Med (ns)  Min (ns)  Max (ns)  StdDev (ns)
                    Name
 --------  ---------------  ---------  --------  --------  --------  --------  -----------  ----------------------------------------------------------------------------------------------------
     36.1           146046          1  146046.0  146046.0    146046    146046          0.0  <unnamed>::nvfuser_none_f0_c0_r0_g0(<unnamed>::Tensor<<unnamed>::__half, (int)3, (int)3>, <unnamed>…
     21.7            87839          1   87839.0   87839.0     87839     87839          0.0  nvjet_hsh_256x128_64x4_1x2_h_bz_coopA_NTT

60.1%