gogyzzz / iip_sph_pp

C library for speech pre-processing.
Mozilla Public License 2.0
11 stars 3 forks source link

Broadcasting test case info #112

Closed kooBH closed 6 years ago

kooBH commented 6 years ago

d_broadcast_0_add.bin : 1x1x4 + 1x128x4 = 1x128x4 d_broadcast_1_add.bin : 1x1x4 + 128x1x4 = 128x1x4 d_broadcast_2_add.bin : 1x128x4 + 1x1x4 = 1x128x4 d_broadcast_3_add.bin : 128x1x4 + 1x1x1 = 128x1x4 d_broadcast_4_mul.bin : 1x1x4 128x128x1 = 128x128x4 d_broadcast_5_mul.bin : 128x128x4 1x1x1 = 128x128x4 d_broadcast_6_mul.bin : 1x128x1 128x1x4 = 128x128x4 d_broadcast_7_mul.bin : 128x1x1 1x128x4 = 128x128x4 d_broadcast_8_mul.bin : 1x128x1 * 128x128x4 = 128x128x4 d_broadcast_9_div.bin : 128x1x1 / 128x128x1 = 128x128x1 d_broadcast_10_div.bin : 128x128x1 / 1x128x1 = 128x128x1 d_broadcast_11_div.bin : 128x128x1 / 128x1x1 = 128x128x1 d_broadcast_12_div.bin : 128x128x1 / 128x128x1 = 128x128x1

kooBH commented 6 years ago

d_1_4_2_cMatmul.bin : d_1_4_2.bin d_4_4_2.bin = (1,4,2) d_4_4_2_Abs.bin
d_4_4_2_Add.bin : Add self d_4_4_2_Asum.bin // 논의가 필요함 d_4_4_2_Ceil.bin d_4_4_2_Diagonal.bin : d0 1 d2 d_4_4_2_EXP.bin d_4_4_2_Floor.bin d_4_4_2_Invert.bin d_4_4_2_Log10.bin d_4_4_2_Log2.bin d_4_4_2_Log.bin d_4_4_2_Permute132.bin : d0 d2 d1 d_4_4_2_Permute213.bin : d1 d0 d2 d_4_4_2_Permute231.bin : d1 d2 d0 d_4_4_2_Permute312.bin : d2 d0 d1 d_4_4_2_Permute321.bin : d2 d1 d0 d_4_4_2_Pow.bin : ^2 d_4_4_2_Repmat.bin : d0*3 d1*2 d2 d_4_4_2_Reshape.bin : d0*d1*d0 1 1 d_4_4_2_Round.bin d_4_4_2_Scale.bin //
2 d_4_4_2_Sqrt.bin d_4_4_2_Sum.bin // 논의가 필요함 d_4_4_2_Trace.bin : 1 1 d2 d_4_4_2_Transpose.bin : d1 d0 d2

대충 이런 양식 (128 -> 4, 4 -> 2) Broadcast_0_add.bin : 1x1x4 + 1x128x4 = 1x128x4 Broadcast_1_add.bin : 1x1x4 + 128x1x4 = 128x1x4 Broadcast_2_add.bin : 1x128x4 + 1x1x4 = 1x128x4 Broadcast_3_add.bin : 128x1x4 + 1x1x1 = 128x1x4 Broadcast_4_mul.bin : 1x1x4 128x128x1 = 128x128x4 Broadcast_5_mul.bin : 128x128x4 1x1x1 = 128x128x4 Broadcast_6_mul.bin : 1x128x1 128x1x4 = 128x128x4 Broadcast_7_mul.bin : 128x1x1 1x128x4 = 128x128x4 Broadcast_8_mul.bin : 1x128x1 * 128x128x4 = 128x128x4 Broadcast_9_div.bin : 128x1x1 / 128x128x1 = 128x128x1 Broadcast_10_div.bin : 128x128x1 / 1x128x1 = 128x128x1 Broadcast_11_div.bin : 128x128x1 / 128x1x1 = 128x128x1 Broadcast_12_div.bin : 128x128x1 / 128x128x1 = 128x128x1

c 는 같은 규칙

c_1_4_2_cMatmul.bin c_4_4_2_cAbs.bin c_4_4_2_cAdd.bin c_4_4_2_cAsum.bin c_4_4_2_cCeil.bin c_4_4_2_cDiagonal.bin c_4_4_2_cExp.bin c_4_4_2_cFloor.bin c_4_4_2_cInvert.bin c_4_4_2_cLog10.bin c_4_4_2_cLog2.bin c_4_4_2_cLog.bin c_4_4_2_cPow.bin c_4_4_2_uPow.bin : ^ (2 + 2i) c_4_4_2_cRepmat.bin c_4_4_2_cReshape.bin c_4_4_2_cRound.bin c_4_4_2_cScale.bin c_4_4_2_uScale.bin : * (2 + 2i ) c_4_4_2_cSqrt.bin c_4_4_2_cSum.bin c_4_4_2_cTrace.bin c_4_4_2_cTranspose.bin c_4_4_2_Permute132.bin c_4_4_2_Permute213.bin c_4_4_2_Permute231.bin c_4_4_2_Permute312.bin c_4_4_2_Permute321.bin c_4_4_2_Repmat.bin cBroadcast_0_add.bin cBroadcast_10_div.bin cBroadcast_11_div.bin cBroadcast_12_div.bin cBroadcast_1_add.bin cBroadcast_2_add.bin cBroadcast_3_add.bin cBroadcast_4_mul.bin cBroadcast_5_mul.bin cBroadcast_6_mul.bin cBroadcast_7_mul.bin cBroadcast_8_mul.bin cBroadcast_9_div.bin

kooBH commented 6 years ago

수정사항

idx 1 : b1 -> b0

idx 3 : b1 -> a0