Closed kooBH closed 6 years ago
d_1_4_2_cMatmul.bin : d_1_4_2.bin d_4_4_2.bin = (1,4,2)
d_4_4_2_Abs.bin
d_4_4_2_Add.bin : Add self
d_4_4_2_Asum.bin // 논의가 필요함
d_4_4_2_Ceil.bin
d_4_4_2_Diagonal.bin : d0 1 d2
d_4_4_2_EXP.bin
d_4_4_2_Floor.bin
d_4_4_2_Invert.bin
d_4_4_2_Log10.bin
d_4_4_2_Log2.bin
d_4_4_2_Log.bin
d_4_4_2_Permute132.bin : d0 d2 d1
d_4_4_2_Permute213.bin : d1 d0 d2
d_4_4_2_Permute231.bin : d1 d2 d0
d_4_4_2_Permute312.bin : d2 d0 d1
d_4_4_2_Permute321.bin : d2 d1 d0
d_4_4_2_Pow.bin : ^2
d_4_4_2_Repmat.bin : d0*3 d1*2 d2
d_4_4_2_Reshape.bin : d0*d1*d0 1 1
d_4_4_2_Round.bin
d_4_4_2_Scale.bin // 2
d_4_4_2_Sqrt.bin
d_4_4_2_Sum.bin // 논의가 필요함
d_4_4_2_Trace.bin : 1 1 d2
d_4_4_2_Transpose.bin : d1 d0 d2
대충 이런 양식 (128 -> 4, 4 -> 2) Broadcast_0_add.bin : 1x1x4 + 1x128x4 = 1x128x4 Broadcast_1_add.bin : 1x1x4 + 128x1x4 = 128x1x4 Broadcast_2_add.bin : 1x128x4 + 1x1x4 = 1x128x4 Broadcast_3_add.bin : 128x1x4 + 1x1x1 = 128x1x4 Broadcast_4_mul.bin : 1x1x4 128x128x1 = 128x128x4 Broadcast_5_mul.bin : 128x128x4 1x1x1 = 128x128x4 Broadcast_6_mul.bin : 1x128x1 128x1x4 = 128x128x4 Broadcast_7_mul.bin : 128x1x1 1x128x4 = 128x128x4 Broadcast_8_mul.bin : 1x128x1 * 128x128x4 = 128x128x4 Broadcast_9_div.bin : 128x1x1 / 128x128x1 = 128x128x1 Broadcast_10_div.bin : 128x128x1 / 1x128x1 = 128x128x1 Broadcast_11_div.bin : 128x128x1 / 128x1x1 = 128x128x1 Broadcast_12_div.bin : 128x128x1 / 128x128x1 = 128x128x1
c 는 같은 규칙
c_1_4_2_cMatmul.bin c_4_4_2_cAbs.bin c_4_4_2_cAdd.bin c_4_4_2_cAsum.bin c_4_4_2_cCeil.bin c_4_4_2_cDiagonal.bin c_4_4_2_cExp.bin c_4_4_2_cFloor.bin c_4_4_2_cInvert.bin c_4_4_2_cLog10.bin c_4_4_2_cLog2.bin c_4_4_2_cLog.bin c_4_4_2_cPow.bin c_4_4_2_uPow.bin : ^ (2 + 2i) c_4_4_2_cRepmat.bin c_4_4_2_cReshape.bin c_4_4_2_cRound.bin c_4_4_2_cScale.bin c_4_4_2_uScale.bin : * (2 + 2i ) c_4_4_2_cSqrt.bin c_4_4_2_cSum.bin c_4_4_2_cTrace.bin c_4_4_2_cTranspose.bin c_4_4_2_Permute132.bin c_4_4_2_Permute213.bin c_4_4_2_Permute231.bin c_4_4_2_Permute312.bin c_4_4_2_Permute321.bin c_4_4_2_Repmat.bin cBroadcast_0_add.bin cBroadcast_10_div.bin cBroadcast_11_div.bin cBroadcast_12_div.bin cBroadcast_1_add.bin cBroadcast_2_add.bin cBroadcast_3_add.bin cBroadcast_4_mul.bin cBroadcast_5_mul.bin cBroadcast_6_mul.bin cBroadcast_7_mul.bin cBroadcast_8_mul.bin cBroadcast_9_div.bin
수정사항
idx 1 : b1 -> b0
idx 3 : b1 -> a0
d_broadcast_0_add.bin : 1x1x4 + 1x128x4 = 1x128x4 d_broadcast_1_add.bin : 1x1x4 + 128x1x4 = 128x1x4 d_broadcast_2_add.bin : 1x128x4 + 1x1x4 = 1x128x4 d_broadcast_3_add.bin : 128x1x4 + 1x1x1 = 128x1x4 d_broadcast_4_mul.bin : 1x1x4 128x128x1 = 128x128x4 d_broadcast_5_mul.bin : 128x128x4 1x1x1 = 128x128x4 d_broadcast_6_mul.bin : 1x128x1 128x1x4 = 128x128x4 d_broadcast_7_mul.bin : 128x1x1 1x128x4 = 128x128x4 d_broadcast_8_mul.bin : 1x128x1 * 128x128x4 = 128x128x4 d_broadcast_9_div.bin : 128x1x1 / 128x128x1 = 128x128x1 d_broadcast_10_div.bin : 128x128x1 / 1x128x1 = 128x128x1 d_broadcast_11_div.bin : 128x128x1 / 128x1x1 = 128x128x1 d_broadcast_12_div.bin : 128x128x1 / 128x128x1 = 128x128x1