Oneflow-Inc / DLPerf

DeepLearning Framework Performance Profiling Toolkit
Apache License 2.0
275 stars 27 forks source link

add extract gpt result script #142

Open ouyangyu opened 3 years ago

ouyangyu commented 3 years ago

Filtered Result median value

case memory (MiB) lantency (ms) throuthput(sample/sec)
1n1g_dp1_mp1_pp1_mbs16_gbs16_na1_l24_hs1536_nah24_sl2048 30008 2223.93 7.19
1n1g_dp1_mp1_pp1_mbs1_gbs1_na1_l24_hs2304_nah24_sl2048 30130 286.76 3.49
1n1g_dp1_mp1_pp1_mbs2_gbs2_na1_l24_hs2304_nah24_sl2048 31080 489.99 4.08
1n1g_dp1_mp1_pp1_mbs4_gbs4_na1_l24_hs2304_nah24_sl2048 32984 896.91 4.46
1n4g_dp4_mp1_pp1_mbs1_gbs4_na1_l24_hs2304_nah24_sl2048 33870 305.59 13.09
1n4g_dp4_mp1_pp1_mbs2_gbs8_na1_l24_hs2304_nah24_sl2048 34808 510.87 15.66
1n4g_dp4_mp1_pp1_mbs4_gbs16_na1_l24_hs2304_nah24_sl2048 36724 917.67 17.44
1n8g_dp1_mp8_pp1_mbs32_gbs32_na1_l16_hs2304_nah16_sl2048 14110 977.59 32.73
1n8g_dp1_mp8_pp1_mbs32_gbs32_na1_l24_hs1536_nah24_sl2048 13214 1049.49 30.49
1n8g_dp1_mp8_pp1_mbs32_gbs32_na1_l24_hs2304_nah24_sl2048 17744 1516.38 21.10
1n8g_dp1_mp8_pp1_mbs64_gbs64_na1_l16_hs2304_nah16_sl2048 24748 1949.71 32.83
1n8g_dp1_mp8_pp1_mbs64_gbs64_na1_l24_hs1536_nah24_sl2048 23504 2063.59 31.01
1n8g_dp1_mp8_pp1_mbs64_gbs64_na1_l24_hs2304_nah24_sl2048 31270 3016.09 21.22
1n8g_dp2_mp4_pp1_mbs16_gbs32_na1_l24_hs1536_nah24_sl2048 12584 806.30 39.69
1n8g_dp2_mp4_pp1_mbs16_gbs32_na1_l24_hs2304_nah24_sl2048 16476 1174.57 27.24
1n8g_dp2_mp4_pp1_mbs32_gbs64_na1_l24_hs2304_nah24_sl2048 25402 2294.77 27.89
1n8g_dp4_mp2_pp1_mbs16_gbs64_na1_l24_hs1536_nah24_sl2048 19880 1331.34 48.07
1n8g_dp4_mp2_pp1_mbs16_gbs64_na1_l24_hs2304_nah24_sl2048 26304 1963.60 32.60
1n8g_dp8_mp1_pp1_mbs16_gbs128_na1_l24_hs1536_nah24_sl2048 32112 2263.92 56.54
1n8g_dp8_mp1_pp1_mbs1_gbs8_na1_l24_hs2304_nah24_sl2048 33870 312.95 25.56
1n8g_dp8_mp1_pp1_mbs2_gbs16_na1_l24_hs2304_nah24_sl2048 34820 518.97 30.83
1n8g_dp8_mp1_pp1_mbs4_gbs32_na1_l24_hs2304_nah24_sl2048 36730 928.78 34.45