issues
search
Oneflow-Inc
/
DLPerf
DeepLearning Framework Performance Profiling Toolkit
Apache License 2.0
276
stars
27
forks
source link
add extract gpt result script
#142
Open
ouyangyu
opened
3 years ago
ouyangyu
commented
3 years ago
Filtered Result
median value
case
memory (MiB)
lantency (ms)
throuthput(sample/sec)
1n1g_dp1_mp1_pp1_mbs16_gbs16_na1_l24_hs1536_nah24_sl2048
30008
2223.93
7.19
1n1g_dp1_mp1_pp1_mbs1_gbs1_na1_l24_hs2304_nah24_sl2048
30130
286.76
3.49
1n1g_dp1_mp1_pp1_mbs2_gbs2_na1_l24_hs2304_nah24_sl2048
31080
489.99
4.08
1n1g_dp1_mp1_pp1_mbs4_gbs4_na1_l24_hs2304_nah24_sl2048
32984
896.91
4.46
1n4g_dp4_mp1_pp1_mbs1_gbs4_na1_l24_hs2304_nah24_sl2048
33870
305.59
13.09
1n4g_dp4_mp1_pp1_mbs2_gbs8_na1_l24_hs2304_nah24_sl2048
34808
510.87
15.66
1n4g_dp4_mp1_pp1_mbs4_gbs16_na1_l24_hs2304_nah24_sl2048
36724
917.67
17.44
1n8g_dp1_mp8_pp1_mbs32_gbs32_na1_l16_hs2304_nah16_sl2048
14110
977.59
32.73
1n8g_dp1_mp8_pp1_mbs32_gbs32_na1_l24_hs1536_nah24_sl2048
13214
1049.49
30.49
1n8g_dp1_mp8_pp1_mbs32_gbs32_na1_l24_hs2304_nah24_sl2048
17744
1516.38
21.10
1n8g_dp1_mp8_pp1_mbs64_gbs64_na1_l16_hs2304_nah16_sl2048
24748
1949.71
32.83
1n8g_dp1_mp8_pp1_mbs64_gbs64_na1_l24_hs1536_nah24_sl2048
23504
2063.59
31.01
1n8g_dp1_mp8_pp1_mbs64_gbs64_na1_l24_hs2304_nah24_sl2048
31270
3016.09
21.22
1n8g_dp2_mp4_pp1_mbs16_gbs32_na1_l24_hs1536_nah24_sl2048
12584
806.30
39.69
1n8g_dp2_mp4_pp1_mbs16_gbs32_na1_l24_hs2304_nah24_sl2048
16476
1174.57
27.24
1n8g_dp2_mp4_pp1_mbs32_gbs64_na1_l24_hs2304_nah24_sl2048
25402
2294.77
27.89
1n8g_dp4_mp2_pp1_mbs16_gbs64_na1_l24_hs1536_nah24_sl2048
19880
1331.34
48.07
1n8g_dp4_mp2_pp1_mbs16_gbs64_na1_l24_hs2304_nah24_sl2048
26304
1963.60
32.60
1n8g_dp8_mp1_pp1_mbs16_gbs128_na1_l24_hs1536_nah24_sl2048
32112
2263.92
56.54
1n8g_dp8_mp1_pp1_mbs1_gbs8_na1_l24_hs2304_nah24_sl2048
33870
312.95
25.56
1n8g_dp8_mp1_pp1_mbs2_gbs16_na1_l24_hs2304_nah24_sl2048
34820
518.97
30.83
1n8g_dp8_mp1_pp1_mbs4_gbs32_na1_l24_hs2304_nah24_sl2048
36730
928.78
34.45
Filtered Result
median value