issues
search
Tendo33
/
oneflow-test
oneflow test
0
stars
0
forks
source link
LibAI_bert_large_pretrain_graph_nl24_nah16_hs1024_FP16_acfalse_DP4_MP2_PP2_zerofalse_stage0_mbs4_gbs64_acc4_2n8g
#3
Open
Tendo33
opened
1 year ago
Tendo33
commented
1 year ago
case3
NVIDIA_GeForce_RTX_3080_Ti | master@b51cb72 | rank_per_process @a442869 | naive@a442869 -- | -- | -- | -- LibAI_bert_large_pretrain_graph nl24_nah16_hs1024_FP16_acfalse DP4_MP2_PP2_zerofalse_stage0 _mbs4_gbs64_acc4_2n8g | building plan Done! Cost time: 18.92s. building graph Done! Cost time: 19.91s node0:8814MIB–8960MIB node1:3638MIB–3638MIB [[master_output.log](https://oneflow-test.oss-cn-beijing.aliyuncs.com/OneAutoTest/onebench/libai/sunjinfeng_bert_test/case3/b51cb72_master3/LibAI_bert_large_pretrain_graph_nl24_nah16_hs1024_FP16_acfalse_DP4_MP2_PP2_zerofalse_stage0_mbs4_gbs64_acc4_2n8g/output.log)] | building plan Done! Cost time: 15.94s. building graph Done! Cost time: 22.09 s. node0: 8808MIB--8960MIB node1:3638MIB–3638MIB [[rank_per_process_output.log](https://oneflow-test.oss-cn-beijing.aliyuncs.com/OneAutoTest/onebench/libai/sunjinfeng_bert_test/case3/a442869_env_rank3/LibAI_bert_large_pretrain_graph_nl24_nah16_hs1024_FP16_acfalse_DP4_MP2_PP2_zerofalse_stage0_mbs4_gbs64_acc4_2n8g/output.log)] | building plan Done! Cost time: 18.92s. building graph Done! Cost time: 22.16s. node0:8808MIB--8954MIB node1:3638MIB–3638MIB [[naive_output.log](https://oneflow-test.oss-cn-beijing.aliyuncs.com/OneAutoTest/onebench/libai/sunjinfeng_bert_test/case3/a442869_naive3/LibAI_bert_large_pretrain_graph_nl24_nah16_hs1024_FP16_acfalse_DP4_MP2_PP2_zerofalse_stage0_mbs4_gbs64_acc4_2n8g/output.log)]
全局loss曲线对比
50步loss曲线对比
100步loss曲线对比
case3
全局loss曲线对比
50步loss曲线对比
100步loss曲线对比