修改oneflow测试脚本,log信息提取文件
script
├── 300k_iters.sh # 300k iterations test, display loss and auc every 1000 iterations.
├── 500_iters.sh # 500 iterations test, display loss and auc every iteration.
├── bsz_x2.sh # Batch Size Double Test
├── fix_bsz.sh # test with different number of devices and fixing batch size per device and with different number of devices and fixing total batch size
├── train_nn_graph.sh #base script
├── vocab_x2.sh # Vocabulary Size Double Test
tool
├── gpu_memory_usage.py # log maximum GPU device memory usage during testing
├──extract_info_from_log.py # extract information from log files
├── extract_info_from_log.sh # bash extract_info_from_log.py
修改oneflow测试脚本,log信息提取文件 script ├── 300k_iters.sh # 300k iterations test, display loss and auc every 1000 iterations. ├── 500_iters.sh # 500 iterations test, display loss and auc every iteration. ├── bsz_x2.sh # Batch Size Double Test ├── fix_bsz.sh # test with different number of devices and fixing batch size per device and with different number of devices and fixing total batch size ├── train_nn_graph.sh #base script ├── vocab_x2.sh # Vocabulary Size Double Test tool ├── gpu_memory_usage.py # log maximum GPU device memory usage during testing ├──extract_info_from_log.py # extract information from log files ├── extract_info_from_log.sh # bash extract_info_from_log.py