读取数据的日志正常:
16541 Model construction finished. training number is 1800, dev number is 1100, test_num is 1100 Maximum train sequence length: 96, dev sequence length: 110, test sequence length: 110 data is ready
按照训练1000个epoch后,日志显示
gBatch 19000, lBatch 19, loss 0.13127, loss_crf 0.07497, loss_ft 0.00051 At epoch 999, official dev f1 : 0.950363, precision : 0.950363, recall : 0.950363 At this run, the maximum dev f1:0.950363, dev precision:0.950363, dev recall:0.950363
我的复现步骤如下:
读取数据的日志正常:
16541 Model construction finished. training number is 1800, dev number is 1100, test_num is 1100 Maximum train sequence length: 96, dev sequence length: 110, test sequence length: 110 data is ready
按照训练1000个epoch后,日志显示
gBatch 19000, lBatch 19, loss 0.13127, loss_crf 0.07497, loss_ft 0.00051 At epoch 999, official dev f1 : 0.950363, precision : 0.950363, recall : 0.950363 At this run, the maximum dev f1:0.950363, dev precision:0.950363, dev recall:0.950363
从代码来看,这个应该是字级别的accuracy,并不是纠错的指标。从ckpt/SIGHAN15_0.5/dev_pred.txt的预测结果看,错误也很多,与论文中80%的sentence-level correction F1不符。
请问如何复现论文结果,最好包括训练步骤和测试步骤,谢谢!