Two things are not working: I put as you can see --overwrite --keep-best to prevent storing all iterations of the model, but all the iterations are stored consuming a huge amount of disk space.
also, despite I put --log model/train.log --valid-log model/valid.log I can not see any log files generated in model dir.
Only this output is generated to the STDERR:
[2019-04-22 08:44:28] Ep. 1 : Up. 1000 : Sen. 3,804 : Cost 194.71755981 : Time 269.17s : 449.69 words/s
[2019-04-22 08:44:52] Ep. 1 : Up. 2000 : Sen. 7,665 : Cost 143.95985413 : Time 23.48s : 5032.12 words/s
[2019-04-22 08:45:15] Ep. 1 : Up. 3000 : Sen. 11,452 : Cost 124.35760498 : Time 23.47s : 5041.61 words/s
[2019-04-22 08:45:39] Ep. 1 : Up. 4000 : Sen. 15,217 : Cost 109.81844330 : Time 23.45s : 5089.74 words/s
[2019-04-22 08:46:02] Ep. 1 : Up. 5000 : Sen. 19,040 : Cost 100.38690948 : Time 23.53s : 5157.90 words/s
[2019-04-22 08:46:26] Ep. 1 : Up. 6000 : Sen. 22,764 : Cost 92.49450684 : Time 23.38s : 5089.51 words/s
[2019-04-22 08:46:49] Ep. 1 : Up. 7000 : Sen. 26,602 : Cost 82.52687073 : Time 23.41s : 5092.01 words/s
[2019-04-22 08:47:12] Ep. 1 : Up. 8000 : Sen. 30,290 : Cost 82.85927582 : Time 23.33s : 5119.25 words/s
[2019-04-22 08:47:36] Ep. 1 : Up. 9000 : Sen. 34,026 : Cost 77.60969543 : Time 23.34s : 5158.21 words/s
[2019-04-22 08:47:59] Ep. 1 : Up. 10000 : Sen. 37,991 : Cost 73.47486877 : Time 23.48s : 5236.94 words/s
[2019-04-22 08:47:59] Saving model weights and runtime parameters to model/model.iter10000.npz
[2019-04-22 08:48:00] Saving model weights and runtime parameters to model/model.npz
[2019-04-22 08:48:01] Saving Adam parameters to model/model.npz.optimizer.npz
Hi, I am running transformer example using my own data as follows:
Two things are not working: I put as you can see --overwrite --keep-best to prevent storing all iterations of the model, but all the iterations are stored consuming a huge amount of disk space.
also, despite I put --log model/train.log --valid-log model/valid.log I can not see any log files generated in model dir.
Only this output is generated to the STDERR:
Am I doing something wrong here?