Closed bart2001 closed 6 years ago
모델명: sejong_max_char_per_token_50 max_characters_per_token=50로 설정 options.json
{ "all_clip_norm_val": 10.0, "batch_size": 64, "bidirectional": true, "char_cnn": { "activation": "relu", "embedding": { "dim": 16 }, "filters": [ [ 1, 32 ], [ 2, 32 ], [ 3, 64 ], [ 4, 128 ], [ 5, 256 ], [ 6, 512 ], [ 7, 1024 ] ], "max_characters_per_token": 50, "n_characters": 261, "n_highway": 2 }, "dropout": 0.1, "lstm": { "cell_clip": 3, "dim": 4096, "n_layers": 2, "proj_clip": 3, "projection_dim": 512, "use_skip_connections": true }, "n_epochs": 10, "n_negative_samples_batch": 45, "n_tokens_vocab": 4488, "n_train_tokens": 32119740, "unroll_steps": 20 }
학습중
최종 성능
Batch 125400, train_perplexity=2.9646606 Total time: 65514.2766084671
모델명: sejong_max_char_per_token_50 max_characters_per_token=50로 설정 options.json
학습중