关于使用TIMIT数据集的问题

Walleclipse / Deep_Speaker-speaker_recognition_system

Keras implementation of ‘’Deep Speaker: an End-to-End Neural Speaker Embedding System‘’ (speaker recognition)

245 stars 81 forks source link

Closed ZJG0 closed 3 years ago

ZJG0 commented 4 years ago

您好，我使用train.py训练,一直卡在这个地方，是数据的问题吗？ QQ截图20200313160042

Walleclipse commented 4 years ago

应该不是数据的问题，可能就是选择最优喂入数据的 select batch 这一步很慢。你可以改一些参数使其加快或者换成 random_batch，请查看 issue 37 。

ZJG0 commented 4 years ago

您好，我已经使用random_batch.py训练完模型，但是结果不太理想，我又在这个基础上使用了select__batch.py进行训练，但是已经跑了80个小时了，依旧卡在select阶段，不知道就是这么慢还是我的数据有问题？

sophia-396 commented 4 years ago

您好，我使用了checkpoints/model_17200_0.54980.h5的权重做迁移学习，使用了您的网络结构，在原本的CNN模型加了一层softmax层，并且只训练最后一层。用的是TIMIT数据集，结果出现如下，能否帮忙分析以下是什么问题呢？

Walleclipse commented 4 years ago

我不太了解 TIMIT数据集，但是看这张图似乎，模型对train set过拟合了。没有泛化到 validation set上。我觉得你可以训练所有层，而不只是训练最后一个softmax层。毕竟数据很不一样，训练最后一层可能远远不够迁移到新的数据上。