Closed liangyanfeng closed 5 years ago
There are other inconsistencies between paper and code worth noting:
m
: 0.35 (code) / 0.4 (paper)Still, I would like to thank the author for publishing this code with pretrained weights.
Thanks. This was my very first work on this project, so I think there are a lot of room for improving this work.
https://github.com/WeidiXie/VGG-Speaker-Recognition/blob/master/src/backbone.py#L221
y = MaxPooling2D((3, 1), strides=(2, 1), name='mpool2')(x5)