scutcsq / DWFormer

DWFormer: Dynamic Window Transformer for Speech Emotion Recognition(ICASSP 2023 Oral)
54 stars 3 forks source link

在Dataset里面lmdb部分为什么规定长度为324,所有语音长度加起来的平均长度是354。求指点迷津 #13

Closed 18wangsss closed 7 months ago

scutcsq commented 7 months ago

我印象中语音特征长度80%就是差不多这个数值?