ina-foss / inaSpeechSegmenter

CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
MIT License
717 stars 127 forks source link

could I input waveform data? #36

Closed ucas010 closed 4 years ago

ucas010 commented 4 years ago

hi,dear I see the input is the mp3 file,but could I set the waveform data ? Or wav file is Ok??[tried,but not success]

could you help me ? thx

DavidDoukhan commented 4 years ago

All major sound format are supported by inaSpeechSegmenter (mp3, wav, ogg, avi, mp4, ...) Could you provide more details in your issue ?