princeton-ddss / SpeechMLPipeline

SpeechMLPipeline is a complete pipeline to deploy Machine Learning Models to generate labelled and timestamped transcripts from audio inputs
MIT License
0 stars 1 forks source link

Features Development #14

Closed fjying closed 10 months ago

fjying commented 1 year ago

Develop Features for the Inputs to Speaker Segmentation

fjying commented 10 months ago

Instead of developing handcrafted features, would directly apply audio analysis packages to diarization