Audio Speech Recognition

UMass-Rescue / CombinedTechStack

Handle and process large amounts of media data with plug-and-play machine learning models

https://umass-rescue.github.io/CombinedTechStack/

MIT License

1 stars 1 forks source link

Audio Speech Recognition #16

Open christopherdoan opened 3 years ago

christopherdoan commented 3 years ago

Use machine learning to perform audio speech recognition and transcription. Use the transcription to pull out valuable information

[ ] Perform speech recognition on long audio files
[x] From an audio transcription, perform Named Entity Recognition
[ ] Be able to discern all important name entities after NER operation (ranking?)
[ ] Associated named entities with time stamp from audio

christopherdoan commented 3 years ago

I am working on this issue. CD 3/29/2021

christopherdoan commented 3 years ago

Able to add video model: Speech_Rec_NERMicroservice in the prediction/models directory. Has non-deterministic bugs (sometimes it wont send results to backend)