UMass-Rescue / CombinedTechStack

Handle and process large amounts of media data with plug-and-play machine learning models
https://umass-rescue.github.io/CombinedTechStack/
MIT License
1 stars 1 forks source link

Audio Speech Recognition #16

Open christopherdoan opened 3 years ago

christopherdoan commented 3 years ago

Use machine learning to perform audio speech recognition and transcription. Use the transcription to pull out valuable information

christopherdoan commented 3 years ago

I am working on this issue. CD 3/29/2021

christopherdoan commented 3 years ago

Able to add video model: Speech_Rec_NERMicroservice in the prediction/models directory. Has non-deterministic bugs (sometimes it wont send results to backend)