10up / classifai

Supercharge WordPress Content Workflows and Engagement with Artificial Intelligence.
https://classifaiplugin.com
GNU General Public License v2.0
564 stars 52 forks source link

Integrate Watson API for speech-to-text #47

Open jeffpaul opened 5 years ago

jeffpaul commented 5 years ago

Splitting this out from #2

Sidsector9 commented 1 year ago

Additional info:

The IBM Watson Speech to Text provides 3 interfaces for speech recognition:

See more on Advantages of the WebSocket interface.

@jeffpaul This service provides a lot of features, few of them such as: Speaker labels, Profanity filtering and Background audio suppression, etc.

Can you expand on the use case of this feature? That way we can list out the features that can go with the implementation.

jeffpaul commented 1 year ago

@Sidsector9 I believe the original thought on this enhancements was taking live speech and generating text from that for captions, so the speaker labels bit you highlighted probably most applies here.

jeffpaul commented 1 year ago

Could similarly look at using OpenAI's Whisper for this.