wit-ai / wit-unity

Wit-Unity is a Unity C# wrapper around the the Wit.ai rest APIs and is a core component of Voice SDK.
Other
82 stars 20 forks source link

Wanting to disable recording from the App Voice Service #93

Open ElDuderini opened 4 months ago

ElDuderini commented 4 months ago

Having trouble setting up a more custom implementation of this SDK where I want to manage transcription with the Open AI whispers for on-device transcription so I can also utilize that API for a wake word to activate voice commands. https://github.com/Macoron/whisper.unity

However since there isn't an option for the microphone to not be recording while the script is active, there is a conflict in terms of getting access to the microphone. I tried workarounds for this where the Wit voice experience game object was enabled and disabled based on when the wake word was triggered. This would work for the first command, but then the activation would cease to function like it should when the 2nd voice command was issued.

I know this SDK offers Activate(String) as a function, so I want to just fully manage transcription via the whispers API and then send the string to WIT for processing after the wake word is triggered.

Version 66 of the Oculus voice SDK

yolanother commented 3 months ago

Sorry for the delay. This is definitely possible. There is a hook for custom transcription services that will plug in directly to the activation process or you can do as you asked call AVE.Activate(String); If you are having mic conflicts there are ways to work around that by using a shared mic buffer. I think I put up a video on doing that at some point where I hooked up one of the unity on device ASR assets. I'm working on a project now to do this as well so I will be reverifying this works soon. I think I set up something similar to this a while back using PocketSphinx a while back that might help you as well (https://www.youtube.com/live/jeSx6VzpMhk)