robotology / assistive-rehab

Assistive and Rehabilitative Robotics
https://robotology.github.io/assistive-rehab/doc/mkdocs/site
BSD 3-Clause "New" or "Revised" License
20 stars 11 forks source link

Implement verbal interaction #192

Closed vvasco closed 5 years ago

vvasco commented 5 years ago

Examples of potential conversation can be found here:

Given these considerations, we have to understand which tool to use and start to create a potential conversation for a meaningful verbal interaction.

cc @vtikha

vtikha commented 5 years ago

:thought_balloon: :speech_balloon:

The best option then for this is to use the architecture developed for the IBM JL. Do you confirm that we will be using English? If yes, then:

pattacini commented 5 years ago

Unfortunately, the first language we've got to use is Italian 🤔

vtikha commented 5 years ago

we therefore need to use google services for speech to text.

vvasco commented 5 years ago

Ok so we would have:

vtikha commented 5 years ago

I completely agree @vvasco , but the mics are terrible on R1. In any case, the mics are plug and play therefore I can switch and test both easily.

vvasco commented 5 years ago

We decided to use Google services for interpreting potential questions. As we don't plan to implement a conversation for now, but rather a simple natural question and answer mechanism, we can avoid IBM services and eventually resort to them later to include a more complex interaction.

The speech interpreter will manage questions (triggered by "hey robot" or hand raise), interpret them and send a corresponding keyword to managerTUG (port name: /managerTUG/speech-interp:i), which will formulate the answer accordingly and send it to iSpeak. For now, managerTUG manages the following keywords:

Important: we have to think of how to manage that the question generates an event, which can occur at any time and which the modules involved must be aware of.

cc @vtikha @pattacini

vtikha commented 5 years ago

The following PR deals with this by adding:

vvasco commented 5 years ago

Done in #225.