walkingmachinegraveyard / sara_commun

Commun packages for the Sara robot
4 stars 0 forks source link

Speech Recognition & Audio Detection Test - Robocup2k17 #129

Open gortium opened 7 years ago

gortium commented 7 years ago

Speech Recognition \& Audio Detection Test

This test is divided in two phases. First the robot must answer a set of questions to an operator at the first attempt without asking for confirmation. The operator is not allowed to move to the robot or shout to the robot.

For the second phase, the focus is to carry out Sound Source Localization (SSL) and Automatic Speech Recognition (ASR) at the same time. To this effect, volunteers will stand around the robot and a randomly chosen volunteer will ask a question. The robot is expected to turn towards that volunteer and answer the question. If the robot is NOT able to carry out SSL and ASR at the same time (e.g. it needs to turn towards the speaking volunteer using SSL before doing ASR), the robot is allowed to request to repeat the question (with a penalty). In addition, if SSL cannot be carried out, the robot may still answer the question without turning, but no points will be provided for SSL.

Goal

The robot must be able to properly recognize and answer to a specific set of questions without ask for confirmation. Also, the robot shall be able to react to a speaking operator which is not facing towards to.

Focus

This test focuses on voice recognition and audio-source localization in a noisy environment, with moving sound sources. This test may also be held outside the arena.

Setup

  1. The apartment is in its normal state.

  2. All doors of the apartment are open, except for the entry door.

Task

1. Direct speech recognition:

The robot should move (or be moved) to a previously specified point inside the arena. A TC member will ask 5 questions from the set of 50 predefined questions in front of the robot. The robot should answer the question without asking confirmation. A question will only be asked once; there are no repetitions of a question.

2. Indirect speech recognition:

A group of several volunteers will stand around the robot as in Figure \ref{fig:asrsetup}, standing still, facing the robot, and between 0.75 to 1.0 meters away from it. The number of volunteers may change from the Figure. A random person of this group will ask a question of the same set described in the first part of the test. Before answering, the robot can do one or both of the following (each providing different amount of points):

This process will repeat 5 times. No attempt may receive negative points. When the robot provides an answer it should not ask for confirmation.

Questions

In previous years, the robots only had to answer questions from a list of pre-defined trivia questions. In addition to 10 predefined trivia-questions, the robot will be asked some questions about the RoboCup arena and the objects within it, i.e. the scenario.

Note theat the robot does not have to go and find out the answer; it should be able to infer the answers from knowledge it already has.

Additional rules and remarks

- Continue rule:

Continue rule (Section 3.9.15) can not be used during this test.

- Question timeout:

If the robot does not answer within 10 seconds, the question is considered as missed, and the TC member will proceed with the next question.

- Understanding the answer:

If the robot's answer is not understood by the operator, it is considered as \textit{incorrect}, and the TC member will proceed with the next question. It is thus advised that the robot provide answers such that it is clear that the robot understood the question. For example, if the question is What is the capital of Germany?'', instead of just answeringBerlin'', it is advised that the robot answers something to the effect of ``The capital of Germany is Berlin''.

- Question repetition:

In the second phase, if the robot turns towards the operator to be asked once again, it should clearly state that it requires a repetition of the question once it finishes turning, so as to cue the operator to do so.

Data recording

Please record the following data (See section 3.4):