This test is divided in two phases. First the robot must answer a set of questions to an operator at the first attempt without asking for confirmation. The operator is not allowed to move to the robot or shout to the robot.
For the second phase, the focus is to carry out Sound Source Localization (SSL) and Automatic Speech Recognition (ASR) at the same time. To this effect, volunteers will stand around the robot and a randomly chosen volunteer will ask a question. The robot is expected to turn towards that volunteer and answer the question. If the robot is NOT able to carry out SSL and ASR at the same time (e.g. it needs to turn towards the speaking volunteer using SSL before doing ASR), the robot is allowed to request to repeat the question (with a penalty). In addition, if SSL cannot be carried out, the robot may still answer the question without turning, but no points will be provided for SSL.
Goal
The robot must be able to properly recognize and answer to a specific set of questions without ask for confirmation. Also, the robot shall be able to react to a speaking operator which is not facing towards to.
Focus
This test focuses on voice recognition and audio-source localization in a noisy environment, with moving sound sources. This test may also be held outside the arena.
Setup
The apartment is in its normal state.
All doors of the apartment are open, except for the entry door.
Task
1. Direct speech recognition:
The robot should move (or be moved) to a previously specified point inside the arena.
A TC member will ask 5 questions from the set of 50 predefined questions in front of the robot.
The robot should answer the question without asking confirmation.
A question will only be asked once; there are no repetitions of a question.
The operator shall be standing still and facing to the robot.
The operator shall be between 0.75 and 1.0 meters away from the robot position.
The operator shall be between -60 and 60 from the robot's center (front range).
2. Indirect speech recognition:
A group of several volunteers will stand around the robot as in Figure \ref{fig:asrsetup}, standing still, facing the robot, and between 0.75 to 1.0 meters away from it. The number of volunteers may change from the Figure. A random person of this group will ask a question of the same set described in the first part of the test. Before answering, the robot can do one or both of the following (each providing different amount of points):
Answer the question,
Turn towards the person that asked the question,
Ask for only one repetition of the question, if needed (penalty).
This process will repeat 5 times. No attempt may receive negative points. When the robot provides an answer it should not ask for confirmation.
Questions
In previous years, the robots only had to answer questions from a list of pre-defined trivia questions. In addition to 10 predefined trivia-questions, the robot will be asked some questions about the RoboCup arena and the objects within it, i.e. the scenario.
What color is the shampoo? -> The shampoo is blue.
What is the color of the sponge -> The sponge is green.
What type of thing is the beer? -> Beer a kind of drink.
Where can I find the beer? -> The beer is in the fridge, nice and cool.
Tell me where the plates are. -> The plate is in the cupboard.
How big is the egg? -> The egg is a couple of centimeters big.
What shape is the egg? -> It is egg-shaped of course.
What shape is the soap? -> The soap is a flat block.
Where is the shampoo? -> The shampoo is in the closet.
What objects are in the closet? -> The shampoo, soap, the sponge and a cloth. Or: The closet contains toiletries.
Note theat the robot does not have to go and find out the answer; it should be able to infer the answers from knowledge it already has.
Additional rules and remarks
- Continue rule:
Continue rule (Section 3.9.15) can not be used during this test.
- Question timeout:
If the robot does not answer within 10 seconds, the question is considered as missed, and the TC member will proceed with the next question.
- Understanding the answer:
If the robot's answer is not understood by the operator, it is considered as \textit{incorrect}, and the TC member will proceed with the next question. It is thus advised that the robot provide answers such that it is clear that the robot understood the question. For example, if the question is What is the capital of Germany?'', instead of just answeringBerlin'', it is advised that the robot answers something to the effect of ``The capital of Germany is Berlin''.
- Question repetition:
In the second phase, if the robot turns towards the operator to be asked once again, it should clearly state that it requires a repetition of the question once it finishes turning, so as to cue the operator to do so.
Data recording
Please record the following data (See section 3.4):
Speech Recognition \& Audio Detection Test
This test is divided in two phases. First the robot must answer a set of questions to an operator at the first attempt without asking for confirmation. The operator is not allowed to move to the robot or shout to the robot.
For the second phase, the focus is to carry out Sound Source Localization (SSL) and Automatic Speech Recognition (ASR) at the same time. To this effect, volunteers will stand around the robot and a randomly chosen volunteer will ask a question. The robot is expected to turn towards that volunteer and answer the question. If the robot is NOT able to carry out SSL and ASR at the same time (e.g. it needs to turn towards the speaking volunteer using SSL before doing ASR), the robot is allowed to request to repeat the question (with a penalty). In addition, if SSL cannot be carried out, the robot may still answer the question without turning, but no points will be provided for SSL.
Goal
The robot must be able to properly recognize and answer to a specific set of questions without ask for confirmation. Also, the robot shall be able to react to a speaking operator which is not facing towards to.
Focus
This test focuses on voice recognition and audio-source localization in a noisy environment, with moving sound sources. This test may also be held outside the arena.
Setup
The apartment is in its normal state.
All doors of the apartment are open, except for the entry door.
Task
1. Direct speech recognition:
The robot should move (or be moved) to a previously specified point inside the arena. A TC member will ask 5 questions from the set of 50 predefined questions in front of the robot. The robot should answer the question without asking confirmation. A question will only be asked once; there are no repetitions of a question.
The operator shall be standing still and facing to the robot.
The operator shall be between 0.75 and 1.0 meters away from the robot position.
The operator shall be between -60 and 60 from the robot's center (front range).
2. Indirect speech recognition:
A group of several volunteers will stand around the robot as in Figure \ref{fig:asrsetup}, standing still, facing the robot, and between 0.75 to 1.0 meters away from it. The number of volunteers may change from the Figure. A random person of this group will ask a question of the same set described in the first part of the test. Before answering, the robot can do one or both of the following (each providing different amount of points):
Answer the question,
Turn towards the person that asked the question,
Ask for only one repetition of the question, if needed (penalty).
This process will repeat 5 times. No attempt may receive negative points. When the robot provides an answer it should not ask for confirmation.
Questions
In previous years, the robots only had to answer questions from a list of pre-defined trivia questions. In addition to 10 predefined trivia-questions, the robot will be asked some questions about the RoboCup arena and the objects within it, i.e. the scenario.
Note theat the robot does not have to go and find out the answer; it should be able to infer the answers from knowledge it already has.
Additional rules and remarks
- Continue rule:
Continue rule (Section 3.9.15) can not be used during this test.
- Question timeout:
If the robot does not answer within 10 seconds, the question is considered as missed, and the TC member will proceed with the next question.
- Understanding the answer:
If the robot's answer is not understood by the operator, it is considered as \textit{incorrect}, and the TC member will proceed with the next question. It is thus advised that the robot provide answers such that it is clear that the robot understood the question. For example, if the question is
What is the capital of Germany?'', instead of just answering
Berlin'', it is advised that the robot answers something to the effect of ``The capital of Germany is Berlin''.- Question repetition:
In the second phase, if the robot turns towards the operator to be asked once again, it should clearly state that it requires a repetition of the question once it finishes turning, so as to cue the operator to do so.
Data recording
Please record the following data (See section 3.4):
Audio
Commands