This paper presents an interactive quizmaster robot that can manage a multiparty speech-based quiz game. The basic flow of the quiz game is that (1) the robot reads a question, (2) one or more players answer it, and (3) the robot judges the correctness of the answers. We categorize such speech-based quiz games into school-type interaction and auction-type interaction. The former asks players to say 'Yes' to get the right to answer before answering a question and the latter allows players to directly answer a question without any advance notice. To realize such interaction, the robot needs the capability of recognizing utterances from multiple people using its own microphones (i.e. ears), even if those utterances are made simultaneously. To cope with such situations, the robot estimates which player made the fastest utterance and recognizes it by localizing and separating a mixture of audio signals. Experiments were conducted to evaluate the success rates of the fastest player identification and speech recognition. The results showed that the robot could identify the fastest speakers with a success rate of 90.0% more accurately than humans when only one speaker slightly preceded the other speakers. We found that although the success rate of speech recognition for the fastest speakers did not reach that of humans, the robot attains amusing quiz game interaction.
ASJC Scopus subject areas
- コンピュータ サイエンスの応用