Extract voice information using high-speed camera

Mariko Akutsu, Yasuhiro Oikawa, Yoshio Yamasaki

研究成果: Conference article

5 引用 (Scopus)


Conversation is one of the most important channels for human beings. To help communications, speech recognition technologies have been developed. Above all, in a conversation, not only contents of utterances but also intonations and tones include important information regarding a speaker's intention. To study the sphere of human speech, microphones are typically used to record voices. However, since microphones have to be set around a space, their existences affect a physical behavior of the sound field. To challenge this problem, we have suggested a recording method using a high-speed camera. By using a high-speed camera for recording sound vibrations, it can record two or more points within the range of the camera at the same time and can record from a distance, without interfering with the sound fields. In this study, we extract voice information using high-speed videos which capture both a face and a cervical part of the subject. This method allows recording skin vibrations which contain voices with individuality and extrapolating sound waves by using an image processing method. The result of the experiment shows that a high-speed camera is capable of recording voice information.

ジャーナルProceedings of Meetings on Acoustics
出版物ステータスPublished - 2013 6 19
イベント21st International Congress on Acoustics, ICA 2013 - 165th Meeting of the Acoustical Society of America - Montreal, QC, Canada
継続期間: 2013 6 22013 6 7

ASJC Scopus subject areas

  • Acoustics and Ultrasonics

フィンガープリント Extract voice information using high-speed camera' の研究トピックを掘り下げます。これらはともに一意のフィンガープリントを構成します。

  • これを引用