3D human head geometry estimation from a speech

Akinobu Maejima*, Shigeo Morishima

*この研究の対応する著者

研究成果: Conference contribution

抄録

We can visualize acquaintances' appearance by just hearing their voice if we have met them in past few years. Thus, it would appear that some relationships exist in between voice and appearance. If 3D head geometry could be estimated from a voice, we can realize some applications (e.g, avatar generation, character modeling for video game, etc.). Previously, although many researchers have been reported about a relationship between acoustic features of a voice and its corresponding dynamical visual features including lip, tongue, and jaw movements or vocal articulation during a speech, however, there have been few reports about a relationship between acoustic features and static 3D head geometry. In this paper, we focus on estimating 3D head geometry from a voice. Acoustic features vary depending on a speech context and its intonation. Therefore we restrict a context to Japanese 5 vowels. Under this assumption, to estimate 3D head geometry, we use a Feedforward Neural Network (FNN) trained by using a correspondence between an individual acoustic features extracted from a Japanese vowel and 3D head geometry generated based on a 3D range scan. The performance of our method is shown by both closed and open tests. As a result, we found that 3D head geometry which is acoustically similar to an input voice could be estimated under the limited condition.

本文言語English
ホスト出版物のタイトルACM SIGGRAPH 2012 Posters, SIGGRAPH'12
DOI
出版ステータスPublished - 2012 9月 6
イベントACM Special Interest Group on Computer Graphics and Interactive Techniques Conference, SIGGRAPH'12 - Los Angeles, CA, United States
継続期間: 2012 8月 52012 8月 9

出版物シリーズ

名前ACM SIGGRAPH 2012 Posters, SIGGRAPH'12

Conference

ConferenceACM Special Interest Group on Computer Graphics and Interactive Techniques Conference, SIGGRAPH'12
国/地域United States
CityLos Angeles, CA
Period12/8/512/8/9

ASJC Scopus subject areas

  • コンピュータ グラフィックスおよびコンピュータ支援設計
  • コンピュータ ビジョンおよびパターン認識
  • ソフトウェア

フィンガープリント

「3D human head geometry estimation from a speech」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル