Speech-to-image media conversion based on VQ and neural network

Shigeo Morishima, Hiroshi Harashima

研究成果: Conference contribution

4 被引用数 (Scopus)

抄録

Automatic media conversion schemes from speech to a facial image and a construction of a real-time image synthesis system are presented. The purpose of this research is to realize an intelligent human-machine interface or intelligent communication system with synthesized human face images. A human face image is reconstructed on the display of a terminal using a 3-D surface model and texture mapping technique. Facial motion images are synthesized by transformation of the 3-D model. In the motion driving method, based on vector quantization and the neural network, the synthesized head image can appear to speak some given words and phrases naturally, in synchronization with voice signals from a speaker.

本文言語English
ホスト出版物のタイトルProceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing
編集者 Anon
出版社Publ by IEEE
ページ2865-2868
ページ数4
ISBN(印刷版)078030033
出版ステータスPublished - 1991 12 1
外部発表はい
イベントProceedings of the 1991 International Conference on Acoustics, Speech, and Signal Processing - ICASSP 91 - Toronto, Ont, Can
継続期間: 1991 5 141991 5 17

出版物シリーズ

名前Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing
4
ISSN(印刷版)0736-7791

Other

OtherProceedings of the 1991 International Conference on Acoustics, Speech, and Signal Processing - ICASSP 91
CityToronto, Ont, Can
Period91/5/1491/5/17

ASJC Scopus subject areas

  • Software
  • Signal Processing
  • Electrical and Electronic Engineering

フィンガープリント 「Speech-to-image media conversion based on VQ and neural network」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル