Online meeting recognizer with multichannel speaker diarization

Shoko Araki, Takaaki Hori, Masakiyo Fujimoto, Shinji Watanabe, Takuya Yoshioka, Tomohiro Nakatani, Atsushi Nakamura

研究成果: Conference contribution

10 被引用数 (Scopus)

抄録

We present our newly developed real-time conversation analyzer for group meetings. The goal of the system is to estimate automatically "who speaks when and what" in an online manner. In our system, "who speaks when" information is first obtained by estimating the directions of arrival (DOAs) of signals. Then, "who speaks what" is estimated with our automatic speech recognition (ASR) system, after suppressing reverberation, background noise, and interference speakers' voices. In this paper, we focus particularly on the speaker diarization ("who speaks when" estimation) method, and we show that the speaker diarization information helps the ASR to reduce insertion errors.

本文言語English
ホスト出版物のタイトルConference Record of the 44th Asilomar Conference on Signals, Systems and Computers, Asilomar 2010
ページ1697-1701
ページ数5
DOI
出版ステータスPublished - 2010 12 1
外部発表はい
イベント44th Asilomar Conference on Signals, Systems and Computers, Asilomar 2010 - Pacific Grove, CA, United States
継続期間: 2010 11 72010 11 10

出版物シリーズ

名前Conference Record - Asilomar Conference on Signals, Systems and Computers
ISSN(印刷版)1058-6393

Other

Other44th Asilomar Conference on Signals, Systems and Computers, Asilomar 2010
CountryUnited States
CityPacific Grove, CA
Period10/11/710/11/10

ASJC Scopus subject areas

  • Signal Processing
  • Computer Networks and Communications

フィンガープリント 「Online meeting recognizer with multichannel speaker diarization」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル