Estimation of speaking style in speech corpora focusing on speech transcriptions

Raymond Shen, Hideaki Kikuchi

研究成果: Conference contribution

抄録

Recent developments in computer technology have allowed the construction and widespread application of large-scale speech corpora. To foster ease of data retrieval for people interested in utilising these speech corpora, we attempt to characterise speaking style across some of them. In this paper, we first introduce the 3 scales of speaking style proposed by Eskenazi in 1993. We then use morphological features extracted from speech transcriptions that have proven effective in style discrimination and author identification in the field of natural language processing to construct an estimation model of speaking style. More specifically, we randomly choose transcriptions from various speech corpora as text stimuli with which to conduct a rating experiment on speaking style perception; then, using the features extracted from those stimuli and the rating results, we construct an estimation model of speaking style by a multi-regression analysis. After the cross validation (leave-1-out), the results show that among the 3 scales of speaking style, the ratings of 2 scales can be estimated with high accuracies, which prove the effectiveness of our method in the estimation of speaking style.

本文言語English
ホスト出版物のタイトルProceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014
編集者Nicoletta Calzolari, Khalid Choukri, Sara Goggi, Thierry Declerck, Joseph Mariani, Bente Maegaard, Asuncion Moreno, Jan Odijk, Helene Mazo, Stelios Piperidis, Hrafn Loftsson
出版社European Language Resources Association (ELRA)
ページ2747-2752
ページ数6
ISBN(電子版)9782951740884
出版ステータスPublished - 2014
イベント9th International Conference on Language Resources and Evaluation, LREC 2014 - Reykjavik, Iceland
継続期間: 2014 5月 262014 5月 31

出版物シリーズ

名前Proceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014

Other

Other9th International Conference on Language Resources and Evaluation, LREC 2014
国/地域Iceland
CityReykjavik
Period14/5/2614/5/31

ASJC Scopus subject areas

  • 言語学および言語
  • 図書館情報学
  • 教育
  • 言語および言語学

フィンガープリント

「Estimation of speaking style in speech corpora focusing on speech transcriptions」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル