Study on automated audio descriptions overlapping live television commentary

Manon Ichiki, Toshihiro Shimizu, Atsushi Imai, Tohru Takagi, Mamoru Iwabuchi, Kiyoshi Kurihara, Taro Miyazaki, Tadashi Kumano, Hiroyuki Kaneko, Shoei Sato, Nobumasa Seiyama, Yuko Yamanouchi, Hideki Sumiyoshi

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We are conducting research on “automated audio description (AAD)” which automatically generates audio descriptions from real-time competition data for visually impaired people to enjoy live sports programs. However, there is a problem that AAD overlaps with the live television commentary voice, making it difficult to hear each other’s comment. In this paper, first, we show that the game situation is conveyed effectively when visually impaired persons listen to the AAD alone. Then we state the results of experiments on the following items to solve the overlap issue: (1) There is a difference in optimum volume level between live commentary and AAD, (2) The ease of listening differs depending on the difference in the characteristics of text-to-speech synthesizer for AAD, (3) Playing back AAD through a speaker placed differently from the TV speaker makes both voice sounds easier to listen to. We had clues to solve that depending on the presentation method of AAD, we can make AAD easy to listen to even when AAD overlaps the live television commentary.

Original languageEnglish
Title of host publicationComputers Helping People with Special Needs - 16th International Conference, ICCHP 2018, Proceedings
EditorsKlaus Miesenberger, Georgios Kouroupetroglou
PublisherSpringer-Verlag
Pages220-224
Number of pages5
ISBN (Print)9783319942766
DOIs
Publication statusPublished - 2018 Jan 1
Externally publishedYes
Event16th International Conference on Computers Helping People with Special Needs, ICCHP 2018 - Linz, Austria
Duration: 2018 Jul 112018 Jul 13

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume10896 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference16th International Conference on Computers Helping People with Special Needs, ICCHP 2018
CountryAustria
CityLinz
Period18/7/1118/7/13

Fingerprint

Television
Overlapping
Overlap
Visually Impaired
Text-to-speech
Sports
Person
Acoustic waves
Game
Real-time
Experiment
Experiments
Voice

Keywords

  • Audio description
  • Text-to-speech synthesizer
  • Visually impaired

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

Cite this

Ichiki, M., Shimizu, T., Imai, A., Takagi, T., Iwabuchi, M., Kurihara, K., ... Sumiyoshi, H. (2018). Study on automated audio descriptions overlapping live television commentary. In K. Miesenberger, & G. Kouroupetroglou (Eds.), Computers Helping People with Special Needs - 16th International Conference, ICCHP 2018, Proceedings (pp. 220-224). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 10896 LNCS). Springer-Verlag. https://doi.org/10.1007/978-3-319-94277-3_36

Study on automated audio descriptions overlapping live television commentary. / Ichiki, Manon; Shimizu, Toshihiro; Imai, Atsushi; Takagi, Tohru; Iwabuchi, Mamoru; Kurihara, Kiyoshi; Miyazaki, Taro; Kumano, Tadashi; Kaneko, Hiroyuki; Sato, Shoei; Seiyama, Nobumasa; Yamanouchi, Yuko; Sumiyoshi, Hideki.

Computers Helping People with Special Needs - 16th International Conference, ICCHP 2018, Proceedings. ed. / Klaus Miesenberger; Georgios Kouroupetroglou. Springer-Verlag, 2018. p. 220-224 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 10896 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Ichiki, M, Shimizu, T, Imai, A, Takagi, T, Iwabuchi, M, Kurihara, K, Miyazaki, T, Kumano, T, Kaneko, H, Sato, S, Seiyama, N, Yamanouchi, Y & Sumiyoshi, H 2018, Study on automated audio descriptions overlapping live television commentary. in K Miesenberger & G Kouroupetroglou (eds), Computers Helping People with Special Needs - 16th International Conference, ICCHP 2018, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 10896 LNCS, Springer-Verlag, pp. 220-224, 16th International Conference on Computers Helping People with Special Needs, ICCHP 2018, Linz, Austria, 18/7/11. https://doi.org/10.1007/978-3-319-94277-3_36
Ichiki M, Shimizu T, Imai A, Takagi T, Iwabuchi M, Kurihara K et al. Study on automated audio descriptions overlapping live television commentary. In Miesenberger K, Kouroupetroglou G, editors, Computers Helping People with Special Needs - 16th International Conference, ICCHP 2018, Proceedings. Springer-Verlag. 2018. p. 220-224. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). https://doi.org/10.1007/978-3-319-94277-3_36
Ichiki, Manon ; Shimizu, Toshihiro ; Imai, Atsushi ; Takagi, Tohru ; Iwabuchi, Mamoru ; Kurihara, Kiyoshi ; Miyazaki, Taro ; Kumano, Tadashi ; Kaneko, Hiroyuki ; Sato, Shoei ; Seiyama, Nobumasa ; Yamanouchi, Yuko ; Sumiyoshi, Hideki. / Study on automated audio descriptions overlapping live television commentary. Computers Helping People with Special Needs - 16th International Conference, ICCHP 2018, Proceedings. editor / Klaus Miesenberger ; Georgios Kouroupetroglou. Springer-Verlag, 2018. pp. 220-224 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
@inproceedings{c43a2abde6204d71905d658f854fc2df,
title = "Study on automated audio descriptions overlapping live television commentary",
abstract = "We are conducting research on “automated audio description (AAD)” which automatically generates audio descriptions from real-time competition data for visually impaired people to enjoy live sports programs. However, there is a problem that AAD overlaps with the live television commentary voice, making it difficult to hear each other’s comment. In this paper, first, we show that the game situation is conveyed effectively when visually impaired persons listen to the AAD alone. Then we state the results of experiments on the following items to solve the overlap issue: (1) There is a difference in optimum volume level between live commentary and AAD, (2) The ease of listening differs depending on the difference in the characteristics of text-to-speech synthesizer for AAD, (3) Playing back AAD through a speaker placed differently from the TV speaker makes both voice sounds easier to listen to. We had clues to solve that depending on the presentation method of AAD, we can make AAD easy to listen to even when AAD overlaps the live television commentary.",
keywords = "Audio description, Text-to-speech synthesizer, Visually impaired",
author = "Manon Ichiki and Toshihiro Shimizu and Atsushi Imai and Tohru Takagi and Mamoru Iwabuchi and Kiyoshi Kurihara and Taro Miyazaki and Tadashi Kumano and Hiroyuki Kaneko and Shoei Sato and Nobumasa Seiyama and Yuko Yamanouchi and Hideki Sumiyoshi",
year = "2018",
month = "1",
day = "1",
doi = "10.1007/978-3-319-94277-3_36",
language = "English",
isbn = "9783319942766",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
publisher = "Springer-Verlag",
pages = "220--224",
editor = "Klaus Miesenberger and Georgios Kouroupetroglou",
booktitle = "Computers Helping People with Special Needs - 16th International Conference, ICCHP 2018, Proceedings",

}

TY - GEN

T1 - Study on automated audio descriptions overlapping live television commentary

AU - Ichiki, Manon

AU - Shimizu, Toshihiro

AU - Imai, Atsushi

AU - Takagi, Tohru

AU - Iwabuchi, Mamoru

AU - Kurihara, Kiyoshi

AU - Miyazaki, Taro

AU - Kumano, Tadashi

AU - Kaneko, Hiroyuki

AU - Sato, Shoei

AU - Seiyama, Nobumasa

AU - Yamanouchi, Yuko

AU - Sumiyoshi, Hideki

PY - 2018/1/1

Y1 - 2018/1/1

N2 - We are conducting research on “automated audio description (AAD)” which automatically generates audio descriptions from real-time competition data for visually impaired people to enjoy live sports programs. However, there is a problem that AAD overlaps with the live television commentary voice, making it difficult to hear each other’s comment. In this paper, first, we show that the game situation is conveyed effectively when visually impaired persons listen to the AAD alone. Then we state the results of experiments on the following items to solve the overlap issue: (1) There is a difference in optimum volume level between live commentary and AAD, (2) The ease of listening differs depending on the difference in the characteristics of text-to-speech synthesizer for AAD, (3) Playing back AAD through a speaker placed differently from the TV speaker makes both voice sounds easier to listen to. We had clues to solve that depending on the presentation method of AAD, we can make AAD easy to listen to even when AAD overlaps the live television commentary.

AB - We are conducting research on “automated audio description (AAD)” which automatically generates audio descriptions from real-time competition data for visually impaired people to enjoy live sports programs. However, there is a problem that AAD overlaps with the live television commentary voice, making it difficult to hear each other’s comment. In this paper, first, we show that the game situation is conveyed effectively when visually impaired persons listen to the AAD alone. Then we state the results of experiments on the following items to solve the overlap issue: (1) There is a difference in optimum volume level between live commentary and AAD, (2) The ease of listening differs depending on the difference in the characteristics of text-to-speech synthesizer for AAD, (3) Playing back AAD through a speaker placed differently from the TV speaker makes both voice sounds easier to listen to. We had clues to solve that depending on the presentation method of AAD, we can make AAD easy to listen to even when AAD overlaps the live television commentary.

KW - Audio description

KW - Text-to-speech synthesizer

KW - Visually impaired

UR - http://www.scopus.com/inward/record.url?scp=85049774170&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85049774170&partnerID=8YFLogxK

U2 - 10.1007/978-3-319-94277-3_36

DO - 10.1007/978-3-319-94277-3_36

M3 - Conference contribution

AN - SCOPUS:85049774170

SN - 9783319942766

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 220

EP - 224

BT - Computers Helping People with Special Needs - 16th International Conference, ICCHP 2018, Proceedings

A2 - Miesenberger, Klaus

A2 - Kouroupetroglou, Georgios

PB - Springer-Verlag

ER -