Dynamic subtitle placement considering the region of interest and speaker location

Wataru Akahori, Tatsunori Hirai, Shigeo Morishima

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Citations (Scopus)

Abstract

This paper presents a subtitle placement method that reduces unnecessary eye movements. Although methods that vary the position of subtitles have been discussed in a previous study, subtitles may overlap the region of interest (ROI). Therefore, we propose a dynamic subtitling method that utilizes eye-Tracking data to avoid the subtitles from overlapping with important regions. The proposed method calculates the ROI based on the eye-Tracking data of multiple viewers. By positioning subtitles immediately under the ROI, the subtitles do not overlap the ROI. Furthermore, we detect speakers in a scene based on audio and visual information to help viewers recognize the speaker by positioning subtitles near the speaker. Experimental results show that the proposed method enables viewers to watch the ROI and the subtitle in longer duration than traditional subtitles, and is effective in terms of enhancing the comfort and utility of the viewing experience.

Original languageEnglish
Title of host publicationVISAPP
EditorsFrancisco Imai, Alain Tremeau, Jose Braz
PublisherSciTePress
Pages102-109
Number of pages8
ISBN (Electronic)9789897582271
DOIs
Publication statusPublished - 2017 Jan 1
Externally publishedYes
Event12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, VISIGRAPP 2017 - Porto, Portugal
Duration: 2017 Feb 272017 Mar 1

Publication series

NameVISIGRAPP 2017 - Proceedings of the 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications
Volume6

Other

Other12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, VISIGRAPP 2017
CountryPortugal
CityPorto
Period17/2/2717/3/1

Keywords

  • Dynamic Subtitles
  • Eye-Tracking
  • Region of Interest
  • Speaker Detection
  • User Experience.

ASJC Scopus subject areas

  • Computer Graphics and Computer-Aided Design
  • Computer Vision and Pattern Recognition
  • Artificial Intelligence

Fingerprint Dive into the research topics of 'Dynamic subtitle placement considering the region of interest and speaker location'. Together they form a unique fingerprint.

  • Cite this

    Akahori, W., Hirai, T., & Morishima, S. (2017). Dynamic subtitle placement considering the region of interest and speaker location. In F. Imai, A. Tremeau, & J. Braz (Eds.), VISAPP (pp. 102-109). (VISIGRAPP 2017 - Proceedings of the 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications; Vol. 6). SciTePress. https://doi.org/10.5220/0006262201020109