Low-complexity coarse-level mode-mapping based H.264/AVC to H.264/SVC spatial transcoding for video conferencing

Lei Sun, Jie Leng, Jia Su, Yiqing Huang, Hiroomi Motohashi, Takeshi Ikenaga

Research output: Contribution to journalArticle

3 Citations (Scopus)

Abstract

Scalable Video Coding (SVC) was standardized as an extension of H.264/AVC with the intention to provide flexible adaptation to heterogeneous networks and different end-user requirements, which provides great scalability in multi-point applications such as video conferencing. However, due to the existence of H.264/AVC-based systems, transcoding between AVC and SVC becomes necessary. Most existing works focus on temporal transcoding, quality transcoding or SVC-to-AVC spatial transcoding while the straightforward re-encoding method requires high computational cost. This paper proposes a low-complexity AVC-to-SVC spatial transcoder based on coarse-level mode mapping for video conferencing scenes. First, to omit unnecessary motion estimations (ME) for layers with reduced resolution, an ME skipping scheme based on AVC mode distribution is proposed with an adaptive search range. Then a probabilityprofile based scheme is proposed for further mode skipping. After that 3 coarse-level mode-mapping methods are presented for fast mode decision and the adaptive usage of the 3 methods is discussed. Finally, motion vector (MV) refinement is introduced for further lower-layer time reduction. As for the top layer, direct encapsulation is proposed to preserve better quality and another scheme involving inter-layer predictions is also provided for bandwidth-crucial applications. Simulation results show that proposed transcoder achieves up to 92.6% time reduction without significant coding efficiency loss compared to re-encoding method.

Original languageEnglish
Pages (from-to)1313-1323
Number of pages11
JournalIEICE Transactions on Information and Systems
VolumeE95-D
Issue number5
DOIs
Publication statusPublished - 2012 May

Fingerprint

Scalable video coding
Video conferencing
Motion estimation
Heterogeneous networks
Encapsulation
Scalability
Bandwidth
Costs

Keywords

  • AVC-SVC transcoding
  • Lowcomplexity
  • Mode-mapping
  • Spatial scalability
  • Video conferencing

ASJC Scopus subject areas

  • Electrical and Electronic Engineering
  • Software
  • Artificial Intelligence
  • Hardware and Architecture
  • Computer Vision and Pattern Recognition

Cite this

Low-complexity coarse-level mode-mapping based H.264/AVC to H.264/SVC spatial transcoding for video conferencing. / Sun, Lei; Leng, Jie; Su, Jia; Huang, Yiqing; Motohashi, Hiroomi; Ikenaga, Takeshi.

In: IEICE Transactions on Information and Systems, Vol. E95-D, No. 5, 05.2012, p. 1313-1323.

Research output: Contribution to journalArticle

Sun, Lei ; Leng, Jie ; Su, Jia ; Huang, Yiqing ; Motohashi, Hiroomi ; Ikenaga, Takeshi. / Low-complexity coarse-level mode-mapping based H.264/AVC to H.264/SVC spatial transcoding for video conferencing. In: IEICE Transactions on Information and Systems. 2012 ; Vol. E95-D, No. 5. pp. 1313-1323.
@article{1cff5d1e415c4501a9b607b504966b2b,
title = "Low-complexity coarse-level mode-mapping based H.264/AVC to H.264/SVC spatial transcoding for video conferencing",
abstract = "Scalable Video Coding (SVC) was standardized as an extension of H.264/AVC with the intention to provide flexible adaptation to heterogeneous networks and different end-user requirements, which provides great scalability in multi-point applications such as video conferencing. However, due to the existence of H.264/AVC-based systems, transcoding between AVC and SVC becomes necessary. Most existing works focus on temporal transcoding, quality transcoding or SVC-to-AVC spatial transcoding while the straightforward re-encoding method requires high computational cost. This paper proposes a low-complexity AVC-to-SVC spatial transcoder based on coarse-level mode mapping for video conferencing scenes. First, to omit unnecessary motion estimations (ME) for layers with reduced resolution, an ME skipping scheme based on AVC mode distribution is proposed with an adaptive search range. Then a probabilityprofile based scheme is proposed for further mode skipping. After that 3 coarse-level mode-mapping methods are presented for fast mode decision and the adaptive usage of the 3 methods is discussed. Finally, motion vector (MV) refinement is introduced for further lower-layer time reduction. As for the top layer, direct encapsulation is proposed to preserve better quality and another scheme involving inter-layer predictions is also provided for bandwidth-crucial applications. Simulation results show that proposed transcoder achieves up to 92.6{\%} time reduction without significant coding efficiency loss compared to re-encoding method.",
keywords = "AVC-SVC transcoding, Lowcomplexity, Mode-mapping, Spatial scalability, Video conferencing",
author = "Lei Sun and Jie Leng and Jia Su and Yiqing Huang and Hiroomi Motohashi and Takeshi Ikenaga",
year = "2012",
month = "5",
doi = "10.1587/transinf.E95.D.1313",
language = "English",
volume = "E95-D",
pages = "1313--1323",
journal = "IEICE Transactions on Information and Systems",
issn = "0916-8532",
publisher = "Maruzen Co., Ltd/Maruzen Kabushikikaisha",
number = "5",

}

TY - JOUR

T1 - Low-complexity coarse-level mode-mapping based H.264/AVC to H.264/SVC spatial transcoding for video conferencing

AU - Sun, Lei

AU - Leng, Jie

AU - Su, Jia

AU - Huang, Yiqing

AU - Motohashi, Hiroomi

AU - Ikenaga, Takeshi

PY - 2012/5

Y1 - 2012/5

N2 - Scalable Video Coding (SVC) was standardized as an extension of H.264/AVC with the intention to provide flexible adaptation to heterogeneous networks and different end-user requirements, which provides great scalability in multi-point applications such as video conferencing. However, due to the existence of H.264/AVC-based systems, transcoding between AVC and SVC becomes necessary. Most existing works focus on temporal transcoding, quality transcoding or SVC-to-AVC spatial transcoding while the straightforward re-encoding method requires high computational cost. This paper proposes a low-complexity AVC-to-SVC spatial transcoder based on coarse-level mode mapping for video conferencing scenes. First, to omit unnecessary motion estimations (ME) for layers with reduced resolution, an ME skipping scheme based on AVC mode distribution is proposed with an adaptive search range. Then a probabilityprofile based scheme is proposed for further mode skipping. After that 3 coarse-level mode-mapping methods are presented for fast mode decision and the adaptive usage of the 3 methods is discussed. Finally, motion vector (MV) refinement is introduced for further lower-layer time reduction. As for the top layer, direct encapsulation is proposed to preserve better quality and another scheme involving inter-layer predictions is also provided for bandwidth-crucial applications. Simulation results show that proposed transcoder achieves up to 92.6% time reduction without significant coding efficiency loss compared to re-encoding method.

AB - Scalable Video Coding (SVC) was standardized as an extension of H.264/AVC with the intention to provide flexible adaptation to heterogeneous networks and different end-user requirements, which provides great scalability in multi-point applications such as video conferencing. However, due to the existence of H.264/AVC-based systems, transcoding between AVC and SVC becomes necessary. Most existing works focus on temporal transcoding, quality transcoding or SVC-to-AVC spatial transcoding while the straightforward re-encoding method requires high computational cost. This paper proposes a low-complexity AVC-to-SVC spatial transcoder based on coarse-level mode mapping for video conferencing scenes. First, to omit unnecessary motion estimations (ME) for layers with reduced resolution, an ME skipping scheme based on AVC mode distribution is proposed with an adaptive search range. Then a probabilityprofile based scheme is proposed for further mode skipping. After that 3 coarse-level mode-mapping methods are presented for fast mode decision and the adaptive usage of the 3 methods is discussed. Finally, motion vector (MV) refinement is introduced for further lower-layer time reduction. As for the top layer, direct encapsulation is proposed to preserve better quality and another scheme involving inter-layer predictions is also provided for bandwidth-crucial applications. Simulation results show that proposed transcoder achieves up to 92.6% time reduction without significant coding efficiency loss compared to re-encoding method.

KW - AVC-SVC transcoding

KW - Lowcomplexity

KW - Mode-mapping

KW - Spatial scalability

KW - Video conferencing

UR - http://www.scopus.com/inward/record.url?scp=84860642314&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84860642314&partnerID=8YFLogxK

U2 - 10.1587/transinf.E95.D.1313

DO - 10.1587/transinf.E95.D.1313

M3 - Article

AN - SCOPUS:84860642314

VL - E95-D

SP - 1313

EP - 1323

JO - IEICE Transactions on Information and Systems

JF - IEICE Transactions on Information and Systems

SN - 0916-8532

IS - 5

ER -