A drift-constrained frequency-domain ultra-low-delay H.264/SVC to H.264/AVC transcoder with medium-grain quality scalability for videoconferencing

Lei Sun, Zhenyu Liu, Takeshi Ikenaga

Research output: Contribution to journalArticle

Abstract

Scalable Video Coding (SVC) is an extension of H.264/ AVC, aiming to provide the ability to adapt to heterogeneous networks or requirements. It offers great flexibility for bitstream adaptation in multipoint applications such as videoconferencing. However, transcoding between SVC and AVC is necessary due to the existence of legacy AVC-based systems. The straightforward re-encoding method requires great computational cost, and delay-sensitive applications like videoconferencing require much faster transcoding scheme. This paper proposes an ultra-lowdelay SVC-to-AVC MGS (Medium-Grain quality Scalability) transcoder for videoconferencing applications. Transcoding is performed in pure frequency domain with partial decoding/encoding in order to achieve significant speed-up. Three fast transcoding methods in frequency domain are proposed for macroblocks with different coding modes in non-KEY pictures. KEY pictures are transcoded by reusing the base layer motion data, and error propagation is constrained between KEY pictures. Simulation results show that proposed transcoder achieves averagely 38.5 times speed-up compared with the re-encoding method, while introducing merely 0.71 dB BDPSNR coding quality loss for videoconferencing sequences as compared with the re-encoding algorithm.

Original languageEnglish
Pages (from-to)1253-1263
Number of pages11
JournalIEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences
VolumeE96-A
Issue number6
DOIs
Publication statusPublished - 2013 Jun

Fingerprint

Transcoding
Scalable video coding
Video Coding
Frequency Domain
Scalability
Encoding
Speedup
Coding
Heterogeneous networks
Decoding
Error Propagation
Heterogeneous Networks
Computational Cost
Flexibility
Partial
Necessary
Motion
Costs
Requirements
Simulation

Keywords

  • Frequency domain
  • Low delay
  • MGS scalability
  • SVC-to-AVC transcoding
  • Videoconferencing

ASJC Scopus subject areas

  • Electrical and Electronic Engineering
  • Computer Graphics and Computer-Aided Design
  • Applied Mathematics
  • Signal Processing

Cite this

@article{9ad1c628d9a04b019de9c4c6f0fa461e,
title = "A drift-constrained frequency-domain ultra-low-delay H.264/SVC to H.264/AVC transcoder with medium-grain quality scalability for videoconferencing",
abstract = "Scalable Video Coding (SVC) is an extension of H.264/ AVC, aiming to provide the ability to adapt to heterogeneous networks or requirements. It offers great flexibility for bitstream adaptation in multipoint applications such as videoconferencing. However, transcoding between SVC and AVC is necessary due to the existence of legacy AVC-based systems. The straightforward re-encoding method requires great computational cost, and delay-sensitive applications like videoconferencing require much faster transcoding scheme. This paper proposes an ultra-lowdelay SVC-to-AVC MGS (Medium-Grain quality Scalability) transcoder for videoconferencing applications. Transcoding is performed in pure frequency domain with partial decoding/encoding in order to achieve significant speed-up. Three fast transcoding methods in frequency domain are proposed for macroblocks with different coding modes in non-KEY pictures. KEY pictures are transcoded by reusing the base layer motion data, and error propagation is constrained between KEY pictures. Simulation results show that proposed transcoder achieves averagely 38.5 times speed-up compared with the re-encoding method, while introducing merely 0.71 dB BDPSNR coding quality loss for videoconferencing sequences as compared with the re-encoding algorithm.",
keywords = "Frequency domain, Low delay, MGS scalability, SVC-to-AVC transcoding, Videoconferencing",
author = "Lei Sun and Zhenyu Liu and Takeshi Ikenaga",
year = "2013",
month = "6",
doi = "10.1587/transfun.E96.A.1253",
language = "English",
volume = "E96-A",
pages = "1253--1263",
journal = "IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences",
issn = "0916-8508",
publisher = "Maruzen Co., Ltd/Maruzen Kabushikikaisha",
number = "6",

}

TY - JOUR

T1 - A drift-constrained frequency-domain ultra-low-delay H.264/SVC to H.264/AVC transcoder with medium-grain quality scalability for videoconferencing

AU - Sun, Lei

AU - Liu, Zhenyu

AU - Ikenaga, Takeshi

PY - 2013/6

Y1 - 2013/6

N2 - Scalable Video Coding (SVC) is an extension of H.264/ AVC, aiming to provide the ability to adapt to heterogeneous networks or requirements. It offers great flexibility for bitstream adaptation in multipoint applications such as videoconferencing. However, transcoding between SVC and AVC is necessary due to the existence of legacy AVC-based systems. The straightforward re-encoding method requires great computational cost, and delay-sensitive applications like videoconferencing require much faster transcoding scheme. This paper proposes an ultra-lowdelay SVC-to-AVC MGS (Medium-Grain quality Scalability) transcoder for videoconferencing applications. Transcoding is performed in pure frequency domain with partial decoding/encoding in order to achieve significant speed-up. Three fast transcoding methods in frequency domain are proposed for macroblocks with different coding modes in non-KEY pictures. KEY pictures are transcoded by reusing the base layer motion data, and error propagation is constrained between KEY pictures. Simulation results show that proposed transcoder achieves averagely 38.5 times speed-up compared with the re-encoding method, while introducing merely 0.71 dB BDPSNR coding quality loss for videoconferencing sequences as compared with the re-encoding algorithm.

AB - Scalable Video Coding (SVC) is an extension of H.264/ AVC, aiming to provide the ability to adapt to heterogeneous networks or requirements. It offers great flexibility for bitstream adaptation in multipoint applications such as videoconferencing. However, transcoding between SVC and AVC is necessary due to the existence of legacy AVC-based systems. The straightforward re-encoding method requires great computational cost, and delay-sensitive applications like videoconferencing require much faster transcoding scheme. This paper proposes an ultra-lowdelay SVC-to-AVC MGS (Medium-Grain quality Scalability) transcoder for videoconferencing applications. Transcoding is performed in pure frequency domain with partial decoding/encoding in order to achieve significant speed-up. Three fast transcoding methods in frequency domain are proposed for macroblocks with different coding modes in non-KEY pictures. KEY pictures are transcoded by reusing the base layer motion data, and error propagation is constrained between KEY pictures. Simulation results show that proposed transcoder achieves averagely 38.5 times speed-up compared with the re-encoding method, while introducing merely 0.71 dB BDPSNR coding quality loss for videoconferencing sequences as compared with the re-encoding algorithm.

KW - Frequency domain

KW - Low delay

KW - MGS scalability

KW - SVC-to-AVC transcoding

KW - Videoconferencing

UR - http://www.scopus.com/inward/record.url?scp=84878547614&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84878547614&partnerID=8YFLogxK

U2 - 10.1587/transfun.E96.A.1253

DO - 10.1587/transfun.E96.A.1253

M3 - Article

AN - SCOPUS:84878547614

VL - E96-A

SP - 1253

EP - 1263

JO - IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences

JF - IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences

SN - 0916-8508

IS - 6

ER -