Video conferencing oriented low-complexity coarse-level mode-mapping based H.264/AVC to H.264/SVC spatial transcoding

Lei Sun, Jie Leng, Jia Su, Yiqing Huang, Hiroomi Motohashi, Takeshi Ikenaga

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

As an extension of H.264/AVC, Scalable Video Coding (SVC) provides flexible adaptation to heterogeneous networks and end-users, which provides great scalability for applications such as video conferencing. However, due to the existence of legacy H.264/AVC-based systems, transcoding between AVC and SVC becomes necessary. Currently there are few works done on AVC-to-SVC spatial transcoding, and re-encoding method involves high computational cost. This paper proposes a low-complexity coarse-level mode-mapping based AVC-to-SVC spatial transcoder for video conferencing applications. First, to omit unnecessary motion estimations (ME) for layers with reduced resolution, an ME skipping scheme based on AVC mode distribution is proposed with an adaptive search range. After that an adaptive coarse-level mode-mapping method is presented for fast mode decision. Finally, motion vector (MV) refinement is introduced for further lower-layer time reduction. As for the top layer, direct encapsulation is proposed to preserve better quality and another scheme involving inter-layer predictions is also provided for bandwidth-crucial applications. Simulation results show that proposed transcoder achieves up to 90.6% time reduction without significant coding efficiency loss compared to re-encoding method.

Original languageEnglish
Title of host publicationAPSIPA ASC 2011 - Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2011
Pages621-628
Number of pages8
Publication statusPublished - 2011
EventAsia-Pacific Signal and Information Processing Association Annual Summit and Conference 2011, APSIPA ASC 2011 - Xi'an
Duration: 2011 Oct 182011 Oct 21

Other

OtherAsia-Pacific Signal and Information Processing Association Annual Summit and Conference 2011, APSIPA ASC 2011
CityXi'an
Period11/10/1811/10/21

Fingerprint

Scalable video coding
Video conferencing
Motion estimation
Heterogeneous networks
Encapsulation
Scalability
Bandwidth
Costs

ASJC Scopus subject areas

  • Information Systems
  • Signal Processing

Cite this

Sun, L., Leng, J., Su, J., Huang, Y., Motohashi, H., & Ikenaga, T. (2011). Video conferencing oriented low-complexity coarse-level mode-mapping based H.264/AVC to H.264/SVC spatial transcoding. In APSIPA ASC 2011 - Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2011 (pp. 621-628)

Video conferencing oriented low-complexity coarse-level mode-mapping based H.264/AVC to H.264/SVC spatial transcoding. / Sun, Lei; Leng, Jie; Su, Jia; Huang, Yiqing; Motohashi, Hiroomi; Ikenaga, Takeshi.

APSIPA ASC 2011 - Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2011. 2011. p. 621-628.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Sun, L, Leng, J, Su, J, Huang, Y, Motohashi, H & Ikenaga, T 2011, Video conferencing oriented low-complexity coarse-level mode-mapping based H.264/AVC to H.264/SVC spatial transcoding. in APSIPA ASC 2011 - Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2011. pp. 621-628, Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2011, APSIPA ASC 2011, Xi'an, 11/10/18.
Sun L, Leng J, Su J, Huang Y, Motohashi H, Ikenaga T. Video conferencing oriented low-complexity coarse-level mode-mapping based H.264/AVC to H.264/SVC spatial transcoding. In APSIPA ASC 2011 - Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2011. 2011. p. 621-628
Sun, Lei ; Leng, Jie ; Su, Jia ; Huang, Yiqing ; Motohashi, Hiroomi ; Ikenaga, Takeshi. / Video conferencing oriented low-complexity coarse-level mode-mapping based H.264/AVC to H.264/SVC spatial transcoding. APSIPA ASC 2011 - Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2011. 2011. pp. 621-628
@inproceedings{07f4fb31b52b4314846ea313b6b1ff6d,
title = "Video conferencing oriented low-complexity coarse-level mode-mapping based H.264/AVC to H.264/SVC spatial transcoding",
abstract = "As an extension of H.264/AVC, Scalable Video Coding (SVC) provides flexible adaptation to heterogeneous networks and end-users, which provides great scalability for applications such as video conferencing. However, due to the existence of legacy H.264/AVC-based systems, transcoding between AVC and SVC becomes necessary. Currently there are few works done on AVC-to-SVC spatial transcoding, and re-encoding method involves high computational cost. This paper proposes a low-complexity coarse-level mode-mapping based AVC-to-SVC spatial transcoder for video conferencing applications. First, to omit unnecessary motion estimations (ME) for layers with reduced resolution, an ME skipping scheme based on AVC mode distribution is proposed with an adaptive search range. After that an adaptive coarse-level mode-mapping method is presented for fast mode decision. Finally, motion vector (MV) refinement is introduced for further lower-layer time reduction. As for the top layer, direct encapsulation is proposed to preserve better quality and another scheme involving inter-layer predictions is also provided for bandwidth-crucial applications. Simulation results show that proposed transcoder achieves up to 90.6{\%} time reduction without significant coding efficiency loss compared to re-encoding method.",
author = "Lei Sun and Jie Leng and Jia Su and Yiqing Huang and Hiroomi Motohashi and Takeshi Ikenaga",
year = "2011",
language = "English",
pages = "621--628",
booktitle = "APSIPA ASC 2011 - Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2011",

}

TY - GEN

T1 - Video conferencing oriented low-complexity coarse-level mode-mapping based H.264/AVC to H.264/SVC spatial transcoding

AU - Sun, Lei

AU - Leng, Jie

AU - Su, Jia

AU - Huang, Yiqing

AU - Motohashi, Hiroomi

AU - Ikenaga, Takeshi

PY - 2011

Y1 - 2011

N2 - As an extension of H.264/AVC, Scalable Video Coding (SVC) provides flexible adaptation to heterogeneous networks and end-users, which provides great scalability for applications such as video conferencing. However, due to the existence of legacy H.264/AVC-based systems, transcoding between AVC and SVC becomes necessary. Currently there are few works done on AVC-to-SVC spatial transcoding, and re-encoding method involves high computational cost. This paper proposes a low-complexity coarse-level mode-mapping based AVC-to-SVC spatial transcoder for video conferencing applications. First, to omit unnecessary motion estimations (ME) for layers with reduced resolution, an ME skipping scheme based on AVC mode distribution is proposed with an adaptive search range. After that an adaptive coarse-level mode-mapping method is presented for fast mode decision. Finally, motion vector (MV) refinement is introduced for further lower-layer time reduction. As for the top layer, direct encapsulation is proposed to preserve better quality and another scheme involving inter-layer predictions is also provided for bandwidth-crucial applications. Simulation results show that proposed transcoder achieves up to 90.6% time reduction without significant coding efficiency loss compared to re-encoding method.

AB - As an extension of H.264/AVC, Scalable Video Coding (SVC) provides flexible adaptation to heterogeneous networks and end-users, which provides great scalability for applications such as video conferencing. However, due to the existence of legacy H.264/AVC-based systems, transcoding between AVC and SVC becomes necessary. Currently there are few works done on AVC-to-SVC spatial transcoding, and re-encoding method involves high computational cost. This paper proposes a low-complexity coarse-level mode-mapping based AVC-to-SVC spatial transcoder for video conferencing applications. First, to omit unnecessary motion estimations (ME) for layers with reduced resolution, an ME skipping scheme based on AVC mode distribution is proposed with an adaptive search range. After that an adaptive coarse-level mode-mapping method is presented for fast mode decision. Finally, motion vector (MV) refinement is introduced for further lower-layer time reduction. As for the top layer, direct encapsulation is proposed to preserve better quality and another scheme involving inter-layer predictions is also provided for bandwidth-crucial applications. Simulation results show that proposed transcoder achieves up to 90.6% time reduction without significant coding efficiency loss compared to re-encoding method.

UR - http://www.scopus.com/inward/record.url?scp=84866865153&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84866865153&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:84866865153

SP - 621

EP - 628

BT - APSIPA ASC 2011 - Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2011

ER -