Fast H.264/AVC DIRECT mode decision based on mode selection and predicted rate-distortion cost

Xiaocong Jin, Jun Sun, Yiqing Huang, Jia Su, Takeshi Ikenaga

Research output: Contribution to journalArticle

2 Citations (Scopus)

Abstract

Different encoding modes for variable block size are available in the H.264/AVC standard in order to offer better coding quality. However, this also introduces huge computation time due to the exhaustive check for all modes. In this paper, a fast spatial DIRECT mode decision method for profiles supporting B frame encoding (main profile, high profile, etc.) in H.264/AVC is proposed. Statistical analysis on multiple video sequences is carried out, and the strong relationship of mode selection and rate-distortion (RD) cost between the current DIRECT macroblock (MB) and the co-located MBs is observed. With the check of mode condition, predicted RD cost threshold and dynamic parameter update model, the complex mode decision process can be terminated at an early stage even for small QP cases. Simulation results demonstrate the proposed method can achieve much better performance than the original exhaustive ratedistortion optimization (RDO) based mode decision algorithm by reducing up to 56.8% of encoding time for IBPBP picture group and up to 67.8% of encoding time for IBBPBBP picture group while incurring only negligible bit increment and quality degradation.

Original languageEnglish
Pages (from-to)1653-1662
Number of pages10
JournalIEICE Transactions on Information and Systems
VolumeE94-D
Issue number8
DOIs
Publication statusPublished - 2011 Aug

Fingerprint

Costs
Statistical methods
Degradation

Keywords

  • Adaptive
  • Fast mode decision
  • H.264/AVC
  • Spatial DIRECT

ASJC Scopus subject areas

  • Electrical and Electronic Engineering
  • Software
  • Artificial Intelligence
  • Hardware and Architecture
  • Computer Vision and Pattern Recognition

Cite this

Fast H.264/AVC DIRECT mode decision based on mode selection and predicted rate-distortion cost. / Jin, Xiaocong; Sun, Jun; Huang, Yiqing; Su, Jia; Ikenaga, Takeshi.

In: IEICE Transactions on Information and Systems, Vol. E94-D, No. 8, 08.2011, p. 1653-1662.

Research output: Contribution to journalArticle

@article{a8bf5b8b3eb743c98cf8f35c9f93627f,
title = "Fast H.264/AVC DIRECT mode decision based on mode selection and predicted rate-distortion cost",
abstract = "Different encoding modes for variable block size are available in the H.264/AVC standard in order to offer better coding quality. However, this also introduces huge computation time due to the exhaustive check for all modes. In this paper, a fast spatial DIRECT mode decision method for profiles supporting B frame encoding (main profile, high profile, etc.) in H.264/AVC is proposed. Statistical analysis on multiple video sequences is carried out, and the strong relationship of mode selection and rate-distortion (RD) cost between the current DIRECT macroblock (MB) and the co-located MBs is observed. With the check of mode condition, predicted RD cost threshold and dynamic parameter update model, the complex mode decision process can be terminated at an early stage even for small QP cases. Simulation results demonstrate the proposed method can achieve much better performance than the original exhaustive ratedistortion optimization (RDO) based mode decision algorithm by reducing up to 56.8{\%} of encoding time for IBPBP picture group and up to 67.8{\%} of encoding time for IBBPBBP picture group while incurring only negligible bit increment and quality degradation.",
keywords = "Adaptive, Fast mode decision, H.264/AVC, Spatial DIRECT",
author = "Xiaocong Jin and Jun Sun and Yiqing Huang and Jia Su and Takeshi Ikenaga",
year = "2011",
month = "8",
doi = "10.1587/transinf.E94.D.1653",
language = "English",
volume = "E94-D",
pages = "1653--1662",
journal = "IEICE Transactions on Information and Systems",
issn = "0916-8532",
publisher = "Maruzen Co., Ltd/Maruzen Kabushikikaisha",
number = "8",

}

TY - JOUR

T1 - Fast H.264/AVC DIRECT mode decision based on mode selection and predicted rate-distortion cost

AU - Jin, Xiaocong

AU - Sun, Jun

AU - Huang, Yiqing

AU - Su, Jia

AU - Ikenaga, Takeshi

PY - 2011/8

Y1 - 2011/8

N2 - Different encoding modes for variable block size are available in the H.264/AVC standard in order to offer better coding quality. However, this also introduces huge computation time due to the exhaustive check for all modes. In this paper, a fast spatial DIRECT mode decision method for profiles supporting B frame encoding (main profile, high profile, etc.) in H.264/AVC is proposed. Statistical analysis on multiple video sequences is carried out, and the strong relationship of mode selection and rate-distortion (RD) cost between the current DIRECT macroblock (MB) and the co-located MBs is observed. With the check of mode condition, predicted RD cost threshold and dynamic parameter update model, the complex mode decision process can be terminated at an early stage even for small QP cases. Simulation results demonstrate the proposed method can achieve much better performance than the original exhaustive ratedistortion optimization (RDO) based mode decision algorithm by reducing up to 56.8% of encoding time for IBPBP picture group and up to 67.8% of encoding time for IBBPBBP picture group while incurring only negligible bit increment and quality degradation.

AB - Different encoding modes for variable block size are available in the H.264/AVC standard in order to offer better coding quality. However, this also introduces huge computation time due to the exhaustive check for all modes. In this paper, a fast spatial DIRECT mode decision method for profiles supporting B frame encoding (main profile, high profile, etc.) in H.264/AVC is proposed. Statistical analysis on multiple video sequences is carried out, and the strong relationship of mode selection and rate-distortion (RD) cost between the current DIRECT macroblock (MB) and the co-located MBs is observed. With the check of mode condition, predicted RD cost threshold and dynamic parameter update model, the complex mode decision process can be terminated at an early stage even for small QP cases. Simulation results demonstrate the proposed method can achieve much better performance than the original exhaustive ratedistortion optimization (RDO) based mode decision algorithm by reducing up to 56.8% of encoding time for IBPBP picture group and up to 67.8% of encoding time for IBBPBBP picture group while incurring only negligible bit increment and quality degradation.

KW - Adaptive

KW - Fast mode decision

KW - H.264/AVC

KW - Spatial DIRECT

UR - http://www.scopus.com/inward/record.url?scp=79961087154&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=79961087154&partnerID=8YFLogxK

U2 - 10.1587/transinf.E94.D.1653

DO - 10.1587/transinf.E94.D.1653

M3 - Article

AN - SCOPUS:79961087154

VL - E94-D

SP - 1653

EP - 1662

JO - IEICE Transactions on Information and Systems

JF - IEICE Transactions on Information and Systems

SN - 0916-8532

IS - 8

ER -