Motion estimation optimization for H.264/AVC using source image edge features

Zhenyu Liu, Junwei Zhou, Satoshi Goto, Takeshi Ikenaga

Research output: Contribution to journalArticle

12 Citations (Scopus)

Abstract

The H.264/AVC coding standard processes variable block size motion-compensated prediction with multiple reference frames to achieve a pronounced improvement in compression efficiency. Accordingly, the computation of motion estimation increases in proportion to the product of the number of reference frame and the number of intermode. The mathematical analysis in this paper illustrates that the motion-compensated prediction errors are mainly determined by the detailed textures in the source image. The image block being rich in textures contains numerous high-frequency signals, which make variable block size and multiple reference frame techniques essential. On the basis of rate-distortion theory, in this paper, the spatial homogeneity of an image block is made as a relative concept with respect to the current quantization step. For the homogenous block, its futile reference frames and intermodes can be eliminated efficiently. It is further revealed that the sum of absolute differences value of an image block is mainly determined by the sum of its edge gradient amplitude and the current quantization step. Consequently, the image content-based early termination algorithm is proposed, and it outperforms the original method adopted by JVT reference software. Moreover, the dynamic search range algorithm based on the edge gradient amplitude of source image block is analyzed. One eminent advantage of the proposed edgebased algorithms is their efficiency to the macroblock-pipelining architecture, and another desirable feature is their orthogonality to fast block-matching algorithms. Experimental results show that when these algorithms are integrated with hybrid unsymmetrical-cross multi-hexagongrid search, an averaged 31.4-60.0% motion estimation time can be saved, whereas the averaging BDPSNR loss is 0.0497 dB for all tested sequences.

Original languageEnglish
Article number4914856
Pages (from-to)1095-1107
Number of pages13
JournalIEEE Transactions on Circuits and Systems for Video Technology
Volume19
Issue number8
DOIs
Publication statusPublished - 2009 Aug

Fingerprint

Motion estimation
Textures

Keywords

  • Edge gradient
  • Fast mode decision
  • H.264/AVC
  • Motion estimation (ME)
  • Multiple reference frame (MRF)
  • Variable block size (VBS)

ASJC Scopus subject areas

  • Electrical and Electronic Engineering
  • Media Technology

Cite this

Motion estimation optimization for H.264/AVC using source image edge features. / Liu, Zhenyu; Zhou, Junwei; Goto, Satoshi; Ikenaga, Takeshi.

In: IEEE Transactions on Circuits and Systems for Video Technology, Vol. 19, No. 8, 4914856, 08.2009, p. 1095-1107.

Research output: Contribution to journalArticle

@article{1b8e29e7f79c4986b7c5ab9e4925b149,
title = "Motion estimation optimization for H.264/AVC using source image edge features",
abstract = "The H.264/AVC coding standard processes variable block size motion-compensated prediction with multiple reference frames to achieve a pronounced improvement in compression efficiency. Accordingly, the computation of motion estimation increases in proportion to the product of the number of reference frame and the number of intermode. The mathematical analysis in this paper illustrates that the motion-compensated prediction errors are mainly determined by the detailed textures in the source image. The image block being rich in textures contains numerous high-frequency signals, which make variable block size and multiple reference frame techniques essential. On the basis of rate-distortion theory, in this paper, the spatial homogeneity of an image block is made as a relative concept with respect to the current quantization step. For the homogenous block, its futile reference frames and intermodes can be eliminated efficiently. It is further revealed that the sum of absolute differences value of an image block is mainly determined by the sum of its edge gradient amplitude and the current quantization step. Consequently, the image content-based early termination algorithm is proposed, and it outperforms the original method adopted by JVT reference software. Moreover, the dynamic search range algorithm based on the edge gradient amplitude of source image block is analyzed. One eminent advantage of the proposed edgebased algorithms is their efficiency to the macroblock-pipelining architecture, and another desirable feature is their orthogonality to fast block-matching algorithms. Experimental results show that when these algorithms are integrated with hybrid unsymmetrical-cross multi-hexagongrid search, an averaged 31.4-60.0{\%} motion estimation time can be saved, whereas the averaging BDPSNR loss is 0.0497 dB for all tested sequences.",
keywords = "Edge gradient, Fast mode decision, H.264/AVC, Motion estimation (ME), Multiple reference frame (MRF), Variable block size (VBS)",
author = "Zhenyu Liu and Junwei Zhou and Satoshi Goto and Takeshi Ikenaga",
year = "2009",
month = "8",
doi = "10.1109/TCSVT.2009.2022796",
language = "English",
volume = "19",
pages = "1095--1107",
journal = "IEEE Transactions on Circuits and Systems for Video Technology",
issn = "1051-8215",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
number = "8",

}

TY - JOUR

T1 - Motion estimation optimization for H.264/AVC using source image edge features

AU - Liu, Zhenyu

AU - Zhou, Junwei

AU - Goto, Satoshi

AU - Ikenaga, Takeshi

PY - 2009/8

Y1 - 2009/8

N2 - The H.264/AVC coding standard processes variable block size motion-compensated prediction with multiple reference frames to achieve a pronounced improvement in compression efficiency. Accordingly, the computation of motion estimation increases in proportion to the product of the number of reference frame and the number of intermode. The mathematical analysis in this paper illustrates that the motion-compensated prediction errors are mainly determined by the detailed textures in the source image. The image block being rich in textures contains numerous high-frequency signals, which make variable block size and multiple reference frame techniques essential. On the basis of rate-distortion theory, in this paper, the spatial homogeneity of an image block is made as a relative concept with respect to the current quantization step. For the homogenous block, its futile reference frames and intermodes can be eliminated efficiently. It is further revealed that the sum of absolute differences value of an image block is mainly determined by the sum of its edge gradient amplitude and the current quantization step. Consequently, the image content-based early termination algorithm is proposed, and it outperforms the original method adopted by JVT reference software. Moreover, the dynamic search range algorithm based on the edge gradient amplitude of source image block is analyzed. One eminent advantage of the proposed edgebased algorithms is their efficiency to the macroblock-pipelining architecture, and another desirable feature is their orthogonality to fast block-matching algorithms. Experimental results show that when these algorithms are integrated with hybrid unsymmetrical-cross multi-hexagongrid search, an averaged 31.4-60.0% motion estimation time can be saved, whereas the averaging BDPSNR loss is 0.0497 dB for all tested sequences.

AB - The H.264/AVC coding standard processes variable block size motion-compensated prediction with multiple reference frames to achieve a pronounced improvement in compression efficiency. Accordingly, the computation of motion estimation increases in proportion to the product of the number of reference frame and the number of intermode. The mathematical analysis in this paper illustrates that the motion-compensated prediction errors are mainly determined by the detailed textures in the source image. The image block being rich in textures contains numerous high-frequency signals, which make variable block size and multiple reference frame techniques essential. On the basis of rate-distortion theory, in this paper, the spatial homogeneity of an image block is made as a relative concept with respect to the current quantization step. For the homogenous block, its futile reference frames and intermodes can be eliminated efficiently. It is further revealed that the sum of absolute differences value of an image block is mainly determined by the sum of its edge gradient amplitude and the current quantization step. Consequently, the image content-based early termination algorithm is proposed, and it outperforms the original method adopted by JVT reference software. Moreover, the dynamic search range algorithm based on the edge gradient amplitude of source image block is analyzed. One eminent advantage of the proposed edgebased algorithms is their efficiency to the macroblock-pipelining architecture, and another desirable feature is their orthogonality to fast block-matching algorithms. Experimental results show that when these algorithms are integrated with hybrid unsymmetrical-cross multi-hexagongrid search, an averaged 31.4-60.0% motion estimation time can be saved, whereas the averaging BDPSNR loss is 0.0497 dB for all tested sequences.

KW - Edge gradient

KW - Fast mode decision

KW - H.264/AVC

KW - Motion estimation (ME)

KW - Multiple reference frame (MRF)

KW - Variable block size (VBS)

UR - http://www.scopus.com/inward/record.url?scp=69449108169&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=69449108169&partnerID=8YFLogxK

U2 - 10.1109/TCSVT.2009.2022796

DO - 10.1109/TCSVT.2009.2022796

M3 - Article

VL - 19

SP - 1095

EP - 1107

JO - IEEE Transactions on Circuits and Systems for Video Technology

JF - IEEE Transactions on Circuits and Systems for Video Technology

SN - 1051-8215

IS - 8

M1 - 4914856

ER -