Region-of-interest based H.264 encoder for videophone with a hardware macroblock level face detector

Tianruo Zhang, Chen Liu, Minghui Wang, Satoshi Goto

Research output: Chapter in Book/Report/Conference proceedingConference contribution

9 Citations (Scopus)

Abstract

Region-of-interest (ROI) can be applied in H.264 video encoder to enhance subjective quality and reduce computation complexity. For the aiming application of low cost hardware real-time encoder in videophone with faces as ROI, this paper proposes a face detection algorithm to detect each macroblock (MB) as one part of a face or not. This face detection algorithm has a unique estimation-and-verification process and can be combined with a H.264 encoder by MB level pipeline architecture. 97.91% MBs in faces can be detected. VLSI architecture of proposed face detection algorithm is designed and an area of 4.3k gates is achieved. Power consumption is only 1.45mW at 100MHz. A ROI based H.264 encoder with dynamic parameters is proposed to enhance subjective quality and reduce the rate-distortion-optimization (RDO) complexity. The PSNR in ROI increases for 4.8dB under similar bit rate. Encoding time is reduced to 54.4% in videophone-like sequences.

Original languageEnglish
Title of host publication2009 IEEE International Workshop on Multimedia Signal Processing, MMSP '09
DOIs
Publication statusPublished - 2009
Event2009 IEEE International Workshop on Multimedia Signal Processing, MMSP '09 - Rio De Janeiro
Duration: 2009 Oct 52009 Oct 7

Other

Other2009 IEEE International Workshop on Multimedia Signal Processing, MMSP '09
CityRio De Janeiro
Period09/10/509/10/7

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Networks and Communications
  • Computer Vision and Pattern Recognition
  • Signal Processing

Fingerprint Dive into the research topics of 'Region-of-interest based H.264 encoder for videophone with a hardware macroblock level face detector'. Together they form a unique fingerprint.

Cite this