Region-of-interest based H.264 encoding parameter allocation for low power video communication

Minghui Wang, Tianruo Zhang, Chen Liu, Satoshi Goto

Research output: Chapter in Book/Report/Conference proceedingConference contribution

7 Citations (Scopus)

Abstract

H.264 is the state-of-the-art in modern video compression standards. Its extremely high compression ratio meets the requirements the video communication between portable devices. Since the power is limited in portable devices, the huge computation of H.264 is a critical problem for hardware design. According to the human visual system (HVS) research, human vision is only able to focus on one area in a frame, which is defined as region-of-interest (ROI). In most cases, human face attracts the most attention of the device user. This phenomenon gives a chance to code all macroblocks unequally. In this work, the ROI is detected with an encoder-oriented fast algorithm, using chrominance and texture contrast features. After the ROI is detected, the encoder will allocated the coding parameters respectively in ROI and non-ROI. As a result, it keeps fine quality in ROI, saves much throughput in non-ROI, and greatly reduces the computation. The ROI detector and the encoder are also designed to be decoding-friendly and hardware-friendly.

Original languageEnglish
Title of host publicationProceedings of 2009 5th International Colloquium on Signal Processing and Its Applications, CSPA 2009
Pages233-237
Number of pages5
DOIs
Publication statusPublished - 2009
Event2009 5th International Colloquium on Signal Processing and Its Applications, CSPA 2009 - Kuala Lumpur
Duration: 2009 Mar 62009 Mar 8

Other

Other2009 5th International Colloquium on Signal Processing and Its Applications, CSPA 2009
CityKuala Lumpur
Period09/3/609/3/8

Fingerprint

Hardware
Communication
Image compression
Decoding
Textures
Throughput
Detectors

Keywords

  • Dynamical parameter allocation
  • H.264/AVC
  • Human visual system (HVS)
  • Region-of-interest (ROI)
  • Skin color

ASJC Scopus subject areas

  • Computational Theory and Mathematics
  • Computer Science Applications
  • Computer Vision and Pattern Recognition
  • Signal Processing

Cite this

Wang, M., Zhang, T., Liu, C., & Goto, S. (2009). Region-of-interest based H.264 encoding parameter allocation for low power video communication. In Proceedings of 2009 5th International Colloquium on Signal Processing and Its Applications, CSPA 2009 (pp. 233-237). [5069223] https://doi.org/10.1109/CSPA.2009.5069223

Region-of-interest based H.264 encoding parameter allocation for low power video communication. / Wang, Minghui; Zhang, Tianruo; Liu, Chen; Goto, Satoshi.

Proceedings of 2009 5th International Colloquium on Signal Processing and Its Applications, CSPA 2009. 2009. p. 233-237 5069223.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Wang, M, Zhang, T, Liu, C & Goto, S 2009, Region-of-interest based H.264 encoding parameter allocation for low power video communication. in Proceedings of 2009 5th International Colloquium on Signal Processing and Its Applications, CSPA 2009., 5069223, pp. 233-237, 2009 5th International Colloquium on Signal Processing and Its Applications, CSPA 2009, Kuala Lumpur, 09/3/6. https://doi.org/10.1109/CSPA.2009.5069223
Wang M, Zhang T, Liu C, Goto S. Region-of-interest based H.264 encoding parameter allocation for low power video communication. In Proceedings of 2009 5th International Colloquium on Signal Processing and Its Applications, CSPA 2009. 2009. p. 233-237. 5069223 https://doi.org/10.1109/CSPA.2009.5069223
Wang, Minghui ; Zhang, Tianruo ; Liu, Chen ; Goto, Satoshi. / Region-of-interest based H.264 encoding parameter allocation for low power video communication. Proceedings of 2009 5th International Colloquium on Signal Processing and Its Applications, CSPA 2009. 2009. pp. 233-237
@inproceedings{24d9e6cfbfd94b79a854e62bda9cbc74,
title = "Region-of-interest based H.264 encoding parameter allocation for low power video communication",
abstract = "H.264 is the state-of-the-art in modern video compression standards. Its extremely high compression ratio meets the requirements the video communication between portable devices. Since the power is limited in portable devices, the huge computation of H.264 is a critical problem for hardware design. According to the human visual system (HVS) research, human vision is only able to focus on one area in a frame, which is defined as region-of-interest (ROI). In most cases, human face attracts the most attention of the device user. This phenomenon gives a chance to code all macroblocks unequally. In this work, the ROI is detected with an encoder-oriented fast algorithm, using chrominance and texture contrast features. After the ROI is detected, the encoder will allocated the coding parameters respectively in ROI and non-ROI. As a result, it keeps fine quality in ROI, saves much throughput in non-ROI, and greatly reduces the computation. The ROI detector and the encoder are also designed to be decoding-friendly and hardware-friendly.",
keywords = "Dynamical parameter allocation, H.264/AVC, Human visual system (HVS), Region-of-interest (ROI), Skin color",
author = "Minghui Wang and Tianruo Zhang and Chen Liu and Satoshi Goto",
year = "2009",
doi = "10.1109/CSPA.2009.5069223",
language = "English",
isbn = "9781424441501",
pages = "233--237",
booktitle = "Proceedings of 2009 5th International Colloquium on Signal Processing and Its Applications, CSPA 2009",

}

TY - GEN

T1 - Region-of-interest based H.264 encoding parameter allocation for low power video communication

AU - Wang, Minghui

AU - Zhang, Tianruo

AU - Liu, Chen

AU - Goto, Satoshi

PY - 2009

Y1 - 2009

N2 - H.264 is the state-of-the-art in modern video compression standards. Its extremely high compression ratio meets the requirements the video communication between portable devices. Since the power is limited in portable devices, the huge computation of H.264 is a critical problem for hardware design. According to the human visual system (HVS) research, human vision is only able to focus on one area in a frame, which is defined as region-of-interest (ROI). In most cases, human face attracts the most attention of the device user. This phenomenon gives a chance to code all macroblocks unequally. In this work, the ROI is detected with an encoder-oriented fast algorithm, using chrominance and texture contrast features. After the ROI is detected, the encoder will allocated the coding parameters respectively in ROI and non-ROI. As a result, it keeps fine quality in ROI, saves much throughput in non-ROI, and greatly reduces the computation. The ROI detector and the encoder are also designed to be decoding-friendly and hardware-friendly.

AB - H.264 is the state-of-the-art in modern video compression standards. Its extremely high compression ratio meets the requirements the video communication between portable devices. Since the power is limited in portable devices, the huge computation of H.264 is a critical problem for hardware design. According to the human visual system (HVS) research, human vision is only able to focus on one area in a frame, which is defined as region-of-interest (ROI). In most cases, human face attracts the most attention of the device user. This phenomenon gives a chance to code all macroblocks unequally. In this work, the ROI is detected with an encoder-oriented fast algorithm, using chrominance and texture contrast features. After the ROI is detected, the encoder will allocated the coding parameters respectively in ROI and non-ROI. As a result, it keeps fine quality in ROI, saves much throughput in non-ROI, and greatly reduces the computation. The ROI detector and the encoder are also designed to be decoding-friendly and hardware-friendly.

KW - Dynamical parameter allocation

KW - H.264/AVC

KW - Human visual system (HVS)

KW - Region-of-interest (ROI)

KW - Skin color

UR - http://www.scopus.com/inward/record.url?scp=70349897545&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=70349897545&partnerID=8YFLogxK

U2 - 10.1109/CSPA.2009.5069223

DO - 10.1109/CSPA.2009.5069223

M3 - Conference contribution

SN - 9781424441501

SP - 233

EP - 237

BT - Proceedings of 2009 5th International Colloquium on Signal Processing and Its Applications, CSPA 2009

ER -