Approximate DCT Design for Video Encoding Based on Novel Truncation Scheme

Heming Sun, Zhengxue Cheng, Amir Masoud Gharehbaghi, Shinji Kimura, Masahiro Fujita

Research output: Contribution to journalArticle

1 Citation (Scopus)

Abstract

This paper presents an energy- and area-efficient architecture for approximated discrete cosine transform (DCT). Due to the good compression ability, DCT is widely exploited in signal processing. However, it is computationally intensive especially for large transform sizes. In this paper, we have reduced the computation cost of DCT by truncating a couple of least significant bits (LSB), most significant bits (MSB), and zero columns. First, considering that the contribution of LSBs is weakened because of the final right shift operation, we have eliminated the computation process for some LSBs. For the addition of the remaining LSBs, a parallel carry propagation adder is proposed to reduce the calculation latency. Second, owing to the phenomenon that high-frequency components are quite small in natural scenes, a couple of MSBs are selectively truncated according to their positions. Third, quantization is taken into account for the system-level optimization. The quantized results of all-zero columns are utilized to skip the column transforms afterward. The experimental results show that at most 32% area consumption and 60% power consumption can be reduced compared with the originally accurate DCT, while the compression efficiency loss caused by the DCT approximation is negligible for High Efficiency Video Coding.

Original languageEnglish
JournalIEEE Transactions on Circuits and Systems I: Regular Papers
DOIs
Publication statusAccepted/In press - 2018 Jan 1

Fingerprint

Discrete cosine transforms
Adders
Image coding
Signal processing
Electric power utilization
Costs

Keywords

  • approximate computing
  • DCT
  • HEVC
  • truncation.
  • VVC

ASJC Scopus subject areas

  • Electrical and Electronic Engineering

Cite this

Approximate DCT Design for Video Encoding Based on Novel Truncation Scheme. / Sun, Heming; Cheng, Zhengxue; Gharehbaghi, Amir Masoud; Kimura, Shinji; Fujita, Masahiro.

In: IEEE Transactions on Circuits and Systems I: Regular Papers, 01.01.2018.

Research output: Contribution to journalArticle

@article{c4b5e023c97c42378bdef8db634a7c1e,
title = "Approximate DCT Design for Video Encoding Based on Novel Truncation Scheme",
abstract = "This paper presents an energy- and area-efficient architecture for approximated discrete cosine transform (DCT). Due to the good compression ability, DCT is widely exploited in signal processing. However, it is computationally intensive especially for large transform sizes. In this paper, we have reduced the computation cost of DCT by truncating a couple of least significant bits (LSB), most significant bits (MSB), and zero columns. First, considering that the contribution of LSBs is weakened because of the final right shift operation, we have eliminated the computation process for some LSBs. For the addition of the remaining LSBs, a parallel carry propagation adder is proposed to reduce the calculation latency. Second, owing to the phenomenon that high-frequency components are quite small in natural scenes, a couple of MSBs are selectively truncated according to their positions. Third, quantization is taken into account for the system-level optimization. The quantized results of all-zero columns are utilized to skip the column transforms afterward. The experimental results show that at most 32{\%} area consumption and 60{\%} power consumption can be reduced compared with the originally accurate DCT, while the compression efficiency loss caused by the DCT approximation is negligible for High Efficiency Video Coding.",
keywords = "approximate computing, DCT, HEVC, truncation., VVC",
author = "Heming Sun and Zhengxue Cheng and Gharehbaghi, {Amir Masoud} and Shinji Kimura and Masahiro Fujita",
year = "2018",
month = "1",
day = "1",
doi = "10.1109/TCSI.2018.2882474",
language = "English",
journal = "IEEE Transactions on Circuits and Systems II: Express Briefs",
issn = "1549-8328",
publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - Approximate DCT Design for Video Encoding Based on Novel Truncation Scheme

AU - Sun, Heming

AU - Cheng, Zhengxue

AU - Gharehbaghi, Amir Masoud

AU - Kimura, Shinji

AU - Fujita, Masahiro

PY - 2018/1/1

Y1 - 2018/1/1

N2 - This paper presents an energy- and area-efficient architecture for approximated discrete cosine transform (DCT). Due to the good compression ability, DCT is widely exploited in signal processing. However, it is computationally intensive especially for large transform sizes. In this paper, we have reduced the computation cost of DCT by truncating a couple of least significant bits (LSB), most significant bits (MSB), and zero columns. First, considering that the contribution of LSBs is weakened because of the final right shift operation, we have eliminated the computation process for some LSBs. For the addition of the remaining LSBs, a parallel carry propagation adder is proposed to reduce the calculation latency. Second, owing to the phenomenon that high-frequency components are quite small in natural scenes, a couple of MSBs are selectively truncated according to their positions. Third, quantization is taken into account for the system-level optimization. The quantized results of all-zero columns are utilized to skip the column transforms afterward. The experimental results show that at most 32% area consumption and 60% power consumption can be reduced compared with the originally accurate DCT, while the compression efficiency loss caused by the DCT approximation is negligible for High Efficiency Video Coding.

AB - This paper presents an energy- and area-efficient architecture for approximated discrete cosine transform (DCT). Due to the good compression ability, DCT is widely exploited in signal processing. However, it is computationally intensive especially for large transform sizes. In this paper, we have reduced the computation cost of DCT by truncating a couple of least significant bits (LSB), most significant bits (MSB), and zero columns. First, considering that the contribution of LSBs is weakened because of the final right shift operation, we have eliminated the computation process for some LSBs. For the addition of the remaining LSBs, a parallel carry propagation adder is proposed to reduce the calculation latency. Second, owing to the phenomenon that high-frequency components are quite small in natural scenes, a couple of MSBs are selectively truncated according to their positions. Third, quantization is taken into account for the system-level optimization. The quantized results of all-zero columns are utilized to skip the column transforms afterward. The experimental results show that at most 32% area consumption and 60% power consumption can be reduced compared with the originally accurate DCT, while the compression efficiency loss caused by the DCT approximation is negligible for High Efficiency Video Coding.

KW - approximate computing

KW - DCT

KW - HEVC

KW - truncation.

KW - VVC

UR - http://www.scopus.com/inward/record.url?scp=85058131307&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85058131307&partnerID=8YFLogxK

U2 - 10.1109/TCSI.2018.2882474

DO - 10.1109/TCSI.2018.2882474

M3 - Article

AN - SCOPUS:85058131307

JO - IEEE Transactions on Circuits and Systems II: Express Briefs

JF - IEEE Transactions on Circuits and Systems II: Express Briefs

SN - 1549-8328

ER -