Design of an MPEG-4 general audio coder for improving speech quality

T. Moriya, A. Jin, N. Iwakami, Tatsuya Mori

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

This paper proposes a design for an ISO/IEC MPEG-4 general audio encoder to improve the speech quality at low bit rates. The main contributions to the improvement are using i) a higher sampling rate to get higher time resolution for a given frame length and ii) adaptive preprocessing to reduce the bandwidth. Listening tests at 8 and 16 kbit/s showed that compared with a conventional audio coder and a speech-specific coder, the proposed coder provided better speech quality than the conventional audio coder (MP3) while keeping the quality for music. For speech signals, however, the speech-specific coder (MPEG-4 CELP) produced significantly better quality than the audio coders. The proposed design will be especially useful for low-bit-rate audio-visual delivery applications which may include both speech and music signals.

Original languageEnglish
Title of host publication2000 IEEE Workshop on Speech Coding - Proceedings: Meeting the Challenges of the New Millennium
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages139-141
Number of pages3
ISBN (Print)0780364163, 9780780364165
DOIs
Publication statusPublished - 2000
Externally publishedYes
EventIEEE Workshop on Speech Coding - Delavan, United States
Duration: 2000 Sep 172000 Sep 20

Other

OtherIEEE Workshop on Speech Coding
CountryUnited States
CityDelavan
Period00/9/1700/9/20

Fingerprint

Sampling
Bandwidth

Keywords

  • Bandwidth
  • Bit rate
  • Digital audio players
  • IEC
  • ISO
  • MPEG 4 Standard
  • Sampling methods
  • Signal design
  • Speech
  • Testing

ASJC Scopus subject areas

  • Signal Processing
  • Software

Cite this

Moriya, T., Jin, A., Iwakami, N., & Mori, T. (2000). Design of an MPEG-4 general audio coder for improving speech quality. In 2000 IEEE Workshop on Speech Coding - Proceedings: Meeting the Challenges of the New Millennium (pp. 139-141). [878429] Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/SCFT.2000.878429

Design of an MPEG-4 general audio coder for improving speech quality. / Moriya, T.; Jin, A.; Iwakami, N.; Mori, Tatsuya.

2000 IEEE Workshop on Speech Coding - Proceedings: Meeting the Challenges of the New Millennium. Institute of Electrical and Electronics Engineers Inc., 2000. p. 139-141 878429.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Moriya, T, Jin, A, Iwakami, N & Mori, T 2000, Design of an MPEG-4 general audio coder for improving speech quality. in 2000 IEEE Workshop on Speech Coding - Proceedings: Meeting the Challenges of the New Millennium., 878429, Institute of Electrical and Electronics Engineers Inc., pp. 139-141, IEEE Workshop on Speech Coding, Delavan, United States, 00/9/17. https://doi.org/10.1109/SCFT.2000.878429
Moriya T, Jin A, Iwakami N, Mori T. Design of an MPEG-4 general audio coder for improving speech quality. In 2000 IEEE Workshop on Speech Coding - Proceedings: Meeting the Challenges of the New Millennium. Institute of Electrical and Electronics Engineers Inc. 2000. p. 139-141. 878429 https://doi.org/10.1109/SCFT.2000.878429
Moriya, T. ; Jin, A. ; Iwakami, N. ; Mori, Tatsuya. / Design of an MPEG-4 general audio coder for improving speech quality. 2000 IEEE Workshop on Speech Coding - Proceedings: Meeting the Challenges of the New Millennium. Institute of Electrical and Electronics Engineers Inc., 2000. pp. 139-141
@inproceedings{fae0e1e27c1944f0a710de37da51330a,
title = "Design of an MPEG-4 general audio coder for improving speech quality",
abstract = "This paper proposes a design for an ISO/IEC MPEG-4 general audio encoder to improve the speech quality at low bit rates. The main contributions to the improvement are using i) a higher sampling rate to get higher time resolution for a given frame length and ii) adaptive preprocessing to reduce the bandwidth. Listening tests at 8 and 16 kbit/s showed that compared with a conventional audio coder and a speech-specific coder, the proposed coder provided better speech quality than the conventional audio coder (MP3) while keeping the quality for music. For speech signals, however, the speech-specific coder (MPEG-4 CELP) produced significantly better quality than the audio coders. The proposed design will be especially useful for low-bit-rate audio-visual delivery applications which may include both speech and music signals.",
keywords = "Bandwidth, Bit rate, Digital audio players, IEC, ISO, MPEG 4 Standard, Sampling methods, Signal design, Speech, Testing",
author = "T. Moriya and A. Jin and N. Iwakami and Tatsuya Mori",
year = "2000",
doi = "10.1109/SCFT.2000.878429",
language = "English",
isbn = "0780364163",
pages = "139--141",
booktitle = "2000 IEEE Workshop on Speech Coding - Proceedings: Meeting the Challenges of the New Millennium",
publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - GEN

T1 - Design of an MPEG-4 general audio coder for improving speech quality

AU - Moriya, T.

AU - Jin, A.

AU - Iwakami, N.

AU - Mori, Tatsuya

PY - 2000

Y1 - 2000

N2 - This paper proposes a design for an ISO/IEC MPEG-4 general audio encoder to improve the speech quality at low bit rates. The main contributions to the improvement are using i) a higher sampling rate to get higher time resolution for a given frame length and ii) adaptive preprocessing to reduce the bandwidth. Listening tests at 8 and 16 kbit/s showed that compared with a conventional audio coder and a speech-specific coder, the proposed coder provided better speech quality than the conventional audio coder (MP3) while keeping the quality for music. For speech signals, however, the speech-specific coder (MPEG-4 CELP) produced significantly better quality than the audio coders. The proposed design will be especially useful for low-bit-rate audio-visual delivery applications which may include both speech and music signals.

AB - This paper proposes a design for an ISO/IEC MPEG-4 general audio encoder to improve the speech quality at low bit rates. The main contributions to the improvement are using i) a higher sampling rate to get higher time resolution for a given frame length and ii) adaptive preprocessing to reduce the bandwidth. Listening tests at 8 and 16 kbit/s showed that compared with a conventional audio coder and a speech-specific coder, the proposed coder provided better speech quality than the conventional audio coder (MP3) while keeping the quality for music. For speech signals, however, the speech-specific coder (MPEG-4 CELP) produced significantly better quality than the audio coders. The proposed design will be especially useful for low-bit-rate audio-visual delivery applications which may include both speech and music signals.

KW - Bandwidth

KW - Bit rate

KW - Digital audio players

KW - IEC

KW - ISO

KW - MPEG 4 Standard

KW - Sampling methods

KW - Signal design

KW - Speech

KW - Testing

UR - http://www.scopus.com/inward/record.url?scp=84947325358&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84947325358&partnerID=8YFLogxK

U2 - 10.1109/SCFT.2000.878429

DO - 10.1109/SCFT.2000.878429

M3 - Conference contribution

AN - SCOPUS:84947325358

SN - 0780364163

SN - 9780780364165

SP - 139

EP - 141

BT - 2000 IEEE Workshop on Speech Coding - Proceedings: Meeting the Challenges of the New Millennium

PB - Institute of Electrical and Electronics Engineers Inc.

ER -