Abstract
We describe the implementation of a cellular-phone based speech translation system without telephone quality speech database or special CT hardware. The purpose is to quickly build a prototype service system that can be used for data collection with real users. To train the acoustic model for the speech recognition system, available high-quality databases were made usable by 1.) appropriate downsampling and filtering of high-quality databases, and 2.) by piping, similar to the NTIMIT and CTIMIT paradigms. An evaluation of acoustic models with filtered, piped and real cellular-phone data is given. Recognition rates are at same levels as for wideband speech.
Original language | English |
---|---|
Title of host publication | 6th International Conference on Spoken Language Processing, ICSLP 2000 |
Publisher | International Speech Communication Association |
ISBN (Electronic) | 7801501144, 9787801501141 |
Publication status | Published - 2000 |
Externally published | Yes |
Event | 6th International Conference on Spoken Language Processing, ICSLP 2000 - Beijing, China Duration: 2000 Oct 16 → 2000 Oct 20 |
Other
Other | 6th International Conference on Spoken Language Processing, ICSLP 2000 |
---|---|
Country | China |
City | Beijing |
Period | 00/10/16 → 00/10/20 |
ASJC Scopus subject areas
- Linguistics and Language
- Language and Linguistics