Find Research Outputs

Search in all content

Filters for Research Output

Search concepts
Selected filters

Speech act annotation for domain specific multilingual expression services

Bourdon, J. & Ishida, T., 2008 Dec 1, Proceedings of the 2nd International Symposium on Universal Communication, ISUC 2008. p. 243-250 8 p. 4724469. (Proceedings of the 2nd International Symposium on Universal Communication, ISUC 2008).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Speech and Language Processing for Generating Content Description Metadata for Broadcast News

Hayashi, Y., Matsunaga, S. & Matsuo, Y., 2003 Jun, In : NTT Technical Review. 1, 3, p. 62-65 4 p.

Research output: Contribution to journalArticle

1 Citation (Scopus)

Speech-Based and Video-Supported Indexing of Multimedia Broadcast News

Hayashi, Y., Ohtsuki, K., Bessho, K., Mizuno, O., Matsuo, Y., Matsunaga, S., Hayashi, M., Hasegawa, T. & Ikeda, N., 2003, In : SIGIR Forum (ACM Special Interest Group on Information Retrieval). SPEC. ISS., p. 441-442 2 p.

Research output: Contribution to journalConference article

21 Citations (Scopus)

Speech Codec for Multimedia Services and Speech Coding Software: DualSpeech

Kaneko, T. & Nishino, Y., 1998 Dec 1, In : NTT R and D. 47, 5, p. 541-548 8 p.

Research output: Contribution to journalArticle

1 Citation (Scopus)

SPEECH CODER USING PHASE EQUALIZATION AND VECTOR QUANTIZATION.

Moriya, T. & Honda, M., 1986 Dec 1, In : ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. p. 1701-1704 4 p.

Research output: Contribution to journalConference article

16 Citations (Scopus)

Speech coding based on a multi-layer neural network

Morishima, S., Harashima, H. & Katayama, Y., 1990 Dec 1, In : Conference Record - International Conference on Communications. 2, p. 429-433 5 p.

Research output: Contribution to journalConference article

3 Citations (Scopus)
7 Citations (Scopus)

Speech Communication: Editorial

Hirose, K., Hirst, D. & Sagisaka, Y., 2005 Jul 1, In : Speech Communication. 46, 3-4, p. 217-219 3 p.

Research output: Contribution to journalArticle

1 Citation (Scopus)

Speech Communication: Editorial

De Mori, R., Sagisaka, Y. & Alwan, A., 2003 May 1, In : Speech Communication. 40, 3, p. 259-260 2 p.

Research output: Contribution to journalEditorial

Speech compensation to time-scale modified auditory feedback

Ogane, R. & Honda, M., 2011, 9th International Seminar on Speech Production 2011, ISSP 2011. 9th International Seminar on Speech Production 2011, p. 321-328 8 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

SPEECH CONVERSATION SYSTEM OF THE MUSICIAN ROBOT.

Kobayashi, T., Komori, Y., Hashimoto, N., Iwata, K., Fukazawa, Y. & Shirai, K., 1985 Dec 1, p. 483-488. 6 p.

Research output: Contribution to conferencePaper

4 Citations (Scopus)

Speech dialogue management system for human interface employing visual anthropomorphous agent

Hiramoto, Y., Dohi, H. & Ishizuka, M., 1994, Robot and Human Communication - Proceedings of the IEEE International Workshop. Piscataway, NJ, United States: IEEE, p. 277-282 6 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

5 Citations (Scopus)

Speech emotional features measured by power-law distribution based on electroglottography

Chen, L., Mao, X., Xue, Y. & Ishizuka, M., 2012, BIOSIGNALS 2012 - Proceedings of the International Conference on Bio-Inspired Systems and Signal Processing. p. 131-136 6 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Speech enhancement and recognition using multi-task learning of long short-term memory recurrent neural networks

Chen, Z., Watanabe, S., Erdogan, H. & Hershey, J. R., 2015, In : Unknown Journal. 2015-January, p. 3274-3278 5 p.

Research output: Contribution to journalArticle

61 Citations (Scopus)

Speech Enhancement Based on Bayesian Low-Rank and Sparse Decomposition of Multichannel Magnitude Spectrograms

Bando, Y., Itoyama, K., Konyo, M., Tadokoro, S., Nakadai, K., Yoshii, K., Kawahara, T. & Okuno, H. G., 2018 Feb 1, In : IEEE/ACM Transactions on Audio Speech and Language Processing. 26, 2, p. 215-230 16 p.

Research output: Contribution to journalArticle

10 Citations (Scopus)
1 Citation (Scopus)
3 Citations (Scopus)

Speech enhancement using end-to-end speech recognition objectives

Subramanian, A. S., Wang, X., Baskar, M. K., Watanabe, S., Taniguchi, T., Tran, D. & Fujita, Y., 2019 Oct, 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2019. Institute of Electrical and Electronics Engineers Inc., p. 234-238 5 p. 8937250. (IEEE Workshop on Applications of Signal Processing to Audio and Acoustics; vol. 2019-October).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Citations (Scopus)

Speech enhancement using square microphone array for mobile devices

Takada, S., Ogawa, T., Akagiri, K. & Kobayashi, T., 2008 Sep 16, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP. p. 313-316 4 p. 4517609. (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

4 Citations (Scopus)

Speech enhancement with LSTM recurrent neural networks and its application to noise-robust ASR

Weninger, F., Erdogan, H., Watanabe, S., Vincent, E., Le Roux, J., Hershey, J. R. & Schuller, B., 2015, Latent Variable Analysis and Signal Separation - 12th International Conference, LVA/ICA 2015, Proceedings. Koldovský, Z., Vincent, E., Yeredor, A. & Tichavský, P. (eds.). Springer Verlag, p. 91-99 9 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 9237).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

223 Citations (Scopus)

SPEECH I/O SYSTEM REALIZING FLEXIBLE CONVERSATION FOR ROBOT - THE CONVERSATIONAL SYSTEM OF WABOT-2.

Shirai, K., Kobayashi, T., Komori, Y., Hashimoto, N., Iwata, K., Fukazawa, Y. & Yazawa, J., 1985 Dec 1, In : Waseda Daigaku Rikogaku Kenkyusho Hokoku/Bulletin of Science and Engineering Research Laboratory, Waseda University. 112, p. 53-79 27 p.

Research output: Contribution to journalArticle

Speech planning of an anthropomorphic talking robot for consonant sounds production

Nishikawa, K., Imai, A., Ogawara, T., Takanobu, H., Mochida, T. & Takanishi, A., 2002 Jan 1, In : Proceedings - IEEE International Conference on Robotics and Automation. 2, p. 1830-1835 6 p.

Research output: Contribution to journalConference article

10 Citations (Scopus)

Speech Processing for Digital Home Assistants: Combining signal processing with deep-learning techniques

Haeb-Umbach, R., Watanabe, S., Nakatani, T., Bacchiani, M., Hoffmeister, B., Seltzer, M. L., Zen, H. & Souden, M., 2019 Nov, In : IEEE Signal Processing Magazine. 36, 6, p. 111-124 14 p., 8887564.

Research output: Contribution to journalReview article

9 Citations (Scopus)
19 Citations (Scopus)

Speech production model based on articulatory movements

Honda, M. & Kaburagi, T., 1995 Jan 1, In : NTT R and D. 44, 1, p. 87-92 6 p.

Research output: Contribution to journalArticle

Speech production of an advanced talking robot based on human acoustic theory

Nishikawa, K., Takanobu, H., Mochida, T., Honda, M. & Takanishi, A., 2004 Jul 5, In : Proceedings - IEEE International Conference on Robotics and Automation. 2004, 4, p. 3213-3219 7 p.

Research output: Contribution to journalConference article

7 Citations (Scopus)

Speech-Quality Assessment Methods for Speech-Coding Systems

Kitawaki, N., Honda, M. & Itoh, K., 1984 Oct, In : IEEE Communications Magazine. 22, 10, p. 26-33 8 p.

Research output: Contribution to journalArticle

37 Citations (Scopus)

Speech quality assessment of CS-ACELP

Hayashi, S., Kataoka, A., Moriya, T. & Kaneko, T., 1996 Jul 1, In : NTT Review. 8, 4, p. 36-41 6 p.

Research output: Contribution to journalArticle

Speech recognition based on acoustically derived segment units

Fukada, T., Bacchiani, M., Paliwal, K. K. & Sagisaka, Y., 1996 Dec 1, p. 1077-1080. 4 p.

Research output: Contribution to conferencePaper

8 Citations (Scopus)

Speech recognition based on student's t-distribution derived from total Bayesian framework

Watanabe, S. & Nakamura, A., 2006 Jan 1, In : IEICE Transactions on Information and Systems. E89-D, 3, p. 970-980 11 p.

Research output: Contribution to journalArticle

4 Citations (Scopus)

Speech recognition chip for monosyllables

Nakamura, K., Zhu, Q., Maruoka, S., Horiyama, T., Kimura, S. & Watanabe, K., 2001 Jan 1, Proceedings of the ASP-DAC 2001: Asia and South Pacific Design Automation Conference 2001. Institute of Electrical and Electronics Engineers Inc., p. 396-399 4 p. 913339. (Proceedings of the Asia and South Pacific Design Automation Conference, ASP-DAC; vol. 2001-January).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

5 Citations (Scopus)

Speech recognition for a humanoid with motor noise utilizing missing feature theory

Nishimura, Y., Ishizuka, M., Nakadai, K., Nakano, M. & Tsujino, H., 2006, Proceedings of the 2006 6th IEEE-RAS International Conference on Humanoid Robots, HUMANOIDS. p. 26-33 8 p. 4115576

Research output: Chapter in Book/Report/Conference proceedingConference contribution

13 Citations (Scopus)

Speech recognition in living rooms: Integrated speech enhancement and recognition system based on spatial, spectral and temporal modeling of sounds

Delcroix, M., Kinoshita, K., Nakatani, T., Araki, S., Ogawa, A., Hori, T., Watanabe, S., Fujimoto, M., Yoshioka, T., Oba, T., Kubo, Y., Souden, M., Hahm, S. J. & Nakamura, A., 2013 May 3, In : Computer Speech and Language. 27, 3, p. 851-873 23 p., 547.

Research output: Contribution to journalArticle

19 Citations (Scopus)

Speech recognition in nonstationary noise based on parallel HMMs and spectral subtraction

Mine, R., Kobayashi, T. & Shirai, K., 1996 Dec, In : Systems and Computers in Japan. 27, 14, p. 37-44 8 p.

Research output: Contribution to journalArticle

2 Citations (Scopus)

Speech recognition in the blind condition based on multiple directivity patterns using a microphone array

Sekiya, T. & Kobayashi, T., 2005 Jan 1, 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05 - Proceedings - Image and Multidimensional Signal Processing Multimedia Signal Processing. Institute of Electrical and Electronics Engineers Inc., p. I373-I376 1415128. (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings; vol. I).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Citations (Scopus)

Speech recognition of a named entity

Tomita, T., Okimoto, Y., Yamamoto, H. & Sagisaka, Y., 2005 Jan 1, 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05 - Proceedings - Image and Multidimensional Signal Processing Multimedia Signal Processing. Institute of Electrical and Electronics Engineers Inc., p. I1057-I1060 1415299. (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings; vol. I).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Citations (Scopus)

Speech recognition of double talk using SAFIA-based audio segregation

Sekiya, T., Ogawa, T. & Kobayashi, T., 2003, EUROSPEECH 2003 - 8th European Conference on Speech Communication and Technology. International Speech Communication Association, p. 1285-1288 4 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Speech recognition of foreign out-of-vocabulary words using a hierarchical language model

Yamamoto, H., Kikui, G., Nakamura, S. & Sagisaka, Y., 2006 Jan 1, INTERSPEECH 2006 and 9th International Conference on Spoken Language Processing, INTERSPEECH 2006 - ICSLP. International Speech Communication Association, p. 1870-1873 4 p. (INTERSPEECH 2006 and 9th International Conference on Spoken Language Processing, INTERSPEECH 2006 - ICSLP; vol. 4).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Speech recognition technology combined with three dimensional lip movement

Komiya, K., Ishikawa, R. & Momose, K., 2001 Jan 1, In : Proceedings of SPIE - The International Society for Optical Engineering. 4298, p. 95-102 8 p.

Research output: Contribution to journalConference article

Speech robot mimicking human articulatory motion

Fukui, K., Kusano, T., Mukaeda, Y., Suzuki, Y., Takanishi, A. & Honda, M., 2010 Dec 1, p. 1021-1024. 4 p.

Research output: Contribution to conferencePaper

4 Citations (Scopus)

Speech segment network approach for optimization of synthesis unit set

Iwahashi, N. & Sagisaka, Y., 1995 Oct, In : Computer Speech and Language. 9, 4, p. 335-352 18 p.

Research output: Contribution to journalArticle

2 Citations (Scopus)
25 Citations (Scopus)

Speech shift: Direct speech-input-mode switching through intentional control of voice pitch

Goto, M., Omoto, Y., Itou, K. & Kobayashi, T., 2003 Jan 1, p. 1201-1204. 4 p.

Research output: Contribution to conferencePaper

2 Citations (Scopus)
42 Citations (Scopus)

Speech spectrum transformation by speaker interpolation

Ituahashi, N. & Sagisaka, Y., 1994, In : ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. 1, p. I461-I464 389256.

Research output: Contribution to journalConference article

14 Citations (Scopus)

Speech spotter: On-demand speech recognition in human-human conversation on the telephone or in face-to-face situations

Goto, M., Kitayama, K., Itou, K. & Kobayashi, T., 2004 Jan 1, p. 1533-1536. 4 p.

Research output: Contribution to conferencePaper

8 Citations (Scopus)

Speech starter: Noise-robust endpoint detection by using filled pauses

Kitayama, K., Goto, M., Itou, K. & Kobayashi, T., 2003, EUROSPEECH 2003 - 8th European Conference on Speech Communication and Technology. International Speech Communication Association, p. 1237-1240 4 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

9 Citations (Scopus)

Speech synthesis by mimicking articulatory movements

Honda, M., Kaburagi, T. & Okadome, T., 1999 Dec 1, In : Proceedings of the IEEE International Conference on Systems, Man and Cybernetics. 2, p. II-463 - II-468

Research output: Contribution to journalConference article

4 Citations (Scopus)
71 Citations (Scopus)