Non-native English speakers' speech correction, based on domain focused document

Kacper Pawel Radzikowski, Le Wang, Osamu Yoshie

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

With the increase in exchange programs, many international students worldwide can face communication problems. Dur-ing lectures, usually English language is used for the commu-nication between students and teachers. However both sides, not necessarily being native speakers of English, may misun-derstand each other. In this paper we propose a method for correction of non-native English speakers' speech, based on the domain focused electronic document. The method relies on the results of speech recognition (SR) software, and uses them altogether with the document. Our approach consists of three steps. Firstly, document analysis in the preprocess-ing phase. Secondly, finding the document part correspond-ing to sentence from SR software, realised using the Hidden Markov Model (HMM) based method. Finally, the correc-tion by calculating the score for each of candidate sentences, based on the result of SR software. The probability score combines keywords comparison, BM25F method and HMM based method scores. Highest score candidate is chosen as replacement.

Original languageEnglish
Title of host publication18th International Conference on Information Integration and Web-Based Applications and Services, iiWAS 2016 - Proceedings
PublisherAssociation for Computing Machinery
Pages276-281
Number of pages6
VolumePart F126325
ISBN (Electronic)9781450348072
DOIs
Publication statusPublished - 2016 Nov 28
Event18th International Conference on Information Integration and Web-Based Applications and Services, iiWAS 2016 - Singapore, Singapore
Duration: 2016 Nov 282016 Nov 30

Other

Other18th International Conference on Information Integration and Web-Based Applications and Services, iiWAS 2016
CountrySingapore
CitySingapore
Period16/11/2816/11/30

Fingerprint

Speech recognition
Hidden Markov models
Students
Communication

Keywords

  • BM25F
  • Domain document
  • HMM
  • Sentence cor-rection
  • Speech recognition correction

ASJC Scopus subject areas

  • Human-Computer Interaction
  • Computer Networks and Communications
  • Computer Vision and Pattern Recognition
  • Software

Cite this

Radzikowski, K. P., Wang, L., & Yoshie, O. (2016). Non-native English speakers' speech correction, based on domain focused document. In 18th International Conference on Information Integration and Web-Based Applications and Services, iiWAS 2016 - Proceedings (Vol. Part F126325, pp. 276-281). Association for Computing Machinery. https://doi.org/10.1145/3011141.3011169

Non-native English speakers' speech correction, based on domain focused document. / Radzikowski, Kacper Pawel; Wang, Le; Yoshie, Osamu.

18th International Conference on Information Integration and Web-Based Applications and Services, iiWAS 2016 - Proceedings. Vol. Part F126325 Association for Computing Machinery, 2016. p. 276-281.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Radzikowski, KP, Wang, L & Yoshie, O 2016, Non-native English speakers' speech correction, based on domain focused document. in 18th International Conference on Information Integration and Web-Based Applications and Services, iiWAS 2016 - Proceedings. vol. Part F126325, Association for Computing Machinery, pp. 276-281, 18th International Conference on Information Integration and Web-Based Applications and Services, iiWAS 2016, Singapore, Singapore, 16/11/28. https://doi.org/10.1145/3011141.3011169
Radzikowski KP, Wang L, Yoshie O. Non-native English speakers' speech correction, based on domain focused document. In 18th International Conference on Information Integration and Web-Based Applications and Services, iiWAS 2016 - Proceedings. Vol. Part F126325. Association for Computing Machinery. 2016. p. 276-281 https://doi.org/10.1145/3011141.3011169
Radzikowski, Kacper Pawel ; Wang, Le ; Yoshie, Osamu. / Non-native English speakers' speech correction, based on domain focused document. 18th International Conference on Information Integration and Web-Based Applications and Services, iiWAS 2016 - Proceedings. Vol. Part F126325 Association for Computing Machinery, 2016. pp. 276-281
@inproceedings{50d24942e5064c10a14d457a4bf3d73e,
title = "Non-native English speakers' speech correction, based on domain focused document",
abstract = "With the increase in exchange programs, many international students worldwide can face communication problems. Dur-ing lectures, usually English language is used for the commu-nication between students and teachers. However both sides, not necessarily being native speakers of English, may misun-derstand each other. In this paper we propose a method for correction of non-native English speakers' speech, based on the domain focused electronic document. The method relies on the results of speech recognition (SR) software, and uses them altogether with the document. Our approach consists of three steps. Firstly, document analysis in the preprocess-ing phase. Secondly, finding the document part correspond-ing to sentence from SR software, realised using the Hidden Markov Model (HMM) based method. Finally, the correc-tion by calculating the score for each of candidate sentences, based on the result of SR software. The probability score combines keywords comparison, BM25F method and HMM based method scores. Highest score candidate is chosen as replacement.",
keywords = "BM25F, Domain document, HMM, Sentence cor-rection, Speech recognition correction",
author = "Radzikowski, {Kacper Pawel} and Le Wang and Osamu Yoshie",
year = "2016",
month = "11",
day = "28",
doi = "10.1145/3011141.3011169",
language = "English",
volume = "Part F126325",
pages = "276--281",
booktitle = "18th International Conference on Information Integration and Web-Based Applications and Services, iiWAS 2016 - Proceedings",
publisher = "Association for Computing Machinery",

}

TY - GEN

T1 - Non-native English speakers' speech correction, based on domain focused document

AU - Radzikowski, Kacper Pawel

AU - Wang, Le

AU - Yoshie, Osamu

PY - 2016/11/28

Y1 - 2016/11/28

N2 - With the increase in exchange programs, many international students worldwide can face communication problems. Dur-ing lectures, usually English language is used for the commu-nication between students and teachers. However both sides, not necessarily being native speakers of English, may misun-derstand each other. In this paper we propose a method for correction of non-native English speakers' speech, based on the domain focused electronic document. The method relies on the results of speech recognition (SR) software, and uses them altogether with the document. Our approach consists of three steps. Firstly, document analysis in the preprocess-ing phase. Secondly, finding the document part correspond-ing to sentence from SR software, realised using the Hidden Markov Model (HMM) based method. Finally, the correc-tion by calculating the score for each of candidate sentences, based on the result of SR software. The probability score combines keywords comparison, BM25F method and HMM based method scores. Highest score candidate is chosen as replacement.

AB - With the increase in exchange programs, many international students worldwide can face communication problems. Dur-ing lectures, usually English language is used for the commu-nication between students and teachers. However both sides, not necessarily being native speakers of English, may misun-derstand each other. In this paper we propose a method for correction of non-native English speakers' speech, based on the domain focused electronic document. The method relies on the results of speech recognition (SR) software, and uses them altogether with the document. Our approach consists of three steps. Firstly, document analysis in the preprocess-ing phase. Secondly, finding the document part correspond-ing to sentence from SR software, realised using the Hidden Markov Model (HMM) based method. Finally, the correc-tion by calculating the score for each of candidate sentences, based on the result of SR software. The probability score combines keywords comparison, BM25F method and HMM based method scores. Highest score candidate is chosen as replacement.

KW - BM25F

KW - Domain document

KW - HMM

KW - Sentence cor-rection

KW - Speech recognition correction

UR - http://www.scopus.com/inward/record.url?scp=85014968612&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85014968612&partnerID=8YFLogxK

U2 - 10.1145/3011141.3011169

DO - 10.1145/3011141.3011169

M3 - Conference contribution

AN - SCOPUS:85014968612

VL - Part F126325

SP - 276

EP - 281

BT - 18th International Conference on Information Integration and Web-Based Applications and Services, iiWAS 2016 - Proceedings

PB - Association for Computing Machinery

ER -