Speech-to-text applications’ accuracy in English language learners’ speech transcription

July 28, 2024, 1:34 a.m.
Dec. 16, 2024, 7:21 p.m.
Dec. 16, 2024, 7:21 p.m.
[['https://scholarspace.manoa.hawaii.edu/bitstreams/246910c3-ff36-469c-b0a8-2f5d0507d989/download', '28_01_10125-73555.pdf']]
[['https://scholarspace.manoa.hawaii.edu/bitstreams/5ab06867-ded6-478f-836e-62d68ac8dbbb/download', 'full_text']]
Volume 28 Number 1, 2024
Hirai, Akiyo Kovalyova, Angelina
2024-04-04T20:37:39Z
2024-04-04T20:37:39Z
2024
2024-04-08
Speech-to-text applications have great potential for helping students with English language comprehension and pronunciation practice. This study explores the functionality of five speech-to-text (STT) applications (Google Docs voice typing tool, Apple Dictation, Windows 10 Dictation, Dictation.io [a website service], and “Transcribe” [an app on iOS]) to measure their speech transcription accuracy of American English. The experiment involved 30 nonnative speakers, who were asked to perform four speaking tasks and whose speeches were recorded and transcribed with these applications. The transcriptions produced by the applications were then compared with human-made transcriptions to evaluate the accuracy rate of each application’s speech transcription ability. The results revealed that the accuracy rate of speech transcriptions depends not only on the applications’ automatic speech recognition ability but also on the types of speech produced, as well as each speaker’s L1 influence on L2 (English). The study also offers examples of Japanese speakers’ pronunciation errors attained through STT transcription, demonstrating great pedagogical potential for pronunciation practice and assessment in English classrooms.
21
21
Hirai, A., & Kovalyova, A. (2024). Speech-to-text applications’ accuracy in English language learners’ speech transcription. Language Learning & Technology, 28(1), 1–21. https://hdl.handle.net/10125/73555
1094-3501
https://hdl.handle.net/10125/73555
eng
1
Language Learning & Technology
University of Hawaii National Foreign Language Resource Center Center for Language & Technology
Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License
https://creativecommons.org/licenses/by-nc-nd/4.0/
/item/10125-73555/
1
Automatic Speech Recognition, Speech-to-text Applications, Pronunciation, Loanwords
Speech-to-text applications’ accuracy in English language learners’ speech transcription
Article Text
28