Artificial Intelligence Integration in Second Language Pronunciation Training

A Mixed-Methods Study on Learning Outcomes

  • Sarwadi Sarwadi University of Qamaru Huda Badaruddin

Abstract

The increasing use of Artificial Intelligence (AI) in language learning has opened new opportunities for improving second language (L2) pronunciation skills. This study aims to evaluate the effectiveness of integrating artificial intelligence into second language pronunciation training by examining its impact on learners' outcomes through a mixed-methods approach. Quantitative results from 60 participants showed the AI group (M=4.89) significantly outperformed the control group (M=3.71) in post-tests (p=0.001), with an 85% vs 65% improvement rate. Qualitative interviews revealed three key findings: (1) real-time feedback enhanced engagement and self-confidence, (2) notable improvements in intonation and stress patterns, but (3) limitations in recognizing regional accents. The results demonstrate that AI-assisted pronunciation training offers clear advantages over traditional methods in enhancing L2 pronunciation. However, limitations in accent recognition highlight the need for further development of AI speech processing technologies. This study provides empirical support for integrating AI into language education, highlighting its potential and areas that require improvement.

Downloads

Download data is not yet available.

References

Burston, J. (2015). Twenty years of MALL project implementation: A meta-analysis of learning outcomes. ReCALL, 27(1), 4–20. https://doi.org/10.1017/S0958344014000159

Celce-Murcia, M., Brinton, D. M., & Goodwin, J. M. (2010). Teaching pronunciation: A course book and reference guide (2nd ed.). Cambridge University Press.

Chapelle, C. A. (2003). English language learning and technology: Lectures on applied linguistics in the age of information and communication technology. John Benjamins.

Chun, D. M. (2012). Computer-assisted pronunciation teaching. In C. A. Chapelle (Ed.), The encyclopedia of applied linguistics (pp. 1–5). Wiley-Blackwell.

Chun, D. M. (2019). Technology-assisted pronunciation instruction. In M. O’Brien & J. Levis (Eds.), Pronunciation instruction: A research-based approach (pp. 115–134). Routledge.

Chen, Y., Wang, Y., & Zhang, L. (2021). Effects of automatic speech recognition on EFL learners’ pronunciation and motivation. Computer Assisted Language Learning, 34(3), 265–284.

Derwing, T. M., & Munro, M. J. (2005). Second language accent and pronunciation teaching: A research-based approach. TESOL Quarterly, 39(3), 379–397. https://doi.org/10.2307/3588486

Escudero, P., & Wanrooij, K. (2010). The effect of L1 orthography on non-native vowel perception. Language and Speech, 53(3), 343–365.

Godwin-Jones, R. (2018). Using mobile technology to develop language skills and cultural understanding. Language Learning & Technology, 22(3), 3–17. https://doi.org/10.10125/44607

Handley, Z. (2009). Is text-to-speech synthesis ready for use in computer-assisted language learning? Speech Communication, 51(10), 906–919. https://doi.org/10.1016/j.specom.2008.12.004

Huang, Z., Zhang, Y., & Glass, J. (2020). Exploring speaker adaptation for end-to-end speech recognition. arXiv preprint, arXiv:2005.04290.

Krashen, S. D. (1982). Principles and practice in second language acquisition. Pergamon Press.

Kukulska-Hulme, A., & Shield, L. (2008). An overview of mobile assisted language learning: From content delivery to supported collaboration and interaction. ReCALL, 20(3), 271–289. https://doi.org/10.1017/S0958344008000335

Kukulska-Hulme, A. (2020). Intelligent assistants in language learning: Friends or foes? International Journal of Emerging Technologies in Learning (iJET), 15(10), 226–235. https://doi.org/10.3991/ijet.v15i10.xxxx

Lantolf, J. P., & Thorne, S. L. (2006). Sociocultural theory and the genesis of second language development. Oxford University Press.

Levis, J. M. (2005). Changing contexts and shifting paradigms in pronunciation teaching. TESOL Quarterly, 39(3), 369–377. https://doi.org/10.2307/3588485

Levis, J. M. (2007). Computer technology in teaching and researching pronunciation. In M. C. Pennington (Ed.), Phonology in context (pp. 184–202). Palgrave Macmillan.

Li, Q., Sun, Y., & Luo, W. (2023). Real-time feedback in ASR-based pronunciation training: A learner-centered approach. ReCALL, 35(2), 158–174.

Li, Z., Zou, B., & Xie, H. (2021). Learning L2 pronunciation with automatic speech recognition technology: Past, present, and future. Computer Assisted Language Learning, 34(8), 1155–1189. https://doi.org/10.1080/09588221.2019.1629902

McCrocklin, S. (2019). ASR-based dictation: Scaffolding towards independence. Language Learning & Technology, 23(1), 32–47.

Mayer, R. E. (2009). Multimedia learning (2nd ed.). Cambridge University Press.

Shadiev, R., Wang, X., & Huang, Y. M. (2020). Investigating the effect of virtual reality on foreign language learning: A meta-analysis. Interactive Learning Environments, 30(1), 1–16. https://doi.org/10.1080/10494820.2020.1722719

Stockwell, G. (2012). Using mobile phones for vocabulary activities: Examining the effect of the platform. Language Learning & Technology, 16(3), 95–110.

Vygotsky, L. S. (1978). Mind in society: The development of higher psychological processes. Harvard University Press.

Wang, Y., Luo, R., & Wang, L. (2023). Effectiveness of AI-assisted pronunciation learning: Evidence from a longitudinal study. Language Learning & Technology, 27(1), 45–68. https://doi.org/10.10125/44730

Witt, S. M. (2012). Automatic error detection in pronunciation training: Where we are and where we need to go. In Proceedings of the International Symposium on Automatic Detection of Errors in Pronunciation Training (pp. 1–8).

Xu, H., Yunus, M. M., & Hashim, H. (2022). The use of artificial intelligence in second language learning: A systematic literature review. International Journal of Learning, Teaching and Educational Research, 21(1), 132–147.

Yuan, Y., & Chen, W. (2023). Challenges of accented speech recognition in ASR-assisted language learning tools. Frontiers in Psychology, 14, 1210187.

Zhang, R., & Yin, B. (2022). The effectiveness of automatic speech recognition in ESL/EFL pronunciation: A meta-analysis. ReCALL, 34(1), 31–51.
Published
2025-06-30
How to Cite
SARWADI, Sarwadi. Artificial Intelligence Integration in Second Language Pronunciation Training. Pioneer: Journal of Language and Literature, [S.l.], v. 17, n. 1, p. 80-91, june 2025. ISSN 2655-8718. Available at: <https://unars.ac.id/ojs/index.php/pioneer/article/view/6329>. Date accessed: 05 dec. 2025. doi: https://doi.org/10.36841/pioneer.v17i1.6329.
Section
Articles