Interactive web-based text-to-speech and speech recognition media for enhancing Arabic listening proficiency
DOI:
https://doi.org/10.30603/al.v11i1.7257Keywords:
Arabic listening;, interactive web-based;, text-to-speech;, speech recognition mediaAbstract
Background: Arabic listening lessons often rely on passive audio repetition, leaving students with little opportunity to check their understanding or receive immediate feedback. This condition slows their progress, especially in recognizing key phonemes and responding accurately.
Aims: This study aims to develop and validate a web-based interactive listening tool using text-to-speech (TTS) and speech-recognition (ASR) to support students’ Arabic listening proficiency.
Methods: The research followed the Dick, Carey & Carey model through the stages of analysis, design, development, and expert validation. Data were collected through needs analysis, classroom observations, and interviews with teachers to identify learners’ challenges in Arabic listening. After the prototype was created, expert judgment sheets were used to gather validation data from design and content specialists. The responses were then analyzed using percentage-based validity scoring to determine the feasibility of the media. The prototype integrated web-based TTS and ASR modules, specifying platform architecture, speech engines, supported languages, and technical parameters to ensure replicability.
Results: The results show that the media supports active listening practice and helps address common learner difficulties, particularly in phoneme recognition and comprehension. TTS provided clear and natural audio, while ASR offered direct corrective feedback. Expert assessments indicated high feasibility, with a design validity score of 92% and content validity of 87%.
Implications: The findings indicate that TTS–ASR integration can ease teachers’ correction workload and help students practice more independently, offering practical potential for broader classroom use.
Downloads
References
Alandejani, J. A., & Sayed, G. (2024). The implementation of high-impact practices and communication competency using the Arabic language. Cogent Education, 11(1), 2401253. https://doi.org/10.1080/2331186X.2024.2401253
Aldhafiri, M. D. (2020). The effectiveness of using interactive white boards in improving the Arabic listening skills of undergraduates majoring in Arabic language at Kuwaiti universities. Education and Information Technologies, 25(5), 3577–3591. https://doi.org/10.1007/s10639-020-10107-5
Al-Issa, A. S. M. (2020). The language planning situation in the Sultanate of Oman. Current Issues in Language Planning, 21(4), 347–414. https://doi.org/10.1080/14664208.2020.1764729
Alsuhaibani, Y., Mahdi, H. S., Al Khateeb, A., Al Fadda, H. A., & Alkadi, H. (2024). Web-based pronunciation training and learning consonant clusters among EFL learners. Acta Psychologica, 249, 104459. https://doi.org/10.1016/j.actpsy.2024.104459
Ardasheva, Y., Wang, Z., Adesope, O. O., & Valentine, J. C. (2017). Exploring effectiveness and moderators of language learning strategy instruction on second language and self-regulated learning outcomes. Review of Educational Research, 87(3), 544–582. https://doi.org/10.3102/0034654316689135
Asadi, I. A., Kawar, K., & Tarabeh, G. (2025). Development of verb and noun word patterns in Arabic: A comparison between typically developing children and those with reading difficulties. Journal of Speech, Language, and Hearing Research, 68(8), 3976–3988. https://doi.org/10.1044/2025_JSLHR-24-00673
Bozorgian, H., & Shamsi, E. (2025). A review of research on metacognitive instruction for listening development. International Journal of Listening, 39(1), 1–16. https://doi.org/10.1080/10904018.2023.2197008
Chalghoumi, H., Al-Thani, D., Hassan, A., Hammad, S., & Othman, A. (2022). Research on older persons’ access and use of technology in the Arab region: Critical overview and future directions. Applied Sciences, 12(14), 7258. https://doi.org/10.3390/app12147258
Chemnad, K., & Othman, A. (2023). Advancements in Arabic text-to-speech systems: A 22-year literature review. IEEE Access, 11, 30929–30954. https://doi.org/10.1109/ACCESS.2023.3260844
Chen, C.-H. (2020). Impacts of augmented reality and a digital game on students’ science learning with reflection prompts in multimedia learning. Educational Technology Research and Development, 68(6), 3057–3076. https://doi.org/10.1007/s11423-020-09834-w
Dhonburi Rajabhat University, Thailand, Prasongngern, P., Soontornwipast, K., & Ed.D., Language Institute, Thammasart University, Thailand. (2023). Effects of listening strategy instruction incorporating intensive and extensive listening on listening skills and metacognitive awareness. International Journal of Instruction, 16(4), 155–172. https://doi.org/10.29333/iji.2023.16410a
Ebrahimzadeh, Y., & Ebadi, S. (2025). An exploration into the development of Iranian EFL learners’ oral fluency through online dynamic assessment: A case study. Cogent Education, 12(1), 2549788. https://doi.org/10.1080/2331186X.2025.2549788
Erdiana, L., Dziqy, A. N. A., Farouq, A. A., & Slamet, J. (2025). Enhancing listening comprehension in non-English majors through AI-integrated gamified formative assessment. Applied Research on English Language, 14(3). https://doi.org/10.22108/are.2025.144695.2475
Fadillah, R., & Bariyyah, N. A. (2024). Implementation of the use of artificial intelligence (AI) based animation media to enhance vocabulary learning skills of elementary school students. SHS Web of Conferences, 202, 06004. https://doi.org/10.1051/shsconf/202420206004
Fanoush, T., Al-Khatib, W. G., Amro, M., Alzahrani, A., & Elshafei, M. (2025). Mispronunciation detection and diagnosis for young Arabic learners using transfer learning. IEEE Access, 13, 175047–175068. https://doi.org/10.1109/ACCESS.2025.3616335
Ghanipour, F., & Bozorgian, H. (2025). EFL learners’ challenges in metacognitive listening intervention. International Journal of Applied Linguistics, ijal.12862. https://doi.org/10.1111/ijal.12862
Hill, J. R., & Hannafin, M. J. (2001). Teaching and learning in digital environments: The resurgence of resource-based learning. Educational Technology Research and Development, 49(3), 37–52. https://doi.org/10.1007/BF02504914
Itriq, M., & Mohd Noor, M. H. (2025). Arabic hate speech detection using deep learning: A state-of-the-art survey of advances, challenges, and future directions (2020–2024). PeerJ Computer Science, 11, e3133. https://doi.org/10.7717/peerj-cs.3133
Jones, D. (2007). Speaking, listening, planning and assessing: The teacher’s role in developing metacognitive awareness. Early Child Development and Care, 177(6–7), 569–579. https://doi.org/10.1080/03004430701378977
Kumar, Y., Koul, A., & Singh, C. (2023). Deep learning approaches in text-to-speech system: A systematic review and recent research perspective. Multimedia Tools and Applications, 82(10), 15171–15197. https://doi.org/10.1007/s11042-022-13943-4
Lee, J., & Ko, Y. (2025). Comparing metaverse and face-to-face instruction for enhancing English skills and attitudes among EFL adolescents. Education and Information Technologies, 30(6), 8215–8244. https://doi.org/10.1007/s10639-024-13162-4
Mahajan, M. (2022). A literature review of the opportunities and barriers in computer-assisted language learning (CALL). Journal of English Language and Literature, 09(03), 37–48. https://doi.org/10.54513/joell.2022.9305
Mohammed, T. (2022). Designing an Arabic speaking and listening skills e-course: Resources, activities, and students’ perceptions. Electronic Journal of E-Learning, 20(1), 53–68. https://doi.org/10.34190/ejel.20.1.2177
Mohd Sidek, H., & Mikail, I. (2017). Arabic as a second language listening comprehension: Instruction and assessment. Ulum Islamiyyah, 20, 21–34. https://doi.org/10.33102/uij.vol20no0.32
Morzy, M. (2025). Spoken language processing: Conversational AI for spontaneous human dialogues (1st ed.). Springer Nature Switzerland. https://doi.org/10.1007/978-3-031-88566-2
Oumaima, Z., Abdelouafi, M., & Meryem, E. H. (2018). Text-to-speech technology for Arabic language learners. In 2018 IEEE 5th International Congress on Information Science and Technology (CiSt) (pp. 432–436). https://doi.org/10.1109/CIST.2018.8596372
Rakhlin, N. V., Li, N., Aljughaiman, A., & Grigorenko, E. L. (2025). “Speech is golden”: The importance of colloquial Arabic for reading standard Arabic for beginning readers. Journal of Speech, Language, and Hearing Research, 68(3S), 1441–1467. https://doi.org/10.1044/2024_JSLHR-23-00522
Rehman, N., Huang, X., Mahmood, A., Maqbool, S., & Javed, S. (2024). Enhancing the quality of research synopsis of international students through peer feedback: A case study. New Directions for Child and Adolescent Development, 2024(1), 1271802. https://doi.org/10.1155/2024/1271802
Sun, W. (2023). The impact of automatic speech recognition technology on second language pronunciation and speaking skills of EFL learners: A mixed methods investigation. Frontiers in Psychology, 14, 1210187. https://doi.org/10.3389/fpsyg.2023.1210187
Toker, S. (2022). The progress of 21st-century skills throughout instructional design projects: A quasi-experimental comparison of rapid prototyping and Dick and Carey models. Education and Information Technologies, 27(2), 1959–1992. https://doi.org/10.1007/s10639-021-10673-2
Ubaidillah, U., Millah, F. I., & Sapitri, N. (2024). The use of online media “alefbata.com” in improving Arabic listening skills: Experimental study. Al-Ta’rib: Jurnal Ilmiah Program Studi Pendidikan Bahasa Arab IAIN Palangka Raya, 12(1), 103–114. https://doi.org/10.23971/altarib.v12i1.7852
Wang, X., Gao, Y., Sun, F., & Wang, Q. (2024). Unveiling the tapestry of teacher belief research: Tracing the present and forging the future through bibliometric analysis. Current Psychology, 43(17), 15659–15672. https://doi.org/10.1007/s12144-023-05546-5
Yan, W., Li, B., & Lowell, V. L. (2025). Integrating artificial intelligence and extended reality in language education: A systematic literature review (2017–2024). Education Sciences, 15(8), 1066. https://doi.org/10.3390/educsci15081066
Zhang, P., & Graham, S. (2020). Learning vocabulary through listening: The role of vocabulary knowledge and listening proficiency. Language Learning, 70(4), 1017–1053. https://doi.org/10.1111/lang.12411
Zhang, Y., & Dong, C. (2024). Exploring the digital transformation of generative AI-assisted foreign language education: A socio-technical systems perspective based on mixed-methods. Systems, 12(11), 462. https://doi.org/10.3390/systems12110462
Zikrillah, Erlina, E., Rafli, Z., & Amrulloh, M. A. (2025). The contribution of bilingualism to the enhancement of Arabic listening and speaking skills in language instruction. Jurnal Pendidikan Bahasa, 14(1), 183–200. https://doi.org/10.31571/bahasa.v14i1.9058
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Luthfiyatuz Zuhriyah, Asep Sunarko, Ahmad Zuhdi, Moh Ali Khusain

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Copyright Notice
Authors who publish in Al-Lisan: Jurnal Bahasa (e-Journal) agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work.






