Interactive web-based text-to-speech and speech recognition media for enhancing Arabic listening proficiency

Authors

  • Luthfiyatuz Zuhriyah Universitas Sains Al Qur'an
  • Asep Sunarko Universitas Sains Al Qur'an
  • Ahmad Zuhdi Universitas Sains Al Qur’an
  • Moh Ali Khusain Islamic Studies, Arrohuni Faqih Institut, Kenitra

DOI:

https://doi.org/10.30603/al.v11i1.7257

Keywords:

Arabic listening;, interactive web-based;, text-to-speech;, speech recognition media

Abstract

Background: Arabic listening lessons often rely on passive audio repetition, leaving students with little opportunity to check their understanding or receive immediate feedback. This condition slows their progress, especially in recognizing key phonemes and responding accurately.

Aims: This study aims to develop and validate a web-based interactive listening tool using text-to-speech (TTS) and speech-recognition (ASR) to support students’ Arabic listening proficiency.

Methods: The research followed the Dick, Carey & Carey model through the stages of analysis, design, development, and expert validation. Data were collected through needs analysis, classroom observations, and interviews with teachers to identify learners’ challenges in Arabic listening. After the prototype was created, expert judgment sheets were used to gather validation data from design and content specialists. The responses were then analyzed using percentage-based validity scoring to determine the feasibility of the media. The prototype integrated web-based TTS and ASR modules, specifying platform architecture, speech engines, supported languages, and technical parameters to ensure replicability.

Results: The results show that the media supports active listening practice and helps address common learner difficulties, particularly in phoneme recognition and comprehension. TTS provided clear and natural audio, while ASR offered direct corrective feedback. Expert assessments indicated high feasibility, with a design validity score of 92% and content validity of 87%.

Implications: The findings indicate that TTS–ASR integration can ease teachers’ correction workload and help students practice more independently, offering practical potential for broader classroom use.

Downloads

Download data is not yet available.

References

Alandejani, J. A., & Sayed, G. (2024). The implementation of high-impact practices and communication competency using the Arabic language. Cogent Education, 11(1), 2401253. https://doi.org/10.1080/2331186X.2024.2401253

Aldhafiri, M. D. (2020). The effectiveness of using interactive white boards in improving the Arabic listening skills of undergraduates majoring in Arabic language at Kuwaiti universities. Education and Information Technologies, 25(5), 3577–3591. https://doi.org/10.1007/s10639-020-10107-5

Al-Issa, A. S. M. (2020). The language planning situation in the Sultanate of Oman. Current Issues in Language Planning, 21(4), 347–414. https://doi.org/10.1080/14664208.2020.1764729

Alsuhaibani, Y., Mahdi, H. S., Al Khateeb, A., Al Fadda, H. A., & Alkadi, H. (2024). Web-based pronunciation training and learning consonant clusters among EFL learners. Acta Psychologica, 249, 104459. https://doi.org/10.1016/j.actpsy.2024.104459

Ardasheva, Y., Wang, Z., Adesope, O. O., & Valentine, J. C. (2017). Exploring effectiveness and moderators of language learning strategy instruction on second language and self-regulated learning outcomes. Review of Educational Research, 87(3), 544–582. https://doi.org/10.3102/0034654316689135

Asadi, I. A., Kawar, K., & Tarabeh, G. (2025). Development of verb and noun word patterns in Arabic: A comparison between typically developing children and those with reading difficulties. Journal of Speech, Language, and Hearing Research, 68(8), 3976–3988. https://doi.org/10.1044/2025_JSLHR-24-00673

Bozorgian, H., & Shamsi, E. (2025). A review of research on metacognitive instruction for listening development. International Journal of Listening, 39(1), 1–16. https://doi.org/10.1080/10904018.2023.2197008

Chalghoumi, H., Al-Thani, D., Hassan, A., Hammad, S., & Othman, A. (2022). Research on older persons’ access and use of technology in the Arab region: Critical overview and future directions. Applied Sciences, 12(14), 7258. https://doi.org/10.3390/app12147258

Chemnad, K., & Othman, A. (2023). Advancements in Arabic text-to-speech systems: A 22-year literature review. IEEE Access, 11, 30929–30954. https://doi.org/10.1109/ACCESS.2023.3260844

Chen, C.-H. (2020). Impacts of augmented reality and a digital game on students’ science learning with reflection prompts in multimedia learning. Educational Technology Research and Development, 68(6), 3057–3076. https://doi.org/10.1007/s11423-020-09834-w

Dhonburi Rajabhat University, Thailand, Prasongngern, P., Soontornwipast, K., & Ed.D., Language Institute, Thammasart University, Thailand. (2023). Effects of listening strategy instruction incorporating intensive and extensive listening on listening skills and metacognitive awareness. International Journal of Instruction, 16(4), 155–172. https://doi.org/10.29333/iji.2023.16410a

Ebrahimzadeh, Y., & Ebadi, S. (2025). An exploration into the development of Iranian EFL learners’ oral fluency through online dynamic assessment: A case study. Cogent Education, 12(1), 2549788. https://doi.org/10.1080/2331186X.2025.2549788

Erdiana, L., Dziqy, A. N. A., Farouq, A. A., & Slamet, J. (2025). Enhancing listening comprehension in non-English majors through AI-integrated gamified formative assessment. Applied Research on English Language, 14(3). https://doi.org/10.22108/are.2025.144695.2475

Fadillah, R., & Bariyyah, N. A. (2024). Implementation of the use of artificial intelligence (AI) based animation media to enhance vocabulary learning skills of elementary school students. SHS Web of Conferences, 202, 06004. https://doi.org/10.1051/shsconf/202420206004

Fanoush, T., Al-Khatib, W. G., Amro, M., Alzahrani, A., & Elshafei, M. (2025). Mispronunciation detection and diagnosis for young Arabic learners using transfer learning. IEEE Access, 13, 175047–175068. https://doi.org/10.1109/ACCESS.2025.3616335

Ghanipour, F., & Bozorgian, H. (2025). EFL learners’ challenges in metacognitive listening intervention. International Journal of Applied Linguistics, ijal.12862. https://doi.org/10.1111/ijal.12862

Hill, J. R., & Hannafin, M. J. (2001). Teaching and learning in digital environments: The resurgence of resource-based learning. Educational Technology Research and Development, 49(3), 37–52. https://doi.org/10.1007/BF02504914

Itriq, M., & Mohd Noor, M. H. (2025). Arabic hate speech detection using deep learning: A state-of-the-art survey of advances, challenges, and future directions (2020–2024). PeerJ Computer Science, 11, e3133. https://doi.org/10.7717/peerj-cs.3133

Jones, D. (2007). Speaking, listening, planning and assessing: The teacher’s role in developing metacognitive awareness. Early Child Development and Care, 177(6–7), 569–579. https://doi.org/10.1080/03004430701378977

Kumar, Y., Koul, A., & Singh, C. (2023). Deep learning approaches in text-to-speech system: A systematic review and recent research perspective. Multimedia Tools and Applications, 82(10), 15171–15197. https://doi.org/10.1007/s11042-022-13943-4

Lee, J., & Ko, Y. (2025). Comparing metaverse and face-to-face instruction for enhancing English skills and attitudes among EFL adolescents. Education and Information Technologies, 30(6), 8215–8244. https://doi.org/10.1007/s10639-024-13162-4

Mahajan, M. (2022). A literature review of the opportunities and barriers in computer-assisted language learning (CALL). Journal of English Language and Literature, 09(03), 37–48. https://doi.org/10.54513/joell.2022.9305

Mohammed, T. (2022). Designing an Arabic speaking and listening skills e-course: Resources, activities, and students’ perceptions. Electronic Journal of E-Learning, 20(1), 53–68. https://doi.org/10.34190/ejel.20.1.2177

Mohd Sidek, H., & Mikail, I. (2017). Arabic as a second language listening comprehension: Instruction and assessment. Ulum Islamiyyah, 20, 21–34. https://doi.org/10.33102/uij.vol20no0.32

Morzy, M. (2025). Spoken language processing: Conversational AI for spontaneous human dialogues (1st ed.). Springer Nature Switzerland. https://doi.org/10.1007/978-3-031-88566-2

Oumaima, Z., Abdelouafi, M., & Meryem, E. H. (2018). Text-to-speech technology for Arabic language learners. In 2018 IEEE 5th International Congress on Information Science and Technology (CiSt) (pp. 432–436). https://doi.org/10.1109/CIST.2018.8596372

Rakhlin, N. V., Li, N., Aljughaiman, A., & Grigorenko, E. L. (2025). “Speech is golden”: The importance of colloquial Arabic for reading standard Arabic for beginning readers. Journal of Speech, Language, and Hearing Research, 68(3S), 1441–1467. https://doi.org/10.1044/2024_JSLHR-23-00522

Rehman, N., Huang, X., Mahmood, A., Maqbool, S., & Javed, S. (2024). Enhancing the quality of research synopsis of international students through peer feedback: A case study. New Directions for Child and Adolescent Development, 2024(1), 1271802. https://doi.org/10.1155/2024/1271802

Sun, W. (2023). The impact of automatic speech recognition technology on second language pronunciation and speaking skills of EFL learners: A mixed methods investigation. Frontiers in Psychology, 14, 1210187. https://doi.org/10.3389/fpsyg.2023.1210187

Toker, S. (2022). The progress of 21st-century skills throughout instructional design projects: A quasi-experimental comparison of rapid prototyping and Dick and Carey models. Education and Information Technologies, 27(2), 1959–1992. https://doi.org/10.1007/s10639-021-10673-2

Ubaidillah, U., Millah, F. I., & Sapitri, N. (2024). The use of online media “alefbata.com” in improving Arabic listening skills: Experimental study. Al-Ta’rib: Jurnal Ilmiah Program Studi Pendidikan Bahasa Arab IAIN Palangka Raya, 12(1), 103–114. https://doi.org/10.23971/altarib.v12i1.7852

Wang, X., Gao, Y., Sun, F., & Wang, Q. (2024). Unveiling the tapestry of teacher belief research: Tracing the present and forging the future through bibliometric analysis. Current Psychology, 43(17), 15659–15672. https://doi.org/10.1007/s12144-023-05546-5

Yan, W., Li, B., & Lowell, V. L. (2025). Integrating artificial intelligence and extended reality in language education: A systematic literature review (2017–2024). Education Sciences, 15(8), 1066. https://doi.org/10.3390/educsci15081066

Zhang, P., & Graham, S. (2020). Learning vocabulary through listening: The role of vocabulary knowledge and listening proficiency. Language Learning, 70(4), 1017–1053. https://doi.org/10.1111/lang.12411

Zhang, Y., & Dong, C. (2024). Exploring the digital transformation of generative AI-assisted foreign language education: A socio-technical systems perspective based on mixed-methods. Systems, 12(11), 462. https://doi.org/10.3390/systems12110462

Zikrillah, Erlina, E., Rafli, Z., & Amrulloh, M. A. (2025). The contribution of bilingualism to the enhancement of Arabic listening and speaking skills in language instruction. Jurnal Pendidikan Bahasa, 14(1), 183–200. https://doi.org/10.31571/bahasa.v14i1.9058

Downloads

Published

2026-02-28

How to Cite

Zuhriyah, L., Sunarko, A., Zuhdi, A., & Khusain, M. A. (2026). Interactive web-based text-to-speech and speech recognition media for enhancing Arabic listening proficiency. Al-Lisan: Jurnal Bahasa (e-Journal), 11(1), 71–86. https://doi.org/10.30603/al.v11i1.7257