The Benefits and Limitations of the Use of Generative Artificial Intelligence Tools in the Acquisition of Productive Skills in English as a Foreign Language—A Systematic Analysis
Abstract
1. Introduction
2. Theoretical Background
- In what areas is the use of GenAI technology effective in the acquisition of EFL writing skills?
- In what areas is the use of GenAI technology effective in the acquisition of EFL speaking skills?
- What limitations and challenges are reported in the selected studies regarding the use of GenAI technology in EFL?
3. Methods
- Time span of 2024–2025, in order to report the most current trends of GenAI on EFL.
- Linguistics and educational subject area categories of articles were selected since this search is implemented from an educational standpoint.
- Journal articles written in English.
- Research that focused on the acquisition of EFL.
- Research that used quantitative means of measurements, quasi-experimental, or mixed methodology.
- Studies that utilize GenAI tools. At this point, it should be mentioned that we included tools that had some elements of GenAI technologies and were not exclusively constructed on the idea of producing human-like products.
- Any articles published before the set date of 2024.
- Mechanics or computer technologies subject area categories of articles.
- Studies that did not make a direct reference to a GenAI tool or tools.
- Studies written in languages other than English.
- Literature overviews and systematic or scoping reviews.
- Studies that made reference to L2 acquisition which was other than the English language.
- Studies that included qualitative means of measurements, in order to ensure empirical focus on skill development.
- Studies that examined relevant topics in accordance with students’ or teachers’ views and perceptions. We deliberately excluded studies that based their results solely on students’ or teachers’ perceptions, as we primarily wanted to focus on empirical outcomes. Thus, we intended to focus on the measured effects on writing and speaking skills, as well as documented challenges and limitations, within empirical contexts.
4. Results
4.1. RQ1: Benefits on Writing Skills
4.2. RQ2: Benefits on Speaking Skills
4.3. RQ3: Limitations and Concerns
5. Discussion
5.1. GenAI in EFL Writing
5.2. GenAI in EFL Speaking
5.3. Concerns and Limitations
5.4. Pedagogical Implications
5.5. Limitations and Future Research
6. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Abbreviations
| GenAI | Generative Artificial Intelligence |
| AI | Artificial Intelligence |
| EFL | English as a second/foreign language |
| AR | Augmented Reality |
| VR | Virtual Reality |
| L2 | Second language |
| ML | Machine Learning |
| PLL | Personalized Language Learning |
References
- Hsu, M.; Chen, P.; Yu, C. Proposing a task-oriented chatbot system for EFL learners speaking practice. Interact. Learn. Environ. 2021, 31, 4297–4308. [Google Scholar] [CrossRef]
- Du, Y.; Gao, H. Determinants affecting teachers’ adoption of AI-based applications in EFL context: An analysis of analytic hierarchy process. Educ. Inf. Technol. 2022, 27, 9357–9384. [Google Scholar] [CrossRef]
- Bygate, M. Speaking; Oxford University Press: Oxford, UK, 1987. [Google Scholar]
- Swain, M. Communicative competence: Some roles of comprehensible input and comprehensible output in its development. In Input in Second Language Acquisition; Gass, S., Madden, C., Eds.; Newbury House: Rowley, MA, USA, 1985; pp. 235–253. [Google Scholar]
- Canale, M.; Swain, M. Theoretical Bases of Communicative Approaches to Second Language Teaching and Testing. Appl. Linguist. 1980, 1, 1–47. [Google Scholar] [CrossRef]
- Swain, M. Three functions of output in second language learning. In Principle and Practice in Applied Linguistics: Studies in Honour; Widdowson, H.G., Cook, G., Seidlhofer, B., Eds.; Oxford University Press: Oxford, UK, 1995; pp. 125–144. [Google Scholar]
- Zhuang, M.; Long, S.; Martin, F.; Castellanos-Reyes, D. The affordances of Artificial Intelligence (AI) and ethical considerations across the instruction cycle: A systematic review of AI in online higher education. Internet High. Educ. 2025, 67, 101039. [Google Scholar] [CrossRef]
- Higgins, J.P.T.; Thomas, J.; Chandler, J.; Cumpston, M.; Li, T.; Page, M.J.; Welch, V.A. Cochrane Handbook for Systematic Reviews of Interventions, Version 6.3; Wiley: Hoboken, NJ, USA, 2022. Available online: https://training.cochrane.org/handbook (accessed on 30 May 2025).
- Peters, M.D.J.; Godfrey, C.M.; McInerney, P.; Munn, Z.; Tricco, A.C.; Khalil, H. Scoping Reviews. In JBI Manual for Evidence Synthesis; Aromataris, E., Munn, Z., Eds.; JBI: St. Miami, FL, USA, 2020; Chapter 11. [Google Scholar] [CrossRef]
- Divekar, R.; Drozdal, J.; Chabot, S.; Zhou, Y. Foreign language acquisition via Artificial Intelligence and Extended Reality: Design and evaluation. Comput. Assist. Lang. Learn. 2021, 35, 2332–2360. [Google Scholar] [CrossRef]
- Zhang, Z.; Huang, X. The impact of chatbots based on large language models on second language vocabulary acquisition. Heliyon 2024, 10, e25370. [Google Scholar] [CrossRef]
- Li, K.C.; Wong, B.T.M. How learning has been personalized: A review of literature from 2009 to 2018. In Blended Learning: Educational Innovation for Personalized Learning, Proceedings of the ICBL 2019, Hradec Kralove, Czech Republic, 2–4 July 2019; Cheung, S., Lee, L.K., Simonova, I., Kozel, T., Kwok, L.F., Eds.; Lecture Notes in Computer Science; Springer: Berlin/Heidelberg, Germany, 2019; Volume 11546, pp. 72–84. [Google Scholar] [CrossRef]
- Mastura, T. Assessing learners’ communicative competence in different testing system. Online J. Sustain. Leadersh. Res. 2021, 1, 154–158. [Google Scholar] [CrossRef]
- Timpe-Laughlin, V.; Sydorenko, T.; Daurio, P. Using spoken dialogue technology for L2 speaking practice: What do teachers think? Comput. Assist. Lang. Learn. 2020, 35, 1194–1217. [Google Scholar] [CrossRef]
- Young, R. Interactional competence in learning, teaching, and testing. In Handbook of Research in Second Language Teaching and Learning; Routledge, E.H., Ed.; Taylor Francis Group: Abingdon, UK, 2011; pp. 426–443. [Google Scholar] [CrossRef]
- Reem, A. Chatting with AI Bot: Vocabulary Learning Assistant for Saudi EFL Learners. Engl. Lang. Teach. 2021, 14, 135. [Google Scholar] [CrossRef]
- Long, M.H. The role of the linguistic environment in second language acquisition. In Handbook of Second Language Acquisition; Ritchie, W.C., Bhatia, T.K., Eds.; Academic Press: New York, NY, USA, 1996; pp. 413–468. [Google Scholar]
- Karataş, F.; Gunyel, F.; Abedi, F.; Karadeniz, D. Incorporating AI in foreign language education: An investigation into ChatGPT’s effect on foreign language learners. Educ. Inf. Technol. 2024, 3, 3195–3217. [Google Scholar] [CrossRef]
- Koraishi, O. Teaching English in the age of AI: Embracing ChatGPT to optimize EFL materials and assessment. LET Linguist. Lit. Engl. Teach. J. 2023, 3, 55–72. Available online: https://langedutech.com/letjournal/index.php/let/article/view/48 (accessed on 14 October 2025).
- Liu, J.; Hamid, H.A.; Bao, X. Motivation and achievement in EFL: The power of instructional approach. Front. Educ. 2025, 10, 1614388. [Google Scholar] [CrossRef]
- Liu, X.J.; Wang, J.; Zou, B. Evaluating an AI speaking assessment tool: Score accuracy, perceived validity, and oral peer feedback as feedback enhancement. J. Engl. Acad. Purp. 2025, 75, 101505. [Google Scholar] [CrossRef]
- Lo, C.K.; Yu, P.L.H.; Xu, S.; Ng, D.T.K.; Jong, M.S.Y. Exploring the application of ChatGPT in ESL/EFL education and related research issues: A systematic review of empirical studies. Smart Learn. Environ. 2024, 11, 50. [Google Scholar] [CrossRef]
- Hoang Mai Tram, N.; Trung Nguyen, T.; Duc Tran, C. ChatGPT as a tool for self-learning English among EFL learners: A multi-methods study. System 2024, 127, 103528. [Google Scholar] [CrossRef]
- Huang, J.; Li, S. Opportunities and challenges in the application of ChatGpt in foreign language teaching. Int. J. Educ. Soc. Sci. Res. 2023, 6, 75–89. [Google Scholar] [CrossRef]
- Page, M.J.; McKenzie, J.E.; Bossuyt, P.M.; Boutron, I.; Hoffmann, T.C.; Mulrow, C.D.; Shamseer, L.; Tetzlaff, J.M.; Akl, E.A.; Brennan, S.E.; et al. The PRISMA 2020 statement: An updated guideline for reporting systematic reviews. BMJ 2021, 372, n71. [Google Scholar] [CrossRef]
- Abuhussein, H.F.; Badah, A. The Role of AI-based Writing Tools on L2 Writing Competency: Evidence from Palestinian EFL Learners. Dirasat Hum. Soc. Sci. 2025, 52, 6566. [Google Scholar] [CrossRef]
- Afandi, I. Assessing text comprehension proficiency: Indonesian higher education students vs. ChatGPT. XLinguae 2024, 17, 49–68. [Google Scholar] [CrossRef]
- Al-Husban, N.A. The impact of AI-assisted language learning tools on augmenting university EFL students’ speaking skills in Jordan. J. Appl. Learn. Teach. 2025, 8, 116–127. [Google Scholar] [CrossRef]
- Alsaweed, W.; Aljebreen, S. Investigating the Accuracy of ChatGPT as a Writing Error Correction Tool. Int. J. Comput.-Assist. Lang. Learn. Teach. 2024, 14, 18. [Google Scholar] [CrossRef]
- Alsofyani, A.H.; Barzanji, A.M. The Effects of ChatGPT-Generated Feedback on Saudi EFL Learners’ Writing Skills and Perception at the Tertiary Level: A Mixed-Methods Study. J. Educ. Comput. Res. 2025, 63, 431–463. [Google Scholar] [CrossRef]
- Alwasidi, M.A.; Al-Khalifah, K.S. Assessing the Impact of ChatGPT on EFL Students’ Writing Productivity and Proficiency. J. Lang. Teach. Res. 2025, 16, 986–995. [Google Scholar] [CrossRef]
- Asadi, M.; Ebadi, S.; Mohammadi, L. The impact of integrating ChatGPT with teachers’ feedback on EFL writing skills. Think. Ski. Creat. 2025, 56, 101766. [Google Scholar] [CrossRef]
- Asadi, M.; Ebadi, S.; Salman, A.R.; Taheri, R.; Mohammadi, L. Investigating the effects of AI-assisted teacher instruction on online IELTS writing. Soc. Sci. Humanit. Open 2025, 12, 101747. [Google Scholar] [CrossRef]
- Bodaubekov, A.; Agaidarova, S.; Zhussipbek, T.; Gaipov, D.; Balta, N. Leveraging AI to enhance writing skills of senior TFL students in Kazakhstan: A case study using “Write Improve”. Contemp. Educ. Technol. 2025, 17, 15687. [Google Scholar] [CrossRef]
- Chen, A.; Jia, J.; Li, Y.; Fu, L. Investigating the Effect of Role-Play Activity With GenAI Agent on EFL Students’ Speaking Performance. J. Educ. Comput. Res. 2025, 63, 99–125. [Google Scholar] [CrossRef]
- Chen, C.; Gong, Y. The Role of AI-Assisted Learning in Academic Writing: A Mixed-Methods Study on Chinese as a Second Language Students. Educ. Sci. 2025, 15, 141. [Google Scholar] [CrossRef]
- Chen, S.-Y.R. Enhancing ESP Writing Skills Through the Use of Termsoup: An Exploratory Study. Technol. Lang. Teach. Learn. 2025, 7, 102355. [Google Scholar] [CrossRef]
- Darmawansah, D.; Rachman, D.; Febiyani, F.; Hwang, G.-J. ChatGPT-supported collaborative argumentation: Integrating collaboration script and argument mapping to enhance EFL students’ argumentation skills. Educ. Inf. Technol. 2025, 30, 3803–3827. [Google Scholar] [CrossRef]
- Duong, T.; Suppasetseree, S. The Effects of an Artificial Intelligence Voice Chatbot on Improving Vietnamese Undergraduate Students’ English Speaking Skills. Int. J. Learn. Teach. Educ. Res. 2024, 23, 293–321. [Google Scholar] [CrossRef]
- Duong, T.-N.-A.; Chen, H.-L. An AI Chatbot for EFL Writing: Students’ Usage Tendencies, Writing Performance, and Perceptions. J. Educ. Comput. Res. 2025, 63, 406–430. [Google Scholar] [CrossRef]
- ElEbyary, K.; Shabara, R.; Boraie, D. The differential role of AI-operated WCF in L2 students’ noticing of errors and its impact on writing scores. Lang. Test. Asia 2024, 14, 59. [Google Scholar] [CrossRef]
- Gao, H.; Hashim, H.; Md Yunus, M. Assessing the reliability and relevance of DeepSeek in EFL writing evaluation: A generalizability theory approach. Lang. Test. Asia 2025, 15, 33. [Google Scholar] [CrossRef]
- Guo, K.; Pan, M.; Li, Y.; Lai, C. Effects of an AI-supported approach to peer feedback on university EFL students’ feedback quality and writing ability. Internet High. Educ. 2024, 63, 100962. [Google Scholar] [CrossRef]
- Han, J. Exploring ChatGPT-supported teacher feedback in the EFL context. System 2024, 126, 103502. [Google Scholar] [CrossRef]
- Jmaiel, H.A.; Abukhait, R.O.; Mohamed, A.M.; Shaaban, T.S.; Al-khresheh, M.H.; AL-Qadri, A.H. The role of ChatGPT in enhancing EFL students’ ESP writing skills: An experimental study of gender and major differences. Discov. Educ. 2025, 4, 240. [Google Scholar] [CrossRef]
- Kanoksilapatham, B.; Takrudkaew, T. Integrating ChatGPT to Enhance University Students‘Communication Skills: Pedagogical Considerations. World J. Engl. Lang. 2025, 15, 173–183. [Google Scholar] [CrossRef]
- Kemelbekova, Z.; Degtyareva, X.; Yessenaman, S.; Ismailova, D.; Seidaliyeva, G. AI in teaching English as a foreign language: Effectiveness and prospects in Kazakh higher education. XLinguae 2024, 17, 69–83. [Google Scholar] [CrossRef]
- Kim, M.; Chon, Y.V. The impact of self-revision, machine translation, and ChatGPT on L2 writing: Raters’ assessments, linguistic complexity, and error correction. Assess. Writ. 2025, 65, 100950. [Google Scholar] [CrossRef]
- Lai, Z.C.-C. Enhancing EFL Oral Proficiency Through a ChatGPT-Integrated BOPPPS Learning Framework. Int. J. Online Pedagog. Course Des. 2025, 15, 1–21. [Google Scholar] [CrossRef]
- Liebenow, L.W.; Schmidt, F.T.C.; Meyer, J.; Fleckenstein, J. Self-assessment accuracy in the age of artificial Intelligence: Differential effects of LLM-generated feedback. Comput. Educ. 2025, 237, 105385. [Google Scholar] [CrossRef]
- Mekheimer, M. Generative AI-assisted feedback and EFL writing: A study on proficiency, revision frequency and writing quality. Discov. Educ. 2025, 4, 170. [Google Scholar] [CrossRef]
- Mingyan, M.; Noordin, N.; Razali, A.B. Improving EFL speaking performance among undergraduate students with an AI-powered mobile app in after-class assignments: An empirical investigation. Humanit. Soc. Sci. Commun. 2025, 12, 370. [Google Scholar] [CrossRef]
- Mirzoyeva, L.; Makhanova, Z.; Ibrahim, M.K.; Snezhko, Z. Formation of auditory and speech competences in learning English based on neural network technologies: Psycholinguistic aspect. Cogent Educ. 2024, 11, 2404264. [Google Scholar] [CrossRef]
- Ozdere, M. AI in Academic Writing: Assessing the Effectiveness, Grading Consistency, and Student Perspectives of ChatGPT and You.com for EFL Students. Int. J. Technol. Educ. 2025, 8, 123–154. [Google Scholar] [CrossRef]
- Pack, A.; Barrett, A.; Escalante, J. Large language models and automated essay scoring of English language learner writing: Insights into validity and reliability. Comput. Educ. Artif. Intell. 2024, 6, 100234. [Google Scholar] [CrossRef]
- Polakova, P.; Ivenz, P. The impact of ChatGPT feedback on the development of EFL students’ writing skills. Cogent Educ. 2024, 11, 2410101. [Google Scholar] [CrossRef]
- Rahmi, R.; Amalina, Z.; Rodgers, A. Does It Really Help? Exploring the Impact of Al-Generated Writing Assistant on the Students’ English Writing. Stud. Engl. Lang. Educ. 2024, 11, 998–1012. [Google Scholar] [CrossRef]
- Sawangwan, S. ChatGPT vs. Teacher Roles in Developing EFL Writing. Int. J. Comput. Assist. Lang. Learn. Teach. 2024, 14, 1–21. [Google Scholar] [CrossRef]
- Sayed, B.T.; Bani Younes, Z.B.; Alkhayyat, A.; Adhamova, I.; Teferi, H. To be with artificial intelligence in oral test or not to be: A probe into the traces of success in speaking skill, psychological well-being, autonomy, and academic buoyancy. Lang. Test. Asia 2024, 14, 49. [Google Scholar] [CrossRef]
- Schenck, A. ChatGPT Is Powerful, but Does It Have Power Distance? Int. J. Adult Educ. Technol. 2024, 15, 1–17. [Google Scholar] [CrossRef]
- Shaalan, I.E.-N.A.W. The Impact of Artificial Intelligence and Machine Learning on Linguistic Accuracy, Fluency, and Self-Direction Among Advanced EFL Students. Theory Pract. Lang. Stud. 2025, 15, 1987–1995. [Google Scholar] [CrossRef]
- Shabara, R.; ElEbyary, K.; Boraie, D. TEACHERS or CHATGPT: The ISSUE of ACCURACY and CONSISTENCY in L2 ASSESSMENT. Teach. Engl. Technol. 2024, 24, 71–92. [Google Scholar] [CrossRef]
- Shafiee Rad, H. Revolutionizing L2 speaking proficiency, willingness to communicate, and perceptions through artificial intelligence: A case of Speeko application. Innov. Lang. Learn. Teach. 2024, 18, 364–379. [Google Scholar] [CrossRef]
- Shikun, S.; Grigoryan, G.; Ning, H.; Harutyunyan, H. AI Chatbots: Developing English Language Proficiency in EFL Classroom. Arab. World Engl. J. 2024, 1, 292–305. [Google Scholar] [CrossRef]
- Tai, T.-Y. Exploring the effects of intelligent personal assistants on bilingual education teachers’ L2 speaking proficiency. Interact. Learn. Environ. 2025, 33, 440–451. [Google Scholar] [CrossRef]
- Tai, T.-Y.; Chen, H.H.-J. Navigating elementary EFL speaking skills with generative AI chatbots: Exploring individual and paired interactions. Comput. Educ. 2024, 220, 105112. [Google Scholar] [CrossRef]
- Tran, T.T.T. Enhancing EFL Writing Revision Practices: The Impact of AI- and Teacher-Generated Feedback and Their Sequences. Educ. Sci. 2025, 15, 232. [Google Scholar] [CrossRef]
- Tsai, C.-Y.; Lin, Y.-T.; Brown, I.K. Impacts of ChatGPT-assisted writing for EFL English majors: Feasibility and challenges. Educ. Inf. Technol. 2024, 29, 22427–22445. [Google Scholar] [CrossRef]
- Uchida, S.; Negishi, M. Assigning CEFR-J levels to English learners’ writing: An approach using lexical metrics and generative AI. Res. Methods Appl. Linguist. 2025, 4, 100199. [Google Scholar] [CrossRef]
- Uyar, A.; Büyükahıska, D. Artificial intelligence as an automated essay scoring tool: A focus on ChatGPT. Int. J. Assess. Tools Educ. 2025, 12, 20–32. [Google Scholar] [CrossRef]
- Yasmin, M.; Fatima, W.; Irshad, I. Evaluating ChatGPT’s effectiveness in enhancing argumentative writing: A quasi-experimental study of EFL learners in Pakistan. Sustain. Futures 2025, 10, 100809. [Google Scholar] [CrossRef]
- Yavuz, F.; Çelik, Ö.; Yavaş Çelik, G. Utilizing large language models for EFL essay grading: An examination of reliability and validity in rubric-based assessments. Br. J. Educ. Technol. 2025, 56, 150–166. [Google Scholar] [CrossRef]
- Yildiz, C. ChatGPT Integration in EFL Education: A Path to Enhanced Speaking Self-Efficacy. Novitas-ROYAL 2024, 18, 167–182. [Google Scholar] [CrossRef]
- Zhang, R.; Zou, D.; Cheng, G.; Xie, H. Flow in ChatGPT-based logic learning and its influences on logic and self-efficacy in English argumentative writing. Comput. Hum. Behav. 2025, 162, 108457. [Google Scholar] [CrossRef]
- Zhao, D. The impact of AI-enhanced natural language processing tools on writing proficiency: An analysis of language precision, content summarization, and creative writing facilitation. Educ. Inf. Technol. 2025, 30, 8055–8086. [Google Scholar] [CrossRef]
- Zheldibayeva, R. GenAI as a Learning Buddy for Non-English Majors: Effects on Listening and Writing Performance. Educ. Process Int. J. 2025, 14, 51. [Google Scholar] [CrossRef]
- Zhou, Q.; Hashim, H.; Sulaiman, N.A. Supporting english speaking practice in higher education: The impact of AI chatbot-integrated mobile-assisted blended learning framework. Educ. Inf. Technol. 2025, 30, 14629–14660. [Google Scholar] [CrossRef]
- Lai, Z.C.-C. The Impact of AI-Assisted Blended Learning on Writing Efficacy and Resilience. Int. J. Comput.-Assist. Lang. Learn. Teach. 2025, 15, 1–21. [Google Scholar] [CrossRef]








| Author(s) | Title | Year | AI Tool(s) | Methodology | Country | Sample Size |
|---|---|---|---|---|---|---|
| Abuhussein H.F.; Badah A. [26] | The Role of AI-based Writing Tools on L2 Writing Competency: Evidence from Palestinian EFL Learners | 2025 | Grammarly | mixed methods | Jordan | 73 |
| Afandi, I. [27] | Assessing text comprehension proficiency: Indonesian higher education students vs ChatGPT | 2024 | ChatGPT | quantitative | Indonesia | 288 |
| Al-Husban N.A. [28] | The impact of AI-assisted language learning tools on augmenting university EFL students’ speaking skills in Jordan | 2025 | ELSA | controlled experiment | Jordan | 34 |
| Alsaweed W.; Aljebreen S. [29] | Investigating the Accuracy of ChatGPT as a Writing Error Correction Tool | 2024 | ChatGPT | mixed methods | Saudi Arabia | |
| Alsofyani A.H.; Barzanji A.M. [30] | The Effects of ChatGPT-Generated Feedback on Saudi EFL Learners’ Writing Skills and Perception at the Tertiary Level: A Mixed-Methods Study | 2025 | ChatGPT | mixed methods | Saudi Arabia | 102 |
| Alwasidi M.A.; Al-Khalifah K.S. [31] | Assessing the Impact of ChatGPT on EFL Students’ Writing Productivity and Proficiency | 2025 | ChatGPT | controlled experiment | Saudi Arabia | 52 |
| Asadi M.; Ebadi S.; Mohammadi L. [32] | The impact of integrating ChatGPT with teachers’ feedback on EFL writing skills | 2025 | ChatGPT | mixed methods | Iran | 68 |
| Asadi M.; Ebadi S.; Salman A.R.; Taheri R.; Mohammadi L. [33] | Investigating the effects of AI-assisted teacher instruction on online IELTS writing | 2025 | Copilot | mixed methods | Iran | 14 |
| Bodaubekov A.; Agaidarova S.; Zhussipbek T.; Gaipov D.; Balta N. [34] | Leveraging AI to enhance writing skills of senior TFL students in Kazakhstan: A case study using “Write & Improve” | 2025 | Write & Improve | quasi-experimental | Kazakhstan | 4 instructors + 134 students |
| Chen A.; Jia J.; Li Y.; Fu L. [35] | Investigating the Effect of Role-Play Activity With GenAI Agent on EFL Students’ Speaking Performance | 2025 | Human-AI role-play | quasi-experimental | China | 53 |
| Chen C.; Gong Y. [36] | The Role of AI-Assisted Learning in Academic Writing: A Mixed-Methods Study on Chinese as a Second Language Students | 2025 | ChatGPT | mixed methods | China | 50 |
| Chen S.-Y.R. [37] | Enhancing ESP Writing Skills Through the Use of Termsoup: An Exploratory Study | 2025 | Termsoup | controlled experiment | Taiwan | 10 |
| Darmawansah D.; Rachman D.; Febiyani F.; Hwang G.-J. [38] | ChatGPT-supported collaborative argumentation: Integrating collaboration script and argument mapping to enhance EFL students’ argumentation skills | 2025 | ChatGPT | quasi-experimental | 67 | |
| Duong T.; Suppasetseree S. [39] | The Effects of an Artificial Intelligence Voice Chatbot on Improving Vietnamese Undergraduate Students’ English Speaking Skills | 2024 | quasi-experimental | Vietnam | 30 | |
| Duong T.-N.-A.; Chen H.-L. [40] | An AI Chatbot for EFL Writing: Students’ Usage Tendencies, Writing Performance, and Perceptions | 2025 | Writing assistant Bot (WAB) | mixed methods | Vietnam | 47 |
| ElEbyary K.; Shabara R.; Boraie D. [41] | The differential role of AI-operated WCF in L2 students’ noticing of errors and its impact on writing scores | 2024 | Grammarly, E-rater | quasi-experimental | Egypt | 75 |
| Gao H.; Hashim H.; Md Yunus M. [42] | Assessing the reliability and relevance of DeepSeek in EFL writing evaluation: a generalizability theory approach | 2025 | DeepSeek | mixed methods | China | 92 |
| Guo K.; Pan M.; Li Y.; Lai C. [43] | Effects of an AI-supported approach to peer feedback on university EFL students’ feedback quality and writing ability | 2024 | Eva | controlled experiment | China | 60 |
| Han J.; Mimi Li [44] | Exploring ChatGPT-supported teacher feedback in the EFL context | 2024 | ChatGPT | controlled experiment | China | 4 instructors + 102 students |
| Jmaiel H.A.; Abukhait R.O.; Mohamed A.M.; Shaaban T.S.; Al-khresheh M.H.; AL-Qadri A.H. [45] | The role of ChatGPT in enhancing EFL students’ ESP writing skills: an experimental study of gender and major differences | 2025 | ChatGPT | controlled experiment | 117 | |
| Kanoksilapatham B.; Takrudkaew T. [46] | Integrating ChatGPT to Enhance University Students‘Communication Skills: Pedagogical Considerations | 2025 | ChatGPT | mixed methods | Thailand | 93 |
| Kemelbekova Z.; Degtyareva X.; Yessenaman S.; Ismailova D.; Seidaliyeva G. [47] | AI in teaching English as a foreign language: Effectiveness and prospects in Kazakh higher education | 2024 | controlled experiment | Kazakhstan | 51 | |
| Kim M.; Chon Y.V. [48] | The impact of self-revision, machine translation, and ChatGPT on L2 writing: Raters’ assessments, linguistic complexity, and error correction | 2025 | ChatGPT | controlled experiment | Korea | 79 |
| Lai Z.C.-C. [49] | Enhancing EFL Oral Proficiency Through a ChatGPT-Integrated BOPPPS Learning Framework | 2025 | BOPPS, ChatGPT | controlled experiment | Taiwan | 89 |
| Liebenow L.W.; Schmidt F.T.C.; Meyer J.; Fleckenstein J. [50] | Self-assessment accuracy in the age of artificial Intelligence: Differential effects of LLM-generated feedback | 2025 | ChatGPT | controlled experiment | German | 459 |
| Liu J.; Hamid H.A.; Bao X. [20] | Motivation and achievement in EFL: the power of instructional approach | 2025 | DouBao | mixed methods | China | 43 |
| Liu X.J.; Wang J.; Zou B. [21] | Evaluating an AI speaking assessment tool: Score accuracy, perceived validity, and oral peer feedback as feedback enhancement | 2025 | EAP Talk | mixed methods | 64 students + 5 teachers | |
| Mekheimer M. [51] | Generative AI-assisted feedback and EFL writing: a study on proficiency, revision frequency and writing quality | 2025 | Grammarly | mixed methods | Egypt | 60 |
| Mingyan M.; Noordin N.; Razali A.B. [52] | Improving EFL speaking performance among undergraduate students with an AI-powered mobile app in after-class assignments: an empirical investigation | 2025 | Liulishuo | quasi-experimental | China | 63 |
| Mirzoyeva L.; Makhanova Z.; Ibrahim M.K.; Snezhko Z. [53] | Formation of auditory and speech competences in learning English based on neural network technologies: psycholinguistic aspect | 2024 | Speechace, Rosetta Stone | mixed methods | Kazakhstan | 180 |
| Ozdere, M [54] | AI in Academic Writing: Assessing the Effectiveness, Grading Consistency, and Student Perspectives of ChatGPT and You.com for EFL Students | 2025 | ChatGPT, You.com | mixed methods | Turkey | 16 |
| Pack A.; Barrett A.; Escalante J. [55] | Large language models and automated essay scoring of English language learner writing: Insights into validity and reliability | 2024 | Google’s PaLM 2, Anthropic’s Claude 2, ChatGPT | quasi-experimental | Asia-Pacific | 119 |
| Polakova P.; Ivenz P. [56] | The impact of ChatGPT feedback on the development of EFL students’ writing skills | 2024 | ChatGPT | quasi-experimental | 110 | |
| Rahmi R.; Amalina Z.; Andriansyah; Rodgers A. [57] | Does It Really Help? Exploring the Impact of Al-Generated Writing Assistant on the Students’ English Writing | 2024 | ParagraphAI | mixed methods | Indonesia | 4 |
| Sawangwan S. [58] | ChatGPT vs Teacher Roles in Developing EFL Writing | 2024 | ChatGPT | mixed methods | Thailand | 50 |
| Sayed B.T.; Bani Younes Z.B.; Alkhayyat A.; Adhamova I.; Teferi H. [59] | To be with artificial intelligence in oral test or not to be: a probe into the traces of success in speaking skill, psychological well-being, autonomy, and academic buoyancy | 2024 | ChatGPT | mixed methods | Ethiopia | 28 |
| Schenck, A [60] | ChatGPT Is Powerful, but Does It Have Power Distance? A Study of Culturally Imbued Discourse in AI-Generated Essays | 2024 | ChatGPT | quantitative | 200 | |
| Shaalan I.E.-N.A.W. [61] | The Impact of Artificial Intelligence and Machine Learning on Linguistic Accuracy, Fluency, and Self-Direction Among Advanced EFL Students | 2025 | Grammarly, Replica, Anki | mixed methods | Saudi Arabia | 120 |
| Shabara R.; ElEbyary K.; Boraie D. [62] | TEACHERS or CHATGPT: The ISSUE of ACCURACY and CONSISTENCY in L2 ASSESSMENT | 2024 | ChatGPT | quantitative | Egypt | 100 |
| Shafiee Rad H. [63] | Revolutionizing L2 speaking proficiency, willingness to communicate, and perceptions through artificial intelligence: a case of Speeko application | 2024 | Speeko | mixed methods | 66 | |
| Shikun, S; Grigoryan, G; Huichun, N; Harutyunyan, H [64] | AI Chatbots: Developing English Language Proficiency in EFL Classroom | 2024 | Talk Pal, Bard, Dou Bao | mixed methods | China | 90 |
| Tai T.-Y. [65] | Exploring the effects of intelligent personal assistants on bilingual education teachers’ L2 speaking proficiency | 2025 | Google Assistant | mixed methods | Taiwan | 30 |
| Tai T.-Y.; Chen H.H.-J. [66] | Navigating elementary EFL speaking skills with generative AI chatbots: Exploring individual and paired interactions | 2024 | CooIE Bot | mixed methods | Taiwan | 85 |
| Tran T.T.T. [67] | Enhancing EFL Writing Revision Practices: The Impact of AI- and Teacher-Generated Feedback and Their Sequences | 2025 | Gemini | controlled experiment | Vietnam | 14 |
| Tsai C.-Y.; Lin Y.-T.; Brown I.K. [68] | Impacts of ChatGPT-assisted writing for EFL English majors: Feasibility and challenges | 2024 | ChatGPT | controlled experiment | Taiwan | 44 |
| Uchida S.; Negishi M. [69] | Assigning CEFR-J levels to English learners’ writing: An approach using lexical metrics and generative AI | 2025 | CEFR-based Writing Level Analyzer (CWLA) | Japan | ||
| Uyar, AC; Büyükahiska, D [70] | Artificial intelligence as an automated essay scoring tool: A focus on ChatGPT | 2025 | ChatGPT | quantitative | 10 | |
| Yasmin M.; Fatima W.; Irshad I. [71] | Evaluating ChatGPT’s effectiveness in enhancing argumentative writing: A quasi-experimental study of EFL learners in Pakistan | 2025 | quasi-experimental | Pakistan | 30 | |
| Yavuz F.; Çelik Ö.; Yavaş Çelik G. [72] | Utilizing large language models for EFL essay grading: An examination of reliability and validity in rubric-based assessments | 2025 | ChatGPT, Google’s Bard | 15 | ||
| Yildiz C. [73] | ChatGPT Integration in EFL Education: A Path to Enhanced Speaking Self-Efficacy | 2024 | ChatGPT | mixed methods | 65 | |
| Zhang R.; Zou D.; Cheng G.; Xie H. [74] | Flow in ChatGPT-based logic learning and its influences on logic and self-efficacy in English argumentative writing | 2025 | ChatGPT | mixed methods | China | 40 |
| Zhao D. [75] | The impact of AI-enhanced natural language processing tools on writing proficiency: an analysis of language precision, content summarization, and creative writing facilitation | 2025 | quasi-experimental | 295 | ||
| Zheldibayeva R. [76] | GenAI as a Learning Buddy for Non-English Majors: Effects on Listening and Writing Performance | 2025 | quasi-experimental | 93 | ||
| Zhou Q.; Hashim H.; Sulaiman N.A. [77] | Supporting English speaking practice in higher education: the impact of AI chatbot-integrated mobile-assisted blended learning framework | 2025 | mixed methods | China | 72 |
| Reported Limitations of Generative AI Tools in EFL Learning |
|---|
| Advantages not maintained at follow-up stage |
| ChatGPT-generated feedback as effective as teacher-generated feedback |
| Connectors |
| Content density |
| Cost for premium access |
| Disproportionate improvement raises concerns about fairness |
| Fluctuating scores |
| Further fine-tuning |
| Generic language |
| Human raters’ scores are higher |
| Increased noticing of errors with teacher generated feedback |
| Lower consistent reliability than teachers |
| No significant difference with traditional methods |
| No significant effect on students’ self-assessment |
| Overreliance |
| Performance in controlled tasks |
| Pronunciation showed no significant effect |
| Reduced verb cohesion |
| Self efficacy |
| Social dynamics |
| Same models have higher interrater reliability than others |
| Teacher instruction led to higher scores |
| Teachers’ scores higher accuracy |
| The prompt quality influences effectiveness |
| Unclear messages |
| Vocabulary |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Andreou, G.; Christani, P. The Benefits and Limitations of the Use of Generative Artificial Intelligence Tools in the Acquisition of Productive Skills in English as a Foreign Language—A Systematic Analysis. Appl. Sci. 2025, 15, 11476. https://doi.org/10.3390/app152111476
Andreou G, Christani P. The Benefits and Limitations of the Use of Generative Artificial Intelligence Tools in the Acquisition of Productive Skills in English as a Foreign Language—A Systematic Analysis. Applied Sciences. 2025; 15(21):11476. https://doi.org/10.3390/app152111476
Chicago/Turabian StyleAndreou, Georgia, and Pinelopi Christani. 2025. "The Benefits and Limitations of the Use of Generative Artificial Intelligence Tools in the Acquisition of Productive Skills in English as a Foreign Language—A Systematic Analysis" Applied Sciences 15, no. 21: 11476. https://doi.org/10.3390/app152111476
APA StyleAndreou, G., & Christani, P. (2025). The Benefits and Limitations of the Use of Generative Artificial Intelligence Tools in the Acquisition of Productive Skills in English as a Foreign Language—A Systematic Analysis. Applied Sciences, 15(21), 11476. https://doi.org/10.3390/app152111476

