Analyzing the Impact of a Structured LLM Workshop in Different Education Levels
Abstract
:1. Introduction
2. Materials and Methods
2.1. Start of the Iterative Process
2.2. End of Iterative Process
3. Results
3.1. Practical Outcome and Preliminary Data Analysis of the Workshop Results
- -
- AI needs constant reminders about who the characters are, as it often hallucinates new companions.
- -
- The 3000-word minimum required constant new prompts that contained explanations and previous data.
- -
- The image results were imperfect; for most people, at least several tries were necessary to achieve results that had all fingers attached and had no missing limbs or broken arm shapes.
- -
- Generated images were different than what the expectations were—revealing the necessity for improving prompt skills.
- -
- The most important—it is not enough to just copy the information from an AI chat bot, because without sufficient understanding, it cannot be used effectively. There needs to be a strong link between information, technology, and application.
3.2. Surveys and Interviews
- (1)
- Pruning the questions with answers, which were not compliant with the Likert Scale format.
- (2)
- Random errors were eliminated as much as possible, due to the nature of collecting the responses—the survey was online, so no errors due to wrong reporting were possible.
- (3)
- Systematic errors were prevented due to choosing only the most important questions throughout the questionnaire. All questions that had the possibility of including such errors were pruned.
- (4)
- The possibility of misunderstanding the questions was mitigated as much as possible by introducing equalized scaling answers—“1 = bad/less, 5 = good/more”.
3.2.1. Sample Size and Description
3.2.2. Data Analysis Methods
3.2.3. Group Comparison Analysis
4. Discussion
5. Conclusions
Supplementary Materials
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Zhu, Y.; Wang, T.T.; Wang, C.; Quan, W.; Tang, M.W. Complexity-Driven Trust Dynamics in Human–Robot Interactions: Insights from AI-Enhanced Collaborative Engagements. Appl. Sci. 2023, 13, 12989. [Google Scholar] [CrossRef]
- Yang, C.; Kim, J.; Kang, D.; Eom, D.S. Vision AI System Development for Improved Productivity in Challenging Industrial Environments: A Sustainable and Efficient Approach. Appl. Sci. 2024, 14, 2750. [Google Scholar] [CrossRef]
- Kevin, M.; Baeza-Yates, R. Responsible AI in Farming: A Multi-Criteria Framework for Sustainable Technology Design. Appl. Sci. 2024, 14, 437. [Google Scholar] [CrossRef]
- Raiaan, M.A.K.; Mukta, M.S.H.; Fatema, K.; Fahad, N.M.; Sakib, S.; Mim, M.M.J.; Ahmad, J.; Ali, M.E.; Azam, S. A review on large Language Models: Architectures, applications, taxonomies, open issues and challenges. IEEE Access 2024, 12, 26839–26874. [Google Scholar] [CrossRef]
- Sufi, F. Generative Pre-Trained Transformer (GPT) in Research: A Systematic Review on Data Augmentation. Information 2024, 15, 99. [Google Scholar] [CrossRef]
- Silva, D.; Anjalee; Janaka; Wijekoon, L.; Liyanarachchi, R.; Panchendrarajan, R.; Rajapaksha, W. AI insights: A case study on utilizing ChatGPT intelligence for research paper analysis. arXiv 2024, arXiv:2403.03293. [Google Scholar]
- Stöckli, L.; Joho, L.; Lehner, F.; Hanne, T. The Personification of ChatGPT (GPT-4)—Understanding Its Personality and Adaptability. Information 2024, 6, 300. [Google Scholar] [CrossRef]
- Choi, D.; Lee, S.; Kim, S.I.; Lee, K.; Yoo, H.J.; Lee, S.; Hong, H. Unlock Life with a Chat (GPT): Integrating Conversational AI with Large Language Models into Everyday Lives of Autistic Individuals. In Proceedings of the CHI Conference on Human Factors in Computing Systems, Honolulu, HI, USA, 11–16 May 2024; pp. 1–17. Available online: https://dl.acm.org/doi/full/10.1145/3613904.3641989 (accessed on 20 June 2024).
- Almasre, M. Development and Evaluation of a Custom GPT for the Assessment of Students’ Designs in a Typography Course. Educ. Sci. 2024, 14, 148. [Google Scholar] [CrossRef]
- Bravo, F.A.; Cruz-Bohorquez, J.M. Engineering Education in the Age of AI: Analysis of the Impact of Chatbots on Learning in Engineering. Educ. Sci. 2024, 14, 484. [Google Scholar] [CrossRef]
- Valova, I.; Mladenova, T.; Kanev, G. Students’ Perception of ChatGPT Usage in Education. Int. J. Adv. Comput. Sci. Appl. 2024, 15, 466–473. [Google Scholar] [CrossRef]
- Marques, N.; Silva, R.R.; Bernardino, J. Using ChatGPT in Software Requirements Engineering: A Comprehensive Review. Future Internet 2024, 16, 180. [Google Scholar] [CrossRef]
- Anchev, A.; Georgieva, T.; Ivanov, A. Experimental Comparison of Standard and Trajectories Movement Algorithm using Intelligent Camera. In Proceedings of the 2023 4th International Conference on Communications, Information, Electronic and Energy Systems (CIEES), Plovdiv, Bulgaria, 23–25 November 2023; IEEE: Piscataway, NJ, USA; pp. 1–6. Available online: https://ieeexplore.ieee.org/abstract/document/10378783/ (accessed on 20 June 2024).
- Ivanov, A.; Ivanova, M.; Anchev, A. Mathematical Models Incorporated in a Digital Workflow for Designing an Anthropomorphous Robot. In Computer Science On-line Conference; Springer International Publishing: Cham, Switzerland, 2019; pp. 306–314. Available online: https://link.springer.com/chapter/10.1007/978-3-030-19813-8_31 (accessed on 20 June 2024).
- Ivanova, M.; Grosseck, G.; Holotescu, C. Unveiling Insights: A Bibliometric Analysis of Artificial Intelligence in Teaching. Informatics 2024, 11, 10. [Google Scholar] [CrossRef]
- Guan, C.; Mou, J.; Jiang, Z. Artificial intelligence innovation in education: A twenty-year data-driven historical analysis. Int. J. Innov. Stud. 2020, 4, 134–147. [Google Scholar] [CrossRef]
- Hou, C.; Hua, L.; Lin, Y.; Zhang, J.; Liu, G.; Xiao, Y. Application and exploration of artificial intelligence and edge computing in long-distance education on mobile network. Mob. Netw. Appl. 2021, 26, 2164–2175. [Google Scholar] [CrossRef]
- Markos, A.; Prentzas, J.; Sidiropoulou, M. Pre-Service Teachers’ Assessment of ChatGPT’s Utility in Higher Education: SWOT and Content Analysis. Electronics 2024, 13, 1985. [Google Scholar] [CrossRef]
- Sánchez-Ruiz, L.M.; Moll-López, S.; Nuñez-Pérez, A.; Moraño-Fernández, J.A.; Vega-Fleitas, E. ChatGPT challenges blended learning methodologies in engineering education: A case study in mathematics. Appl. Sci. 2023, 13, 6039. [Google Scholar] [CrossRef]
- Calvo, I.; Carrascal, E.; González, J.M.; Armentia, A.; Gil-García, J.M.; Barambones, O.; Basogain, X.; Tazo-Herran, I.; Apiñaniz, E. A Methodology to Introduce Sustainable Development Goals in Engineering Degrees by Means of Multidisciplinary Projects. Educ. Sci. 2024, 14, 583. [Google Scholar] [CrossRef]
- Kyllonen, P.C. Inequality, education, workforce preparedness, and complex problem solving. J. Intell. 2018, 6, 33. [Google Scholar] [CrossRef] [PubMed]
- Perevoznic, F.M.; Dragomir, V.D. Achieving the 2030 Agenda: Mapping the Landscape of Corporate Sustainability Goals and Policies in the European Union. Sustainability 2024, 16, 2971. [Google Scholar] [CrossRef]
- Ionescu, G.H.; Jianu, E.; Patrichi, I.C.; Ghiocel, F.; Țenea, L.; Iancu, D. Assessment of sustainable development goals (SDG) implementation in Bulgaria and future developments. Sustainability 2021, 13, 12000. [Google Scholar] [CrossRef]
- Saif, N.; Khan, S.U.; Shaheen, I.; ALotaibi, F.A.; Alnfiai, M.M.; Arif, M. Chat-GPT; validating Technology Acceptance Model (TAM) in education sector via ubiquitous learning mechanism. Comput. Hum. Behav. 2024, 154, 108097. [Google Scholar] [CrossRef]
- Albadarin, Y.; Saqr, M.; Pope, N.; Tukiainen, M. A systematic literature review of empirical research on ChatGPT in education. Discov. Educ. 2024, 3, 60. [Google Scholar] [CrossRef]
- Díaz-Santos, S.; Cigala-Álvarez, Ó.; Gonzalez-Sosa, E.; Caballero-Gil, P.; Caballero-Gil, C. Driver identification and detection of drowsiness while driving. Appl. Sci. 2024, 14, 2603. [Google Scholar] [CrossRef]
- Raharjanti, N.W.; Wiguna, T.; Purwadianto, A.; Soemantri, D.; Indriatmi, W.; Poerwandari, E.K.; Mahajudin, M.S.; Nugrahadi, N.R.; Roekman, A.E.; Saroso, O.J.D.A.; et al. Translation, validity and reliability of decision style scale in forensic psychiatric setting in Indonesia. Heliyon 2022, 8, e09810. [Google Scholar] [CrossRef] [PubMed]
- Taber, K.S. The use of Cronbach’s alpha when developing and reporting research instruments in science education. Res. Sci. Educ. 2018, 48, 1273–1296. [Google Scholar] [CrossRef]
- Field, A. Discovering Statistics Using IBM SPSS Statistics; Sage: London, UK, 2013. [Google Scholar]
Type | Count | Type | Count |
---|---|---|---|
person | 704 | truck | 19 |
chair | 139 | handbag | 15 |
car | 70 | teddy bear | 14 |
bird | 63 | dog | 14 |
bottle | 62 | umbrella | 12 |
tie | 51 | suitcase | 11 |
kite | 43 | cell phone | 10 |
potted plant | 31 | horse | 9 |
vase | 29 | cake | 9 |
book | 29 | bench | 9 |
motorcycle | 28 | tv | 8 |
backpack | 27 | bowl | 7 |
dining table | 24 | sports ball | 6 |
sheep | 21 | traffic light | 6 |
cup | 20 | scissors | 5 |
boat | 20 | fire hydrant | 5 |
clock | 19 | cat | 5 |
Category | Q# | Question Text | Answer Type |
---|---|---|---|
Story text | Q1 | How did ChatGPT help you in crafting the introduction of your short story? | Text |
Q2 | Were you able to effectively communicate the setting and atmosphere of the story’s beginning? | No/Yes | |
Q6 | Did you encounter any challenges in maintaining coherence and flow between the introduction and the first scene? | No/Yes | |
Q9 | How many challenges did you face in maintaining consistency in the story’s tone and style during the second scene? | Scale 1–5 | |
Q10 | Did ChatGPT aid you in resolving the conflict or climax of the story in the third scene? | No/Yes | |
Q12 | Did you face challenges in ensuring a satisfying and coherent conclusion to the story’s narrative? | No/Yes | |
Q15 | Did you face any challenges in maintaining visual coherence and consistency across different parts of the story? | No/Yes | |
Q16 | How did ChatGPT assist you in crafting smooth transitions between different scenes of the short story? | Text | |
Images | Q3 | Did ChatGPT assist in generating descriptive visuals for the initial scene and characters? | No/Yes |
Q4 | How did you use ChatGPT to develop the interactions between the characters in the first scene? | Text | |
Q5 | Were you able to create a visually descriptive scene for the first part of your story using ChatGPT? | No/Yes | |
Q7 | How much did ChatGPT contribute to the development of the second scene and the introduction of new characters? | Scale 1–5 | |
Q8 | Were you able to use ChatGPT to visually describe the second scene and characters effectively? | No/Yes | |
Q11 | Were you successful in creating visually descriptive elements for the resolution using ChatGPT? | No/Yes | |
Q13 | How much did LLM applications assist in generating visual descriptions for characters and scenes throughout the story? | Scale 1–5 | |
Code | Q14 | Were you able to seamlessly incorporate the generated visual elements into the HTML and CSS design of the website? | No/Yes |
Q17 | Were you able to effectively visualize and represent the transitions between scenes on the website using ChatGPT? | No/Yes | |
Q18 | Did you encounter challenges in creating seamless transitions, and how did ChatGPT contribute to overcoming them? | Text | |
Q19 | How satisfied are you with the overall assistance provided by ChatGPT in creating your short story and website? | Scale 1–5 | |
Results | Q24 | Were there many instances where ChatGPT’s responses did not align with the tone or style you wanted for your short story? | Scale 1–5 |
Q25 | How did you handle situations where ChatGPT generated content that deviated from your intended plot or character development? | Text | |
Q26 | How close was the image generated to what you wanted to create? How effective were your prompts? | Scale 1–5 | |
Q27 | What was the genre of your story? (Or genres, if there were multiple.) | Text | |
Q30 | Did you use the default character names, or did you create your own? | Own/Default | |
Q31 | How helpful were the LLM tools to you in general, when you had to complete this task? Did you manage to work together successfully? | Scale 1–5 | |
Q32 | How difficult did you find it overall to get the correct responses while using LLM tools to complete this task? | Scale 1–5 | |
General | Q20.1 | Were there specific areas where you felt ChatGPT’s assistance was particularly helpful? | Text |
Q20.2 | Were there specific areas where you felt ChatGPT’s assistance was particularly lacking? | Text | |
Q21 | In retrospect, would you consider using ChatGPT for a similar creative writing and web design task in the future? | No/Yes/Maybe | |
Q22 | Do you need any improvements or additional features you would like to see in ChatGPT to enhance its support for creative tasks? | No/Yes | |
Q23 | Were there any specific difficulties or limitations you faced in using ChatGPT for this task that you’d like addressed? (such as moderated content or ineffective prompts) | Text | |
Q28 | After working together with LLMs to generate content and code, how likely are you to use them in the future in your education? | Scale 1–5 | |
Q29 | How much did working on this project increase your interest in Information systems and technologies? | Scale 1–5 |
Education Type | Age (in Years) | Group Name | Number Participants | Study Level | Gender Percentage | |
---|---|---|---|---|---|---|
M | F | |||||
High School of Mathematics | 18–19 | 12 Math | 21 | High school grade 12 | 45% | 55% |
English Language School | 17–18 | 11 ELS | 60 | High school grade 11 | 42% | 58% |
English Language School | 18–19 | 12 ELS | 34 | High school grade 12 | 46% | 54% |
Ruse University—Software engineering, 3rd year | 21–22 | 3 SE | 15 | University 3rd year | 92% | 8% |
Descriptive Statistics | ||||||||
---|---|---|---|---|---|---|---|---|
N | Mean | Std. Deviation | Minimum | Maximum | ||||
Q24 | 128 | 3.33 | 0.989 | 1 | 5 | |||
Q26 | 128 | 3.73 | 0.900 | 1 | 5 | |||
Q28 | 128 | 3.46 | 1.019 | 1 | 5 | |||
Q29 | 128 | 3.48 | 0.955 | 1 | 5 | |||
Q31 | 128 | 3.88 | 0.896 | 1 | 5 | |||
Q32 | 128 | 2.81 | 0.929 | 1 | 5 | |||
Q32. | Q31. | Q29. | Q28. | Q26. | Q24. | |||
N | 128 | 128 | 128 | 128 | 128 | 128 | ||
Normal Parameters a,b | Mean | 2.81 | 3.88 | 3.48 | 3.46 | 3.73 | 3.33 | |
Std. Deviation | 0.929 | 0.896 | 0.955 | 1.019 | 0.900 | 0.989 | ||
Most Extreme Differences | Absolute | 0.236 | 0.212 | 0.254 | 0.221 | 0.249 | 0.216 | |
Positive | 0.225 | 0.179 | 0.254 | 0.221 | 0.189 | 0.216 | ||
Negative | −0.236 | −0.212 | −0.200 | −0.193 | −0.249 | −0.198 | ||
Test Statistic | 0.236 | 0.212 | 0.254 | 0.221 | 0.249 | 0.216 | ||
Asymp. Sig. (2-tailed) | 0.000 c | 0.000 c | 0.000 c | 0.000 c | 0.000 c | 0.000 c |
Q24. Were There Many Instances Where ChatGPT’s Responses Did Not Align with the Tone or Style You Wanted for Your Short Story? (Many 1–None 5) | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
1—Many | 2—Rather a Lot | 3—Some | 4—Rather None | 5—None | Blank | |||||||
Group | n | Friq | Friq, % | Friq | Friq, % | Friq | Friq, % | Friq | Friq, % | Friq | Friq, % | % |
11 ELS | 60 | 3 | 5.00% | 5 | 8.33% | 31 | 51.67% | 15 | 25.00% | 6 | 10.00% | 0.00% |
12 ELS | 34 | 0 | 0.00% | 4 | 11.76% | 9 | 26.47% | 13 | 38.24% | 7 | 20.59% | 2.94% |
12 Math | 21 | 2 | 9.52% | 5 | 23.81% | 9 | 42.86% | 4 | 19.05% | 1 | 4.76% | 0.00% |
3 SE | 15 | 0 | 0.00% | 3 | 20.00% | 4 | 26.67% | 5 | 33.33% | 2 | 13.33% | 6.67% |
Q26. How Close Was the Image Generated to What You Wanted to Create? How Effective Were Your Prompts? (Not at All 1—Perfectly Alike 5) | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
1—Not at All | 2—Rather Not | 3—Almost | 4—Alike | 5—Perfectly Alike | Blank | |||||||
Group | n | Friq | Friq, % | Friq | Friq, % | Friq | Friq, % | Friq | Friq, % | Friq | Friq, % | % |
11 ELS | 60 | 1 | 1.67% | 4 | 6.67% | 15 | 25.00% | 29 | 48.33% | 11 | 18.33% | 0.00% |
12 ELS | 34 | 0 | 0.00% | 2 | 5.88% | 11 | 32.35% | 15 | 44.12% | 5 | 14.71% | 2.94% |
12 Math | 21 | 0 | 0.00% | 2 | 9.52% | 8 | 38.10% | 4 | 19.05% | 6 | 28.57% | 0.00% |
3 SE | 15 | 0 | 0.00% | 0 | 0.00% | 3 | 20.00% | 8 | 53.33% | 3 | 20.00% | 6.67% |
Q28. After Working Together with LLMs to Generate Content and Code, How Likely Are You to Use Them in the Future in Your Education? (No 1–Yes 5) | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
1—Not | 2—Rather Not | 3—Maybe | 4—Closer to Yes | 5—Yes | Blank | |||||||
Group | n | Friq | Friq, % | Friq | Friq, % | Friq | Friq, % | Friq | Friq, % | Friq | Friq, % | % |
11 ELS | 60 | 1 | 1.67% | 7 | 11.67% | 33 | 55.00% | 14 | 23.33% | 4 | 6.67% | 0.00% |
12 ELS | 34 | 2 | 5.88% | 4 | 11.76% | 11 | 32.35% | 12 | 35.29% | 4 | 11.76% | 2.94% |
12 Math | 21 | 2 | 9.52% | 1 | 4.76% | 5 | 23.81% | 6 | 28.57% | 7 | 33.33% | 0.00% |
3 SE | 15 | 0 | 0.00% | 0 | 0.00% | 3 | 20.00% | 3 | 20.00% | 8 | 53.33% | 6.67% |
Q29. How Much Did Working on This Project Increase Your Interest in Information Systems and Technologies? (Decrease 1–Increase 5) | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
1—Decreased | 2—Rather Decreased | 3—Unchanged | 4—Rather Increased | 5—Increased | Blank | |||||||
Group | n | Friq | Friq, % | Friq | Friq, % | Friq | Friq, % | Friq | Friq, % | Friq | Friq, % | % |
11 ELS | 60 | 0 | 0.00% | 6 | 10.00% | 31 | 51.67% | 15 | 25.00% | 8 | 13.33% | 0.00% |
12 ELS | 34 | 1 | 2.94% | 2 | 5.88% | 14 | 41.18% | 13 | 38.24% | 3 | 8.82% | 2.94% |
12 Math | 21 | 2 | 9.52% | 3 | 14.29% | 8 | 38.10% | 1 | 4.76% | 7 | 33.33% | 0.00% |
3 SE | 15 | 0 | 0.00% | 0 | 0.00% | 5 | 33.33% | 5 | 33.33% | 4 | 26.67% | 6.67% |
Q31. How Helpful Were the LLM Tools to You in General, When You Had to Complete this Task? Did You Manage to Work Together Successfully? (Not at All 1–Very Helpful 5) | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
1—Not at All | 2—Rather Not Helpful | 3—Rather Helpful | 4—Helpful | 5—Very Helpful | Blank | |||||||
Group | n | Friq | Friq, % | Friq | Friq, % | Friq | Friq, % | Friq | Friq, % | Friq | Friq, % | % |
11 ELS | 60 | 0 | 0.00% | 0 | 0.00% | 25 | 41.67% | 23 | 38.33% | 12 | 20.00% | 0.00% |
12 ELS | 34 | 0 | 0.00% | 3 | 8.82% | 6 | 17.65% | 16 | 47.06% | 8 | 23.53% | 2.94% |
12 Math | 21 | 2 | 9.52% | 0 | 0.00% | 4 | 19.05% | 7 | 33.33% | 8 | 38.10% | 0.00% |
3 SE | 15 | 0 | 0.00% | 0 | 0.00% | 4 | 26.67% | 3 | 20.00% | 7 | 46.67% | 6.67% |
Q32. How Difficult Did You Find It Overall to Get the Correct Responses While Using LLM Tools to Complete This Task? (Not 1–Very 5) | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
1—Not Difficult | 2—Closer to Not Difficult | 3—Neutral | 4—Closer to Very Difficult | 5—Very Difficult | Blank | |||||||
Group | n | Friq | Friq, % | Friq | Friq, % | Friq | Friq, % | Friq | Friq, % | Friq | Friq, % | % |
11 ELS | 60 | 3 | 5.00% | 13 | 21.67% | 35 | 58.33% | 7 | 11.67% | 2 | 3.33% | 0.00% |
12 ELS | 34 | 2 | 5.88% | 13 | 38.24% | 10 | 29.41% | 6 | 17.65% | 2 | 5.88% | 2.94% |
12 Math | 21 | 2 | 9.52% | 5 | 23.81% | 8 | 38.10% | 5 | 23.81% | 1 | 4.76% | 0.00% |
3 SE | 15 | 3 | 20.00% | 3 | 20.00% | 6 | 33.33% | 2 | 16.67% | 0 | 0.00% | 6.67% |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Kozov, V.; Ivanova, B.; Shoylekova, K.; Andreeva, M. Analyzing the Impact of a Structured LLM Workshop in Different Education Levels. Appl. Sci. 2024, 14, 6280. https://doi.org/10.3390/app14146280
Kozov V, Ivanova B, Shoylekova K, Andreeva M. Analyzing the Impact of a Structured LLM Workshop in Different Education Levels. Applied Sciences. 2024; 14(14):6280. https://doi.org/10.3390/app14146280
Chicago/Turabian StyleKozov, Vasil, Boyana Ivanova, Kamelia Shoylekova, and Magdalena Andreeva. 2024. "Analyzing the Impact of a Structured LLM Workshop in Different Education Levels" Applied Sciences 14, no. 14: 6280. https://doi.org/10.3390/app14146280
APA StyleKozov, V., Ivanova, B., Shoylekova, K., & Andreeva, M. (2024). Analyzing the Impact of a Structured LLM Workshop in Different Education Levels. Applied Sciences, 14(14), 6280. https://doi.org/10.3390/app14146280