Interaction with LLM-Based Systems: A Structured Review and Taxonomy of Mechanisms and Autonomy
Abstract
1. Introduction
2. Background
2.1. Large Language Models
2.2. Formalizing LLM Prompting
- I(x) represents Instruction—the core directive that communicates the user’s intention;
- C(x) represents Context—additional information such as domain constraints, background knowledge, or system-level guidance;
- D(x) represents Data—user-provided inputs, examples or reference content relevant to the task;
- O(x) represents Output specification—requirements regarding structure, format, style, length, or reasoning transparency.
2.3. Human–AI Interaction
3. Review Methodology
3.1. Database and Search Strategy
3.2. Screening and Selection Procedure
3.2.1. Initial Search (July 2024)
- The most cited records were included to reflect established and influential contributions recognised by the research community.
- The most recent records were incorporated to mitigate citation lag and ensure coverage of emerging work in this fast-moving field.
- The most relevant records, as ranked by WoS based on term occurrence in the title, abstract, and keywords, were used to strengthen alignment with the search query.
3.2.2. Update Search (January 2026)
3.2.3. Final Corpus
4. Results
4.1. Cross-Domain Interaction Patterns
4.1.1. Prompting Strategies
4.1.2. Workflow Patterns
4.1.3. Agentic Architectures
4.2. Application Domains
4.2.1. Software Development
4.2.2. Robotics
4.2.3. Education
4.3. Comparative Analysis Across Domains
5. Taxonomy for Interaction with LLM-Based Systems
5.1. Interaction Mechanism
5.2. Level of Autonomy
5.3. Human-Centred AI Principles
- Control: Users should be able to influence what the system does and stop or reverse it when needed, especially as autonomy increases. This includes simple actions such as accepting or rejecting suggestions, as well as configuring goals and constraints, pausing execution, and reviewing actions before they affect external artefacts or environments.
- Transparency: Users need cues that make the system’s behaviour understandable, including its current goals, intermediate steps, and sources of information. As autonomy and tool use increase, transparency should include plans, tool calls, and intermediate outputs presented in a way that supports intervention without overwhelming the user.
- Error handling: LLM-based interaction can fail through misunderstandings, hallucinations, or misaligned inferences. Effective systems support detection and repair through clarification dialogue, iterative refinement, and structured feedback channels, enabling recovery without excessive user burden.
- User learning: Users learn how to prompt, interpret outputs, and anticipate system behaviour over time. Interaction designs should support this learning and help users maintain calibrated trust, for example by making patterns of success and failure visible.
5.4. Application of the Taxonomy to Representative Systems
5.4.1. PromptChainer
5.4.2. AutoGen
6. Discussion
6.1. Design Implications
6.2. Hybrid Systems
6.3. Ethical Compliance and Domain Adaptation
6.4. Limitations
6.5. Future Research
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Procko, T.T.; Elvira, T.; Ochoa, O. Dawn of the dialogue: AI’s leap from lab to living room. Front. Artif. Intell. 2024, 7, 1308156. [Google Scholar] [CrossRef] [PubMed]
- Amershi, S.; Weld, D.; Vorvoreanu, M.; Fourney, A.; Nushi, B.; Collisson, P.; Suh, J.; Iqbal, S.; Bennett, P.N.; Inkpen, K.; et al. Guidelines for human-AI interaction. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems; Association for Computing Machinery: New York, NY, USA, 2019. [Google Scholar]
- Khan, N.; Khan, Z.; Koubaa, A.; Khan, M.K.; Salleh, R.B. Global insights and the impact of generative AI-ChatGPT on multidisciplinary: A systematic review and bibliometric analysis. Connect. Sci. 2024, 36. [Google Scholar] [CrossRef]
- Al-Hasan, T.M.; Sayed, A.N.; Bensaali, F.; Himeur, Y.; Varlamis, I.; Dimitrakopoulos, G. From Traditional Recommender Systems to GPT-Based Chatbots: A Survey of Recent Developments and Future Directions. Big Data Cogn. Comput. 2024, 8, 36. [Google Scholar] [CrossRef]
- Theofanos, M.; Choong, Y.; Jensen, T. AI Use Taxonomy: A Human-Centered Approach; National Institute of Standards and Technology: Gaithersburg, MD, USA, 2024.
- Yenduri, G.; Ramalingam, M.; Selvi, G.C.; Supriya, Y.; Srivastava, G.; Maddikunta, P.K.R.; Raj, G.D.; Jhaveri, R.H.; Prabadevi, B.; Wang, W.; et al. GPT (Generative Pre-Trained Transformer)—A Comprehensive Review on Enabling Technologies, Potential Applications, Emerging Challenges, and Future Directions. IEEE Access 2024, 12, 54608–54649. [Google Scholar] [CrossRef]
- Douglas, M.R. Large Language Models. arXiv 2023, arXiv:2307.05782. [Google Scholar]
- Fui-Hoon Nah, F.; Zheng, R.; Cai, J.; Siau, K.; Chen, L. Generative AI and ChatGPT: Applications, challenges, and AI-human collaboration. J. Inf. Technol. Case Appl. Res. 2023, 25, 277–304. [Google Scholar] [CrossRef]
- DAIR.AI. Prompt Engineering Guide. 29 May 2024. Available online: https://www.promptingguide.ai (accessed on 29 May 2024).
- Beurer-Kellner, L.; Fischer, M.; Vechev, M. Prompting Is Programming: A Query Language for Large Language Models. Proc. ACM Program. Lang. 2023, 7, 1946–1969. [Google Scholar] [CrossRef]
- National Institute of Standards and Technology—NIST. Human-Centered AI. 23 April 2024. Available online: https://www.nist.gov/programs-projects/human-centered-ai (accessed on 14 July 2024).
- Interaction Design Foundation—IxDF. What is Human-Centered AI (HCAI)? Available online: https://www.interaction-design.org/literature/topics/human-centered-ai (accessed on 14 July 2024).
- Salikutluk, V.; Koert, D.; Jaekel, F. Interacting with Large Language Models: A Case Study on AI-Aided Brainstorming for Guesstimation Problems. In HHAI 2023: Augmenting Human Intellect; IOS Press: Amsterdam, Netherlands, 2023. [Google Scholar] [CrossRef]
- Capel, T.; Brereton, M. What is Human-Centered about Human-Centered AI? A Map of the Research Landscape. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems; Association for Computing Machinery: New York, NY, USA, 2023. [Google Scholar]
- Stephanidis, C.; Salvendy, G.; Antona, M.; Duffy, V.G.; Gao, Q.; Karwowski, W.; Konomi, S.; Nah, F.; Ntoa, S.; Rau, P.-L.P.; et al. Seven HCI Grand Challenges Revisited: Five-Year Progress. Int. J. Human–Computer Interact. 2025, 41, 11947–11995. [Google Scholar] [CrossRef]
- Moher, D.; Liberati, A.; Tetzlaff, J.; Altman, D.G. Preferred reporting items for systematic reviews and meta-analyses: The PRISMA statement. Int. J. Surg. 2010, 8, 336–341. [Google Scholar] [CrossRef]
- Wei, J.; Bosma, M.; Zhao, V.; Guu, K.; Yu, A.W.; Lester, B.; Du, N.; Dai, A.M.; Le, Q.V. Finetuned Language Models Are Zero-Shot Learners. arXiv 2021, arXiv:2109.01652. [Google Scholar]
- Brown, T.; Mann, B.; Ryder, N.; Subbiah, M.; Kaplan, J.D.; Dhariwal, P.; Neelakantan, A.; Shyam, P.; Sastry, G.; Askell, A.; et al. Language Models are Few-Shot Learners. arXiv 2020, arXiv:2005.14165. [Google Scholar] [CrossRef]
- Wei, J.; Wang, X.; Schuurmans, D.; Bosma, M.; Chi, E.H.-H.; Xia, F.; Le, Q.; Zhou, D. Chain of Thought Prompting Elicits Reasoning in Large Language Models. arXiv 2022, arXiv:2201.11903. [Google Scholar]
- Wang, X.; Wei, J.; Schuurmans, D.; Le, Q.; Chi, E.H.-H.; Zhou, D. Self-Consistency Improves Chain of Thought Reasoning in Language Models. arXiv 2022, arXiv:2203.11171. [Google Scholar]
- Lewis, P.; Perez, E.; Piktus, A.; Petroni, F.; Karpukhin, V.; Goyal, N.; Kuttler, H.; Lewis, M.; Yih, W.-T.; Rocktäschel, T.; et al. Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks. arXiv 2020, arXiv:2005.11401. [Google Scholar]
- Zhang, Z.; Zhang, A.; Li, M.; Zhao, H.; Karypis, G.; Smola, A.J. Multimodal Chain-of-Thought Reasoning in Language Models. arXiv 2023, arXiv:2302.00923. [Google Scholar]
- Vera-Amaro, G.; Rojano-Cáceres, J.R. Accessible Web Content Generation Using LLMs: An Empirical Study on Prompting Strategies and Template-Guided Remediation. IEEE Lat. Am. Trans. 2025, 23, 1230–1239. [Google Scholar] [CrossRef]
- Anthropic. Chain Prompts. Available online: https://docs.anthropic.com/en/docs/chain-prompts (accessed on 14 July 2024).
- Yao, S.; Yu, D.; Zhao, J.; Shafran, I.; Griffiths, T.L.; Cao, Y.; Narasimhan, K. Tree of Thoughts: Deliberate Problem Solving with Large Language Models. arXiv 2023, arXiv:2305.10601. [Google Scholar] [CrossRef]
- Paranjape, B.; Lundberg, S.M.; Singh, S.; Hajishirzi, H.; Zettlemoyer, L.; Ribeiro, M.T. ART: Automatic multi-step reasoning and tool-use for large language models. arXiv 2023, arXiv:2303.09014. [Google Scholar]
- Zhou, Y.; Muresanu, A.I.; Han, Z.; Paster, K.; Pitis, S.; Chan, H.; Ba, J. Large Language Models Are Human-Level Prompt Engineers. arXiv 2022, arXiv:2211.01910. [Google Scholar]
- Diao, S.; Wang, P.; Lin, Y.; Zhang, T. Active Prompting with Chain-of-Thought for Large Language Models. arXiv 2023, arXiv:2302.12246. [Google Scholar]
- Gao, L.; Madaan, A.; Zhou, S.; Alon, U.; Liu, P.; Yang, Y.; Callan, J.; Neubig, G. PAL: Program-aided Language Models. arXiv 2022, arXiv:2211.10435. [Google Scholar]
- Yao, S.; Zhao, J.; Yu, D.; Du, N.; Shafran, I.; Narasimhan, K.; Cao, Y. React: Synergizing reasoning and acting in language models. arXiv 2022, arXiv:2210.03629. [Google Scholar]
- Shinn, N.; Cassano, F.; Labash, B.; Gopinath, A.; Narasimhan, K.; Yao, S. Reflexion: Language agents with verbal reinforcement learning. arXiv 2023, arXiv:2303.11366. [Google Scholar] [CrossRef]
- Song, Y.; Lu, J.; Wong, R.C.-W. CoVis: Neural and LLM-Driven Multi-Turn Interactions for Conversational Text-to-Visualization Generation. VLDB J. 2025, 35, 3. [Google Scholar] [CrossRef]
- Wu, Q.; Bansal, G.; Zhang, J.; Wu, Y.; Zhang, S.; Zhu, E.; Li, B.; Jiang, L.; Zhang, X.; Wang, C. AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation Framework. arXiv 2023, arXiv:2308.08155. [Google Scholar]
- Munley, C.; Jarmusch, A.; Chandrasekaran, S. LLM4VV: Developing LLM-driven testsuite for compiler validation. Futur. Gener. Comput. Syst.-Int. J. Escience 2024, 160, 1–13. [Google Scholar] [CrossRef]
- Zan, D.; Chen, B.; Zhang, F.; Lu, D.; Wu, B.; Guan, B.; Wang, Y.; Lou, J.-G. Large Language Models Meet NL2Code: A Survey. arXiv 2022, arXiv:2212.09420. [Google Scholar]
- Marques, N.; Silva, R.R.; Bernardino, J. Using ChatGPT in Software Requirements Engineering: A Comprehensive Review. Futur. Internet 2024, 16, 180. [Google Scholar] [CrossRef]
- Fakhoury, S.; Naik, A.; Sakkas, G.; Chakraborty, S.; Lahiri, S.K. LLM-Based Test-Driven Interactive Code Generation: User Study and Empirical Evaluation. IEEE Trans. Softw. Eng. 2024, 50, 2254–2268. [Google Scholar] [CrossRef]
- Kim, Y.; Kim, D.; Choi, J.; Park, J.; Oh, N.; Park, D. A survey on integration of large language models with intelligent robots. Intell. Serv. Robot. 2024, 17, 1091–1107. [Google Scholar] [CrossRef]
- Vemprala, S.H.; Bonatti, R.; Bucker, A.; Kapoor, A. ChatGPT for Robotics: Design Principles and Model Abilities. IEEE Access 2024, 12, 55682–55696. [Google Scholar] [CrossRef]
- Ye, Y.; You, H.; Du, J. Improved Trust in Human-Robot Collaboration With ChatGPT. IEEE Access 2023, 11, 55748–55754. [Google Scholar] [CrossRef]
- Zhao, X.; Li, M.; Weber, C.; Hafez, M.B.; Wermter, S. Chat with the Environment: Interactive Multimodal Perception Using Large Language Models. In Proceedings of the 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Detroit, MI, USA, 1–5 October 2023. [Google Scholar]
- Lai, Y.; Yuan, S.; Nassar, Y.; Fan, M.; Gopal, A.; Yorita, A.; Kubota, N.; Rätsch, M. Natural Multimodal Fusion-Based Human–Robot Interaction: Application With Voice and Deictic Posture via Large Language Model. IEEE Robot. Autom. Mag. 2025, 2–11. [Google Scholar] [CrossRef]
- Frering, L.; Steinbauer-Wagner, G.; Holzinger, A. Integrating Belief-Desire-Intention agents with large language models for reliable human–robot interaction and explainable Artificial Intelligence. Eng. Appl. Artif. Intell. 2024, 141, 109771. [Google Scholar] [CrossRef]
- Pelaez-Sanchez, I.C.; Velarde-Camaqui, D.; Glasserman-Morales, L.D. The impact of large language models on higher education: Exploring the connection between AI and Education 4.0. Front. Educ. 2024, 9, 1392091. [Google Scholar] [CrossRef]
- Oppenheimer, D.M.; Cash, T.N.; Pensky, A.E.C. You’ve Got AI Friend in Me: LLMs as Collaborative Learning Partners. Int. J. Artif. Intell. Educ. 2025, 35, 3896–3921. [Google Scholar] [CrossRef]
- Mai, D.T.T.; Da, C.V.; Hanh, N.V. The use of ChatGPT in teaching and learning: A systematic review through SWOT analysis approach. Front. Educ. 2024, 9, 1328769. [Google Scholar] [CrossRef]
- Hwang, G.-J.; Chen, N.-S. Editorial Position Paper: Exploring the Potential of Generative Artificial Intelligence in Education: Applications, Challenges, and Future Research Directions. Educ. Technol. Soc. 2023, 26, 1–18. [Google Scholar]
- Bozkurt, A. Unleashing the Potential of Generative AI, Conversational Agents and Chatbots in Educational Praxis: A Systematic Review and Bibliometric Analysis of GenAI in Education. Open Prax. 2023, 15, 261–270. [Google Scholar] [CrossRef]
- Kurian, N. No, Alexa, no!’: Designing child-safe AI and protecting children from the risks of the ‘empathy gap’ in large language models. Learn. Media Technol. 2024, 50, 621–634. [Google Scholar] [CrossRef]
- Zou, R.; Ye, Z.; Ye, C. iTutor: A Generative Tutorial System for Teaching the Elders to Use Smartphone Applications. In Adjunct Proceedings of the 36th Annual ACM Symposium on User Interface Software & Technology, UIST 2023 Adjunct; Association for Computing Machinery: New York, NY, USA, 2023. [Google Scholar]
- Görer, B.; Aydemir, F.B. Generating Requirements Elicitation Interview Scripts with Large Language Models. In Proceedings of the 2023 IEEE 31st International Requirements Engineering Conference Workshops (REW), Hannover, Germany, 4–5 September 2023. [Google Scholar]
- Wu, T.; Jiang, E.; Donsbach, A.; Gray, J.; Molina, A.; Terry, M.; Cai, C.J. PromptChainer: Chaining Large Language Model Prompts through Visual Programming. In Extended Abstracts of the 2022 CHI Conference on Human Factors in Computing Systems, CHI 2022; Association for Computing Machinery: New York, NY, USA, 2022. [Google Scholar]
- Shu, Y.; Zhang, H.; Gu, H.; Zhang, P.; Lu, T.; Li, D.; Gu, N. RAH! RecSys–Assistant–Human: A Human-Centered Recommendation Framework With LLM Agents. IEEE Trans. Comput. Soc. Syst. 2024, 11, 6759–6770. [Google Scholar] [CrossRef]
- Lee, M.; Liang, P.; Yang, Q. CoAuthor: Designing a Human-AI Collaborative Writing Dataset for Exploring Language Model Capabilities. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems (CHI’ 22); Association for Computing Machinery: New York, NY, USA, 2022. [Google Scholar]
- Wang, X.; Huey, S.L.; Sheng, R.; Mehta, S.; Wang, F. SciDaSynth: Interactive Structured Data Extraction From Scientific Literature With Large Language Model. Campbell Syst. Rev. 2025, 21, e70073. [Google Scholar] [CrossRef]
- Wang, B.; Li, G.; Li, Y. Enabling Conversational Interaction with Mobile UI using Large Language Models. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems; Association for Computing Machinery: New York, NY, USA, 2023. [Google Scholar]
- Tao, W.; Zhou, Y.; Zhang, W.; Cheng, Y. MAGIS: LLM-Based Multi-Agent Framework for GitHub Issue Resolution. arXiv 2024, arXiv:2403.17927. [Google Scholar]
- Trimigno, G.; Lombardo, G.; Tomaiuolo, M.; Cagnoni, S.; Poggi, A. LLMs in Staging: An Orchestrated LLM Workflow for Structured Augmentation with Fact Scoring. Futur. Internet 2025, 17, 535. [Google Scholar] [CrossRef]
- Chen, J.; Liu, Z.; Huang, X.; Wu, C.; Liu, Q.; Jiang, G.; Pu, Y.; Lei, Y.; Chen, X.; Wang, X.; et al. When large language models meet personalization: Perspectives of challenges and opportunities. World Wide Web 2024, 27, 42. [Google Scholar] [CrossRef]
- Liu, H.; Zhu, Y.; Kato, K.; Tsukahara, A.; Kondo, I.; Aoyama, T.; Hasegawa, Y. Enhancing the LLM-Based Robot Manipulation Through Human-Robot Collaboration. IEEE Robot. Autom. Lett. 2024, 9, 6904–6911. [Google Scholar] [CrossRef]
- Liu, Y.; Sun, Q.; Kapadia, D.R. Integrating Large Language Models into Robotic Autonomy: A Review of Motion, Voice, and Training Pipelines. AI 2025, 6, 158. [Google Scholar] [CrossRef]
- Zamfrescu-Pereira, J.D.; Wong, R.; Hartmann, B.; Yang, Q. Why Johnny Can’t Prompt: How Non-AI Experts Try (and Fail) to Design LLM Prompts. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems (CHI 2023); Association for Computing Machinery: New York, NY, USA, 2023. [Google Scholar]
- Ngu, N.; Lee, N.; Shakarian, P. Diversity Measures: Domain-Independent Proxies for Failure in Language Model Queries. In Proceedings of the 2024 IEEE 18th International Conference on Semantic Computing (ICSC), Laguna Hills, CA, USA, 5–7 February 2024. [Google Scholar]
- Bridgelall, R. Unraveling the mysteries of AI chatbots. Artif. Intell. Rev. 2024, 57, 89. [Google Scholar] [CrossRef]
- Peykani, P.; Ramezanlou, F.; Tanasescu, C.; Ghanidel, S. Large Language Models: A Structured Taxonomy and Review of Challenges, Limitations, Solutions, and Future Directions. Appl. Sci. 2025, 15, 8103. [Google Scholar] [CrossRef]






| Query Field | Logic Operator | Field Value | |
|---|---|---|---|
| Title | (“LLM*” OR “large language model*” OR “GPT*” OR “ChatGPT”) AND (“interact*” OR “prompt*”) | ||
| Abstract | OR | (“LLM*” OR “large language model*” OR “GPT*” OR “ChatGPT”) AND (“interact*” OR “prompt*”) | |
| Topic | NOT | training OR finetuning OR “fine-tuning” OR benchmark* OR dataset OR corpus OR optimization | |
| Index Date | AND | Initial search 1 January 2021 TO 23 July 2024 | Update search 24 July 2024 TO 31 December 2025 |
| Retained Sampling Stratum | Initial Search (n) | Update Search (n) | Total Included (n) | Share of Final Corpus (%) |
|---|---|---|---|---|
| Most cited | 13 | 23 | 36 | 41.4 |
| Most relevant | 9 | 11 | 20 | 23.0 |
| Most recent | 22 | 9 | 31 | 35.6 |
| Total | 44 | 43 | 87 | 100.0 |
| Software Development | Robotics | Education | |
|---|---|---|---|
| Role of prompting | Artefact generation and verification | Grounding language in physical action | Developing prompting as a learner competence |
| Workflow patterns | Test-driven iteration, failures recoverable | Human approval before execution | Iterative feedback, learning-focused |
| Agentic architectures | Consequences bounded to digital artefacts | Errors can propagate into physical space | Limited evidence in reviewed studies |
| Dominant autonomy | Advisory to Delegated execution | Guided to Delegated execution | Advisory to Guided execution |
| Key risk | Over-reliance on incorrect outputs | Misinterpretation can cause physical harm | Uncalibrated trust, vulnerable users |
| Category | Classification Criteria | Boundary Rule | Example |
|---|---|---|---|
| Conversational exploration and reflection | The interaction is primarily used for exploration, sensemaking, explanation, or idea generation within the dialogue, without direct action outside the dialogue. | If the interaction is organised around producing a specified output with completion criteria, classify as task-oriented assistance. | [13] |
| Task-oriented assistance | The user has a defined task or deliverable, success is judged by completion, adequacy, or correctness of the output, and interaction remains primarily prompt-response based. | If the interaction is primarily structured by application features, interface controls, or tool/API calls, classify as tool-mediated interaction. | [50,51] |
| Tool-mediated interaction | The LLM is embedded in an application, and interaction is primarily structured by interface elements, built-in functions, or tool/API calls. | If the system itself plans and iteratively executes a sequence of actions toward the goal, classify as agentic workflows. | [32,52] |
| Agentic workflows | The system pursues a multi-step goal by planning and iteratively executing actions across tools, modules, or subprocesses. | If the system only proposes steps while the user remains responsible for carrying them out, classify as tool-mediated interaction, or as task-oriented assistance if no tools are involved. | [53] |
| Category | Classification Criteria | Boundary Rule | Example |
|---|---|---|---|
| Advisory systems | The system provides recommendations, suggestions, or explanations, but does not perform actions on external tools, artefacts, or environments. The user remains responsible for execution. | If the system guides the task through predefined stages or structured interaction steps, classify as guided execution. | [51] |
| Guided execution | The system structures the interaction through staged guidance or partial automation, but explicit user confirmation remains required for consequential actions. | If consequential actions do not require step-by-step user confirmation, classify as delegated execution. | [37,54] |
| Delegated execution | The system performs bounded actions through tools, APIs, code, or connected applications on the user’s behalf, while user oversight and intervention remain available. | If the system autonomously plans subgoals and adapts actions, classify as high-autonomy execution. | [55,56] |
| High-autonomy execution | The system pursues extended goals through self-directed planning and multi-step execution, potentially coordinating multiple agents or tools with minimal direct user intervention. | If autonomy remains limited to tasks without extended self-directed planning, classify as delegated execution. | [53] |
| HCAI Principle | Exemplary Operational Indicators |
|---|---|
| Control | Percentage of actions requiring user confirmation; availability of pause, stop, undo, or override functions; intervention success rate |
| Transparency | Proportion of visible intermediate steps; availability of plans, tool calls, sources, or uncertainty cues; user-rated understandability |
| Error handling | Error detection rate; recovery success rate; time needed to recover from failed or incorrect outputs |
| User learning | Improvement in prompt quality over repeated use; reduction in repeated user corrections; change in calibrated trust or self-reported confidence |
| System/Study | Interaction Mechanism | Autonomy Level | Basis for Classification |
|---|---|---|---|
| Salikutluk et al. [13] | Conversational exploration | Advisory | Brainstorming support; no external action |
| Görer & Aydemir [51] | Task-oriented assistance | Advisory | Generates interview-script outputs |
| Fakhoury et al. [37] | Task-oriented assistance | Guided execution | Test-driven code generation with iterative user review |
| iTutor [50] | Task-oriented assistance | Guided execution | Step-by-step tutorial guidance from UI context |
| CoAuthor [54] | Task-oriented assistance | Guided execution | User-controlled writing suggestions and revision |
| PromptChainer [52] | Tool-mediated interaction | Guided execution | Visual prompt-chain authoring and debugging |
| CoVis [32] | Tool-mediated interaction | Delegated execution | Generates data queries and visualizations through a system-mediated workflow |
| SciDaSynth [55] | Tool-mediated interaction | Delegated execution | Generates structured data tables; user validates and refines |
| Wang et al. [56] | Tool-mediated interaction | Delegated execution | Maps language instructions to bounded mobile UI actions |
| RAH [53] | Agentic workflows | High-autonomy execution | LLM-agent with learn–action–critic–reflection cycle |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Nejašmić, D.; Mladenović, S.; Granić, A. Interaction with LLM-Based Systems: A Structured Review and Taxonomy of Mechanisms and Autonomy. Appl. Sci. 2026, 16, 5001. https://doi.org/10.3390/app16105001
Nejašmić D, Mladenović S, Granić A. Interaction with LLM-Based Systems: A Structured Review and Taxonomy of Mechanisms and Autonomy. Applied Sciences. 2026; 16(10):5001. https://doi.org/10.3390/app16105001
Chicago/Turabian StyleNejašmić, Dino, Saša Mladenović, and Andrina Granić. 2026. "Interaction with LLM-Based Systems: A Structured Review and Taxonomy of Mechanisms and Autonomy" Applied Sciences 16, no. 10: 5001. https://doi.org/10.3390/app16105001
APA StyleNejašmić, D., Mladenović, S., & Granić, A. (2026). Interaction with LLM-Based Systems: A Structured Review and Taxonomy of Mechanisms and Autonomy. Applied Sciences, 16(10), 5001. https://doi.org/10.3390/app16105001

