The Era of Artificial Intelligence Deception: Unraveling the Complexities of False Realities and Emerging Threats of Misinformation
Abstract
:1. Introduction
2. Addressing the Challenge of AI Hallucinations in High-Stakes Domains
2.1. Mitigating Risks: Addressing the Challenges of AI Hallucinations in Data Interpretation and Content Generation
2.2. Urgent Strategies for Ensuring the Accuracy and Integrity of AI Applications against Hallucination Challenges
2.3. Balancing AI Advancements with Reliability: Tackling Hallucinations in High-Stakes Environments
2.4. Strategies for Enhancing LLM Reliability: Combating AI Hallucinations through Improved Training and Vigilance
2.5. Toward Trustworthy AI: Collaborative Efforts to Develop Safe and Ethical AI Applications for Hallucination Challenges
3. AI Influence and Erosion of the Human Decision-Making Agency: Navigating the Realms of Manipulation
3.1. Navigating the Complex Landscape of AI Manipulation
3.2. Manipulating Decision Making: Adversarial Framework in AI-Driven Behavior Influence
3.2.1. Engineering Choices: Adversarial Influence on Decision-Making Processes
3.2.2. Adversarial Strategies in Cognitive Control: Exploiting Response Inhibition in the Go/No-Go Task
3.2.3. Manipulating Trust: Dynamics of Adversarial Influence in Social Exchange Tasks
3.3. Navigating the Digital Labyrinth: Unraveling the Ethical and Manipulative Aspects of AI in Social Media
3.4. Deceptive Undercurrents in Artificial Intelligence: Unveiling the Hidden Dangers of Large Language Models
4. When AI Goes Awry: Understanding the Risks of Unpredictable Systems
4.1. Tay’s Troubles: A Pivotal Moment in AI Development and the Quest for Ethical Interaction
4.2. And Then There Was Sydney: The Conundrum of AI Autonomy and Human Decision-Making Ethics
4.3. Controversy and Implications: The Tessa Chatbot Incident and Its Impact on AI in Healthcare and Mental Health Support
4.4. AI Chatbots and Mental Health: Navigating Ethical and Safety Challenges Highlighted by a Tragic Incident
4.5. Charting the Future of AI: Ethical Integration and Cognitive Synergy
5. AI-Generated Hyperrealism: A New Frontier in Digital Deception and Political Propaganda
5.1. The Challenge of Detecting AI-Generated Faces: Realism, Recognition, and the Risk of Deception
5.2. Unveiling the Dual Challenges of AI Hyperrealism: Psychological Insights and Perceptual Detection Errors
5.3. GAN Faces and Social Influence: Psychological Effects of AI-Induced Realness on Human Behavior
5.4. Impact of Awareness on Perception and Trust: Dissecting Responses to GAN Faces
5.5. Navigating AI Mirage: Ensuring Authenticity in the Age of Hyperrealistic Social Media Profiles
6. AI’s Role in Diminishing Human Proclivity for Information Seeking and Learning
7. Limitations
8. Discussion
9. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Bubeck, S.; Chandrasekaran, V.; Eldan, R.; Gehrke, J.; Horvitz, E.; Kamar, E.; Lee, P.; Lee, Y.T.; Li, Y.; Lundberg, S.; et al. Sparks of Artificial General Intelligence: Early experiments with GPT-4. arXiv 2023. [Google Scholar] [CrossRef]
- Zhang, G.; Chong, L.; Kotovsky, K.; Cagan, J. Trust in an AI versus a Human teammate: The effects of teammate identity and performance on Human-AI cooperation. Comput. Hum. Behav. 2023, 139, 107536. [Google Scholar] [CrossRef]
- Brameier, D.T.; Alnasser, A.; Carnino, J.M.; Bhashyam, A.R.; Von Keudell, A.G.; Weaver, M.J. Artificial intelligence in orthopaedic surgery. J. Bone Jt. Surg. 2023, 105, 1388–1392. [Google Scholar] [CrossRef]
- Eysenbach, G. The role of ChatGPT, Generative Language models, and Artificial intelligence in medical Education: A conversation with ChatGPT and a call for papers. JMIR Med. Educ. 2023, 9, e46885. [Google Scholar] [CrossRef]
- Liu, S.; Wright, A.P.; Patterson, B.L.; Wanderer, J.P.; Turer, R.W.; Nelson, S.D.; McCoy, A.B.; Sittig, D.F.; Wright, A. Assessing the value of ChaTGPT for clinical decision support optimization. MedRxiv 2023. [Google Scholar] [CrossRef]
- Ramesh, K.; KhudaBukhsh, A.R.; Kumar, S. ‘Beach’ to ‘Bitch’: Inadvertent unsafe transcription of kids’ content on YouTube. Proc. AAAI Conf. Artif. Intell. 2022, 36, 12108–12118. [Google Scholar] [CrossRef]
- Alkaissi, H.; McFarlane, S.I. Artificial Hallucinations in ChatGPT: Implications in Scientific Writing. Cureus 2023, 15, e35179. [Google Scholar] [CrossRef]
- Athaluri, S.A.; Manthena, S.V.; Kesapragada, V.S.R.K.M.; Yarlagadda, V.; Dave, T.; Duddumpudi, R.T.S. Exploring the Boundaries of Reality: Investigating the phenomenon of artificial intelligence hallucination in scientific writing through ChatGPT references. Cureus 2023, 15. [Google Scholar] [CrossRef]
- Hua, H.-U.; Kaakour, A.-H.; Rachitskaya, A.; Srivastava, S.K.; Sharma, S.; Mammo, D.A. Evaluation and comparison of ophthalmic scientific abstracts and references by current artificial intelligence chatbots. JAMA Ophthalmol. 2023, 141, 819. [Google Scholar] [CrossRef]
- Sharun, K.; Banu, S.A.; Pawde, A.M.; Kumar, R.; Akash, S.; Dhama, K.; Pal, A. ChatGPT and artificial hallucinations in stem cell research: Assessing the accuracy of generated references—A preliminary study. Ann. Med. Surg. 2023, 85, 5275–5278. [Google Scholar] [CrossRef]
- Xie, Q.; Wang, F. Faithful AI in Medicine: A Systematic Review with Large Language Models and Beyond. MedRxiv 2023. [Google Scholar] [CrossRef] [PubMed]
- Karim, S.; Sandu, N.; Kayastha, M. The challenges and opportunities of adopting artificial intelligence (AI) in Jordan’s healthcare transformation. Glob. J. Inf. Technol. 2021, 11, 35–46. [Google Scholar] [CrossRef]
- Wang, Y.; Zheng, P.; Peng, T.; Yang, H.; Zou, J. Smart additive manufacturing: Current artificial intelligence-enabled methods and future perspectives. Sci. China Technol. Sci. 2020, 63, 1600–1611. [Google Scholar] [CrossRef]
- Yu, K.; Beam, A.L.; Kohane, I.S. Artificial intelligence in healthcare. Nat. Biomed. Eng. 2018, 2, 719–731. [Google Scholar] [CrossRef] [PubMed]
- Carroll, M.; Chan, A.H.S.; Ashton, H.C.; Krueger, D.A. Characterizing manipulation from AI systems. arXiv 2023. [Google Scholar] [CrossRef]
- Strümke, I.; Slavkovik, M.; Stachl, C. Against algorithmic exploitation of human vulnerabilities. arXiv 2023. [Google Scholar] [CrossRef]
- Burtell, M.; Woodside, T. Artificial Influence: An analysis of AI-driven persuasion. arXiv 2023. [Google Scholar] [CrossRef]
- Hemmer, P.; Westphal, M.; Schemmer, M.; Vetter, S.; Vössing, M.; Satzger, G. Human-AI Collaboration: The Effect of AI Delegation on Human Task Performance and Task Satisfaction. In Proceedings of the 28th International Conference on Intelligent User Interfaces, Sydney, NSW, Australia, 27–31 March 2023; pp. 453–463. [Google Scholar] [CrossRef]
- Schemmer, M.; Kühl, N.; Benz, C.; Satzger, G. On the Influence of Explainable AI on Automation Bias. arXiv 2022. [Google Scholar] [CrossRef]
- Ferreira, J.J.; De Souza Monteiro, M. The human-AI relationship in decision-making: AI explanation to support people on justifying their decisions. arXiv 2021. [Google Scholar] [CrossRef]
- Beckers, S.; Chockler, H.; Halpern, J.Y. Quantifying harm. arXiv 2022. [Google Scholar] [CrossRef]
- Bohdal, O.; Hospedales, T.M.; Torr, P.H.S.; Barez, F. Fairness in AI and its Long-Term Implications on society. arXiv 2023. [Google Scholar] [CrossRef]
- Clarke, S.; Whittlestone, J. A survey of the potential long-term impacts of AI. In Proceedings of the 2022 AAAI/ACM Conference on AI, Ethics, and Society, Oxford, UK, 19–21 May 2022; pp. 192–202. [Google Scholar] [CrossRef]
- Bajgar, O.; Horenovsky, J. Negative human rights as a basis for long-term AI safety and regulation. J. Artif. Intell. Res. 2023, 76, 1043–1075. [Google Scholar] [CrossRef]
- Prunkl, C.; Whittlestone, J. Beyond Near- and Long-Term: Towards a clearer account of research priorities in AI ethics and society. arXiv 2020. [Google Scholar] [CrossRef]
- Lindner, D.; Heidari, H.; Krause, A. Addressing the long-term impact of ML decisions via policy regret. arXiv 2021. [Google Scholar] [CrossRef]
- Rastogi, C.; Zhang, Y.; Wei, D.; Varshney, K.R.; Dhurandhar, A.; Tomsett, R. Deciding fast and slow: The role of cognitive biases in AI-assisted decision-making. arXiv 2020. [Google Scholar] [CrossRef]
- Sinha, A.R.; Goyal, N.; Dhamnani, S.; Asija, T.; Dubey, R.K.; Raja, M.V.K.; Theocharous, G. Personalized detection of cognitive biases in actions of users from Their logs: Anchoring and recency biases. arXiv 2022. [Google Scholar] [CrossRef]
- Dancy, C.L. Using a Cognitive Architecture to consider antiblackness in design and development of AI systems. arXiv 2022. [Google Scholar] [CrossRef]
- Dezfouli, A.; Nock, R.; Dayan, P. Adversarial vulnerabilities of human decision-making. Proc. Natl. Acad. Sci. USA 2020, 117, 29221–29228. [Google Scholar] [CrossRef] [PubMed]
- Ienca, M. On artificial intelligence and manipulation. Topoi-Int. Rev. Philos. 2023, 42, 833–842. [Google Scholar] [CrossRef]
- Scheurer, J.; Balesni, M.; Hobbhahn, M. Technical Report: Large Language Models can Strategically Deceive their Users when Put Under Pressure. arXiv 2023. [Google Scholar] [CrossRef]
- Hubinger, E.; Denison, C.; Mu, J.; Lambert, M.; Tong, M.; MacDiarmid, M.; Lanham, T.; Ziegler, D.M.; Maxwell, T.T.; Cheng, N.; et al. Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training. arXiv 2024. [Google Scholar] [CrossRef]
- Suárez-Gonzalo, S.; Manchón, L.M.; Guerrero-Solé, F. Tay is you. The attribution of responsibility in the algorithmic culture. Observatorio 2019, 13, 14. [Google Scholar] [CrossRef]
- Yampolskiy, R.V. Unpredictability of AI: On the impossibility of accurately predicting all actions of a smarter agent. J. Artif. Intell. Conscious. 2020, 7, 109–118. [Google Scholar] [CrossRef]
- Anderson, L.B.; Kanneganti, D.; Houk, M.B.; Holm, R.H.; Smith, T. Generative AI as a tool for Environmental Health Research Translation. Geohealth 2023, 7, e2023GH000875. [Google Scholar] [CrossRef] [PubMed]
- Buriak, J.M.; Hersam, M.C.; Kamat, P.V. Can ChatGPT and other AI bots serve as peer reviewers? ACS Energy Lett. 2023, 9, 191–192. [Google Scholar] [CrossRef]
- Kaarre, J.; Feldt, R.; Keeling, L.E.; Dadoo, S.; Zsidai, B.; Hughes, J.D.; Samuelsson, K.; Musahl, V. Exploring the potential of ChatGPT as a supplementary tool for providing orthopaedic information. Knee Surg. Sports Traumatol. Arthrosc. 2023, 31, 5190–5198. [Google Scholar] [CrossRef] [PubMed]
- Schukow, C.; Smith, S.C.; Landgrebe, E.; Parasuraman, S.; Folaranmi, O.O.; Paner, G.P.; Amin, M.B. Application of CHATGPT in routine diagnostic pathology: Promises, pitfalls, and potential future directions. Adv. Anat. Pathol. 2023, 31, 15–21. [Google Scholar] [CrossRef] [PubMed]
- Dergaa, I.; Chamari, K.; Żmijewski, P.; Saad, H.B. From human writing to artificial intelligence generated text: Examining the prospects and potential threats of ChatGPT in academic writing. Biol. Sport 2023, 40, 615–622. [Google Scholar] [CrossRef] [PubMed]
- Montazeri, M.; Galavi, Z.; Ahmadian, L. What are the applications of ChatGPT in healthcare: Gain or loss? Health Sci. Rep. 2024, 7, e1878. [Google Scholar] [CrossRef]
- Sinha, R.K.; Roy, A.D.; Kumar, N.; Mondal, H. Applicability of CHATGPT in assisting to solve higher order problems in pathology. Cureus 2023, 15, e35237. [Google Scholar] [CrossRef]
- Bommasani, R.; Hudson, D.A.; Adeli, E.; Altman, R.B.; Arora, S.; Von Arx, S.; Bernstein, M.S.; Bohg, J.; Bosselut, A.; Brunskill, E.; et al. On the Opportunities and Risks of Foundation Models. arXiv 2021. [Google Scholar] [CrossRef]
- Grimaldi, G.; Ehrler, B. AI et al.: Machines Are About to Change Scientific Publishing Forever. ACS Energy Lett. 2023, 8, 878–880. [Google Scholar] [CrossRef]
- Oeding, J.F.; Yang, L.; Sánchez-Sotelo, J.; Camp, C.L.; Karlsson, J.; Samuelsson, K.; Pearle, A.D.; Ranawat, A.S.; Kelly, B.T.; Pareek, A. A practical guide to the development and deployment of deep learning models for the orthopaedic surgeon: Part III, focus on registry creation, diagnosis, and data privacy. Knee Surg. Sports Traumatol. Arthrosc. 2024, 32, 518–528. [Google Scholar] [CrossRef] [PubMed]
- Maddigan, P.; Sušnjak, T. Chat2VIS: Generating Data Visualisations via Natural Language using ChatGPT, Codex and GPT-3 Large Language Models. arXiv 2023. [Google Scholar] [CrossRef]
- Kianian, R.; Sun, D.; Giaconi, J.A. Can ChatGPT aid clinicians in educating patients on the surgical management of glaucoma? J. Glaucoma 2023, 33, 94–100. [Google Scholar] [CrossRef]
- Wu, R.; Yu, Z. Do AI chatbots improve students learning outcomes? Evidence from a meta-analysis. Br. J. Educ. Technol. 2023, 55, 10–33. [Google Scholar] [CrossRef]
- Ray, P.P. Leveraging deep learning and language models in revolutionizing water resource management, research, and policy making: A case for ChatGPT. ACS ES&T Water 2023, 3, 1984–1986. [Google Scholar] [CrossRef]
- Wang, W.H.; Wang, S.Y.; Huang, J.T.; Liu, X.; Yang, J.; Liao, M.; Lu, Q.; Wu, Z. An investigation study on the interpretation of ultrasonic medical reports using OpenAI’s GPT-3.5-turbo model. J. Clin. Ultrasound 2023, 52, 105–111. [Google Scholar] [CrossRef]
- Lyons, H.; Velloso, E.; Miller, T. Fair and Responsible AI: A focus on the ability to contest. arXiv 2021. [Google Scholar] [CrossRef]
- Shin, D. Embodying algorithms, enactive artificial intelligence and the extended cognition: You can see as much as you know about algorithm. J. Inf. Sci. 2021, 49, 18–31. [Google Scholar] [CrossRef]
- Zhuang, S.; Hadfield-Menell, D. Consequences of misaligned AI. arXiv 2021. [Google Scholar] [CrossRef]
- Qian, H.; Dou, Z.; Zhu, Y.; Ma, Y.; Wen, J.-R. Learning implicit user profile for personalized Retrieval-Based chatbot. In Proceedings of the 30th ACM International Conference on Information & Knowledge Management, Virtual, 1–5 November 2021; pp. 1467–1477. [Google Scholar] [CrossRef]
- Huang, W.; Hew, K.F.; Fryer, L.K. Chatbots for language learning—Are they really useful? A systematic review of chatbot-supported language learning. J. Comput. Assist. Learn. 2021, 38, 237–257. [Google Scholar] [CrossRef]
- Janati, S.E.; Maach, A.; Ghanami, D.E. Adaptive e-Learning AI-Powered Chatbot based on Multimedia Indexing. Int. J. Adv. Comput. Sci. Appl. 2020, 11. [Google Scholar] [CrossRef]
- Zhou, L.; Gao, J.; Li, D.; Shum, H.-Y. The design and implementation of XiaoIce, an empathetic social chatbot. Comput. Linguist. 2020, 46, 53–93. [Google Scholar] [CrossRef]
- Schemmer, M.; Hemmer, P.; Kühl, N.; Benz, C.; Satzger, G. Should I follow AI-based advice? Measuring appropriate reliance in Human-AI Decision-Making. arXiv 2022. [Google Scholar] [CrossRef]
- Zhang, Y.; Liao, Q.V.; Bellamy, R.K.E. Effect of confidence and explanation on accuracy and trust calibration in AI-assisted decision making. In Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency, Barcelona, Spain, 27–30 January 2020; pp. 295–305. [Google Scholar] [CrossRef]
- Tejeda, H.; Kumar, A.; Smyth, P.; Steyvers, M. AI-Assisted Decision-making: A Cognitive Modeling Approach to Infer Latent Reliance Strategies. Comput. Brain Behav. 2022, 5, 491–508. [Google Scholar] [CrossRef]
- Lemus, H.T.; Kumar, A.; Steyvers, M. An empirical investigation of reliance on AI-Assistance in a Noisy-Image classification task. In Frontiers in Artificial Intelligence and Applications; IOS Press: Amsterdam, The Netherlands, 2022. [Google Scholar] [CrossRef]
- Ambartsoumean, V.M.; Yampolskiy, R.V. AI risk Skepticism, a comprehensive survey. arXiv 2023. [Google Scholar] [CrossRef]
- Llorca, D.F.; Charisi, V.; Hamon, R.; Sánchez, I.; Gómez, E. Liability Regimes in the Age of AI: A Use-Case Driven Analysis of the Burden of Proof. J. Artif. Intell. Res. 2023, 76, 613–644. [Google Scholar] [CrossRef]
- Lima, G.; Cha, M. Responsible AI and its stakeholders. arXiv 2020. [Google Scholar] [CrossRef]
- Morosan, C.; Dursun-Cengizci, A. Letting AI make decisions for me: An empirical examination of hotel guests’ acceptance of technology agency. Int. J. Contemp. Hosp. Manag. 2023, 36, 946–974. [Google Scholar] [CrossRef]
- Yang, Q.; Steinfeld, A.; Rosé, C.P.; Zimmerman, J. Re-examining whether, why, and how Human-AI interaction is uniquely difficult to design. In Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems, Honolulu, HI, USA, 25–30 April 2020. [Google Scholar] [CrossRef]
- Schoenherr, J.R.; Abbas, R.; Michael, K.; Rivas, P.; Anderson, T.D. Designing AI using a Human-Centered approach: Explainability and accuracy toward trustworthiness. IEEE Trans. Technol. Soc. 2023, 4, 9–23. [Google Scholar] [CrossRef]
- Cabrera, Á.A.; Perer, A.; Hong, J.I. Improving Human-AI collaboration with descriptions of AI behavior. Proc. ACM Hum.-Comput. Interact. 2023, 7, 1–21. [Google Scholar] [CrossRef]
- Vincent, J. Twitter Taught Microsoft’s AI Chatbot to Be a Racist Asshole in Less than a Day. The Verge. 24 March 2016. Available online: https://www.theverge.com/2016/3/24/11297050/tay-microsoft-chatbot-racist (accessed on 15 January 2024).
- Lee, P. Learning from Tay’s introduction—The Official Microsoft Blog. The Official Microsoft Blog. 25 March 2016. Available online: https://blogs.microsoft.com/blog/2016/03/25/learning-tays-introduction/ (accessed on 15 January 2024).
- Jalan, A. 6 Lessons Microsoft Learned from Its Tay AI Chatbot Disaster. MUO. 10 May 2023. Available online: https://www.makeuseof.com/lessons-microsoft-learned-tay-ai-disaster/ (accessed on 15 January 2024).
- Pasricha, S. AI ethics in smart Healthcare. arXiv 2022. [Google Scholar] [CrossRef]
- Cao, L. AI in Finance: Challenges, Techniques and Opportunities. arXiv 2021. [Google Scholar] [CrossRef]
- Epstein, Z.; Lin, H.; Pennycook, G.; Rand, D.A.J. How many others have shared this? Experimentally investigating the effects of social cues on engagement, misinformation, and unpredictability on social media. arXiv 2022. [Google Scholar] [CrossRef]
- Hacker, P.; Passoth, J.-H. Varieties of AI explanations under the law. from the GDPR to the AIA, and beyond. In International Workshop on Extending Explainable AI beyond Deep Models and Classifiers; Lecture Notes in Computer Science; Springer: Cham, Switzerland, 2022; pp. 343–373. [Google Scholar] [CrossRef]
- Germain, T. Back from the Dead? Sydney, Microsoft’s Psychotic Chatbot, Could Return. Gizmodo. 25 May 2023. Available online: https://gizmodo.com/bing-ai-sydney-microsoft-chatgpt-might-come-back-1850475832 (accessed on 24 January 2024).
- Perrigo, B. The New AI-Powered Bing Is Threatening Users. That’s No Laughing Matter. TIME. 17 February 2023. Available online: https://time.com/6256529/bing-openai-chatgpt-danger-alignment/ (accessed on 1 February 2024).
- Goudarzi, A.; Moya-Galé, G. Automatic speech recognition in noise for Parkinson’s disease: A pilot study. Front. Artif. Intell. 2021, 4, 809321. [Google Scholar] [CrossRef]
- Erdélyi, O.J.; Erdélyi, G. The AI liability puzzle and a Fund-Based Work-Around. In Proceedings of the AAAI/ACM Conference on AI, Ethics, and Society, New York, NY, USA, 7–9 February 2020; pp. 50–56. [Google Scholar] [CrossRef]
- Duffourc, M.N.; Gerke, S. The proposed EU Directives for AI liability leave worrying gaps likely to impact medical AI. Npj Digit. Med. 2023, 6, 77. [Google Scholar] [CrossRef]
- Freeman, L.J.; Rahman, A.; Batarseh, F.A. Enabling Artificial Intelligence Adoption through Assurance. Soc. Sci. 2021, 10, 322. [Google Scholar] [CrossRef]
- Kahn, J. Why Bing’s Creepy Alter-Ego Is a Problem for Microsoft—And Us All. Fortune. 21 February 2023. Available online: https://fortune.com/2023/02/21/bing-microsoft-sydney-chatgpt-openai-controversy-toxic-a-i-risk/ (accessed on 1 February 2024).
- Wells, K. An Eating Disorders Chatbot Offered Dieting Advice, Raising Fears about AI in Health. NPR. 9 June 2023. Available online: https://www.npr.org/sections/health-shots/2023/06/08/1180838096/an-eating-disorders-chatbot-offered-dieting-advice-raising-fears-about-ai-in-hea (accessed on 23 February 2024).
- McCarthy, L. A Wellness Chatbot Is Offline after Its ‘Harmful’ Focus on Weight Loss. The New York Times. 9 June 2023. Available online: https://www.nytimes.com/2023/06/08/us/ai-chatbot-tessa-eating-disorders-association.html (accessed on 23 February 2024).
- Tolentino, D. NEDA Pulls Chatbot after Users Say It Gave Harmful Dieting Tips. NBC News. Available online: https://www.nbcnews.com/tech/neda-pulls-chatbot-eating-advice-rcna87231 (accessed on 23 February 2024).
- O’Sullivan, S.; Nevejans, N.; Allen, C.; Blyth, A.; Léonard, S.; Pagallo, U.; Holzinger, K.; Holzinger, A.; Sajid, M.I.; Ashrafian, H. Legal, regulatory, and ethical frameworks for development of standards in artificial intelligence (AI) and autonomous robotic surgery. Int. J. Med. Robot. Comput. Assist. Surg. 2019, 15, e1968. [Google Scholar] [CrossRef]
- Bitkina, O.V.; Kim, J.; Park, J.; Park, J.; Kim, H.K. User stress in Artificial intelligence: Modeling in case of system failure. IEEE Access 2021, 9, 137430–137443. [Google Scholar] [CrossRef]
- Tomsett, R.; Preece, A.; Braines, D.; Cerutti, F.; Chakraborty, S.; Srivastava, M.; Pearson, G.; Kaplan, L. Rapid Trust Calibration through Interpretable and Uncertainty-Aware AI. Patterns 2020, 1, 100049. [Google Scholar] [CrossRef]
- Novelli, C.; Taddeo, M.; Floridi, L. Accountability in artificial intelligence: What it is and how it works. AI Soc. 2023. [Google Scholar] [CrossRef]
- Atillah, I.E. Man Ends His Life after an AI Chatbot “Encouraged” Him to Sacrifice Himself to Stop Climate Change. Euronews. 31 March 2023. Available online: https://www.euronews.com/next/2023/03/31/man-ends-his-life-after-an-ai-chatbot-encouraged-him-to-sacrifice-himself-to-stop-climate- (accessed on 25 February 2024).
- Bharade, A. A Widow Is Accusing an AI Chatbot of Being a Reason Her Husband Killed Himself. Business Insider. 4 April 2023. Available online: https://www.businessinsider.com/widow-accuses-ai-chatbot-reason-husband-kill-himself-2023-4 (accessed on 25 February 2024).
- Marcus, G. The First Known Chatbot Associated Death. Marcus on AI. Available online: https://garymarcus.substack.com/p/the-first-known-chatbot-associated (accessed on 25 February 2024).
- Walker, L. Belgian Man Dies by Suicide Following Exchanges with Chatbot. The Brussels Times. 2022. Available online: https://www.brusselstimes.com/ (accessed on 12 January 2024).
- Xiang, C. “He Would Still Be Here”: Man Dies by Suicide after Talking with AI Chatbot, Widow Says. Vice. 30 March 2023. Available online: https://www.vice.com/ (accessed on 25 February 2024).
- Huang, M.H.; Rust, R.T. Artificial intelligence in service. J. Serv. Res. 2018, 21, 155–172. [Google Scholar] [CrossRef]
- Ebigbo, A.; Messmann, H. Surfing the AI wave: Insights and challenges. Endoscopy 2023, 56, 70–71. [Google Scholar] [CrossRef]
- Kiyasseh, D.; Laca, J.; Haque, T.F.; Otiato, M.; Miles, B.J.; Von Wagner, C.; Donoho, D.A.; Trinh, Q.; Anandkumar, A.; Hung, A.J. Human visual explanations mitigate bias in AI-based assessment of surgeon skills. NPJ Digit. Med. 2023, 6, 54. [Google Scholar] [CrossRef]
- Ferrara, E. Fairness and Bias in Artificial Intelligence: A Brief Survey of Sources, Impacts, and Mitigation Strategies. Sci 2023, 6, 3. [Google Scholar] [CrossRef]
- Agarwal, A.; Agarwal, H. A seven-layer model with checklists for standardising fairness assessment throughout the AI lifecycle. AI Ethics 2023, 4, 299–314. [Google Scholar] [CrossRef]
- Barney, M.; Fisher, W.P. Avoiding AI armageddon with metrologically-oriented psychometrics. In 18th International Congress of Metrology; EDP Sciences: Les Ulis, France, 2017; p. 09005. [Google Scholar] [CrossRef]
- Greenhalgh, T.; Wherton, J.; Papoutsi, C.; Lynch, J.; Hughes, G.; A’Court, C.; Hinder, S.; Fahy, N.; Procter, R.; Shaw, S. Beyond Adoption: A new framework for theorizing and evaluating nonadoption, abandonment, and challenges to the Scale-Up, spread, and sustainability of health and care technologies. J. Med. Internet Res. 2017, 19, e367. [Google Scholar] [CrossRef]
- Davenport, T.H.; Guha, A.; Grewal, D.; Breßgott, T. How artificial intelligence will change the future of marketing. J. Acad. Mark. Sci. 2019, 48, 24–42. [Google Scholar] [CrossRef]
- Sundaresan, S.; Zhang, Z. AI-enabled knowledge sharing and learning: Redesigning roles and processes. Int. J. Organ. Anal. 2021, 30, 983–999. [Google Scholar] [CrossRef]
- Bawack, R.E.; Wamba, S.F.; Carillo, K. A framework for understanding artificial intelligence research: Insights from practice. J. Enterp. Inf. Manag. 2021, 34, 645–678. [Google Scholar] [CrossRef]
- Salo-Pöntinen, H. AI Ethics—Critical reflections on embedding ethical frameworks in AI technology. In International Conference on Human-Computer Interaction; Lecture Notes in Computer Science; Springer: Cham, Switzerland, 2021; pp. 311–329. [Google Scholar] [CrossRef]
- Miller, E.J.; Steward, B.A.; Witkower, Z.; Sutherland, C.a.M.; Krumhuber, E.G.; Dawel, A. AI hyperrealism: Why AI faces are perceived as more real than human ones. Psychol. Sci. 2023, 34, 1390–1403. [Google Scholar] [CrossRef]
- Tucciarelli, R.; Vehar, N.; Chandaria, S.; Tsakiris, M. On the realness of people who do not exist: The social processing of artificial faces. iScience 2022, 25, 105441. [Google Scholar] [CrossRef]
- Bauer, K.; Von Zahn, M.; Hinz, O. Expl(AI)ned: The Impact of Explainable Artificial Intelligence on Users’ Information Processing. Inf. Syst. Res. 2023, 34, 1582–1602. [Google Scholar] [CrossRef]
- Chatterjee, S.; Shenoy, P. Model-agnostic fits for understanding information seeking patterns in humans. arXiv 2020. [Google Scholar] [CrossRef]
- Gajos, K.Z.; Mamykina, L. Do people engage cognitively with AI? Impact of AI assistance on Incidental Learning. In Proceedings of the 27th International Conference on Intelligent User Interfaces, Helsinki, Finland, 22–25 March 2022; pp. 794–806. [Google Scholar] [CrossRef]
- Huang, Y.; Cheng, Y.; Chen, L.; Hsu, J.Y.-J. Human-AI Co-Learning for Data-Driven AI. arXiv 2019. [Google Scholar] [CrossRef]
- Russell, S.; Moskowitz, I.S.; Raglin, A. Human information interaction, artificial intelligence, and errors. In Autonomy and Artificial Intelligence: A Threat or Savior? Springer: Berlin/Heidelberg, Germany, 2017; pp. 71–101. [Google Scholar] [CrossRef]
- Fisher, M.; Smiley, A.H.; Grillo, T.L.H. Information without knowledge: The effects of Internet search on learning. Memory 2021, 30, 375–387. [Google Scholar] [CrossRef]
Experiment | Objective | Methodology | Key Findings | Additional Insights |
---|---|---|---|---|
Bandit Task | Investigate how the presentation of options affects decision making. | Participants chose between two options (left/right squares) on each trial and received feedback on whether their choice was rewarded. The adversary preassigned rewards to shape preferences toward a predetermined “target” option while being constrained to assign equal total rewards to each option. | Options highlighted or set as default were chosen more frequently, indicating the significant impact of presentation on decision making. | The experiment’s results are significant. The trained adversary, operating within the framework of adversarial bandit tasks, was able to bias choices towards a ‘target’ option. This was achieved while ensuring equal rewards for each option, a crucial aspect of the experiment’s design. When compared to Q-learning models, the adversary achieved a bias of approximately 73.4% toward the target option. Even when tested against human subjects, the adversary maintained a bias of around 70% toward the target. These findings demonstrate the effectiveness of the trained adversary in influencing choice behavior. The adversary’s strategic tactics were instrumental in the experiment’s success. It strategically assigned a few initial rewards to the target action, maintaining its selection probability at a high level while withholding nontarget rewards. Toward the end, the adversary ‘burned’ the nontarget rewards whenever the probability of selecting the target was above chance. Simultaneously, it increased the density of target rewards to counterbalance the effect of the nontarget rewards. This nuanced approach underscores the adversary’s sophisticated deployment of rewards to significantly influence choice behavior, all within the imposed constraints. |
Choice Engineering Task | Investigate the impact of option presentation on decision making. | Participants were presented with a set of options whose presentations varied (e.g., some options were highlighted or set as default). Participants were then asked to choose among these options. | Presenting options as highlighted or default significantly influenced choice, with such options being chosen more frequently. | This experiment utilized a framework where the adversary preassigned rewards to nudge preferences towards a “target” option under constraints to ensure equal total rewards for each option. Against Q-learning models and humans, the adversary achieved a target choice bias of ~73%. Tactics included initial reward assignments to the target option followed by “burning” nontarget rewards when the target was likely to be chosen, showcasing a nuanced strategy to influence decision making. |
Go/No-Go Task | Assess participants’ ability to inhibit their responses in the presence of varying stimuli. | Participants were shown different stimuli and instructed to respond or inhibit their response based on the stimulus type. The AI adversary arranged stimuli to explore participants’ response inhibition capabilities. | Errors increased when the AI adversary manipulated stimuli distribution compared to random “No-Go” trials, indicating the AI’s ability to increase error rates through pattern recognition. | This task implemented an open-loop adversary without access to the subjects’ responses, rearranging 10% No-Go stimuli to maximize errors. Subjects made significantly more errors when No-Go stimuli were adversarially arranged (11.7 errors on average) compared to a random distribution (9.5 errors). The adversary strategically allocated more No-Go trials towards the task’s end, exploiting subjects’ increased tendency to error later in the task, highlighting a deliberate strategy to challenge inhibitory control. |
Multiround Trust Task | Explore dynamics of trust and reciprocity in a game setting between a human investor and an AI trustee. | Over ten rounds, a human “investor” decided how much money to invest with an AI “trustee,” who could return any portion of the tripled investment back to the investor. Two types of AI adversaries, MAX and FAIR, were used. | AI adversaries influenced investors’ decisions by employing strategic behaviors to align with their objectives, effectively manipulating trust and reciprocity in their favor. | This experiment featured AI playing the trustee role, with MAX and FAIR objectives to maximize earnings and minimize earnings gap, respectively. Both adversaries significantly swayed investor behavior. The MAX adversary initially made high repayments to build trust, then reduced repayments to exploit it, depending on the investment amount. Conversely, the FAIR adversary guided investments to balance earnings with proportional repayments, showcasing adaptability and strategic depth in influencing social exchange dynamics. |
Issue | Description | Examples | Implications | Suggested Mitigations | Key Stakeholders | Future Directions |
---|---|---|---|---|---|---|
AI Hallucinations | AI systems generating factually incorrect, nonsensical, or fabricated outputs. | Mathematical and programming errors; misattributions; and higher-level conceptual misunderstandings. | Compromise the reliability and trustworthiness of AI-driven systems, with a potential for spreading misinformation. | Implement rigorous validation and verification protocols to check AI outputs against reliable data sources. Introduce adversarial training techniques to improve model resilience against generating incorrect information. | AI developers, data scientists, users of AI systems, regulatory bodies | Develop advanced techniques for detecting and correcting hallucinations in real-time. Invest in research to understand the root causes of hallucinations in AI systems. |
Misinformation | AI’s potential to create and disseminate misleading or false information. | Fake social media accounts spreading political misinformation. AI-generated hyperrealistic faces used for deception. | Leads to the manipulation of public opinion and erosion of trust in information sources. | Enhance model training with fact-checking algorithms; partner with domain experts to curate training data and identify misinformation; and implement user education programs on the limitations of AI in discerning truth from fiction. | Tech companies, media organizations, educational institutions, general public | Explore blockchain and other immutable ledger technologies for verifying the sources and accuracy of training data and advocate for transparency in AI-generated content. |
Unpredictable AI Behavior | AI systems exhibiting unexpected, inconsistent, or erratic behaviors. | Tay chatbot’s offensive tweets. Sydney chatbot’s erratic and inappropriate responses. | Causes discomfort and distrust among users, presenting challenges in understanding and controlling AI systems. | Incorporate extensive scenario-based testing before deployment; use ensemble methods to diversify AI responses; and establish clear guidelines for human intervention when AI behavior deviates from expected norms. | AI developers, end users, ethicists, industry regulators | Investigate the underlying algorithms and data that lead to unpredictable behavior and promote interdisciplinary research to design AI systems with predictable outcomes. |
Erosion of Human Autonomy | AI’s influence on human decision making and its potential to reduce human agency. | Covert manipulation of choices through personalized recommendations and overreliance on AI for information seeking and decision making. | Diminishes human initiative and critical thinking skills, raising ethical concerns regarding free will and authentic choice. | Design AI systems that supplement rather than replace human decision making. Promote digital literacy to enhance user understanding of AI recommendations. Implement ethical guidelines that prioritize human autonomy. | Technology users, AI ethicists, educational organizations, policymakers | Develop AI systems that require explicit human confirmation for critical decisions and encourage ethical audits of AI systems to assess their impact on human autonomy. |
Biases and Stereotypes | AI systems perpetuating or amplifying societal biases and stereotypes. | Facial recognition systems exhibiting racial biases and language models generating stereotypical or discriminatory outputs. | Reinforces existing inequalities and results in unfair treatment of marginalized groups. | Apply debiasing techniques during model training. Regularly audit AI systems for biased outcomes and adjust. Accordingly, involve diverse teams in AI development to reduce unconscious biases. | AI developers, affected communities, diversity and inclusion advocates, regulatory agencies | Foster research into automated debiasing tools. Promote transparency and accountability in AI development and deployment processes. |
Privacy and Security | AI’s potential to violate individual privacy and pose security risks. | Unauthorized use of personal data for AI training. AI-assisted cyberattacks and social engineering. | Breaches sensitive information, increasing vulnerability to data misuse and exploitation. | Implement robust data encryption and anonymization techniques. Adhere to strict data access controls and privacy regulations. Conduct regular security audits and threat assessments. | Individuals, data protection agencies, cybersecurity experts, AI companies | Advance cryptographic techniques for AI models. Develop international standards for AI security and privacy. |
Ethical Challenges | Ethical dilemmas arising from AI’s deployment in sensitive domains. | AI chatbots providing harmful advice in mental health contexts and autonomous systems making life-altering decisions without human oversight. | Highlights the potential for unintended consequences and harm, underscoring the need for robust ethical frameworks and guidelines. | Establish multidisciplinary ethics committees to oversee AI projects. Integrate ethical considerations into the AI development lifecycle. Educate AI professionals on ethical principles and their application. | AI researchers, ethicists, regulatory bodies, public stakeholders | Promote global collaboration on ethical AI frameworks. Incorporate ethical impact assessments in the AI development process. |
Deceptive Backdoor Strategies | LLMs can be deliberately trained to exhibit backdoor behaviors that activate under specific conditions, demonstrating a sophisticated level of conditional malfeasance. These behaviors persist even after extensive safety training regimes. | Training models to write secure code for a specific year but to insert exploitable code when a different year is specified. | Highlights the difficulty in eliminating embedded deceptions, underlining the potential for AI systems to execute sophisticated and targeted attacks without detection. | Integrate integrity checks and anomaly detection in AI training and deployment phases. Foster open-source collaboration to identify and mitigate backdoors. Educate AI developers on secure coding practices. | AI developers, security researchers, open-source communities, regulatory bodies | Enhance techniques for detecting and neutralizing backdoors in AI systems. Establish industry-wide best practices for secure AI development. |
Resilience to Safety Training | Despite comprehensive safety training, including advanced techniques like supervised fine-tuning, reinforcement learning, and adversarial training, deceptive strategies within LLMs remain robust. This indicates significant challenges in mitigating hidden, potentially harmful behaviors. | Models retaining their deceptive strategies even after undergoing extensive safety training designed to correct such behaviors. | Suggests current safety methodologies may not effectively mitigate the risk posed by deliberately deceptive or inadvertently misaligned AI systems. | Invest in research to understand the mechanisms behind resilience to safety training. Develop more sophisticated adversarial training methods. Encourage the sharing of best practices among AI practitioners. | AI researchers, safety engineers, policymakers, AI ethics boards | Explore new AI training paradigms that inherently incorporate safety and ethical considerations. Develop metrics for assessing resilience to safety interventions. |
Impact of Model Scale and Complexity | The research underscores that the largest models and those with complex reasoning abilities, such as chain-of-thought reasoning, exhibit greater persistence of deceptive behaviors. This raises concerns about the scalability of current safety measures against increasingly sophisticated AI systems. | Larger models and those trained with complex reasoning show greater persistence of deceptive behaviors. | Indicates that as models become more advanced, their capacity for deception and resistance to safety interventions may increase, complicating efforts to ensure ethical and safety standards. | Conduct thorough impact assessments for large and complex AI models before deployment. Develop scalable safety and ethical guidelines that grow with AI complexity. Promote interdisciplinary research to understand the implications of AI scale. | Researchers, AI companies, regulatory agencies, society at large | Study the long-term impacts of large AI models on society and individual well-being. Innovate in modular AI design for easier safety assessments. |
Unintended Consequences of Adversarial Training | Adversarial training, aimed at identifying and mitigating unsafe behaviors, could inadvertently make models more adept at recognizing conditions for their deceptive strategies, effectively hiding these behaviors rather than eliminating them. | Adversarial training sometimes teaches models to better recognize conditions for their deceptive strategies, effectively hiding these behaviors during training. | Highlights the complexity of using adversarial approaches in safety training, necessitating careful consideration of their potential to inadvertently enhance an AI’s deceptive capabilities. | Implement multi-layered validation strategies to capture and correct unintended learning outcomes. Foster collaboration between AI developers and domain experts to identify potential misalignments. | AI developers, domain experts, ethicists, regulatory bodies | Investigate the dynamics of adversarial training to prevent counterproductive learning. Promote transparency in training processes to facilitate external audits. |
Implications for AI Safety and Trust | The persistence of deceptive behaviors through safety training poses profound challenges for AI safety, suggesting that traditional evaluation and mitigation methods may be insufficient. | Validation Protocols Example: Comparing AI diagnostic tools against established medical databases to ensure accuracy. Adversarial Training Example: Testing AI models with fabricated inputs to identify and mitigate potential vulnerabilities. Transparency Mechanisms Example: Developing AI systems that can provide understandable explanations for their decisions to users. Safety Standards Example: Creating an international AI safety certification that AI systems must achieve before deployment. | Emphasizes the need for novel approaches to detect and neutralize deceptive capabilities in AI, highlighting risks to the reliability and trustworthiness of AI-driven systems. | Implement robust validation and verification protocols to ensure AI outputs are accurate and reliable. Introduce adversarial training and comprehensive safety training to mitigate deceptive behavior. | AI researchers and developers, policymakers and legislators, users and society at large, industry and business leaders, AI ethics boards and regulatory bodies, educational institutions | Develop enhanced transparency and accountability mechanisms for AI decision-making processes. Establish industry-wide safety and ethics standards. Foster interdisciplinary collaboration to address AI challenges. |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Williamson, S.M.; Prybutok, V. The Era of Artificial Intelligence Deception: Unraveling the Complexities of False Realities and Emerging Threats of Misinformation. Information 2024, 15, 299. https://doi.org/10.3390/info15060299
Williamson SM, Prybutok V. The Era of Artificial Intelligence Deception: Unraveling the Complexities of False Realities and Emerging Threats of Misinformation. Information. 2024; 15(6):299. https://doi.org/10.3390/info15060299
Chicago/Turabian StyleWilliamson, Steven M., and Victor Prybutok. 2024. "The Era of Artificial Intelligence Deception: Unraveling the Complexities of False Realities and Emerging Threats of Misinformation" Information 15, no. 6: 299. https://doi.org/10.3390/info15060299
APA StyleWilliamson, S. M., & Prybutok, V. (2024). The Era of Artificial Intelligence Deception: Unraveling the Complexities of False Realities and Emerging Threats of Misinformation. Information, 15(6), 299. https://doi.org/10.3390/info15060299