Search for Articles

Article

17 Citations

3,484 Views

11 Pages

Optimizing GPT-4 Turbo Diagnostic Accuracy in Neuroradiology through Prompt Engineering and Confidence Thresholds

Akihiko Wada,
Toshiaki Akashi,
George Shih,
Akifumi Hagiwara,
Mitsuo Nishizawa,
Yayoi Hayakawa,
Junko Kikuta,
Keigo Shimoji,
Katsuhiro Sano and
Koji Kamagata
+ 2 authors

Diagnostics2024, 14(14), 1541;https://doi.org/10.3390/diagnostics14141541

-

17 July 2024

Background and Objectives: Integrating large language models (LLMs) such as GPT-4 Turbo into diagnostic imaging faces a significant challenge, with current misdiagnosis rates ranging from 30–50%. This study evaluates how prompt engineering and...

Full Article

Article

3 Citations

2,955 Views

13 Pages

From Language Models to Medical Diagnoses: Assessing the Potential of GPT-4 and GPT-3.5-Turbo in Digital Health

Jonas Roos,
Theresa Isabelle Wilhelm,
Ron Martin and
Robert Kaczmarczyk

AI2024, 5(4), 2680-2692;https://doi.org/10.3390/ai5040128

-

2 December 2024

Background: Large language models (LLMs) like GPT-3.5-Turbo and GPT-4 show potential to transform medical diagnostics through their linguistic and analytical capabilities. This study evaluates their diagnostic proficiency using English and German med...

Full Article

Article

48 Citations

12,317 Views

24 Pages

Cognitive Network Science Reveals Bias in GPT-3, GPT-3.5 Turbo, and GPT-4 Mirroring Math Anxiety in High-School Students

Katherine Abramski,
Salvatore Citraro,
Luigi Lombardi,
Giulio Rossetti and
Massimo Stella

Big Data Cogn. Comput.2023, 7(3), 124;https://doi.org/10.3390/bdcc7030124

-

27 June 2023

Large Language Models (LLMs) are becoming increasingly integrated into our lives. Hence, it is important to understand the biases present in their outputs in order to avoid perpetuating harmful stereotypes, which originate in our own flawed ways of t...

Full Article

Article

1,950 Views

16 Pages

Automated Grading Method of Python Code Submissions Using Large Language Models and Machine Learning

Mariam Mahdaoui,
Said Nouh,
My Seddiq El Kasmi Alaoui and
Khalid Kandali

Information2025, 16(8), 674;https://doi.org/10.3390/info16080674

-

7 August 2025

Assessment is fundamental to programming education; however, it is a labour-intensive and complicated process, especially in extensive learning contexts where it relies significantly on human teachers. This paper presents an automated grading methodo...

Full Article

Article

14 Citations

5,916 Views

19 Pages

From Detection to Action: A Multimodal AI Framework for Traffic Incident Response

Afaq Ahmed,
Muhammad Farhan,
Hassan Eesaar,
Kil To Chong and
Hilal Tayara

Drones2024, 8(12), 741;https://doi.org/10.3390/drones8120741

-

9 December 2024

With the rising incidence of traffic accidents and growing environmental concerns, the demand for advanced systems to ensure traffic and environmental safety has become increasingly urgent. This paper introduces an automated highway safety management...

Full Article

Feature Paper
Article

25 Citations

11,884 Views

22 Pages

LLM-Informed Multi-Armed Bandit Strategies for Non-Stationary Environments

J. de Curtò,
I. de Zarzà,
Gemma Roig,
Juan Carlos Cano,
Pietro Manzoni and
Carlos T. Calafate

Electronics2023, 12(13), 2814;https://doi.org/10.3390/electronics12132814

-

25 June 2023

In this paper, we introduce an innovative approach to handling the multi-armed bandit (MAB) problem in non-stationary environments, harnessing the predictive power of large language models (LLMs). With the realization that traditional bandit strategi...

Full Article

Article

24 Citations

7,667 Views

16 Pages

Enhancing Software Code Vulnerability Detection Using GPT-4o and Claude-3.5 Sonnet: A Study on Prompt Engineering Techniques

Jaehyeon Bae,
Seoryeong Kwon and
Seunghwan Myeong

Electronics2024, 13(13), 2657;https://doi.org/10.3390/electronics13132657

-

6 July 2024

This study investigates the efficacy of advanced large language models, specifically GPT-4o, Claude-3.5 Sonnet, and GPT-3.5 Turbo, in detecting software vulnerabilities. Our experiment utilized vulnerable and secure code samples from the NIST Softwar...

Full Article

Article

2 Citations

2,571 Views

20 Pages

LLMs in Education: Evaluation GPT and BERT Models in Student Comment Classification

Anabel Pilicita and
Enrique Barra

Multimodal Technol. Interact.2025, 9(5), 44;https://doi.org/10.3390/mti9050044

-

12 May 2025

The incorporation of artificial intelligence in educational contexts has significantly transformed the support provided to students facing learning difficulties, facilitating both the management of their educational process and their emotions. Additi...

Full Article

Article

5 Citations

7,650 Views

22 Pages

Benchmarking Large Language Model (LLM) Performance for Game Playing via Tic-Tac-Toe

Oguzhan Topsakal and
Jackson B. Harper

Electronics2024, 13(8), 1532;https://doi.org/10.3390/electronics13081532

-

17 April 2024

This study investigates the strategic decision-making abilities of large language models (LLMs) via the game of Tic-Tac-Toe, renowned for its straightforward rules and definitive outcomes. We developed a mobile application coupled with web services,...

Full Article

Article

19 Citations

5,911 Views

20 Pages

Brainstorming Will Never Be the Same Again—A Human Group Supported by Artificial Intelligence

Franc Lavrič and
Andrej Škraba

Mach. Learn. Knowl. Extr.2023, 5(4), 1282-1301;https://doi.org/10.3390/make5040065

-

25 September 2023

A modification of the brainstorming process by the application of artificial intelligence (AI) was proposed. Here, we describe the design of the software system “kresilnik”, which enables hybrid work between a human group and AI. The prop...

Full Article

Article

1 Citations

4,082 Views

23 Pages

Performance Comparison of Large Language Models for Efficient Literature Screening

Maria Teresa Colangelo,
Stefano Guizzardi,
Marco Meleti,
Elena Calciolari and
Carlo Galli

BioMedInformatics2025, 5(2), 25;https://doi.org/10.3390/biomedinformatics5020025

-

7 May 2025

Background: Systematic reviewers face a growing body of biomedical literature, making early-stage article screening increasingly time-consuming. In this study, we assessed six large language models (LLMs)—OpenHermes, Flan T5, GPT-2, Claude 3 Ha...

Full Article

Article

908 Views

12 Pages

A Large Language Model-Based Approach for Coding Information from Free-Text Reported in Fall Risk Surveillance Systems: New Opportunities for In-Hospital Risk Management

Davide Rango,
Giulia Lorenzoni,
Henrique Salmazo Da Silva,
Vicente Paulo Alves and
Dario Gregori

J. Clin. Med.2025, 14(5), 1580;https://doi.org/10.3390/jcm14051580

-

26 February 2025

Background/Objectives: Falls are the most common adverse in-hospital event, resulting in a considerable social and economic burden on individuals, their families, and the healthcare system. This study aims to develop and implement an automatic coding...

Full Article

Article

4 Citations

2,137 Views

31 Pages

Automated Identification and Representation of System Requirements Based on Large Language Models and Knowledge Graphs

Lei Wang,
Ming-Chao Wang,
Yuan-Rong Zhang,
Jian Ma,
Hong-Yu Shao and
Zhi-Xing Chang

Appl. Sci.2025, 15(7), 3502;https://doi.org/10.3390/app15073502

-

23 March 2025

In the product design and manufacturing process, the effective management and representation of system requirements (SRs) are crucial for ensuring product quality and consistency. However, current methods are hindered by document ambiguity, weak requ...

Full Article

Article

1 Citations

3,020 Views

26 Pages

Reflective Dialogues with a Humanoid Robot Integrated with an LLM and a Curated NLU System for Positive Behavioral Change in Older Adults

Ryan Browne,
Mirza Mohtashim Alam,
Qasid Saleem,
Abrar Hyder,
Tatsuya Kudo,
Francesca D’Agresti,
Martino Maggio,
Keiko Homma,
Eerik-Juhanna Siitonen and
Naoko Kounosu
+ 8 authors

Electronics2024, 13(22), 4364;https://doi.org/10.3390/electronics13224364

-

7 November 2024

We developed an innovative system that combines Natural Language Understanding (NLU), a curated knowledge base, and the efficient management of a Large Language Model (LLM) to support motivational health coaching. Using Rasa as the core framework, we...

Full Article

Article

6 Citations

5,368 Views

40 Pages

Investigating the Predominance of Large Language Models in Low-Resource Bangla Language over Transformer Models for Hate Speech Detection: A Comparative Analysis

Fatema Tuj Johora Faria,
Laith H. Baniata and
Sangwoo Kang

Mathematics2024, 12(23), 3687;https://doi.org/10.3390/math12233687

-

25 November 2024

The rise in abusive language on social media is a significant threat to mental health and social cohesion. For Bengali speakers, the need for effective detection is critical. However, current methods fall short in addressing the massive volume of con...

Full Article

Article

3 Citations

5,371 Views

15 Pages

Towards Harnessing the Most of ChatGPT for Korean Grammatical Error Correction

Chanjun Park,
Seonmin Koo,
Gyeongmin Kim and
Heuiseok Lim

Appl. Sci.2024, 14(8), 3195;https://doi.org/10.3390/app14083195

-

10 April 2024

In this study, we conduct a pioneering and comprehensive examination of ChatGPT’s (GPT-3.5 Turbo) capabilities within the realm of Korean Grammatical Error Correction (K-GEC). Given the Korean language’s agglutinative nature and its rich...

Full Article

Article

7 Citations

5,261 Views

24 Pages

Evaluating Arabic Emotion Recognition Task Using ChatGPT Models: A Comparative Analysis between Emotional Stimuli Prompt, Fine-Tuning, and In-Context Learning

El Habib Nfaoui and
Hanane Elfaik

J. Theor. Appl. Electron. Commer. Res.2024, 19(2), 1118-1141;https://doi.org/10.3390/jtaer19020058

-

14 May 2024

Textual emotion recognition (TER) has significant commercial potential since it can be used as an excellent tool to monitor a brand/business reputation, understand customer satisfaction, and personalize recommendations. It is considered a natural lan...

Full Article

Article

1,530 Views

23 Pages

Zero-Shot Classification of Illicit Dark Web Content with Commercial LLMs: A Comparative Study on Accuracy, Human Consistency, and Inter-Model Agreement

Víctor-Pablo Prado-Sánchez,
Adrián Domínguez-Díaz,
Luis De-Marcos and
José-Javier Martínez-Herráiz

Electronics2025, 14(20), 4101;https://doi.org/10.3390/electronics14204101

-

19 October 2025

This study evaluates the zero-shot classification performance of eight commercial large language models (LLMs), GPT-4o, GPT-4o Mini, GPT-3.5 Turbo, Claude 3.5 Haiku, Gemini 2.0 Flash, DeepSeek Chat, DeepSeek Reasoner, and Grok, using the CoDA dataset...

Full Article

Article

2,767 Views

18 Pages

Using Large Language Models to Extract Structured Data from Health Coaching Dialogues: A Comparative Study of Code Generation Versus Direct Information Extraction

Sai Sangameswara Aadithya Kanduri,
Apoorv Prasad and
Susan McRoy

BioMedInformatics2025, 5(3), 50;https://doi.org/10.3390/biomedinformatics5030050

-

4 September 2025

Background: Virtual coaching can help people adopt new healthful behaviors by encouraging them to set specific goals and helping them review their progress. One challenge in creating such systems is analyzing clients’ statements about their act...

Full Article

Article

2 Citations

4,992 Views

20 Pages

Comparative Analysis of Generic and Fine-Tuned Large Language Models for Conversational Agent Systems

Laura Villa,
David Carneros-Prado,
Cosmin C. Dobrescu,
Adrián Sánchez-Miguel,
Guillermo Cubero and
Ramón Hervás

Robotics2024, 13(5), 68;https://doi.org/10.3390/robotics13050068

-

29 April 2024

In the rapidly evolving domain of conversational agents, the integration of Large Language Models (LLMs) into Chatbot Development Platforms (CDPs) is a significant innovation. This study compares the efficacy of employing generic and fine-tuned GPT-3...

Full Article

Article

1 Citations

789 Views

13 Pages

Artificial Intelligence Versus Professional Standards: A Cross-Sectional Comparative Study of GPT, Gemini, and ENT UK in Delivering Patient Information on ENT Conditions

Ali Alabdalhussein,
Nehal Singhania,
Shazaan Nadeem,
Mohammed Talib,
Derar Al-Domaidat,
Ibrahim Jimoh,
Waleed Khan and
Manish Mair

Diseases2025, 13(9), 286;https://doi.org/10.3390/diseases13090286

-

1 September 2025

Objective: Patient information materials are sensitive and, if poorly written, can cause misunderstanding. This study evaluated and compared the readability, actionability, and quality of patient education materials on laryngology topics generated by...

Full Article

Article

951 Views

14 Pages

Fine-Tuned Large Language Models for High-Accuracy Prediction of Band Gap and Stability in Transition Metal Sulfides

Zimo Zhao,
Lin Hu and
Honghui Wang

Materials2025, 18(16), 3793;https://doi.org/10.3390/ma18163793

-

13 August 2025

This study presents a fine-tuned Large Language Model approach for predicting band gap and stability of transition metal sulfides. Our method processes textual descriptions of crystal structures directly, eliminating the need for complex feature engi...

Full Article

Article

4 Citations

1,601 Views

19 Pages

Social Media Data-Based Rapid Hazard Assessment of Urban Waterlogging Event: A Case Study of Guilin 6.19 Waterlogging

Guiquan Mo,
Ziyao Xing,
Yueqin Zhu,
Lutong Huang,
Wenting Chen and
Jian Li

Water2025, 17(3), 354;https://doi.org/10.3390/w17030354

-

26 January 2025

Traditional methods for assessing urban waterlogging hazards lack real-time efficiency. This study develops a rapid hazard assessment method for urban waterlogging events using social media data and large language models, with the urban waterlogging...

Full Article

Article

1,894 Views

12 Pages

Artificial Intelligence vs. Human Cognition: A Comparative Analysis of ChatGPT and Candidates Sitting the European Board of Ophthalmology Diploma Examination

Anna P. Maino,
Jakub Klikowski,
Brendan Strong,
Wahid Ghaffari,
Michał Woźniak,
Tristan Bourcier and
Andrzej Grzybowski

Vision2025, 9(2), 31;https://doi.org/10.3390/vision9020031

-

9 April 2025

Background/Objectives: This paper aims to assess ChatGPT’s performance in answering European Board of Ophthalmology Diploma (EBOD) examination papers and to compare these results to pass benchmarks and candidate results. Methods: This cross-sec...

Full Article

Article

5,153 Views

17 Pages

Comparative Analysis of AI Models for Python Code Generation: A HumanEval Benchmark Study

Ali Bayram,
Gonca Gokce Menekse Dalveren and
Mohammad Derawi

Appl. Sci.2025, 15(18), 9907;https://doi.org/10.3390/app15189907

-

10 September 2025

This study conducts a comprehensive comparative analysis of six contemporary artificial intelligence models for Python code generation using the HumanEval benchmark. The evaluated models include GPT-3.5 Turbo, GPT-4 Omni, Claude 3.5 Sonnet, Claude 3....

Full Article

Article

15 Citations

5,218 Views

18 Pages

Improving Training Dataset Balance with ChatGPT Prompt Engineering

Mateusz Kochanek,
Igor Cichecki,
Oliwier Kaszyca,
Dominika Szydło,
Michał Madej,
Dawid Jędrzejewski,
Przemysław Kazienko and
Jan Kocoń

Electronics2024, 13(12), 2255;https://doi.org/10.3390/electronics13122255

-

8 June 2024

The rapid evolution of large language models, in particular OpenAI’s GPT-3.5-turbo and GPT-4, indicates a growing interest in advanced computational methodologies. This paper proposes a novel approach to synthetic data generation and knowledge...

Full Article

Article

11 Citations

2,131 Views

11 Pages

Exploring Multilingual Large Language Models for Enhanced TNM Classification of Radiology Report in Lung Cancer Staging

Hidetoshi Matsuo,
Mizuho Nishio,
Takaaki Matsunaga,
Koji Fujimoto and
Takamichi Murakami

Cancers2024, 16(21), 3621;https://doi.org/10.3390/cancers16213621

-

26 October 2024

Background/Objectives: This study aimed to investigate the accuracy of Tumor, Node, Metastasis (TNM) classification based on radiology reports using GPT3.5-turbo (GPT3.5) and the utility of multilingual large language models (LLMs) in both Japanese a...

Full Article

Article

1,805 Views

29 Pages

Large Language Model-Based Autonomous Agent for Prognostics and Health Management

Minhyeok Cha,
Sang-il Yoon,
Seongrae Kim,
Daeyoung Kang,
Keonwoo Nam,
Teakyong Lee and
Joon-Young Kim

Machines2025, 13(9), 831;https://doi.org/10.3390/machines13090831

-

9 September 2025

Prognostics and Health Management (PHM), including fault diagnosis and Remaining Useful Life (RUL) prediction, is critical for ensuring the reliability and efficiency of industrial equipment. However, traditional AI-based methods require extensive ex...

Full Article

Article

310 Views

14 Pages

Leveraging LLMs for User Rating Prediction from Textual Reviews: A Hospitality Data Annotation Case Study

Patricia Nnanna,
Olasoji Amujo,
Chinedu Pascal Ezenkwu and
Ebuka Ibeke

Information2025, 16(12), 1059;https://doi.org/10.3390/info16121059

-

2 December 2025

The proliferation of user-generated content in today’s digital landscape has further increased dependence on online reviews as a source for decision-making in the hospitality industry. There has been an increasing interest in automating this de...

Full Article

Article

5 Citations

4,638 Views

16 Pages

Investigating Models for the Transcription of Mathematical Formulas in Images

Christian Feichter and
Tim Schlippe

Appl. Sci.2024, 14(3), 1140;https://doi.org/10.3390/app14031140

-

29 January 2024

The automated transcription of mathematical formulas represents a complex challenge that is of great importance for digital processing and comprehensibility of mathematical content. Consequently, our goal was to analyze state-of-the-art approaches fo...

Full Article

Article

3 Citations

4,160 Views

20 Pages

A Systematic Approach for Assessing Large Language Models’ Test Case Generation Capability

Hung-Fu Chang and
Mohammad Shokrolah Shirazi

Software2025, 4(1), 5;https://doi.org/10.3390/software4010005

-

10 March 2025

Software testing ensures the quality and reliability of software products, but manual test case creation is labor-intensive. With the rise of Large Language Models (LLMs), there is growing interest in unit test creation with LLMs. However, effective...

Full Article

Article

2,230 Views

10 Pages

Predictive Language Processing in Humans and Large Language Models: A Comparative Study of Contextual Dependencies

Yifan Zhang and
Kuzma Strelnikov

Informatics2025, 12(3), 83;https://doi.org/10.3390/informatics12030083

-

15 August 2025

Human language comprehension relies on predictive processing; however, the computational mechanisms underlying this phenomenon remain unclear. This study investigates these mechanisms using large language models (LLMs), specifically GPT-3.5-turbo and...

Full Article

Article

1,321 Views

32 Pages

Siyasat: AI-Powered AI Governance Tool to Generate and Improve AI Policies According to Saudi AI Ethics Principles

Dabiah Alboaneen,
Shaikha Alhajri,
Khloud Alhajri,
Muneera Aljalal,
Noura Alalyani,
Hajer Alsaadan,
Zainab Al Thonayan and
Raja Alyafer

Computers2025, 14(11), 452;https://doi.org/10.3390/computers14110452

-

22 October 2025

The rapid development of artificial intelligence (AI) and growing reliance on generative AI (GenAI) tools such as ChatGPT and Bing Chat have raised concerns about risks, including privacy violations, bias, and discrimination. AI governance is viewed...

Full Article

Article

1 Citations

2,879 Views

15 Pages

Domain Adaptation for Arabic Machine Translation: Financial Texts as a Case Study

Emad A. Alghamdi,
Jezia Zakraoui and
Fares A. Abanmy

Appl. Sci.2024, 14(16), 7088;https://doi.org/10.3390/app14167088

-

13 August 2024

Neural machine translation (NMT) has shown impressive performance when trained on large-scale corpora. However, generic NMT systems have demonstrated poor performance on out-of-domain translation. To mitigate this issue, several domain adaptation met...

Full Article

Article

64 Citations

16,557 Views

18 Pages

Prompt Engineering or Fine-Tuning? A Case Study on Phishing Detection with Large Language Models

Fouad Trad and
Ali Chehab

Mach. Learn. Knowl. Extr.2024, 6(1), 367-384;https://doi.org/10.3390/make6010018

-

6 February 2024

Large Language Models (LLMs) are reshaping the landscape of Machine Learning (ML) application development. The emergence of versatile LLMs capable of undertaking a wide array of tasks has reduced the necessity for intensive human involvement in train...

Full Article

Article

1 Citations

3,870 Views

20 Pages

Extracting Implicit User Preferences in Conversational Recommender Systems Using Large Language Models

Woo-Seok Kim,
Seongho Lim,
Gun-Woo Kim and
Sang-Min Choi

Mathematics2025, 13(2), 221;https://doi.org/10.3390/math13020221

-

10 January 2025

Conversational recommender systems (CRSs) have garnered increasing attention for their ability to provide personalized recommendations through natural language interactions. Although large language models (LLMs) have shown potential in recommendation...

Full Article

Article

13 Citations

5,330 Views

21 Pages

Prediction of Arabic Legal Rulings Using Large Language Models

Adel Ammar,
Anis Koubaa,
Bilel Benjdira,
Omer Nacar and
Serry Sibaee

Electronics2024, 13(4), 764;https://doi.org/10.3390/electronics13040764

-

15 February 2024

In the intricate field of legal studies, the analysis of court decisions is a cornerstone for the effective functioning of the judicial system. The ability to predict court outcomes helps judges during the decision-making process and equips lawyers w...

Full Article

Article

6 Citations

3,796 Views

15 Pages

Research on Intelligent Grading of Physics Problems Based on Large Language Models

Yuhao Wei,
Rui Zhang,
Jianwei Zhang,
Dizhi Qi and
Wenqian Cui

Educ. Sci.2025, 15(2), 116;https://doi.org/10.3390/educsci15020116

-

21 January 2025

The automation of educational and instructional assessment plays a crucial role in enhancing the quality of teaching management. In physics education, calculation problems with intricate problem-solving ideas pose challenges to the intelligent gradin...

Full Article

Article

1,838 Views

12 Pages

A Comparative Study of Large Language Models in Programming Education: Accuracy, Efficiency, and Feedback in Student Assignment Grading

Andrija Bernik,
Danijel Radošević and
Andrej Čep

Appl. Sci.2025, 15(18), 10055;https://doi.org/10.3390/app151810055

-

15 September 2025

Programming education traditionally requires extensive manual assessment of student assignments, which is both time-consuming and resource-intensive for instructors. Recent advances in large language models (LLMs) open opportunities for automating th...

Full Article

Article

23 Citations

5,285 Views

15 Pages

LLM Adaptive PID Control for B5G Truck Platooning Systems

I. de Zarzà,
J. de Curtò,
Gemma Roig and
Carlos T. Calafate

Sensors2023, 23(13), 5899;https://doi.org/10.3390/s23135899

-

25 June 2023

This paper presents an exploration into the capabilities of an adaptive PID controller within the realm of truck platooning operations, situating the inquiry within the context of Cognitive Radio and AI-enhanced 5G and Beyond 5G (B5G) networks. We de...

Full Article

Article

786 Views

18 Pages

AI-Powered Chatbot for FDA Drug Labeling Information Retrieval: OpenAI GPT for Grounded Question Answering

Manasa Koppula,
Fnu Madhulika,
Navya Sreeramoju and
Praveen Kolimi

Analytics2025, 4(4), 33;https://doi.org/10.3390/analytics4040033

-

17 November 2025

This study presents the development of an AI-powered chatbot designed to facilitate accurate and efficient retrieval of information from the FDA drug labeling documents. Leveraging OpenAI’s GPT-3.5-turbo model within a controlled, document-grou...

Full Article

Article

1 Citations

1,471 Views

14 Pages

Performance Evaluation of Large Language Model Chatbots for Radiation Therapy Education

Jae-Hong Jung,
Daegun Kim,
Kyung-Bae Lee and
Youngjin Lee

Information2025, 16(7), 521;https://doi.org/10.3390/info16070521

-

22 June 2025

This study aimed to develop a large language model (LLM) chatbot for radiation therapy education and compare the performance of portable document format (PDF)- and webpage-based question-and-answer (Q&A) chatbots. An LLM chatbot was created using...

Full Article

Article

745 Views

27 Pages

Strengths and Weaknesses of Artificial Intelligence in Exploring Asbestos History and Regulations Across Countries

Alessandro Croce,
Francesca Ugo,
Annalisa Roveta,
Carlotta Bertolina,
Caterina Rinaudo,
Antonio Maconi and
Marinella Bertolotti

Geosciences2025, 15(10), 395;https://doi.org/10.3390/geosciences15100395

-

12 October 2025

Asbestos, consisting of six natural mineral fibrous silicate phases, was widely utilized in industrial development during the 20th century and has left a global legacy of health, environmental, and regulatory challenges. Its remarkable properties (e....

Full Article

Article

387 Views

32 Pages

Combining LLMs and Knowledge Graphs to Reduce Hallucinations in Biomedical Question Answering

Larissa Pusch and
Tim O. F. Conrad

BioMedInformatics2025, 5(4), 70;https://doi.org/10.3390/biomedinformatics5040070

-

9 December 2025

Advancements in natural language processing (NLP), particularly Large Language Models (LLMs), have greatly improved how we access knowledge. However, in critical domains like biomedicine, challenges like hallucinations—where language models gen...

Full Article

Article

9 Citations

4,208 Views

18 Pages

Automated Assessment of Comprehension Strategies from Self-Explanations Using LLMs

Bogdan Nicula,
Mihai Dascalu,
Tracy Arner,
Renu Balyan and
Danielle S. McNamara

Information2023, 14(10), 567;https://doi.org/10.3390/info14100567

-

14 October 2023

Text comprehension is an essential skill in today’s information-rich world, and self-explanation practice helps students improve their understanding of complex texts. This study was centered on leveraging open-source Large Language Models (LLMs...

Full Article

Article

7 Citations

3,950 Views

21 Pages

Large Language Model-Informed X-ray Photoelectron Spectroscopy Data Analysis

J. de Curtò,
I. de Zarzà,
Gemma Roig and
Carlos T. Calafate

Signals2024, 5(2), 181-201;https://doi.org/10.3390/signals5020010

-

27 March 2024

X-ray photoelectron spectroscopy (XPS) remains a fundamental technique in materials science, offering invaluable insights into the chemical states and electronic structure of a material. However, the interpretation of XPS spectra can be complex, requ...

Full Article

Article

385 Views

25 Pages

Chain-of-Thought Prompt Optimization via Adversarial Learning

Guang Yang,
Xiantao Cai,
Shaohe Wang and
Juhua Liu

Information2025, 16(12), 1092;https://doi.org/10.3390/info16121092

-

9 December 2025

Chain-of-Thought (CoT) prompting has demonstrated strong effectiveness in improving the reasoning capabilities of Large Language Models (LLMs). However, existing CoT optimization approaches still lack systematic mechanisms for evaluating and refining...

Full Article

Article

3,261 Views

15 Pages

Artificial Intelligence Outperforms Physicians in General Medical Knowledge, Except in the Paediatrics Domain: A Cross-Sectional Study

Joana Miranda,
Raquel Pereira-Silva,
João Guichard,
Jorge Meneses,
Andreia Neves Carreira and
Daniela Seixas

Bioengineering2025, 12(6), 653;https://doi.org/10.3390/bioengineering12060653

-

14 June 2025

Generative artificial intelligence (genAI) shows promising results in clinical practice. This study compared a GPT-4-turbo virtual assistant with physicians from Italy, France, Spain, and Portugal on medical knowledge derived from national exams whil...

Full Article

Article

2 Citations

1,556 Views

25 Pages

Unmasking Media Bias, Economic Resilience, and the Hidden Patterns of Global Catastrophes

Fahim Sufi and
Musleh Alsulami

Sustainability2025, 17(9), 3951;https://doi.org/10.3390/su17093951

-

28 April 2025

The increasing frequency and destructiveness of natural disasters necessitate scalable, transparent, and timely analytical frameworks for risk reduction. Traditional disaster datasets—curated by intergovernmental bodies such as EM-DAT and UNDRR...

Full Article

Article

5 Citations

5,014 Views

15 Pages

Optimizing Ingredient Substitution Using Large Language Models to Enhance Phytochemical Content in Recipes

Luís Rita,
Joshua Southern,
Ivan Laponogov,
Kyle Higgins and
Kirill Veselkov

Mach. Learn. Knowl. Extr.2024, 6(4), 2738-2752;https://doi.org/10.3390/make6040131

-

26 November 2024

In the emerging field of computational gastronomy, aligning culinary practices with scientifically supported nutritional goals is increasingly important. This study explores how large language models (LLMs) can be applied to optimize ingredient subst...

Full Article

66 Results Found

Optimizing GPT-4 Turbo Diagnostic Accuracy in Neuroradiology through Prompt Engineering and Confidence Thresholds

From Language Models to Medical Diagnoses: Assessing the Potential of GPT-4 and GPT-3.5-Turbo in Digital Health

Cognitive Network Science Reveals Bias in GPT-3, GPT-3.5 Turbo, and GPT-4 Mirroring Math Anxiety in High-School Students

Automated Grading Method of Python Code Submissions Using Large Language Models and Machine Learning

From Detection to Action: A Multimodal AI Framework for Traffic Incident Response

LLM-Informed Multi-Armed Bandit Strategies for Non-Stationary Environments

Enhancing Software Code Vulnerability Detection Using GPT-4o and Claude-3.5 Sonnet: A Study on Prompt Engineering Techniques

LLMs in Education: Evaluation GPT and BERT Models in Student Comment Classification

Benchmarking Large Language Model (LLM) Performance for Game Playing via Tic-Tac-Toe

Brainstorming Will Never Be the Same Again—A Human Group Supported by Artificial Intelligence

Performance Comparison of Large Language Models for Efficient Literature Screening

A Large Language Model-Based Approach for Coding Information from Free-Text Reported in Fall Risk Surveillance Systems: New Opportunities for In-Hospital Risk Management

Automated Identification and Representation of System Requirements Based on Large Language Models and Knowledge Graphs

Reflective Dialogues with a Humanoid Robot Integrated with an LLM and a Curated NLU System for Positive Behavioral Change in Older Adults

Investigating the Predominance of Large Language Models in Low-Resource Bangla Language over Transformer Models for Hate Speech Detection: A Comparative Analysis

Towards Harnessing the Most of ChatGPT for Korean Grammatical Error Correction

Evaluating Arabic Emotion Recognition Task Using ChatGPT Models: A Comparative Analysis between Emotional Stimuli Prompt, Fine-Tuning, and In-Context Learning

Zero-Shot Classification of Illicit Dark Web Content with Commercial LLMs: A Comparative Study on Accuracy, Human Consistency, and Inter-Model Agreement

Using Large Language Models to Extract Structured Data from Health Coaching Dialogues: A Comparative Study of Code Generation Versus Direct Information Extraction

Comparative Analysis of Generic and Fine-Tuned Large Language Models for Conversational Agent Systems

Artificial Intelligence Versus Professional Standards: A Cross-Sectional Comparative Study of GPT, Gemini, and ENT UK in Delivering Patient Information on ENT Conditions

Fine-Tuned Large Language Models for High-Accuracy Prediction of Band Gap and Stability in Transition Metal Sulfides

Social Media Data-Based Rapid Hazard Assessment of Urban Waterlogging Event: A Case Study of Guilin 6.19 Waterlogging

Artificial Intelligence vs. Human Cognition: A Comparative Analysis of ChatGPT and Candidates Sitting the European Board of Ophthalmology Diploma Examination

Comparative Analysis of AI Models for Python Code Generation: A HumanEval Benchmark Study

Improving Training Dataset Balance with ChatGPT Prompt Engineering

Exploring Multilingual Large Language Models for Enhanced TNM Classification of Radiology Report in Lung Cancer Staging

Large Language Model-Based Autonomous Agent for Prognostics and Health Management

Leveraging LLMs for User Rating Prediction from Textual Reviews: A Hospitality Data Annotation Case Study

Investigating Models for the Transcription of Mathematical Formulas in Images

A Systematic Approach for Assessing Large Language Models’ Test Case Generation Capability

Predictive Language Processing in Humans and Large Language Models: A Comparative Study of Contextual Dependencies

Siyasat: AI-Powered AI Governance Tool to Generate and Improve AI Policies According to Saudi AI Ethics Principles

Domain Adaptation for Arabic Machine Translation: Financial Texts as a Case Study

Prompt Engineering or Fine-Tuning? A Case Study on Phishing Detection with Large Language Models

Extracting Implicit User Preferences in Conversational Recommender Systems Using Large Language Models

Prediction of Arabic Legal Rulings Using Large Language Models

Research on Intelligent Grading of Physics Problems Based on Large Language Models

A Comparative Study of Large Language Models in Programming Education: Accuracy, Efficiency, and Feedback in Student Assignment Grading

LLM Adaptive PID Control for B5G Truck Platooning Systems

AI-Powered Chatbot for FDA Drug Labeling Information Retrieval: OpenAI GPT for Grounded Question Answering

Performance Evaluation of Large Language Model Chatbots for Radiation Therapy Education

Strengths and Weaknesses of Artificial Intelligence in Exploring Asbestos History and Regulations Across Countries

Combining LLMs and Knowledge Graphs to Reduce Hallucinations in Biomedical Question Answering

Automated Assessment of Comprehension Strategies from Self-Explanations Using LLMs

Large Language Model-Informed X-ray Photoelectron Spectroscopy Data Analysis

Chain-of-Thought Prompt Optimization via Adversarial Learning

Artificial Intelligence Outperforms Physicians in General Medical Knowledge, Except in the Paediatrics Domain: A Cross-Sectional Study

Unmasking Media Bias, Economic Resilience, and the Hidden Patterns of Global Catastrophes

Optimizing Ingredient Substitution Using Large Language Models to Enhance Phytochemical Content in Recipes