You are currently viewing a new version of our website. To view the old version click .

66 Results Found

  • Article
  • Open Access
17 Citations
3,484 Views
11 Pages

Optimizing GPT-4 Turbo Diagnostic Accuracy in Neuroradiology through Prompt Engineering and Confidence Thresholds

  • Akihiko Wada,
  • Toshiaki Akashi,
  • George Shih,
  • Akifumi Hagiwara,
  • Mitsuo Nishizawa,
  • Yayoi Hayakawa,
  • Junko Kikuta,
  • Keigo Shimoji,
  • Katsuhiro Sano and
  • Koji Kamagata
  • + 2 authors

Background and Objectives: Integrating large language models (LLMs) such as GPT-4 Turbo into diagnostic imaging faces a significant challenge, with current misdiagnosis rates ranging from 30–50%. This study evaluates how prompt engineering and...

  • Article
  • Open Access
3 Citations
2,955 Views
13 Pages

2 December 2024

Background: Large language models (LLMs) like GPT-3.5-Turbo and GPT-4 show potential to transform medical diagnostics through their linguistic and analytical capabilities. This study evaluates their diagnostic proficiency using English and German med...

  • Article
  • Open Access
48 Citations
12,317 Views
24 Pages

Cognitive Network Science Reveals Bias in GPT-3, GPT-3.5 Turbo, and GPT-4 Mirroring Math Anxiety in High-School Students

  • Katherine Abramski,
  • Salvatore Citraro,
  • Luigi Lombardi,
  • Giulio Rossetti and
  • Massimo Stella

Large Language Models (LLMs) are becoming increasingly integrated into our lives. Hence, it is important to understand the biases present in their outputs in order to avoid perpetuating harmful stereotypes, which originate in our own flawed ways of t...

  • Article
  • Open Access
1,950 Views
16 Pages

Automated Grading Method of Python Code Submissions Using Large Language Models and Machine Learning

  • Mariam Mahdaoui,
  • Said Nouh,
  • My Seddiq El Kasmi Alaoui and
  • Khalid Kandali

7 August 2025

Assessment is fundamental to programming education; however, it is a labour-intensive and complicated process, especially in extensive learning contexts where it relies significantly on human teachers. This paper presents an automated grading methodo...

  • Article
  • Open Access
14 Citations
5,916 Views
19 Pages

From Detection to Action: A Multimodal AI Framework for Traffic Incident Response

  • Afaq Ahmed,
  • Muhammad Farhan,
  • Hassan Eesaar,
  • Kil To Chong and
  • Hilal Tayara

9 December 2024

With the rising incidence of traffic accidents and growing environmental concerns, the demand for advanced systems to ensure traffic and environmental safety has become increasingly urgent. This paper introduces an automated highway safety management...

  • Feature Paper
  • Article
  • Open Access
25 Citations
11,884 Views
22 Pages

LLM-Informed Multi-Armed Bandit Strategies for Non-Stationary Environments

  • J. de Curtò,
  • I. de Zarzà,
  • Gemma Roig,
  • Juan Carlos Cano,
  • Pietro Manzoni and
  • Carlos T. Calafate

In this paper, we introduce an innovative approach to handling the multi-armed bandit (MAB) problem in non-stationary environments, harnessing the predictive power of large language models (LLMs). With the realization that traditional bandit strategi...

  • Article
  • Open Access
24 Citations
7,667 Views
16 Pages

This study investigates the efficacy of advanced large language models, specifically GPT-4o, Claude-3.5 Sonnet, and GPT-3.5 Turbo, in detecting software vulnerabilities. Our experiment utilized vulnerable and secure code samples from the NIST Softwar...

  • Article
  • Open Access
2 Citations
2,571 Views
20 Pages

The incorporation of artificial intelligence in educational contexts has significantly transformed the support provided to students facing learning difficulties, facilitating both the management of their educational process and their emotions. Additi...

  • Article
  • Open Access
5 Citations
7,650 Views
22 Pages

This study investigates the strategic decision-making abilities of large language models (LLMs) via the game of Tic-Tac-Toe, renowned for its straightforward rules and definitive outcomes. We developed a mobile application coupled with web services,...

  • Article
  • Open Access
19 Citations
5,911 Views
20 Pages

25 September 2023

A modification of the brainstorming process by the application of artificial intelligence (AI) was proposed. Here, we describe the design of the software system “kresilnik”, which enables hybrid work between a human group and AI. The prop...

  • Article
  • Open Access
1 Citations
4,082 Views
23 Pages

Performance Comparison of Large Language Models for Efficient Literature Screening

  • Maria Teresa Colangelo,
  • Stefano Guizzardi,
  • Marco Meleti,
  • Elena Calciolari and
  • Carlo Galli

Background: Systematic reviewers face a growing body of biomedical literature, making early-stage article screening increasingly time-consuming. In this study, we assessed six large language models (LLMs)—OpenHermes, Flan T5, GPT-2, Claude 3 Ha...

  • Article
  • Open Access
908 Views
12 Pages

26 February 2025

Background/Objectives: Falls are the most common adverse in-hospital event, resulting in a considerable social and economic burden on individuals, their families, and the healthcare system. This study aims to develop and implement an automatic coding...

  • Article
  • Open Access
4 Citations
2,137 Views
31 Pages

Automated Identification and Representation of System Requirements Based on Large Language Models and Knowledge Graphs

  • Lei Wang,
  • Ming-Chao Wang,
  • Yuan-Rong Zhang,
  • Jian Ma,
  • Hong-Yu Shao and
  • Zhi-Xing Chang

23 March 2025

In the product design and manufacturing process, the effective management and representation of system requirements (SRs) are crucial for ensuring product quality and consistency. However, current methods are hindered by document ambiguity, weak requ...

  • Article
  • Open Access
1 Citations
3,020 Views
26 Pages

Reflective Dialogues with a Humanoid Robot Integrated with an LLM and a Curated NLU System for Positive Behavioral Change in Older Adults

  • Ryan Browne,
  • Mirza Mohtashim Alam,
  • Qasid Saleem,
  • Abrar Hyder,
  • Tatsuya Kudo,
  • Francesca D’Agresti,
  • Martino Maggio,
  • Keiko Homma,
  • Eerik-Juhanna Siitonen and
  • Naoko Kounosu
  • + 8 authors

7 November 2024

We developed an innovative system that combines Natural Language Understanding (NLU), a curated knowledge base, and the efficient management of a Large Language Model (LLM) to support motivational health coaching. Using Rasa as the core framework, we...

  • Article
  • Open Access
6 Citations
5,368 Views
40 Pages

25 November 2024

The rise in abusive language on social media is a significant threat to mental health and social cohesion. For Bengali speakers, the need for effective detection is critical. However, current methods fall short in addressing the massive volume of con...

  • Article
  • Open Access
3 Citations
5,371 Views
15 Pages

Towards Harnessing the Most of ChatGPT for Korean Grammatical Error Correction

  • Chanjun Park,
  • Seonmin Koo,
  • Gyeongmin Kim and
  • Heuiseok Lim

10 April 2024

In this study, we conduct a pioneering and comprehensive examination of ChatGPT’s (GPT-3.5 Turbo) capabilities within the realm of Korean Grammatical Error Correction (K-GEC). Given the Korean language’s agglutinative nature and its rich...

  • Article
  • Open Access
7 Citations
5,261 Views
24 Pages

Textual emotion recognition (TER) has significant commercial potential since it can be used as an excellent tool to monitor a brand/business reputation, understand customer satisfaction, and personalize recommendations. It is considered a natural lan...

  • Article
  • Open Access
1,530 Views
23 Pages

Zero-Shot Classification of Illicit Dark Web Content with Commercial LLMs: A Comparative Study on Accuracy, Human Consistency, and Inter-Model Agreement

  • Víctor-Pablo Prado-Sánchez,
  • Adrián Domínguez-Díaz,
  • Luis De-Marcos and
  • José-Javier Martínez-Herráiz

19 October 2025

This study evaluates the zero-shot classification performance of eight commercial large language models (LLMs), GPT-4o, GPT-4o Mini, GPT-3.5 Turbo, Claude 3.5 Haiku, Gemini 2.0 Flash, DeepSeek Chat, DeepSeek Reasoner, and Grok, using the CoDA dataset...

  • Article
  • Open Access
2,767 Views
18 Pages

Background: Virtual coaching can help people adopt new healthful behaviors by encouraging them to set specific goals and helping them review their progress. One challenge in creating such systems is analyzing clients’ statements about their act...

  • Article
  • Open Access
2 Citations
4,992 Views
20 Pages

Comparative Analysis of Generic and Fine-Tuned Large Language Models for Conversational Agent Systems

  • Laura Villa,
  • David Carneros-Prado,
  • Cosmin C. Dobrescu,
  • Adrián Sánchez-Miguel,
  • Guillermo Cubero and
  • Ramón Hervás

29 April 2024

In the rapidly evolving domain of conversational agents, the integration of Large Language Models (LLMs) into Chatbot Development Platforms (CDPs) is a significant innovation. This study compares the efficacy of employing generic and fine-tuned GPT-3...

  • Article
  • Open Access
1 Citations
789 Views
13 Pages

Artificial Intelligence Versus Professional Standards: A Cross-Sectional Comparative Study of GPT, Gemini, and ENT UK in Delivering Patient Information on ENT Conditions

  • Ali Alabdalhussein,
  • Nehal Singhania,
  • Shazaan Nadeem,
  • Mohammed Talib,
  • Derar Al-Domaidat,
  • Ibrahim Jimoh,
  • Waleed Khan and
  • Manish Mair

1 September 2025

Objective: Patient information materials are sensitive and, if poorly written, can cause misunderstanding. This study evaluated and compared the readability, actionability, and quality of patient education materials on laryngology topics generated by...

  • Article
  • Open Access
951 Views
14 Pages

13 August 2025

This study presents a fine-tuned Large Language Model approach for predicting band gap and stability of transition metal sulfides. Our method processes textual descriptions of crystal structures directly, eliminating the need for complex feature engi...

  • Article
  • Open Access
4 Citations
1,601 Views
19 Pages

26 January 2025

Traditional methods for assessing urban waterlogging hazards lack real-time efficiency. This study develops a rapid hazard assessment method for urban waterlogging events using social media data and large language models, with the urban waterlogging...

  • Article
  • Open Access
1,894 Views
12 Pages

Artificial Intelligence vs. Human Cognition: A Comparative Analysis of ChatGPT and Candidates Sitting the European Board of Ophthalmology Diploma Examination

  • Anna P. Maino,
  • Jakub Klikowski,
  • Brendan Strong,
  • Wahid Ghaffari,
  • Michał Woźniak,
  • Tristan Bourcier and
  • Andrzej Grzybowski

9 April 2025

Background/Objectives: This paper aims to assess ChatGPT’s performance in answering European Board of Ophthalmology Diploma (EBOD) examination papers and to compare these results to pass benchmarks and candidate results. Methods: This cross-sec...

  • Article
  • Open Access
5,153 Views
17 Pages

Comparative Analysis of AI Models for Python Code Generation: A HumanEval Benchmark Study

  • Ali Bayram,
  • Gonca Gokce Menekse Dalveren and
  • Mohammad Derawi

10 September 2025

This study conducts a comprehensive comparative analysis of six contemporary artificial intelligence models for Python code generation using the HumanEval benchmark. The evaluated models include GPT-3.5 Turbo, GPT-4 Omni, Claude 3.5 Sonnet, Claude 3....

  • Article
  • Open Access
15 Citations
5,218 Views
18 Pages

Improving Training Dataset Balance with ChatGPT Prompt Engineering

  • Mateusz Kochanek,
  • Igor Cichecki,
  • Oliwier Kaszyca,
  • Dominika Szydło,
  • Michał Madej,
  • Dawid Jędrzejewski,
  • Przemysław Kazienko and
  • Jan Kocoń

The rapid evolution of large language models, in particular OpenAI’s GPT-3.5-turbo and GPT-4, indicates a growing interest in advanced computational methodologies. This paper proposes a novel approach to synthetic data generation and knowledge...

  • Article
  • Open Access
11 Citations
2,131 Views
11 Pages

Exploring Multilingual Large Language Models for Enhanced TNM Classification of Radiology Report in Lung Cancer Staging

  • Hidetoshi Matsuo,
  • Mizuho Nishio,
  • Takaaki Matsunaga,
  • Koji Fujimoto and
  • Takamichi Murakami

26 October 2024

Background/Objectives: This study aimed to investigate the accuracy of Tumor, Node, Metastasis (TNM) classification based on radiology reports using GPT3.5-turbo (GPT3.5) and the utility of multilingual large language models (LLMs) in both Japanese a...

  • Article
  • Open Access
1,805 Views
29 Pages

Large Language Model-Based Autonomous Agent for Prognostics and Health Management

  • Minhyeok Cha,
  • Sang-il Yoon,
  • Seongrae Kim,
  • Daeyoung Kang,
  • Keonwoo Nam,
  • Teakyong Lee and
  • Joon-Young Kim

9 September 2025

Prognostics and Health Management (PHM), including fault diagnosis and Remaining Useful Life (RUL) prediction, is critical for ensuring the reliability and efficiency of industrial equipment. However, traditional AI-based methods require extensive ex...

  • Article
  • Open Access
310 Views
14 Pages

Leveraging LLMs for User Rating Prediction from Textual Reviews: A Hospitality Data Annotation Case Study

  • Patricia Nnanna,
  • Olasoji Amujo,
  • Chinedu Pascal Ezenkwu and
  • Ebuka Ibeke

2 December 2025

The proliferation of user-generated content in today’s digital landscape has further increased dependence on online reviews as a source for decision-making in the hospitality industry. There has been an increasing interest in automating this de...

  • Article
  • Open Access
5 Citations
4,638 Views
16 Pages

29 January 2024

The automated transcription of mathematical formulas represents a complex challenge that is of great importance for digital processing and comprehensibility of mathematical content. Consequently, our goal was to analyze state-of-the-art approaches fo...

  • Article
  • Open Access
3 Citations
4,160 Views
20 Pages

Software testing ensures the quality and reliability of software products, but manual test case creation is labor-intensive. With the rise of Large Language Models (LLMs), there is growing interest in unit test creation with LLMs. However, effective...

  • Article
  • Open Access
2,230 Views
10 Pages

Human language comprehension relies on predictive processing; however, the computational mechanisms underlying this phenomenon remain unclear. This study investigates these mechanisms using large language models (LLMs), specifically GPT-3.5-turbo and...

  • Article
  • Open Access
1,321 Views
32 Pages

Siyasat: AI-Powered AI Governance Tool to Generate and Improve AI Policies According to Saudi AI Ethics Principles

  • Dabiah Alboaneen,
  • Shaikha Alhajri,
  • Khloud Alhajri,
  • Muneera Aljalal,
  • Noura Alalyani,
  • Hajer Alsaadan,
  • Zainab Al Thonayan and
  • Raja Alyafer

22 October 2025

The rapid development of artificial intelligence (AI) and growing reliance on generative AI (GenAI) tools such as ChatGPT and Bing Chat have raised concerns about risks, including privacy violations, bias, and discrimination. AI governance is viewed...

  • Article
  • Open Access
1 Citations
2,879 Views
15 Pages

13 August 2024

Neural machine translation (NMT) has shown impressive performance when trained on large-scale corpora. However, generic NMT systems have demonstrated poor performance on out-of-domain translation. To mitigate this issue, several domain adaptation met...

  • Article
  • Open Access
64 Citations
16,557 Views
18 Pages

Large Language Models (LLMs) are reshaping the landscape of Machine Learning (ML) application development. The emergence of versatile LLMs capable of undertaking a wide array of tasks has reduced the necessity for intensive human involvement in train...

  • Article
  • Open Access
1 Citations
3,870 Views
20 Pages

10 January 2025

Conversational recommender systems (CRSs) have garnered increasing attention for their ability to provide personalized recommendations through natural language interactions. Although large language models (LLMs) have shown potential in recommendation...

  • Article
  • Open Access
13 Citations
5,330 Views
21 Pages

Prediction of Arabic Legal Rulings Using Large Language Models

  • Adel Ammar,
  • Anis Koubaa,
  • Bilel Benjdira,
  • Omer Nacar and
  • Serry Sibaee

15 February 2024

In the intricate field of legal studies, the analysis of court decisions is a cornerstone for the effective functioning of the judicial system. The ability to predict court outcomes helps judges during the decision-making process and equips lawyers w...

  • Article
  • Open Access
6 Citations
3,796 Views
15 Pages

Research on Intelligent Grading of Physics Problems Based on Large Language Models

  • Yuhao Wei,
  • Rui Zhang,
  • Jianwei Zhang,
  • Dizhi Qi and
  • Wenqian Cui

21 January 2025

The automation of educational and instructional assessment plays a crucial role in enhancing the quality of teaching management. In physics education, calculation problems with intricate problem-solving ideas pose challenges to the intelligent gradin...

  • Article
  • Open Access
1,838 Views
12 Pages

15 September 2025

Programming education traditionally requires extensive manual assessment of student assignments, which is both time-consuming and resource-intensive for instructors. Recent advances in large language models (LLMs) open opportunities for automating th...

  • Article
  • Open Access
23 Citations
5,285 Views
15 Pages

LLM Adaptive PID Control for B5G Truck Platooning Systems

  • I. de Zarzà,
  • J. de Curtò,
  • Gemma Roig and
  • Carlos T. Calafate

25 June 2023

This paper presents an exploration into the capabilities of an adaptive PID controller within the realm of truck platooning operations, situating the inquiry within the context of Cognitive Radio and AI-enhanced 5G and Beyond 5G (B5G) networks. We de...

  • Article
  • Open Access
786 Views
18 Pages

17 November 2025

This study presents the development of an AI-powered chatbot designed to facilitate accurate and efficient retrieval of information from the FDA drug labeling documents. Leveraging OpenAI’s GPT-3.5-turbo model within a controlled, document-grou...

  • Article
  • Open Access
1 Citations
1,471 Views
14 Pages

22 June 2025

This study aimed to develop a large language model (LLM) chatbot for radiation therapy education and compare the performance of portable document format (PDF)- and webpage-based question-and-answer (Q&A) chatbots. An LLM chatbot was created using...

  • Article
  • Open Access
745 Views
27 Pages

Strengths and Weaknesses of Artificial Intelligence in Exploring Asbestos History and Regulations Across Countries

  • Alessandro Croce,
  • Francesca Ugo,
  • Annalisa Roveta,
  • Carlotta Bertolina,
  • Caterina Rinaudo,
  • Antonio Maconi and
  • Marinella Bertolotti

12 October 2025

Asbestos, consisting of six natural mineral fibrous silicate phases, was widely utilized in industrial development during the 20th century and has left a global legacy of health, environmental, and regulatory challenges. Its remarkable properties (e....

  • Article
  • Open Access
387 Views
32 Pages

Advancements in natural language processing (NLP), particularly Large Language Models (LLMs), have greatly improved how we access knowledge. However, in critical domains like biomedicine, challenges like hallucinations—where language models gen...

  • Article
  • Open Access
9 Citations
4,208 Views
18 Pages

Automated Assessment of Comprehension Strategies from Self-Explanations Using LLMs

  • Bogdan Nicula,
  • Mihai Dascalu,
  • Tracy Arner,
  • Renu Balyan and
  • Danielle S. McNamara

14 October 2023

Text comprehension is an essential skill in today’s information-rich world, and self-explanation practice helps students improve their understanding of complex texts. This study was centered on leveraging open-source Large Language Models (LLMs...

  • Article
  • Open Access
7 Citations
3,950 Views
21 Pages

Large Language Model-Informed X-ray Photoelectron Spectroscopy Data Analysis

  • J. de Curtò,
  • I. de Zarzà,
  • Gemma Roig and
  • Carlos T. Calafate

27 March 2024

X-ray photoelectron spectroscopy (XPS) remains a fundamental technique in materials science, offering invaluable insights into the chemical states and electronic structure of a material. However, the interpretation of XPS spectra can be complex, requ...

  • Article
  • Open Access
385 Views
25 Pages

Chain-of-Thought Prompt Optimization via Adversarial Learning

  • Guang Yang,
  • Xiantao Cai,
  • Shaohe Wang and
  • Juhua Liu

9 December 2025

Chain-of-Thought (CoT) prompting has demonstrated strong effectiveness in improving the reasoning capabilities of Large Language Models (LLMs). However, existing CoT optimization approaches still lack systematic mechanisms for evaluating and refining...

  • Article
  • Open Access
3,261 Views
15 Pages

Artificial Intelligence Outperforms Physicians in General Medical Knowledge, Except in the Paediatrics Domain: A Cross-Sectional Study

  • Joana Miranda,
  • Raquel Pereira-Silva,
  • João Guichard,
  • Jorge Meneses,
  • Andreia Neves Carreira and
  • Daniela Seixas

Generative artificial intelligence (genAI) shows promising results in clinical practice. This study compared a GPT-4-turbo virtual assistant with physicians from Italy, France, Spain, and Portugal on medical knowledge derived from national exams whil...

  • Article
  • Open Access
2 Citations
1,556 Views
25 Pages

28 April 2025

The increasing frequency and destructiveness of natural disasters necessitate scalable, transparent, and timely analytical frameworks for risk reduction. Traditional disaster datasets—curated by intergovernmental bodies such as EM-DAT and UNDRR...

  • Article
  • Open Access
5 Citations
5,014 Views
15 Pages

Optimizing Ingredient Substitution Using Large Language Models to Enhance Phytochemical Content in Recipes

  • Luís Rita,
  • Joshua Southern,
  • Ivan Laponogov,
  • Kyle Higgins and
  • Kirill Veselkov

26 November 2024

In the emerging field of computational gastronomy, aligning culinary practices with scientifically supported nutritional goals is increasingly important. This study explores how large language models (LLMs) can be applied to optimize ingredient subst...

of 2