MDPI - Publisher of Open Access Journals

81 pages, 4442 KB

Open AccessSystematic Review

From Illusion to Insight: A Taxonomic Survey of Hallucination Mitigation Techniques in LLMs

by Ioannis Kazlaris, Efstathios Antoniou, Konstantinos Diamantaras and Charalampos Bratsas

AI 2025, 6(10), 260; https://doi.org/10.3390/ai6100260 - 3 Oct 2025

Large Language Models (LLMs) exhibit remarkable generative capabilities but remain vulnerable to hallucinations—outputs that are fluent yet inaccurate, ungrounded, or inconsistent with source material. To address the lack of methodologically grounded surveys, this paper introduces a novel method-oriented taxonomy of hallucination mitigation strategies [...] Read more.

Large Language Models (LLMs) exhibit remarkable generative capabilities but remain vulnerable to hallucinations—outputs that are fluent yet inaccurate, ungrounded, or inconsistent with source material. To address the lack of methodologically grounded surveys, this paper introduces a novel method-oriented taxonomy of hallucination mitigation strategies in text-based LLMs. The taxonomy organizes over 300 studies into six principled categories: Training and Learning Approaches, Architectural Modifications, Input/Prompt Optimization, Post-Generation Quality Control, Interpretability and Diagnostic Methods, and Agent-Based Orchestration. Beyond mapping the field, we identify persistent challenges such as the absence of standardized evaluation benchmarks, attribution difficulties in multi-method systems, and the fragility of retrieval-based methods when sources are noisy or outdated. We also highlight emerging directions, including knowledge-grounded fine-tuning and hybrid retrieval–generation pipelines integrated with self-reflective reasoning agents. This taxonomy provides a methodological framework for advancing reliable, context-sensitive LLM deployment in high-stakes domains such as healthcare, law, and defense. Full article

(This article belongs to the Section AI Systems: Theory and Applications)

8 pages, 658 KB

Open AccessBrief Report

Mechanistically Explainable AI Model for Predicting Synergistic Cancer Therapy Combinations

by Han Si, Sanyam Kumar, Sneh Lata, Arshad Ahmad, Saurav Ghosh, Karen Stephansen, Deepti Nagarkar, Eda Zhou and Brandon W. Higgs

Curr. Oncol. 2025, 32(10), 548; https://doi.org/10.3390/curroncol32100548 - 30 Sep 2025

Abstract

This study introduces a Large Language Model (LLM)-based framework that combines drug combination data with a knowledge graph to predict synergistic oncology drug combinations with mechanistic insights. Using a retrieval-augmented generation (RAG) approach, over 50,000 in vitro drug pair assay results and 1631 [...] Read more.

This study introduces a Large Language Model (LLM)-based framework that combines drug combination data with a knowledge graph to predict synergistic oncology drug combinations with mechanistic insights. Using a retrieval-augmented generation (RAG) approach, over 50,000 in vitro drug pair assay results and 1631 human clinical trial and preclinical test entries were integrated to enhance predictive accuracy and explainability. Validation achieved an F1 score of 0.80, demonstrating the framework’s potential to streamline drug discovery and improve translational strategies in cancer treatment. Full article

(This article belongs to the Special Issue Shaping the Future of Oncology: The Role of Generative AI in Clinical and Research Environments)

► Show Figures

Figure 1

32 pages, 9638 KB

Open AccessArticle

MSSA: A Multi-Scale Semantic-Aware Method for Remote Sensing Image–Text Retrieval

by Yun Liao, Zongxiao Hu, Fangwei Jin, Junhui Liu, Nan Chen, Jiayi Lv and Qing Duan

Remote Sens. 2025, 17(19), 3341; https://doi.org/10.3390/rs17193341 - 30 Sep 2025

Abstract

In recent years, the convenience and potential for information extraction offered by Remote Sensing Image–Text Retrieval (RSITR) have made it a significant focus of research in remote sensing (RS) knowledge services. Current mainstream methods for RSITR generally align fused image features at multiple [...] Read more.

In recent years, the convenience and potential for information extraction offered by Remote Sensing Image–Text Retrieval (RSITR) have made it a significant focus of research in remote sensing (RS) knowledge services. Current mainstream methods for RSITR generally align fused image features at multiple scales with textual features, primarily focusing on the local information of RS images while neglecting potential semantic information. This results in insufficient alignment in the cross-modal semantic space. To overcome this limitation, we propose a Multi-Scale Semantic-Aware Remote Sensing Image–Text Retrieval method (MSSA). This method introduces Progressive Spatial Channel Joint Attention (PSCJA), which enhances the expressive capability of multi-scale image features through Window-Region-Global Progressive Attention (WRGPA) and Segmented Channel Attention (SCA). Additionally, the Image-Guided Text Attention (IGTA) mechanism dynamically adjust textual attention weights based on visual context. Furthermore, the Cross-Modal Semantic Extraction Module (CMSE) incorporated learnable semantic tokens at each scale, enabling attention interaction between multi-scale features of different modalities and the capturing of hierarchical semantic associations. This multi-scale semantic-guided retrieval method ensures cross-modal semantic consistency, significantly improving the accuracy of cross-modal retrieval in RS. MSSA demonstrates superior retrieval accuracy in experiments across three baseline datasets, achieving a new state-of-the-art performance. Full article

(This article belongs to the Section Remote Sensing Image Processing)

► Show Figures

Figure 1

23 pages, 1167 KB

Open AccessArticle

Integrating RAG for Smarter Animal Certification Platforms

by Pedro Bilar Montero, Jonas Bulegon Gassen, Glênio Descovi, Tais Oltramari Barnasque, Gabriel Rodrigues da Silva, Felipe Amadori Machado, Gabriel Vieira Casanova, Vinícius Maran and Alencar Machado

Information 2025, 16(10), 843; https://doi.org/10.3390/info16100843 - 30 Sep 2025

Abstract

Large Language Models (LLMs) encounter significant challenges when applied in specialized domains that require precise and localized information. This problem is particularly critical in regulatory sectors, such as the animal health sector in Brazil, where professionals depend on complex and constantly updated legal [...] Read more.

Large Language Models (LLMs) encounter significant challenges when applied in specialized domains that require precise and localized information. This problem is particularly critical in regulatory sectors, such as the animal health sector in Brazil, where professionals depend on complex and constantly updated legal norms to perform their work. The generic knowledge encapsulated in traditional LLMs is often insufficient to provide reliable support in these contexts, which can lead to inaccurate or outdated responses. To address this gap, this work presents a practical implementation of a Retrieval-Augmented Generation (RAG) system. We detail the integration of this system with the Plataforma de Defesa Sanitária Animal do Rio Grande do Sul (PDSA-RS), a real platform used for animal production certification. Our solution connects an LLM to an external knowledge base containing specific Brazilian legislation, allowing the model to retrieve relevant legal texts in real time to generate its responses. The principal objective is to demonstrate how this approach can produce accurate and contextually grounded answers for professionals in the veterinary field, assisting in decision-making processes for sanitary certification. Full article

(This article belongs to the Section Artificial Intelligence)

► Show Figures

Graphical abstract

47 pages, 3137 KB

Open AccessArticle

DietQA: A Comprehensive Framework for Personalized Multi-Diet Recipe Retrieval Using Knowledge Graphs, Retrieval-Augmented Generation, and Large Language Models

by Ioannis Tsampos and Emmanouil Marakakis

Computers 2025, 14(10), 412; https://doi.org/10.3390/computers14100412 - 29 Sep 2025

Abstract

Recipes available on the web often lack nutritional transparency and clear indicators of dietary suitability. While searching by title is straightforward, exploring recipes that meet combined dietary needs, nutritional goals, and ingredient-level preferences remains challenging. Most existing recipe search systems do not effectively [...] Read more.

Recipes available on the web often lack nutritional transparency and clear indicators of dietary suitability. While searching by title is straightforward, exploring recipes that meet combined dietary needs, nutritional goals, and ingredient-level preferences remains challenging. Most existing recipe search systems do not effectively support flexible multi-dietary reasoning in combination with user preferences and restrictions. For example, users may seek gluten-free and dairy-free dinners with suitable substitutions, or compound goals such as vegan and low-fat desserts. Recent systematic reviews report that most food recommender systems are content-based and often non-personalized, with limited support for dietary restrictions, ingredient-level exclusions, and multi-criteria nutrition goals. This paper introduces DietQA, an end-to-end, language-adaptable chatbot system that integrates a Knowledge Graph (KG), Retrieval-Augmented Generation (RAG), and a Large Language Model (LLM) to support personalized, dietary-aware recipe search and question answering. DietQA crawls Greek-language recipe websites to extract structured information such as titles, ingredients, and quantities. Nutritional values are calculated using validated food composition databases, and dietary tags are inferred automatically based on ingredient composition. All information is stored in a Neo4j-based knowledge graph, enabling flexible querying via Cypher. Users interact with the system through a natural language chatbot friendly interface, where they can express preferences for ingredients, nutrients, dishes, and diets, and filter recipes based on multiple factors such as ingredient availability, exclusions, and nutritional goals. DietQA supports multi-diet recipe search by retrieving both compliant recipes and those adaptable via ingredient substitutions, explaining how each result aligns with user preferences and constraints. An LLM extracts intents and entities from user queries to support rule-based Cypher retrieval, while the RAG pipeline generates contextualized responses using the user query and preferences, retrieved recipes, statistical summaries, and substitution logic. The system integrates real-time updates of recipe and nutritional data, supporting up-to-date, relevant, and personalized recommendations. It is designed for language-adaptable deployment and has been developed and evaluated using Greek-language content. DietQA provides a scalable framework for transparent and adaptive dietary recommendation systems powered by conversational AI. Full article

(This article belongs to the Special Issue Natural Language Processing (NLP) and Large Language Modelling (2nd Edition))

► Show Figures

Graphical abstract

14 pages, 331 KB

Open AccessArticle

Flow Matching for Simulation-Based Inference: Design Choices and Implications

by Massimiliano Giordano Orsini, Alessio Ferone, Laura Inno, Angelo Casolaro and Antonio Maratea

Electronics 2025, 14(19), 3833; https://doi.org/10.3390/electronics14193833 - 27 Sep 2025

Abstract

Inverse problems are ubiquitous across many scientific fields, and involve the determination of the causes or parameters of a system from observations of its effects or outputs. These problems have been deeply studied through the use of simulated data, thereby under the lens [...] Read more.

Inverse problems are ubiquitous across many scientific fields, and involve the determination of the causes or parameters of a system from observations of its effects or outputs. These problems have been deeply studied through the use of simulated data, thereby under the lens of simulation-based inference. Recently, the natural combination of Continuous Normalizing Flows (CNFs) and Flow Matching Posterior Estimation (FMPE) has emerged as a novel, powerful, and scalable posterior estimator, capable of inferring the distribution of free parameters in a significantly reduced computational time compared to conventional techniques. While CNFs provide substantial flexibility in designing machine learning solutions, modeling decisions during their implementation can strongly influence predictive performance. To the best of our knowledge, no prior work has systematically analyzed how such modeling choices affect the robustness of posterior estimates in this framework. The aim of this work is to address this research gap by investigating the sensitivity of CNFs trained with FMPE under different modeling decisions, including data preprocessing, noise conditioning, and noisy observations. As a case study, we consider atmospheric retrieval of exoplanets and perform an extensive experimental campaign on the Ariel Data Challenge 2023 dataset. Through a comprehensive posterior evaluation framework, we demonstrate that (i) Z-score normalization outperforms min–max scaling across tasks; (ii) noise conditioning improves accuracy, coverage, and uncertainty estimation; and (iii) noisy observations significantly degrade predictive performance, thus underscoring reduced robustness under the assumed noise conditions. Full article

(This article belongs to the Special Issue Digital Signal and Image Processing for Multimedia Technology)

► Show Figures

Figure 1

21 pages, 4052 KB

Open AccessArticle

Enhancing Geological Knowledge Engineering with Retrieval-Augmented Generation: A Case Study of the Qin–Hang Metallogenic Belt

by Jianhua Ma, Yongzhang Zhou, Luhao He, Qianlong Zhang, Muhammad Atif Bilal and Yuqing Zhang

Minerals 2025, 15(10), 1023; https://doi.org/10.3390/min15101023 - 26 Sep 2025

Abstract

This study presents a domain-adapted retrieval-augmented generation (RAG) pipeline that integrates geological knowledge with large language models (LLMs) to support intelligent question answering in the metallogenic domain. Focusing on the Qin–Hang metallogenic belt in South China, we construct a bilingual question-answering (QA) corpus [...] Read more.

This study presents a domain-adapted retrieval-augmented generation (RAG) pipeline that integrates geological knowledge with large language models (LLMs) to support intelligent question answering in the metallogenic domain. Focusing on the Qin–Hang metallogenic belt in South China, we construct a bilingual question-answering (QA) corpus derived from 615 authoritative geological publications, covering topics such as regional tectonics, ore-forming processes, structural evolution, and mineral resources. Using the ChatGLM3-6B language model and LangChain framework, we embed the corpus into a semantic vector database via Sentence-BERT and FAISS, enabling dynamic retrieval and grounded response generation. The RAG-enhanced model significantly outperforms baseline LLMs—including ChatGPT-4, Bing, and Gemini—in a comparative evaluation using BLEU, precision, recall, and F1 metrics, achieving an F1 score of 0.8689. The approach demonstrates high domain adaptability and reproducibility. All datasets and codes are openly released to facilitate application in other metallogenic belts. This work illustrates the potential of LLM-based knowledge engineering to support digital geoscientific research and smart mining. Full article

(This article belongs to the Special Issue Application of Big Data Mining, Machine Learning and Artificial Intelligence in Geoscience, 2nd Edition)

► Show Figures

Figure 1

22 pages, 568 KB

Open AccessReview

To Treat or Not to Treat: A Scoping Review of Speech Treatment for Dysarthria in Amyotrophic Lateral Sclerosis (ALS)

by Brooke-Mai Whelan, Danielle Aldridge, Jessica Ruhle, Persephone Whitelock, Shana Taubert, Annette Collins, Elaine Kearney, Salma Charania, Robert D. Henderson, Sarah J. Wallace, Claire Mitchell, Kaila L. Stipancic, Mili Kuruvilla-Dugdale and Adam P. Vogel

Healthcare 2025, 13(19), 2434; https://doi.org/10.3390/healthcare13192434 - 25 Sep 2025

Abstract

Background: Speech loss is recognised as one of the most devastating outcomes for individuals with ALS, yet active speech intervention is rarely targeted in this population. Clinicians face significant challenges in managing dysarthria associated with ALS due to the rapidly progressive nature of [...] Read more.

Background: Speech loss is recognised as one of the most devastating outcomes for individuals with ALS, yet active speech intervention is rarely targeted in this population. Clinicians face significant challenges in managing dysarthria associated with ALS due to the rapidly progressive nature of the disease, historical concerns around intensive exercise accelerating decline, and an absence of direction on restorative and compensatory intervention strategies in current clinical care guidelines. This review evaluates the scope and quality of evidence for speech treatments in ALS to identify knowledge gaps and establish research priorities to guide clinical care. Methods: Studies were retrieved from six electronic databases (PubMed, CINAHL, Embase, Cochrane library, Web of Science, and PsycINFO). Results: Four studies met inclusion criteria. Treatment approaches included: music-based speech therapy; multisubsystem speech rehabilitation program, tongue strengthening and articulation training; and Lee Silverman Voice Treatment-LOUD^® combined with additional voice and articulation therapy. Sample sizes were small, with all studies demonstrating notable methodological weaknesses. The limited evidence base, marked by conflicting results and methodological flaws, prevents any reliable conclusions about treatment effectiveness. Conclusions: Despite the prevalence and impact of dysarthria in this population, evidence for speech treatment remains sparse, of generally low quality, and provides limited guidance for clinical practice. The changing perspective on exercise in ALS warrants rigorous investigation of tailored dysarthria interventions for this population that are minimally fatiguing and enhance speech by making use of residual physiologic support. Full article

(This article belongs to the Special Issue Improving Care for People Living with ALS/MND)

► Show Figures

Figure 1

23 pages, 1194 KB

Open AccessArticle

Enhancing Embodied Carbon Calculation in Buildings: A Retrieval-Augmented Generation Approach with Large Language Models

by Yushi Zou, Rengeng Zheng and Jun Xia

Buildings 2025, 15(19), 3449; https://doi.org/10.3390/buildings15193449 - 24 Sep 2025

Viewed by 109

Abstract

Accurate calculation of embodied carbon emissions in buildings (ECE) is crucial to achieving global carbon neutrality. However, fragmented data, inconsistent regional standards, and low computational efficiency have long hindered existing methods. This study innovatively integrates large language models (LLMs) with retrieval-enhanced generation (RAG) [...] Read more.

Accurate calculation of embodied carbon emissions in buildings (ECE) is crucial to achieving global carbon neutrality. However, fragmented data, inconsistent regional standards, and low computational efficiency have long hindered existing methods. This study innovatively integrates large language models (LLMs) with retrieval-enhanced generation (RAG) technology to establish a new intelligent accounting paradigm for embodied carbon in buildings. Through a systematic evaluation of three basic models—Kimi, Doubao, and DeepSeek-R1—in a five-level progressive input scenario, the study quantitatively reveals the “information sensitivity” patterns of LLMs. To address the illusion errors of general models in professional scenarios, an innovative three-stage closed-loop architecture of “knowledge retrieval—calculation embedding—trustworthy generation” is proposed. By dynamically invoking domain knowledge bases and embedded computing modules, zero-error verification of benchmark data is achieved. The core contributions include the following: (1) It has been clarified that the basic large model has application potential in calculating the implicit carbon emissions of buildings, but the reliability of the results is limited. (2) The influence of data elements on calculation accuracy is revealed. (3) The application path for integrating RAG with large models has been pioneered, and the results show that the RAG technology can enhance the performance of large models in calculating the implicit carbon emissions of buildings by approximately 25%. (4) The significant efficiency improvement of RAG technology is verified. (5) A supporting theoretical and application system is established. Full article

(This article belongs to the Section Building Energy, Physics, Environment, and Systems)

► Show Figures

Figure 1

16 pages, 2128 KB

Open AccessArticle

Secure Multifaceted-RAG: Hybrid Knowledge Retrieval with Security Filtering

by Grace Byun, Shinsun Lee, Nayoung Choi and Jinho D. Choi

Information 2025, 16(9), 804; https://doi.org/10.3390/info16090804 - 16 Sep 2025

Viewed by 384

Abstract

Existing Retrieval-Augmented Generation (RAG) systems face challenges in enterprise settings due to limited retrieval scope and data security risks. When relevant internal documents are unavailable, the system struggles to generate accurate and complete responses. Additionally, using closed-source Large Language Models (LLMs) raises concerns [...] Read more.

Existing Retrieval-Augmented Generation (RAG) systems face challenges in enterprise settings due to limited retrieval scope and data security risks. When relevant internal documents are unavailable, the system struggles to generate accurate and complete responses. Additionally, using closed-source Large Language Models (LLMs) raises concerns about exposing proprietary information. To address these issues, we propose the Secure Multifaceted-RAG (SecMulti-RAG) framework, which retrieves not only from internal documents but also from two supplementary sources: pre-generated expert knowledge for anticipated queries and on-demand external LLM-generated knowledge. To mitigate security risks, we adopt a local open-source generator and selectively utilize external LLMs only when prompts are deemed safe by a filtering mechanism. This approach enhances completeness, prevents data leakage, and reduces costs. In our evaluation on a report generation task in the automotive industry, SecMulti-RAG significantly outperforms traditional RAG—achieving 79.3–91.9% win rates across correctness, richness, and helpfulness in LLM-based evaluation and 56.3–70.4% in human evaluation. This highlights SecMulti-RAG as a practical and secure solution for enterprise RAG. Full article

(This article belongs to the Special Issue Advanced Retrieval-Augmented Generation Systems Based on Large Language Models)

► Show Figures

Figure 1

47 pages, 4491 KB

Open AccessSystematic Review

New Insights into Agriculture on Small Mediterranean Islands: A Systematic Review

by Mireille Ginésy and Rita Biasi

Land 2025, 14(9), 1874; https://doi.org/10.3390/land14091874 - 13 Sep 2025

Viewed by 439

Abstract

The numerous inhabited small islands of the Mediterranean basin are marginal geographic territories of high natural value. Historically, island communities have developed complex, poly-cultural agricultural systems, based on the use of native genetic resources and traditional ecological knowledge, to address the challenges linked [...] Read more.

The numerous inhabited small islands of the Mediterranean basin are marginal geographic territories of high natural value. Historically, island communities have developed complex, poly-cultural agricultural systems, based on the use of native genetic resources and traditional ecological knowledge, to address the challenges linked to unfavorable climate, geology, and topography. However, economic, socio-demographic, and climatic factors have caused farmland abandonment, leading to soil and land degradation and to a decline in biodiversity and ecosystem services. Following the PRISMA guidelines, we conducted a systematic review to assess the state of scientific research with regard to agriculture on small Mediterranean islands. After screening records retrieved on Scopus, Web of Science, CABI, and Google Scholar, 167 articles published before July 2025 were included in the analysis. The articles covered 6 countries and 126 islands, with Greek and Italian islands being the most represented. Key topics included trajectories, drivers, and consequences of land use change, agrobiodiversity, and water resources. To complete the systematic review, 30 relevant EU-funded projects were identified and analyzed. Overall, the scientific research aimed at supporting agriculture on Mediterranean small islands tends to focus on a single issue or very few issues. However, we suggest that given the complexity of the drivers and consequences of farmland abandonment, more integrated approaches could have a greater impact. By providing a systematic overview of the current state of the research on agriculture on small Mediterranean islands, this review offers a solid basis for guiding ongoing and future research, actions, and policies aimed at building resilience in these fragile and endangered lands. Full article

► Show Figures

Figure 1

21 pages, 3805 KB

Open AccessArticle

GraphTrace: A Modular Retrieval Framework Combining Knowledge Graphs and Large Language Models for Multi-Hop Question Answering

by Anna Osipjan, Hanieh Khorashadizadeh, Akasha-Leonie Kessel, Sven Groppe and Jinghua Groppe

Computers 2025, 14(9), 382; https://doi.org/10.3390/computers14090382 - 11 Sep 2025

Viewed by 490

Abstract

This paper introduces GraphTrace, a novel retrieval framework that integrates a domain-specific knowledge graph (KG) with a large language model (LLM) to improve information retrieval for complex, multi-hop queries. Built on structured economic data related to the COVID-19 pandemic, GraphTrace adopts a modular [...] Read more.

This paper introduces GraphTrace, a novel retrieval framework that integrates a domain-specific knowledge graph (KG) with a large language model (LLM) to improve information retrieval for complex, multi-hop queries. Built on structured economic data related to the COVID-19 pandemic, GraphTrace adopts a modular architecture comprising entity extraction, path finding, query decomposition, semantic path ranking, and context aggregation, followed by LLM-based answer generation. GraphTrace is compared against baseline retrieval-augmented generation (RAG) and graph-based RAG (GraphRAG) approaches in both retrieval and generation settings. Experimental results show that GraphTrace consistently outperforms the baselines across evaluation metrics, particularly in handling mid-complexity (5–6-hop) queries and achieving top scores in directness during the generation evaluation. These gains are attributed to GraphTrace’s alignment of semantic reasoning with structured KG traversal, combining modular components for more targeted and interpretable retrieval. Full article

► Show Figures

Figure 1

18 pages, 1061 KB

Open AccessArticle

HiPC-QR: Hierarchical Prompt Chaining for Query Reformulation

by Hua Yang, Hanyang Li and Teresa Gonçalves

Information 2025, 16(9), 790; https://doi.org/10.3390/info16090790 - 11 Sep 2025

Viewed by 332

Abstract

Query reformulation techniques optimize user queries to better align with documents, thus improving the performance of Information Retrieval (IR) systems. Previous methods have primarily focused on query expansion using techniques such as synonym replacement to improve recall. With the rapid advancement of Large [...] Read more.

Query reformulation techniques optimize user queries to better align with documents, thus improving the performance of Information Retrieval (IR) systems. Previous methods have primarily focused on query expansion using techniques such as synonym replacement to improve recall. With the rapid advancement of Large Language Models (LLMs), the knowledge embedded within these models has grown. Research in prompt engineering has introduced various methods, with prompt chaining proving particularly effective for complex tasks. Directly prompting LLMs to reformulate queries has become a viable approach. However, existing LLM-based prompt methods for query reformulation often introduce irrelevant content into reformulated queries, resulting in decreased retrieval precision and misalignment with user intent. We propose a novel approach called Hierarchical Prompt Chaining for Query Reformulation (HiPC-QR). HiPC-QR employs a two-step prompt chaining technique to extract keywords from the original query and refine its structure by filtering out non-essential keywords based on the user’s query intent. This process reduces the query’s restrictiveness while simultaneously expanding essential keywords to enhance retrieval effectiveness. We evaluated the effectiveness of HiPC-QR on two benchmark retrieval datasets, namely MS MARCO and TREC Deep Learning.The experimental results show that HiPC-QR outperforms existing query reformulation methods on large-scale datasets in terms of both recall@10 and MRR@10. Full article

(This article belongs to the Special Issue Machine Learning and Data Mining: Innovations in Big Data Analytics)

► Show Figures

Figure 1

29 pages, 651 KB

Open AccessSystematic Review

Retrieval-Augmented Generation (RAG) in Healthcare: A Comprehensive Review

by Fnu Neha, Deepshikha Bhati and Deepak Kumar Shukla

AI 2025, 6(9), 226; https://doi.org/10.3390/ai6090226 - 11 Sep 2025

Viewed by 1773

Abstract

Retrieval-Augmented Generation (RAG) enhances large language models (LLMs) by integrating external knowledge retrieval to improve factual consistency and reduce hallucinations. Despite growing interest, its use in healthcare remains fragmented. This paper presents a Systematic Literature Review (SLR) following PRISMA guidelines, synthesizing 30 peer-reviewed [...] Read more.

Retrieval-Augmented Generation (RAG) enhances large language models (LLMs) by integrating external knowledge retrieval to improve factual consistency and reduce hallucinations. Despite growing interest, its use in healthcare remains fragmented. This paper presents a Systematic Literature Review (SLR) following PRISMA guidelines, synthesizing 30 peer-reviewed studies on RAG in clinical domains, focusing on three of its most prevalent and promising applications in diagnostic support, electronic health record (EHR) summarization, and medical question answering. We synthesize the existing architectural variants (naïve, advanced, and modular) and examine their deployment across these applications. Persistent challenges are identified, including retrieval noise (irrelevant or low-quality retrieved information), domain shift (performance degradation when models are applied to data distributions different from their training set), generation latency, and limited explainability. Evaluation strategies are compared using both standard metrics and clinical-specific metrics, FactScore, RadGraph-F1, and MED-F1, which are particularly critical for ensuring factual accuracy, medical validity, and clinical relevance. This synthesis offers a domain-focused perspective to guide researchers, healthcare providers, and policymakers in developing reliable, interpretable, and clinically aligned AI systems, laying the groundwork for future innovation in RAG-based healthcare solutions. Full article

► Show Figures

Figure 1

35 pages, 8381 KB

Open AccessArticle

Bibliometric Analysis of Hospital Design: Knowledge Mapping Evolution and Research Trends

by Jingwen Liu and Youngho Yeo

Buildings 2025, 15(17), 3196; https://doi.org/10.3390/buildings15173196 - 4 Sep 2025

Viewed by 654

Abstract

Hospital design plays a pivotal role in improving patient outcomes, enhancing clinical efficiency, and strengthening infection control. Since the outbreak of COVID-19, research in this field has expanded significantly, showing a marked trend toward interdisciplinary integration. In this study, bibliometric analysis was conducted [...] Read more.

Hospital design plays a pivotal role in improving patient outcomes, enhancing clinical efficiency, and strengthening infection control. Since the outbreak of COVID-19, research in this field has expanded significantly, showing a marked trend toward interdisciplinary integration. In this study, bibliometric analysis was conducted using CiteSpace (version 6.2.R3) as the primary tool, with Excel and Tableau (version 2024.3) as supplementary software. A total of 877 documents on hospital design published between 1932 and 2025 were retrieved from the Web of Science Core Collection and analyzed from multiple perspectives. The analysis examined publication trends, collaborative networks, co-citation structures, disciplinary evolution, and keyword dynamics. The results indicate that the field has entered a phase of rapid development since 2019. Global collaboration networks are becoming increasingly multipolar; yet, institutional and author-level connections remain decentralized, with relatively low overall density. Evidence-based design (EBD) continues to serve as the theoretical foundation of the field, while emerging themes such as healing environments, biophilic design, and patient-centered spatial strategies have become major research hotspots. Increasingly, the field reflects deeper integration across disciplines, including architecture, medicine, nursing, and environmental science. This study provides a clearer picture of the developmental trajectory, knowledge base, and future directions of hospital design research, offering systematic insights and theoretical guidance for both scholars and practitioners. Full article

(This article belongs to the Special Issue Data Analytics Applications for Architecture and Construction)

► Show Figures

Figure 1

Search Results (741)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (741)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI