Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (845)

Search Parameters:
Keywords = multi-agent environment

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
22 pages, 1766 KB  
Article
A Leader-Assisted Decentralized Adaptive Formation Method for UAV Swarms Integrating a Pre-Trained Semantic Broadcast Communication Model
by Xing Xu, Bo Zhang and Rongpeng Li
Drones 2025, 9(10), 681; https://doi.org/10.3390/drones9100681 - 30 Sep 2025
Abstract
Multiple unmanned aerial vehicle (UAV) systems have attracted considerable research interest due to their broad applications, such as formation control. However, decentralized UAV formation faces challenges stemming from limited local observations, which may lead to consistency conflicts, and excessive communication. To address these [...] Read more.
Multiple unmanned aerial vehicle (UAV) systems have attracted considerable research interest due to their broad applications, such as formation control. However, decentralized UAV formation faces challenges stemming from limited local observations, which may lead to consistency conflicts, and excessive communication. To address these issues, this paper proposes SemanticBC-DecAF, a decentralized adaptive formation (DecAF) framework under a leader–follower architecture, incorporating a semantic broadcast communication (SemanticBC) mechanism. The framework consists of three modules: (1) a proximal policy optimization (PPO)-based semantic broadcast module, where the leader UAV transmits semantically encoded global obstacle images to followers to enhance their perception; (2) a YOLOv5-based detection and position estimation module, enabling followers to infer obstacle locations from recovered images; and (3) a multi-agent proximal policy optimization (MAPPO)-based formation module, which fuses global and local observations to achieve adaptive formation and obstacle avoidance. Experiments in the multi-agent simulation environment MPE show that the proposed framework significantly improves global perception and formation efficiency compared with methods that rely on local observations. Full article
(This article belongs to the Section Artificial Intelligence in Drones (AID))
25 pages, 5512 KB  
Review
Histone Deacetylases in Neurodegenerative Diseases and Their Potential Role as Therapeutic Targets: Shedding Light on Astrocytes
by Pedro de Sena Murteira Pinheiro, Luan Pereira Diniz, Lucas S. Franco, Michele Siqueira and Flávia Carvalho Alcantara Gomes
Pharmaceuticals 2025, 18(10), 1471; https://doi.org/10.3390/ph18101471 - 30 Sep 2025
Abstract
Histone deacetylases (HDACs) are crucial enzymes involved in the regulation of gene expression through chromatin remodeling, impacting numerous cellular processes, including cell proliferation, differentiation, and survival. In recent years, HDACs have emerged as therapeutic targets for neurodegenerative diseases (NDDs), such as Alzheimer’s disease, [...] Read more.
Histone deacetylases (HDACs) are crucial enzymes involved in the regulation of gene expression through chromatin remodeling, impacting numerous cellular processes, including cell proliferation, differentiation, and survival. In recent years, HDACs have emerged as therapeutic targets for neurodegenerative diseases (NDDs), such as Alzheimer’s disease, Parkinson’s disease, and Huntington’s disease, given their role in modulating neuronal plasticity, neuroinflammation, and neuronal survival. HDAC inhibitors (HDACi) are small molecules that prevent the deacetylation of histones, thereby promoting a more relaxed chromatin structure and enhancing gene expression associated with neuroprotective pathways. Preclinical and clinical studies have demonstrated that HDACi can mitigate neurodegeneration, reduce neuroinflammatory markers, and improve cognitive and motor functions, positioning them as promising therapeutic agents for NDDs. Given the complexity and multifactorial nature of NDDs, therapeutic success will likely depend on multi-target drugs as well as new cellular and molecular therapeutic targets. Emerging evidence suggests that HDACi can modulate the function of astrocytes, a glial cell type critically involved in neuroinflammation, synaptic regulation, and the progression of neurodegenerative diseases. Consequently, HDACi targeting astrocytic pathways represent a novel approach in NDDs therapy. By modulating HDAC activity specifically in astrocytes, these inhibitors may attenuate pathological inflammation and promote a neuroprotective environment, offering a complementary strategy to neuron-focused treatments. This review aims to provide an overview of HDACs and HDACi in the context of neurodegeneration, emphasizing their molecular mechanisms, therapeutic potential, and limitations. Additionally, it explores the emerging role of astrocytes as targets for HDACi, proposing that this glial cell type could enhance the efficacy of HDACs-targeted therapies in NDD management. Full article
Show Figures

Figure 1

23 pages, 671 KB  
Article
The Impact of the Organization on the Autonomy of Agents
by Zouheyr Tamrabet, Djamel Nessah, Toufik Marir, Varun Gupta and Farid Mokhati
Information 2025, 16(10), 838; https://doi.org/10.3390/info16100838 - 27 Sep 2025
Abstract
In multi-agent systems (MAS), autonomy is a fundamental characteristic that enables agents to operate independently and adaptively within complex environments. However, such characteristics may cause the system to fall into undesirable situations. On the one hand, purely autonomous agents are difficult to predict. [...] Read more.
In multi-agent systems (MAS), autonomy is a fundamental characteristic that enables agents to operate independently and adaptively within complex environments. However, such characteristics may cause the system to fall into undesirable situations. On the one hand, purely autonomous agents are difficult to predict. On the other hand, fully controlled agents lose many of their abilities. Therefore, control frameworks have been designed in the form of organizational architectures to help address the need for balance between purely autonomous and fully controlled agents. This paper investigates the impact of organization on the autonomy of the agents. To measure this impact, we propose a set of seven metrics (Behavioral Wealth (BW), Service Wealth (SW), Frequency of Service Searches per Time (FoSST), Frequency of Service Searches per Behavior (FoSSB), Number of Service Searches (NoSS), Number of Service Demands per Behavior (NoSDB), and Number of Provided Services per Demand (NoPSD)) and apply them to a case study implemented in two configurations: with and without organizational aspects. To model organizational aspects, we adopt the Agent–Group–Role (AGR) model, chosen for its structured approach to defining agent responsibilities and interactions. The findings of this study show that the organizational aspects reduce the communication load and enhance the effectiveness of agents. Full article
Show Figures

Figure 1

26 pages, 9360 KB  
Article
Multi-Agent Hierarchical Reinforcement Learning for PTZ Camera Control and Visual Enhancement
by Zhonglin Yang, Huanyu Liu, Hao Fang, Junbao Li and Yutong Jiang
Electronics 2025, 14(19), 3825; https://doi.org/10.3390/electronics14193825 - 26 Sep 2025
Abstract
Border surveillance, as a critical component of national security, places increasingly stringent demands on the target perception capabilities of video monitoring systems, especially in wide-area and complex environments. To address the limitations of existing systems in low-confidence target detection and multi-camera collaboration, this [...] Read more.
Border surveillance, as a critical component of national security, places increasingly stringent demands on the target perception capabilities of video monitoring systems, especially in wide-area and complex environments. To address the limitations of existing systems in low-confidence target detection and multi-camera collaboration, this paper proposes a novel visual enhancement method for cooperative control of multiple PTZ (Pan–Tilt–Zoom) cameras based on hierarchical reinforcement learning. The proposed approach establishes a hierarchical framework composed of a Global Planner Agent (GPA) and multiple Local Executor Agents (LEAs). The GPA is responsible for global target assignment, while the LEAs perform fine-grained visual enhancement operations based on the assigned targets. To effectively model the spatial relationships among multiple targets and the perceptual topology of the cameras, a graph-based joint state space is constructed. Furthermore, a graph neural network is employed to extract high-level features, enabling efficient information sharing and collaborative decision-making among cameras. Experimental results in simulation environments demonstrate the superiority of the proposed method in terms of target coverage and visual enhancement performance. Hardware experiments further validate the feasibility and robustness of the approach in real-world scenarios. This study provides an effective solution for multi-camera cooperative surveillance in complex environments. Full article
(This article belongs to the Section Artificial Intelligence)
Show Figures

Figure 1

25 pages, 2088 KB  
Systematic Review
A Systematic Review of Generative AI in K–12: Mapping Goals, Activities, Roles, and Outcomes via the 3P Model
by Xiaoling Lin and Hao Tan
Systems 2025, 13(10), 840; https://doi.org/10.3390/systems13100840 - 25 Sep 2025
Abstract
Generative AI is reshaping k–12 learning as a multi-agent system in which goals, activities, and roles co-evolve across formal and informal environments. Following PRISMA and appraising quality with MMAT, we synthesize 84 peer-reviewed empirical studies (2020–2025) involving learners aged 3–18. Using Biggs’s 3P [...] Read more.
Generative AI is reshaping k–12 learning as a multi-agent system in which goals, activities, and roles co-evolve across formal and informal environments. Following PRISMA and appraising quality with MMAT, we synthesize 84 peer-reviewed empirical studies (2020–2025) involving learners aged 3–18. Using Biggs’s 3P model as a systems lens and embedding CIMO logic, we code learning objectives, activity designs, AI role paradigms, and outcomes. Seven recurring objectives emerge (language/literacy; STEM; creativity; socioemotional skills; feedback literacy and self-regulation; motivation; AI literacy). Five dominant activity patterns are identified: dialogic tutoring and formative feedback, generative iterative co-creation, project-based problem-solving, simulation/game-based learning, and assessment support. Across studies, AI roles shift from AI-directed to AI-supported/empowered, re-allocating agency among students, teachers, and caregivers via feedback loops. Reported outcomes span three categories—epistemic, practice, and affective/identity—with opportunities of deeper knowledge, improved practice, and stronger engagement, and risks of hallucinations, reduced originality, over-reliance, motivational loss, and ethical concerns. We propose a goal–activity–role alignment heuristic for instructional design, plus safeguards around teacher professional development, feedback literacy, and ethics. We call for longitudinal and cross-cultural research to evaluate the impacts of GenAI in k–12. Full article
Show Figures

Figure 1

23 pages, 2219 KB  
Article
Research on Decision-Making Strategies for Multi-Agent UAVs in Island Missions Based on Rainbow Fusion MADDPG Algorithm
by Chaofan Yang, Bo Zhang, Meng Zhang, Qi Wang and Peican Zhu
Drones 2025, 9(10), 673; https://doi.org/10.3390/drones9100673 - 25 Sep 2025
Abstract
To address the limitations of the Multi-Agent Deep Deterministic Policy Gradient (MADDPG) algorithm in autonomous control tasks including low convergence efficiency, poor training stability, inadequate adaptability of confrontation strategies, and challenges in handling sparse reward tasks—this paper proposes an enhanced algorithm by integrating [...] Read more.
To address the limitations of the Multi-Agent Deep Deterministic Policy Gradient (MADDPG) algorithm in autonomous control tasks including low convergence efficiency, poor training stability, inadequate adaptability of confrontation strategies, and challenges in handling sparse reward tasks—this paper proposes an enhanced algorithm by integrating the Rainbow module. The proposed algorithm improves long-term reward optimization through prioritized experience replay (PER) and multi-step TD updating mechanisms. Additionally, a dynamic reward allocation strategy is introduced to enhance the collaborative and adaptive decision-making capabilities of agents in complex adversarial scenarios. Furthermore, behavioral cloning is employed to accelerate convergence during the initial training phase. Extensive experiments are conducted on the MaCA simulation platform for 5 vs. 5 to 10 vs. 10 UAV island capture missions. The results demonstrate that the Rainbow-MADDPG outperforms the original MADDPG in several key metrics: (1) The average reward value improves across all confrontation scales, with notable enhancements in 6 vs. 6 and 7 vs. 7 tasks, achieving reward values of 14, representing 6.05-fold and 2.5-fold improvements over the baseline, respectively. (2) The convergence speed increases by 40%. (3) The combat effectiveness preservation rate doubles that of the baseline. Moreover, the algorithm achieves the highest average reward value in quasi-rectangular island scenarios, demonstrating its strong adaptability to large-scale dynamic game environments. This study provides an innovative technical solution to address the challenges of strategy stability and efficiency imbalance in multi-agent autonomous control tasks, with significant application potential in UAV defense, cluster cooperative tasks, and related fields. Full article
(This article belongs to the Section Artificial Intelligence in Drones (AID))
Show Figures

Figure 1

11 pages, 695 KB  
Article
Group Attention Aware Coordination Graph
by Ziyan Fang, Wei Liu and Yu Zhang
Appl. Sci. 2025, 15(19), 10355; https://doi.org/10.3390/app151910355 - 24 Sep 2025
Viewed by 99
Abstract
Cooperative Multi-Agent Reinforcement Learning (MARL) relies on effective coordination among agents to maximize team performance in complex environments. However, existing coordination graph-based approaches often overlook dynamic group structures and struggle to accurately capture fine-grained inter-agent dependencies. In this paper, we introduce a novel [...] Read more.
Cooperative Multi-Agent Reinforcement Learning (MARL) relies on effective coordination among agents to maximize team performance in complex environments. However, existing coordination graph-based approaches often overlook dynamic group structures and struggle to accurately capture fine-grained inter-agent dependencies. In this paper, we introduce a novel method called the Group Attention Aware Coordination Graph (G2ACG), which builds upon the group modeling capabilities of the Group-Aware Coordination Graph (GACG). G2ACG incorporates a dynamic attention mechanism to dynamically compute edge weights in the coordination graph, enabling a more flexible and fine-grained representation of agent interactions. These learned edge weights guide a Graph Attention Network (GAT) to perform message passing and representation learning, and the resulting features are integrated into a global policy via QMIX for cooperative decision-making. Experimental results on the StarCraft II Multi-Agent Challenge (SMAC) benchmark show that G2ACG consistently outperforms strong baselines, including QMIX, DICG, and GACG, across various scenarios with diverse agent types and population sizes. Ablation studies further confirm the effectiveness of the proposed attention mechanism, demonstrating that both the number of attention heads and the number of GAT layers significantly affect performance, with a two-layer GAT and multi-head attention configuration yielding the best results. Full article
Show Figures

Figure 1

46 pages, 3090 KB  
Review
Toward Autonomous UAV Swarm Navigation: A Review of Trajectory Design Paradigms
by Kaleem Arshid, Ali Krayani, Lucio Marcenaro, David Martin Gomez and Carlo Regazzoni
Sensors 2025, 25(18), 5877; https://doi.org/10.3390/s25185877 - 19 Sep 2025
Viewed by 470
Abstract
The development of efficient and reliable trajectory-planning strategies for swarms of unmanned aerial vehicles (UAVs) is an increasingly important area of research, with applications in surveillance, search and rescue, smart agriculture, defence operations, and communication networks. This article provides a comprehensive and critical [...] Read more.
The development of efficient and reliable trajectory-planning strategies for swarms of unmanned aerial vehicles (UAVs) is an increasingly important area of research, with applications in surveillance, search and rescue, smart agriculture, defence operations, and communication networks. This article provides a comprehensive and critical review of the various techniques available for UAV swarm trajectory planning, which can be broadly categorised into three main groups: traditional algorithms, biologically inspired metaheuristics, and modern artificial intelligence (AI)-based methods. The study examines cutting-edge research, comparing key aspects of trajectory planning, including computational efficiency, scalability, inter-UAV coordination, energy consumption, and robustness in uncertain environments. The strengths and weaknesses of these algorithms are discussed in detail, particularly in the context of collision avoidance, adaptive decision making, and the balance between centralised and decentralised control. Additionally, the review highlights hybrid frameworks that combine the global optimisation power of bio-inspired algorithms with the real-time adaptability of AI-based approaches, aiming to achieve an effective exploration–exploitation trade-off in multi-agent environments. Lastly, the article addresses the major challenges in UAV swarm trajectory planning, including multidimensional trajectory spaces, nonlinear dynamics, and real-time adaptation. It also identifies promising directions for future research. This study serves as a valuable resource for researchers, engineers, and system designers working to develop UAV swarms for real-world, integrated, intelligent, and autonomous missions. Full article
(This article belongs to the Special Issue Intelligent Sensor Systems in Unmanned Aerial Vehicles)
Show Figures

Figure 1

23 pages, 11963 KB  
Article
CIRS: A Multi-Agent Machine Learning Framework for Real-Time Accident Detection and Emergency Response
by Sadaf Ayesha, Aqsa Aslam, Muhammad Hassan Zaheer and Muhammad Burhan Khan
Sensors 2025, 25(18), 5845; https://doi.org/10.3390/s25185845 - 19 Sep 2025
Viewed by 552
Abstract
Road traffic accidents remain a leading cause of fatalities worldwide, and the consequences are considerably worsened by delayed detection and emergency response. Although several machine learning-based approaches have been proposed, accident detection systems are not widely deployed, and most existing solutions fail to [...] Read more.
Road traffic accidents remain a leading cause of fatalities worldwide, and the consequences are considerably worsened by delayed detection and emergency response. Although several machine learning-based approaches have been proposed, accident detection systems are not widely deployed, and most existing solutions fail to handle the growing complexity of modern traffic environments. This study introduces Collaborative Intelligence for Road Safety (CIRS), a novel, multi-agent, machine-learning-based framework designed for real-time accident detection, semantic scene understanding, and coordinated emergency response. Each agent in CIRS is designed for a distinct role perception, classification, description, localization, and decision-making, working collaboratively to enhance situational awareness and response efficiency. These agents integrate advanced models: YOLOv11 for high-accuracy accident detection and VideoLLaMA3 for contextual-rich scene description. CIRS bridges the gap between low-level visual analysis and high-level situational awareness. Extensive evaluation on a custom dataset comprising (5200 accident, 4800 nonaccident) frames demonstrates the effectiveness of the proposed approach. YOLOv11 achieves a top-1 accuracy of 86.5% and a perfect top-5 accuracy of 100%, ensuring reliable real-time detection. VideoLLaMA3 outperforms other vision-language models with superior factual accuracy and fewer hallucinations, generating strong results in the metrics of BLEU (0.0755), METEOR (0.2258), and ROUGE-L (0.3625). The decentralized multi-agent architecture of CIRS enables scalability, reduced latency, and the timely dispatch of emergency services while minimizing false positives. Full article
(This article belongs to the Section Intelligent Sensors)
Show Figures

Figure 1

19 pages, 390 KB  
Review
Virulence Regulation in Borrelia burgdorferi
by Sierra George and Zhiming Ouyang
Microorganisms 2025, 13(9), 2183; https://doi.org/10.3390/microorganisms13092183 - 18 Sep 2025
Viewed by 548
Abstract
Borrelia burgdorferi, the causative agent of Lyme disease, is the most common vector-borne disease in the United States. Compared with other bacterial pathogens, B. burgdorferi has many unique features. For instance, its highly segmented genome was predicted to encode very few proteins directly [...] Read more.
Borrelia burgdorferi, the causative agent of Lyme disease, is the most common vector-borne disease in the United States. Compared with other bacterial pathogens, B. burgdorferi has many unique features. For instance, its highly segmented genome was predicted to encode very few proteins directly dedicated to gene expression regulation. Yet, the spirochete continuously reprograms its transcriptome and proteome to promote survival and pathogenesis as spirochetes traverse the enzootic lifecycle between ticks and mammals. Signal sensing systems, a unique alternative sigma factor cascade, and multi-functional regulators work in concert to coordinate virulence gene expression under different tick and mammal environments. In this review, we have summarized recent advances in gene regulation in B. burgdorferi. Full article
(This article belongs to the Special Issue Ticks, Tick Microbiome and Tick-Borne Diseases)
32 pages, 3609 KB  
Article
BPMN-Based Design of Multi-Agent Systems: Personalized Language Learning Workflow Automation with RAG-Enhanced Knowledge Access
by Hedi Tebourbi, Sana Nouzri, Yazan Mualla, Meryem El Fatimi, Amro Najjar, Abdeljalil Abbas-Turki and Mahjoub Dridi
Information 2025, 16(9), 809; https://doi.org/10.3390/info16090809 - 17 Sep 2025
Viewed by 350
Abstract
The intersection of Artificial Intelligence (AI) and education is revolutionizing learning and teaching in this digital era, with Generative AI and large language models (LLMs) providing even greater possibilities for the future. The digital transformation of language education demands innovative approaches that combine [...] Read more.
The intersection of Artificial Intelligence (AI) and education is revolutionizing learning and teaching in this digital era, with Generative AI and large language models (LLMs) providing even greater possibilities for the future. The digital transformation of language education demands innovative approaches that combine pedagogical rigor with explainable AI (XAI) principles, particularly for low-resource languages. This paper presents a novel methodology that integrates Business Process Model and Notation (BPMN) with Multi-Agent Systems (MAS) to create transparent, workflow-driven language tutors. Our approach uniquely embeds XAI through three mechanisms: (1) BPMN’s visual formalism that makes agent decision-making auditable, (2) Retrieval-Augmented Generation (RAG) with verifiable knowledge provenance from textbooks of the National Institute of Languages of Luxembourg, and (3) human-in-the-loop validation of both content and pedagogical sequencing. To ensure realism in learner interaction, we integrate speech-to-text and text-to-speech technologies, creating an immersive, human-like learning environment. The system simulates intelligent tutoring through agents’ collaboration and dynamic adaptation to learner progress. We demonstrate this framework through a Luxembourgish language learning platform where specialized agents (Conversational, Reading, Listening, QA, and Grammar) operate within BPMN-modeled workflows. The system achieves high response faithfulness (0.82) and relevance (0.85) according to RAGA metrics, while speech integration using Whisper STT and Coqui TTS enables immersive practice. Evaluation with learners showed 85.8% satisfaction with contextual responses and 71.4% engagement rates, confirming the effectiveness of our process-driven approach. This work advances AI-powered language education by showing how formal process modeling can create pedagogically coherent and explainable tutoring systems. The architecture’s modularity supports extension to other low-resource languages while maintaining the transparency critical for educational trust. Future work will expand curriculum coverage and develop teacher-facing dashboards to further improve explainability. Full article
(This article belongs to the Section Information Applications)
Show Figures

Figure 1

25 pages, 4653 KB  
Article
Research on Formation Recovery Strategy for UAV Swarms Based on IVYA-Nash Algorithm
by Junfang Li, Zexin Gu, Lei Zhang and Junchi Wang
Electronics 2025, 14(18), 3653; https://doi.org/10.3390/electronics14183653 - 15 Sep 2025
Viewed by 254
Abstract
Contemporary multi-UAV formations face dual challenges of obstacle avoidance and rapid formation recovery. To enable UAV swarms to efficiently restore their predefined configurations post-obstacle navigation, a formation recovery strategy grounded in Nash equilibrium game theory is proposed in this paper. By integrating the [...] Read more.
Contemporary multi-UAV formations face dual challenges of obstacle avoidance and rapid formation recovery. To enable UAV swarms to efficiently restore their predefined configurations post-obstacle navigation, a formation recovery strategy grounded in Nash equilibrium game theory is proposed in this paper. By integrating the IVY optimization algorithm, a collaborative control model that systematically balances individual UAV interests with swarm-level objectives through carefully designed optimization criteria is established. Comparative experimental results demonstrate that, compared to traditional formation obstacle-avoidance algorithms, Improved Particle Swarm Optimization (IPSO), Ant Colony Optimization (ACO), and Genetic Algorithm (GA), our method exhibits superior performance across multiple key metrics, including average path length, formation accuracy rate, recovery time, and total time consumption. Real-flight tests on a multi-UAV platform confirm IVYA-Nash surpasses improved APF in formation accuracy and aerodynamic disturbance resistance, proving robustness in dynamic multi-agent scenarios. The work provides an efficient and reliable solution for coordinated control of UAV formations in complex environments. Full article
Show Figures

Figure 1

30 pages, 2061 KB  
Article
A Feature-Aware Elite Imitation MARL for Multi-UAV Trajectory Optimization in Mountain Terrain Detection
by Quanxi Zhou, Ye Tao, Qianxiao Su and Manabu Tsukada
Drones 2025, 9(9), 645; https://doi.org/10.3390/drones9090645 - 15 Sep 2025
Viewed by 494
Abstract
With the advancement of UAV trajectory planning and sensing technologies, unmanned aerial vehicles (UAVs) are now capable of performing high-performance ground detection and search tasks. Mountainous regions, due to their complex terrain, have long been a focal point in the field of remote [...] Read more.
With the advancement of UAV trajectory planning and sensing technologies, unmanned aerial vehicles (UAVs) are now capable of performing high-performance ground detection and search tasks. Mountainous regions, due to their complex terrain, have long been a focal point in the field of remote sensing. Effective UAV search tasks in such areas must consider not only horizontal coverage but also variations in detection range and angle caused by changes in elevation. Conventional algorithms typically require complete prior knowledge of the environment for trajectory optimization and often depend on scenario-specific policy models, limiting their generalizability. To address these challenges, this paper proposes a Feature-Aware Elite Imitation Multi-Agent Reinforcement Learning (FA-EIMARL) algorithm that leverages partial terrain information to construct a feature extraction network. This approach enables batch training across diverse terrains without the need for full environmental maps. In addition, an elite imitation mechanism has been proposed for convergence acceleration and task performance enhancement. Simulation results demonstrate that the proposed method achieves superior reward performance, convergence rate, and computational efficiency while maintaining strong adaptability to varying terrains. Full article
Show Figures

Figure 1

22 pages, 1167 KB  
Article
CaST-MASAC: Integrating Causal Inference and Spatio-Temporal Attention for Multi-UAV Cooperative Task Planning
by Renjie Chen and Feng Hu
Drones 2025, 9(9), 644; https://doi.org/10.3390/drones9090644 - 14 Sep 2025
Viewed by 325
Abstract
The efficient coordination of multi-Unmanned Aerial Vehicle (UAV) systems in the increasingly complex domain of aerial tasks is hampered by significant challenges, including partial observability, low sample efficiency, and difficulties in inter-agent coordination. To address these issues, this paper introduces a novel Causal [...] Read more.
The efficient coordination of multi-Unmanned Aerial Vehicle (UAV) systems in the increasingly complex domain of aerial tasks is hampered by significant challenges, including partial observability, low sample efficiency, and difficulties in inter-agent coordination. To address these issues, this paper introduces a novel Causal Spatio-Temporal Multi-Agent Soft Actor–Critic (CaST-MASAC) framework. At its core, CaST-MASAC integrates two key innovations: (1) a spatio-temporal attention (STa) module that extracts features from historical observations to enable accurate target trajectory prediction and dynamic task assignment, thereby enhancing situational awareness and collaborative decision-making in highly dynamic and partially observable environments; and (2) a Causal Inference Experience Replay (CIER) mechanism that significantly improves sample efficiency and convergence speed by identifying and prioritizing experiences with a high causal impact on the task success. Evaluated in 4v4 and 2v2 multi-UAV aerial coordination simulation environments, CaST-MASAC demonstrates superior performance over state-of-the-art baselines such as MAPPO and QMIX in terms of task success rate, cumulative reward, and decision efficiency. Furthermore, extensive ablation studies validate the critical contributions of both the STa and CIER modules to the framework’s overall performance. Consequently, CaST-MASAC offers a novel and effective approach for developing robust and efficient multi-agent coordination strategies in complex dynamic environments. Full article
Show Figures

Figure 1

49 pages, 3594 KB  
Review
Model Predictive Control for Smart Buildings: Applications and Innovations in Energy Management
by Panagiotis Michailidis, Iakovos Michailidis, Federico Minelli, Hasan Huseyin Coban and Elias Kosmatopoulos
Buildings 2025, 15(18), 3298; https://doi.org/10.3390/buildings15183298 - 12 Sep 2025
Viewed by 805
Abstract
The integration of advanced control strategies into building energy management systems (BEMS) is essential for achieving energy efficiency and sustaining thermal comfort. Model predictive control (MPC) has gained significant traction as a model-based approach capable of optimizing control actions by predicting future system [...] Read more.
The integration of advanced control strategies into building energy management systems (BEMS) is essential for achieving energy efficiency and sustaining thermal comfort. Model predictive control (MPC) has gained significant traction as a model-based approach capable of optimizing control actions by predicting future system behavior under dynamic conditions. The current review offers an in-depth analysis of MPC, combining its core theoretical foundations with a broad survey of impactful applications in buildings, for extracting key breakthroughs and trends that have defined the field over the past decade. Emphasis is placed on multiverse MPC configurations and their application across various BEMS frameworks integrating HVACs, energy storage, renewable energy, domestic hot water, electric vehicle charging, and lighting systems. A detailed evaluation of MPC key attributes is then conducted, based on essential aspects of MPC, such as algorithms, optimization solvers, baselines, performance indexes, and building types, as well as simulation tools that support system modeling and real-time validation. The study concludes by outlining key research trends and proposing future directions, with a strong emphasis on addressing real-world deployment challenges and advancing scalable, interoperable solutions on smart building ecosystems. According to the evaluation, MPC research is shifting from simple white-box setups to gray- and black-box models paired with metaheuristic or hybrid solvers, leveraging machine learning for forecasting and multi-objective optimization, but still lacking robustness, benchmarks, and real-world validation. Consequently, next-generation MPC is anticipated to evolve into adaptive, hybrid, and multi-agent frameworks that integrate forecasting and control, embed occupant behavior, enable grid-interactive flexibility, and support lightweight, explainable deployment in real building environments. Full article
Show Figures

Figure 1

Back to TopTop