Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (553)

Search Parameters:
Keywords = 10-MDP

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
26 pages, 969 KB  
Article
Constructing Non-Markovian Decision Process via History Aggregator
by Yongyi Wang, Lingfeng Li and Wenxin Li
Appl. Sci. 2026, 16(2), 955; https://doi.org/10.3390/app16020955 - 16 Jan 2026
Viewed by 27
Abstract
In the domain of algorithmic decision-making, non-Markovian dynamics manifest as a significant impediment, especially for paradigms such as Reinforcement Learning (RL), thereby exerting far-reaching consequences on the advancement and effectiveness of the associated systems. Nevertheless, the existing benchmarks are deficient in comprehensively assessing [...] Read more.
In the domain of algorithmic decision-making, non-Markovian dynamics manifest as a significant impediment, especially for paradigms such as Reinforcement Learning (RL), thereby exerting far-reaching consequences on the advancement and effectiveness of the associated systems. Nevertheless, the existing benchmarks are deficient in comprehensively assessing the capacity of decision algorithms to handle non-Markovian dynamics. To address this deficiency, we have devised a generalized methodology grounded in category theory. Notably, we established the category of Markov Decision Processes (MDP) and the category of non-Markovian Decision Processes (NMDP), and proved the equivalence relationship between them. This theoretical foundation provides a novel perspective for understanding and addressing non-Markovian dynamics. We further introduced non-Markovianity into decision-making problem settings via the History Aggregator for State (HAS). With HAS, we can precisely control the state dependency structure of decision-making problems in the time series. Our analysis demonstrates the effectiveness of our method in representing a broad range of non-Markovian dynamics. This approach facilitates a more rigorous and flexible evaluation of decision algorithms by testing them in problem settings where non-Markovian dynamics are explicitly constructed. Full article
(This article belongs to the Special Issue Advances in Intelligent Decision-Making Systems)
19 pages, 6478 KB  
Article
An Intelligent Dynamic Cluster Partitioning and Regulation Strategy for Distribution Networks
by Keyan Liu, Kaiyuan He, Dongli Jia, Huiyu Zhan, Wanxing Sheng, Zukun Li, Yuxuan Huang, Sijia Hu and Yong Li
Energies 2026, 19(2), 384; https://doi.org/10.3390/en19020384 - 13 Jan 2026
Viewed by 150
Abstract
As distributed generators (DGs) and flexible adjustable loads (FALs) further penetrate distribution networks (DNs), to reduce regulation complexity compared with traditional centralized control frameworks, DGs and FALs in DNs should be packed in several clusters to enable their dispatch to become standard in [...] Read more.
As distributed generators (DGs) and flexible adjustable loads (FALs) further penetrate distribution networks (DNs), to reduce regulation complexity compared with traditional centralized control frameworks, DGs and FALs in DNs should be packed in several clusters to enable their dispatch to become standard in the industry. To mitigate the negative influence of DGs’ and FALs’ spatiotemporal distribution and uncertain output characteristics on dispatch, this paper proposes an intelligent dynamic cluster partitioning strategy for DNs, from which the DN’s resources and loads can be intelligently aggregated, organized, and regulated in a dynamic and optimal way with relatively high implementation efficiency. An environmental model based on the Markov decision process (MDP) technique is first developed for DN cluster partitioning, in which a continuous state space, a discrete action space, and a dispatching performance-oriented reward are designed. Then, a novel random forest Q-learning network (RF-QN) is developed to implement dynamic cluster partitioning by interacting with the proposed environmental model, from which the generalization and robust capability to estimate the Q-function can be improved by taking advantage of combining deep learning and decision trees. Finally, a modified IEEE-33-node system is adopted to verify the effectiveness of the proposed intelligent dynamic cluster partitioning and regulation strategy; the results also indicate that the proposed RF-QN is superior to the traditional deep Q-learning (DQN) model in terms of renewable energy accommodation rate, training efficiency, and portioning and regulation performance. Full article
(This article belongs to the Special Issue Advanced in Modeling, Analysis and Control of Microgrids)
Show Figures

Figure 1

19 pages, 38545 KB  
Article
Improving Dynamic Visual SLAM in Robotic Environments via Angle-Based Optical Flow Analysis
by Sedat Dikici and Fikret Arı
Electronics 2026, 15(1), 223; https://doi.org/10.3390/electronics15010223 - 3 Jan 2026
Viewed by 270
Abstract
Dynamic objects present a major challenge for visual simultaneous localization and mapping (Visual SLAM), as feature measurements originating from moving regions can corrupt camera pose estimation and lead to inaccurate maps. In this paper, we propose a lightweight, semantic-free front-end enhancement for ORB-SLAM [...] Read more.
Dynamic objects present a major challenge for visual simultaneous localization and mapping (Visual SLAM), as feature measurements originating from moving regions can corrupt camera pose estimation and lead to inaccurate maps. In this paper, we propose a lightweight, semantic-free front-end enhancement for ORB-SLAM that detects and suppresses dynamic features using optical flow geometry. The key idea is to estimate a global motion direction point (MDP) from optical flow vectors and to classify feature points based on their angular consistency with the camera-induced motion field. Unlike magnitude-based flow filtering, the proposed strategy exploits the geometric consistency of optical flow with respect to a motion direction point, providing robustness not only to depth variation and camera speed changes but also to different camera motion patterns, including pure translation and pure rotation. The method is integrated into the ORB-SLAM front-end without modifying the back-end optimization or cost function. Experiments on public dynamic-scene datasets demonstrate that the proposed approach reduces absolute trajectory error by up to approximately 45% compared to baseline ORB-SLAM, while maintaining real-time performance on a CPU-only platform. These results indicate that reliable dynamic feature suppression can be achieved without semantic priors or deep learning models. Full article
(This article belongs to the Section Computer Science & Engineering)
Show Figures

Figure 1

25 pages, 3099 KB  
Article
Research on Improved PPO-Based Unmanned Surface Vehicle Trajectory Tracking Control Integrated with Pure Pursuit Guidance
by Hongyu Li, Runyu Yang, Yu Zhang, Yicheng Wen, Qunhong Tian, Weizhuang Ma, Zongsheng Wang and Shaobo Yang
J. Mar. Sci. Eng. 2026, 14(1), 70; https://doi.org/10.3390/jmse14010070 - 30 Dec 2025
Viewed by 191
Abstract
To address the low trajectory tracking accuracy and limited robustness of conventional reinforcement learning algorithms under complex marine environments involving wind, wave, and current disturbances, this study proposes a proximal policy optimization (PPO) algorithm incorporating an intrinsic curiosity mechanism to solve the unmanned [...] Read more.
To address the low trajectory tracking accuracy and limited robustness of conventional reinforcement learning algorithms under complex marine environments involving wind, wave, and current disturbances, this study proposes a proximal policy optimization (PPO) algorithm incorporating an intrinsic curiosity mechanism to solve the unmanned surface vehicle (USV) trajectory tracking control problem. The proposed approach is developed on the basis of a three-degree-of-freedom (3-DOF) USV model and formulated within a Markov decision process (MDP) framework, where a multidimensional state space and a continuous action space are defined, and a multi-objective composite reward function is designed. By incorporating a pure pursuit guidance algorithm, the complexity of engineering implementation is reduced. Furthermore, an improved PPO algorithm integrated with an intrinsic curiosity mechanism is adopted as the trajectory tracking controller, in which the exploration incentives provided by the intrinsic curiosity module (ICM) guide the agent to explore the state space efficiently and converge rapidly to an optimal control policy. The final experimental results indicate that, compared with the conventional PPO algorithm, the improved PPO–ICM controller achieves a reduction of 54.2% in average lateral error and 47.1% in average heading error under simple trajectory conditions. Under the complex trajectory condition, the average lateral error and average heading error are reduced by 91.8% and 41.9%, respectively. These results effectively demonstrate that the proposed PPO–ICM algorithm attains high tracking accuracy and strong generalization capability across different trajectory scenarios, and can provide a valuable reference for the application of intelligent control algorithms in the USV domain. Full article
Show Figures

Figure 1

19 pages, 1927 KB  
Article
Plasma Metabolomics Reveals Systemic Metabolic Remodeling in Early-Lactation Dairy Cows Fed a Fusarium-Contaminated Diet and Supplemented with a Mycotoxin-Deactivating Product
by Gabriele Rocchetti, Alessandro Catellani, Marco Lapris, Nicole Reisinger, Johannes Faas, Ignacio Artavia, Silvia Labudova, Erminio Trevisi and Antonio Gallo
Toxins 2026, 18(1), 9; https://doi.org/10.3390/toxins18010009 - 22 Dec 2025
Viewed by 358
Abstract
This study investigated the systemic metabolic effects of feeding a Fusarium-contaminated diet to early-lactation Holstein cows, with or without a mycotoxin-deactivating product (MDP; Mycofix® Plus, BIOMIN Holding GmbH, Tulln, Austria). Thirty cows were divided into three dietary groups: a mildly contaminated [...] Read more.
This study investigated the systemic metabolic effects of feeding a Fusarium-contaminated diet to early-lactation Holstein cows, with or without a mycotoxin-deactivating product (MDP; Mycofix® Plus, BIOMIN Holding GmbH, Tulln, Austria). Thirty cows were divided into three dietary groups: a mildly contaminated control (CTR), a moderately contaminated diet containing zearalenone and deoxynivalenol (MTX), and the same contaminated diet supplemented with MDP. Plasma collected at 56 days in milk was analyzed by untargeted ultra-high-performance liquid chromatography (UHPLC) coupled with high-resolution mass spectrometry (HRMS), and multivariate models identified discriminant metabolites and pathways. MTX-fed cows showed alterations in sphingolipid metabolism, including accumulation of ceramide (t18:0/16:0), lactosylceramide, and sphinganine 1-phosphate, consistent with ceramide synthase inhibition and lipid remodeling stress. Increases in estradiol, estrone, and cholesterol sulfate suggested endocrine disruption, while elevated 8-oxo-dGMP indicated oxidative DNA damage. MDP supplementation mitigated these alterations, reducing sphingolipid intermediates, modulating tryptophan and glycerophospholipid pathways, and lowering oxidative stress markers. Metabolites such as riboflavin, pipecolic acid, and N-acetylserotonin could be likely associated with an improved mitochondrial function and redox homeostasis, although future studies are required to confirm this hypothesis. Additionally, MDP-fed cows exhibited distinct shifts in pyrimidine and nucleotide metabolism. Overall, MDP effectively counteracted Fusarium-related metabolic disturbances, supporting its protective role in maintaining lipid balance, hormonal stability, oxidative control, and metabolic resilience. Full article
(This article belongs to the Special Issue Strategies for Mitigating Mycotoxin Contamination in Food and Feed)
Show Figures

Figure 1

14 pages, 2217 KB  
Article
Utility of Quantitative and Semi-Quantitative SPECT/CT Metrics in Differentiating Mueller–Weiss Syndrome
by Yi-Ching Lin, Shih-Chuan Tsai, Chia-Hung Kao and Shun-Ping Wang
Diagnostics 2026, 16(1), 18; https://doi.org/10.3390/diagnostics16010018 - 20 Dec 2025
Viewed by 292
Abstract
Background/Objectives: Mueller–Weiss syndrome (MWS) is a rare condition characterized by spontaneous adult-onset osteonecrosis of the navicular bone. This study aimed to assess the diagnostic value of quantitative and semi-quantitative standardized uptake value (SUV) measurements on Tc-99m MDP SPECT/CT for differentiating MWS from other [...] Read more.
Background/Objectives: Mueller–Weiss syndrome (MWS) is a rare condition characterized by spontaneous adult-onset osteonecrosis of the navicular bone. This study aimed to assess the diagnostic value of quantitative and semi-quantitative standardized uptake value (SUV) measurements on Tc-99m MDP SPECT/CT for differentiating MWS from other foot pathologies. Methods: We retrospectively reviewed 21 MWS patients who underwent SPECT/CT and compared them with 10 feet from 5 non-MWS patients as controls. MWS severity was staged using the Maceira classification. Volumes of interest (VOIs) were defined in the lateral navicular and distal tibia. SUVmax values were measured for the navicular bone (N), tibial metaphysis (Tm), and diaphysis (Td). Uptake ratios (N/Tm and N/Td) were calculated for semi-quantitative comparison. Results: MWS patients showed significantly higher SUVmax in the navicular compared with controls (9.2 vs. 1.5, p < 0.001). Both N/Tm and N/Td ratios were also significantly elevated (p < 0.001). SUVmax and uptake ratios positively correlated with Maceira stage and visual navicular uptake intensity. Diagnostic thresholds of N SUVmax > 3.77 (AUC = 0.93), N/Tm > 1.139 (AUC = 0.95), and N/Td > 0.93 (AUC = 0.93) effectively distinguished MWS from non-MWS cases. Conclusions: Quantitative and semi-quantitative SUV analysis on SPECT/CT offers a reliable tool for diagnosing MWS and evaluating disease severity. Semi-quantitative ratios, by normalizing metabolic variability, provide a practical and reproducible alternative to absolute SUV measurements for early detection and treatment planning in MWS. Full article
(This article belongs to the Section Medical Imaging and Theranostics)
Show Figures

Figure 1

23 pages, 2132 KB  
Article
Novel Multistage Subunit Mycobacterium tuberculosis Nanoparticle Vaccine Confers Protection Against Experimental Infection in Prophylactic and Therapeutic Regimens
by Amir I. Tukhvatulin, Alina S. Dzharullaeva, Daria V. Vasina, Mikhail V. Fursov, Fatima M. Izhaeva, Denis A. Kleymenov, Dmitry N. Shcherbinin, Nikita B. Polyakov, Andrey I. Solovyev, Vladimir G. Zhukhovitsky, Alla S. Zhitkevich, Ilya V. Gordeychuk, Anna M. Litvinova, Victor A. Manuylov, Vasiliy D. Potapov, Artem P. Tkachuk, Vladimir A. Gushchin, Denis Y. Logunov and Alexander L. Gintsburg
Vaccines 2026, 14(1), 5; https://doi.org/10.3390/vaccines14010005 - 19 Dec 2025
Viewed by 505
Abstract
Background/Objectives: Tuberculosis (TB) remains the leading cause of death from a single infectious agent worldwide. In line with the World Health Organization’s (WHO) goal to end TB by 2035, the rapid development and clinical implementation of new, effective vaccines is urgently needed. [...] Read more.
Background/Objectives: Tuberculosis (TB) remains the leading cause of death from a single infectious agent worldwide. In line with the World Health Organization’s (WHO) goal to end TB by 2035, the rapid development and clinical implementation of new, effective vaccines is urgently needed. To support global TB control efforts, we developed a novel candidate subunit multistage vaccine. Methods: This vaccine incorporates multiple Mycobacterium tuberculosis antigens expressed during both dormant and active stages of infection, fused into a single recombinant protein (ESAT6-CFP10-Ag85A-Rv2660c-Rv1813c). The antigen was encapsulated in biodegradable poly(D,L-lactide-co-glycolide) (PLGA) nanoparticles along with the pattern recognition receptor (PRR) agonists monophosphoryl lipid A (MPLA) and muramyl dipeptide (MDP), which function as adjuvants. Results: Using a mixed intramuscular/nasal prime-boost regimen, the vaccine elicited a mixed Th1/Th17 cell-mediated immune response, as well as a robust humoral response characterized by sustained systemic IgG (lasting at least one year) and prominent local secretory IgA. The vaccine demonstrated protective efficacy as a prophylactic booster following BCG priming in both murine and guinea pig models and was also effective in a therapeutic setting in a murine infection model. Conclusions: The results of this study provide empirical evidence that multistage tuberculosis vaccines represent a promising strategy for achieving global TB control. Full article
Show Figures

Figure 1

25 pages, 821 KB  
Article
Enhancing Microservice Security Through Adaptive Moving Target Defense Policies to Mitigate DDoS Attacks in Cloud-Native Environments
by Yuyang Zhou, Guang Cheng and Kang Du
Future Internet 2025, 17(12), 580; https://doi.org/10.3390/fi17120580 - 16 Dec 2025
Viewed by 299
Abstract
Cloud-native microservice architectures offer scalability and resilience but introduce complex interdependencies and new attack surfaces, making them vulnerable to resource-exhaustion Distributed Denial-of-Service (DDoS) attacks. These attacks propagate along service call chains, closely mimic legitimate traffic, and evade traditional detection and mitigation techniques, resulting [...] Read more.
Cloud-native microservice architectures offer scalability and resilience but introduce complex interdependencies and new attack surfaces, making them vulnerable to resource-exhaustion Distributed Denial-of-Service (DDoS) attacks. These attacks propagate along service call chains, closely mimic legitimate traffic, and evade traditional detection and mitigation techniques, resulting in cascading bottlenecks and degraded Quality of Service (QoS). Existing Moving Target Defense (MTD) approaches lack adaptive, cost-aware policy guidance and are often ineffective against spatiotemporally adaptive adversaries. To address these challenges, this paper proposes ScaleShield, an adaptive MTD framework powered by Deep Reinforcement Learning (DRL) that learns coordinated, attack-aware defense policies for microservices. ScaleShield formulates defense as a Markov Decision Process (MDP) over multi-dimensional discrete actions, leveraging a Multi-Dimensional Double Deep Q-Network (MD3QN) to optimize service availability and minimize operational overhead. Experimental results demonstrate that ScaleShield achieves near 100% defense success rates and reduces compromised nodes to zero within approximately 5 steps, significantly outperforming state-of-the-art baselines. It lowers service latency by up to 72% under dynamic attacks while maintaining over 94% resource efficiency, providing robust and cost-effective protection against resource-exhaustion DDoS attacks in cloud-native environments. Full article
(This article belongs to the Special Issue DDoS Attack Detection for Cyber–Physical Systems)
Show Figures

Figure 1

27 pages, 5486 KB  
Article
Multi-Objective Optimal Scheduling of Park-Level Integrated Energy System Based on Trust Region Policy Optimization Algorithm
by Deyuan Lu, Chongxiao Kou, Shutong Wang, Li Wang, Yongbo Wang and Yingjun Lv
Electronics 2025, 14(24), 4900; https://doi.org/10.3390/electronics14244900 - 12 Dec 2025
Viewed by 331
Abstract
In the context of dual-carbon goals, Park-Level Integrated Energy Systems (PIES) are pivotal for enhancing renewable energy integration and promoting clean, efficient energy use. However, the inherent non-linearity from multi-energy coupling and the high dimensionality of operational data present substantial challenges for conventional [...] Read more.
In the context of dual-carbon goals, Park-Level Integrated Energy Systems (PIES) are pivotal for enhancing renewable energy integration and promoting clean, efficient energy use. However, the inherent non-linearity from multi-energy coupling and the high dimensionality of operational data present substantial challenges for conventional scheduling optimization methods. To overcome these obstacles, this paper introduces a novel multi-objective scheduling framework for PIES leveraging deep reinforcement learning. We innovatively formulate the scheduling task as a Markov Decision Process (MDP) and employ the Trust Region Policy Optimization (TRPO) algorithm, which is adept at handling continuous action spaces. The state and action spaces are meticulously designed according to system constraints and user demands. A comprehensive reward function is then established to concurrently pursue three objectives: minimum operating cost, minimum carbon emissions, and maximum exergy efficiency. Through comparative analyses against other AI-based algorithms, our results demonstrate that the proposed method significantly lowers operating costs and carbon footprint while enhancing overall exergy efficiency. This validates the model’s effectiveness and superiority in addressing the complex multi-objective scheduling challenges inherent in modern energy systems. Full article
Show Figures

Figure 1

37 pages, 3373 KB  
Article
A DQN-Based Intelligent Voltage Control Framework for Enhancing Renewable Integration and Energy Sustainability in Wind-Penetrated Distribution Networks
by Ramesh Kumar Behara and Akshay Kumar Saha
Sustainability 2025, 17(24), 11164; https://doi.org/10.3390/su172411164 - 12 Dec 2025
Viewed by 333
Abstract
The increasing penetration of renewable energy resources is central to global sustainability and decarbonisation goals, yet it introduces intermittency and voltage instability in modern distribution networks. Ensuring stable operation while maximising renewable utilisation is critical for achieving long-term energy sustainability, reduced carbon emissions, [...] Read more.
The increasing penetration of renewable energy resources is central to global sustainability and decarbonisation goals, yet it introduces intermittency and voltage instability in modern distribution networks. Ensuring stable operation while maximising renewable utilisation is critical for achieving long-term energy sustainability, reduced carbon emissions, and efficient grid performance. This study proposes a sustainability-oriented, Reinforcement Learning (RL)-driven voltage control framework that enables reliable and energy-efficient operation of wind-integrated distribution systems. A Deep Q-Network (DQN) agent formulates voltage regulation as a Markov Decision Process (MDP) and autonomously learns optimal control policies for on-load tap changers (OLTCs) and capacitor banks under highly variable wind and load conditions. Using the IEEE 33-bus test system with realistic stochastic wind and ZIP-load models, the results show that the proposed controller maintains voltages within statutory limits, reduces total active power losses by up to 18%, and enhances the network’s capacity to host renewable energy. These improvements translate to increased energy efficiency, reduced technical losses, and greater operational resilience, key enablers of sustainable energy distribution. The findings demonstrate that intelligent RL-based frameworks offer a scalable and model-free tool for advancing sustainable, low-carbon, and resilient power systems. Full article
Show Figures

Figure 1

15 pages, 2080 KB  
Article
Hydrolyzed Milk-Derived Peptides Promote Erythropoietin Pathways and Hematologic Recovery: A Cross-Species Analysis
by Liqing Zang, Akira Yokota, Misa Nakai, Kazutake Fukada, Norihiro Nishimura and Yasuhito Shimada
Molecules 2025, 30(24), 4739; https://doi.org/10.3390/molecules30244739 - 11 Dec 2025
Viewed by 479
Abstract
Anemia, characterized by reduced hemoglobin (Hb), remains a major health concern. Although iron and erythropoietin (EPO) therapies are effective, limitations in safety and accessibility have prompted interest in nutritional alternatives. Hydrolyzed milk-derived peptides (H-MDPs) contain bioactive sequences with diverse physiological effects, yet their [...] Read more.
Anemia, characterized by reduced hemoglobin (Hb), remains a major health concern. Although iron and erythropoietin (EPO) therapies are effective, limitations in safety and accessibility have prompted interest in nutritional alternatives. Hydrolyzed milk-derived peptides (H-MDPs) contain bioactive sequences with diverse physiological effects, yet their role in erythropoiesis remains poorly defined. This study investigated the hematopoietic actions of H-MDP using zebrafish and mouse models. Adult zebrafish underwent phlebotomy-induced anemia and received oral H-MDP for 3 weeks. Hb levels, erythrocyte morphology, and expression of erythropoiesis- and iron-metabolism genes were assessed. In healthy mice, renal Epo expression, circulating EPO, and serum cytokines were measured after 2 weeks of H-MDP administration. H-MDP significantly accelerated Hb recovery in anemic zebrafish (4.6 ± 0.64 g/dL vs. 3.4 ± 0.66 g/dL in untreated fish at week 1) and markedly improved erythrocyte maturation. These effects coincided with strong induction of epo, hif1aa/b, igf1, csf1a, and csf3b in the heart and liver, as well as normalization of anemia-induced hepatic iron-transport genes (tfa, fpn1, tfr2) and reactivation of hamp. In mice, H-MDP elevated renal Epo mRNA and circulating EPO (approximately 2.3-fold) without altering steady-state Hb, and cytokine profiling with IPA-predicted activation of the erythropoietin signaling pathway. Collectively, these findings indicate that H-MDPs modulate erythropoiesis by coordinating the activation of EPO-related and iron-regulatory networks, supporting their potential as functional food ingredients for hematologic recovery and anemia management. Full article
(This article belongs to the Special Issue Small Fish Models for Molecular-Ethnopharmacology and Drug Discovery)
Show Figures

Figure 1

26 pages, 2916 KB  
Article
An Integrated Strategy for Intelligent Evasion and Navigation of Unmanned Aerial Vehicles in Multi-Obstacle Environments
by Tianya Liu, Fengshuo Wang and Peng Li
Aerospace 2025, 12(12), 1092; https://doi.org/10.3390/aerospace12121092 - 8 Dec 2025
Viewed by 307
Abstract
To enhance the navigation accuracy and obstacle avoidance capability of Unmanned Aerial Vehicles (UAVs) operating in dynamic multi-obstacle environments, this paper proposes an intelligent navigation and avoidance strategy based on deep reinforcement learning. First, a performance index is formulated by integrating miss distances [...] Read more.
To enhance the navigation accuracy and obstacle avoidance capability of Unmanned Aerial Vehicles (UAVs) operating in dynamic multi-obstacle environments, this paper proposes an intelligent navigation and avoidance strategy based on deep reinforcement learning. First, a performance index is formulated by integrating miss distances from multiple obstacles with energy consumption. An optimal avoidance strategy is then derived as an expert policy through a solution of the Riccati equation. Subsequently, a Markov Decision Process (MDP) model is constructed for UAV navigation and obstacle avoidance, incorporating a multi-objective reward function that simultaneously optimizes avoidance success rate, navigation accuracy, and energy efficiency. Furthermore, a hybrid learning architecture combining Generative Adversarial Imitation Learning (GAIL) with Proximal Policy Optimization (PPO) is designed and trained. Simulation results demonstrate that the proposed method achieves high training efficiency and enables robust decision-making in complex navigation scenarios. Full article
(This article belongs to the Section Aeronautics)
Show Figures

Figure 1

18 pages, 5613 KB  
Article
Condition-Based Maintenance Decision-Making for Multi-Component Systems with Integrated Dynamic Bayesian Network and Proportional Hazards Model
by Shizheng Li, Canjiong Yao, Pengfei Xu, Jinyan Guo, Guoqing Wang and Jing Tang
Appl. Sci. 2025, 15(23), 12793; https://doi.org/10.3390/app152312793 - 3 Dec 2025
Viewed by 376
Abstract
A condition-based maintenance decision-making framework for multi-component systems is proposed in this work by integrating dynamic Bayesian network (DBN) with proportional hazards model (PHM). The framework is designed to address the challenge of handling mixed failure types and complex failure dependencies, which often [...] Read more.
A condition-based maintenance decision-making framework for multi-component systems is proposed in this work by integrating dynamic Bayesian network (DBN) with proportional hazards model (PHM). The framework is designed to address the challenge of handling mixed failure types and complex failure dependencies, which often lead to inaccurate maintenance decisions in existing methods. In this integrated model, the DBN captures the failure evolution and both dynamic and static dependencies among components, while the PHM enhances the capability to characterize mixed failure interactions, thereby enabling the coverage of three common types of failure dependencies in multi-component systems. The model is formulated and solved using a finite-horizon Markov decision process (MDP), with the optimal maintenance strategy obtained by maximizing the total expected reward. Numerical case studies demonstrate the framework’s flexibility in handling mixed failures and complex dependencies, showing its potential to effectively support condition-based maintenance decision-making for complex multi-component systems. Full article
(This article belongs to the Section Applied Industrial Technologies)
Show Figures

Figure 1

36 pages, 1682 KB  
Article
Structural Properties and a Revised Value Iteration Algorithm for Dynamic Capacity Expansion and Reduction
by Jazeem Abduljaleel and Mohammad M. AlDurgam
Mathematics 2025, 13(23), 3865; https://doi.org/10.3390/math13233865 - 2 Dec 2025
Viewed by 253
Abstract
This manuscript introduces a generalized Markov Decision Process (MDP) model for dynamic capacity planning in the presence of stochastic time-nonhomogeneous demand, wherein system capacity may be flexibly increased or decreased throughout a finite planning horizon. The model includes investment, disinvestment, maintenance, operational, and [...] Read more.
This manuscript introduces a generalized Markov Decision Process (MDP) model for dynamic capacity planning in the presence of stochastic time-nonhomogeneous demand, wherein system capacity may be flexibly increased or decreased throughout a finite planning horizon. The model includes investment, disinvestment, maintenance, operational, and shortage costs, in addition to a salvage value at the end of the planning horizon. Under very realistic conditions, we investigate the structural properties of the optimal policy and demonstrate its monotonic structure. By leveraging these properties, we propose a revised value iteration algorithm that capitalizes on the intrinsic structure of the problem, thereby achieving enhanced computational efficiency compared to traditional dynamic programming techniques. The proposed model is applicable across a range of sectors, including manufacturing systems, cloud-computing services, logistics systems, healthcare resource management, power capacity planning, and other intelligent infrastructures driven by Industry 4.0. Full article
Show Figures

Figure 1

30 pages, 4654 KB  
Article
A Non-Cooperative Game-Based Retail Pricing Model for Electricity Retailers Considering Low-Carbon Incentives and Multi-Player Competition
by Zhiyu Zhao, Bo Bo, Xuemei Li, Po Yang, Dafei Jiang, Ge Wang and Fei Wang
Electronics 2025, 14(23), 4713; https://doi.org/10.3390/electronics14234713 - 29 Nov 2025
Viewed by 263
Abstract
This paper addresses the retail pricing problem for electricity retailers who also act as virtual power plant (VPP) operators, aggregating distributed energy resources (DERs). In future power markets where multiple such retailers compete for customers, a key challenge is to design pricing strategies [...] Read more.
This paper addresses the retail pricing problem for electricity retailers who also act as virtual power plant (VPP) operators, aggregating distributed energy resources (DERs). In future power markets where multiple such retailers compete for customers, a key challenge is to design pricing strategies that balance economic profitability with low-carbon objectives. Existing research often overlooks the impact of retailers’ heterogeneous resource portfolios, particularly the share of low-carbon resources like photovoltaics (PVs), on their competitive advantage and pricing decisions. To bridge this gap, we propose a novel retail pricing model that integrates a non-cooperative game framework with Markov Decision Processes (MDPs). The model enables each retailer to formulate optimal real-time pricing strategies by anticipating competitors’ actions and customer responses, ultimately reaching a Nash equilibrium. A distinctive feature of our approach is the incorporation of spatially differentiated carbon emission factors, which are adjusted based on each retailer’s share of PV generation. This creates a tangible low-carbon incentive, allowing retailers with greener resource mixes to leverage their environmental advantage. The proposed framework is validated on a modified IEEE 30-bus system with six competing retailers. Simulation results demonstrate that our method effectively incentivizes optimal load distribution, alleviates network congestion, and improves branch loading indices. Critically, retailers with a higher share of PV resources achieved significantly higher profits, directly translating their low-carbon advantage into economic value. Notably, the Branch Load Index (BLI) was reduced by 12% and node voltage deviations were improved by 1.32% at Bus 12, demonstrating the model’s effectiveness in integrating economic and low-carbon objectives. Full article
Show Figures

Figure 1

Back to TopTop