MDPI - Publisher of Open Access Journals

29 pages, 3266 KiB

Open AccessFeature PaperArticle

Wavelet Multiresolution Analysis-Based Takagi–Sugeno–Kang Model, with a Projection Step and Surrogate Feature Selection for Spectral Wave Height Prediction

by Panagiotis Korkidis and Anastasios Dounis

Mathematics 2025, 13(15), 2517; https://doi.org/10.3390/math13152517 - 5 Aug 2025

Abstract

The accurate prediction of significant wave height presents a complex yet vital challenge in the fields of ocean engineering. This capability is essential for disaster prevention, fostering sustainable development and deepening our understanding of various scientific phenomena. We explore the development of a [...] Read more.

The accurate prediction of significant wave height presents a complex yet vital challenge in the fields of ocean engineering. This capability is essential for disaster prevention, fostering sustainable development and deepening our understanding of various scientific phenomena. We explore the development of a comprehensive predictive methodology for wave height prediction by integrating novel Takagi–Sugeno–Kang fuzzy models within a multiresolution analysis framework. The multiresolution analysis emerges via wavelets, since they are prominent models characterised by their inherent multiresolution nature. The maximal overlap discrete wavelet transform is utilised to generate the detail and resolution components of the time series, resulting from this multiresolution analysis. The novelty of the proposed model lies on its hybrid training approach, which combines least squares with AdaBound, a gradient-based algorithm derived from the deep learning literature. Significant wave height prediction is studied as a time series problem, hence, the appropriate inputs to the model are selected by developing a surrogate-based wrapped algorithm. The developed wrapper-based algorithm, employs Bayesian optimisation to deliver a fast and accurate method for feature selection. In addition, we introduce a projection step, to further refine the approximation capabilities of the resulting predictive system. The proposed methodology is applied to a real-world time series pertaining to spectral wave height and obtained from the Poseidon operational oceanography system at the Institute of Oceanography, part of the Hellenic Center for Marine Research. Numerical studies showcase a high degree of approximation performance. The predictive scheme with the projection step yields a coefficient of determination of

0.9991

, indicating a high level of accuracy. Furthermore, it outperforms the second-best comparative model by approximately

49%

in terms of root mean squared error. Comparative evaluations against powerful artificial intelligence models, using regression metrics and hypothesis test, underscore the effectiveness of the proposed methodology. Full article

(This article belongs to the Special Issue Applications of Mathematics in Neural Networks and Machine Learning)

► Show Figures

Figure 1

48 pages, 1213 KiB

Open AccessArticle

Parameterized Fractal–Fractional Analysis of Ostrowski- and Simpson-Type Inequalities with Applications

by Saad Ihsan Butt, Muhammad Mehtab and Youngsoo Seol

Fractal Fract. 2025, 9(8), 494; https://doi.org/10.3390/fractalfract9080494 - 28 Jul 2025

Viewed by 223

Abstract

In this paper, we first introduce a parametric identity for generalized differentiable functions using a generalized fractal–fractional integral operators. Based on this identity, we establish several variants of parameterized inequalities for functions whose local fractional derivatives in absolute value satisfy generalized convexity conditions. [...] Read more.

In this paper, we first introduce a parametric identity for generalized differentiable functions using a generalized fractal–fractional integral operators. Based on this identity, we establish several variants of parameterized inequalities for functions whose local fractional derivatives in absolute value satisfy generalized convexity conditions. Furthermore, we demonstrate that our main results reduce to well-known Ostrowski- and Simpson-type inequalities by selecting suitable parameters. These inequalities contribute to finding tight bounds for various integrals over fractal spaces. By comparing the classical Hölder and Power mean inequalities with their new generalized versions, we show that the improved forms yield sharper and more refined upper bounds. In particular, we illustrate that the generalizations of Hölder and Power mean inequalities provide better results when applied to fractal integrals, with their tighter bounds supported by graphical representations. Finally, a series of applications are discussed, including generalized special means, generalized probability density functions and generalized quadrature formulas, which highlight the practical significance of the proposed results in fractal analysis. Full article

(This article belongs to the Section General Mathematics, Analysis)

► Show Figures

Figure 1

30 pages, 795 KiB

Open AccessArticle

A Novel Heterogeneous Federated Edge Learning Framework Empowered with SWIPT

by Yinyin Fang, Sheng Shu, Yujun Zhu, Heju Li and Kunkun Rui

Symmetry 2025, 17(7), 1115; https://doi.org/10.3390/sym17071115 - 11 Jul 2025

Viewed by 221

Abstract

Federated edge learning (FEEL) is an innovative approach that facilitates collaborative training among numerous distributed edge devices while eliminating the need to transfer sensitive information. However, the practical deployment of FEEL faces significant constraints, owing to the limited and asymmetric computational and communication [...] Read more.

Federated edge learning (FEEL) is an innovative approach that facilitates collaborative training among numerous distributed edge devices while eliminating the need to transfer sensitive information. However, the practical deployment of FEEL faces significant constraints, owing to the limited and asymmetric computational and communication resources of these devices, along with their energy availability. To this end, we propose a novel asymmetry-tolerant training approach for FEEL, enabled via simultaneous wireless information and power transfer (SWIPT). This framework leverages SWIPT to offer sustainable energy support for devices while enabling them to train models with varying intensities. Given a limited energy budget, we highlight the critical trade-off between heterogeneous local training intensities and the quality of wireless transmission, suggesting that the design of local training and wireless transmission should be closely integrated, rather than treated as separate entities. To elucidate this perspective, we rigorously derive a new explicit upper bound that captures the combined impact of local training accuracy and the mean square error of wireless aggregation on the convergence performance of FEEL. To maximize overall system performance, we formulate two key optimization problems: the first aims to maximize the energy harvesting capability among all devices, while the second addresses the joint learning–communication optimization under the optimal energy harvesting solution. Comprehensive experiments demonstrate that our proposed framework achieves significant performance improvements compared to existing baselines. Full article

(This article belongs to the Section Computer)

► Show Figures

Figure 1

16 pages, 274 KiB

Open AccessArticle

Quantifying Social Benefits of Virtual Power Plants (VPPs) in South Korea: Contingent Valuation Method

by Dongnyok Shim

Energies 2025, 18(12), 3006; https://doi.org/10.3390/en18123006 - 6 Jun 2025

Viewed by 571

Abstract

This study is one of the first empirical attempts to quantify the social benefit of virtual power plants (VPPs) in South Korea using the contingent valuation method (CVM). As Korea pursues its ambitious carbon neutrality goal by 2050, VPPs have emerged as a [...] Read more.

This study is one of the first empirical attempts to quantify the social benefit of virtual power plants (VPPs) in South Korea using the contingent valuation method (CVM). As Korea pursues its ambitious carbon neutrality goal by 2050, VPPs have emerged as a critical technology for managing the intermittency of renewable energy sources and ensuring grid stability. Despite their recognized technical potential, the social and economic value of VPPs remains largely unexplored. Through a nationwide survey of 1105 households, we employed a double-bounded dichotomous choice spike model to estimate willingness to pay (WTP) for government-led VPP implementation. The analysis revealed two distinct dimensions influencing VPP valuation: electricity bill perceptions and electricity generation mix preferences. Results indicated that Korean households exhibited significant but heterogeneous WTP for VPP implementation, with unconditional mean annual WTP ranging from KRW 23,474 to KRW 26,545 per household. Notably, support for renewable energy transition showed stronger positive effects on WTP compared to nuclear expansion preferences, suggesting VPPs are primarily valued as renewable energy enablers. The substantial spike probability (32–34%) indicated that approximately one-third of the population has zero WTP, highlighting challenges in introducing novel energy technologies. Key determinants of positive WTP included perceived fairness of electricity pricing, support for market-based mechanisms, and preferences for transitioning from coal and nuclear to renewables. These findings provide critical policy insights for VPP deployment strategies, suggesting the need for phased implementation, targeted communication emphasizing renewable integration benefits, and coordination with broader electricity market reforms. The study contributes to energy transition economics literature by demonstrating how public preferences for emerging grid technologies are shaped by both economic considerations and environmental values. Full article

(This article belongs to the Special Issue Energy and Environmental Economics for a Sustainable Future)

20 pages, 4574 KiB

Open AccessArticle

Pavement-DETR: A High-Precision Real-Time Detection Transformer for Pavement Defect Detection

by Cuihua Zuo, Nengxin Huang, Cao Yuan and Yaqin Li

Sensors 2025, 25(8), 2426; https://doi.org/10.3390/s25082426 - 11 Apr 2025

Viewed by 1071

Abstract

The accurate detection of road defects is crucial for enhancing the safety and efficiency of road maintenance. This study focuses on six common types of pavement defects: transverse cracks, longitudinal cracks, alligator cracking, oblique cracks, potholes, and repair marks. In real-world scenarios, key [...] Read more.

The accurate detection of road defects is crucial for enhancing the safety and efficiency of road maintenance. This study focuses on six common types of pavement defects: transverse cracks, longitudinal cracks, alligator cracking, oblique cracks, potholes, and repair marks. In real-world scenarios, key challenges include effectively distinguishing between the foreground and background, as well as accurately identifying small-sized (e.g., fine cracks, dense alligator cracking, and clustered potholes) and overlapping defects (e.g., intersecting cracks or clustered damage areas where multiple defects appear close together). To address these issues, this paper proposes a Pavement-DETR model based on the Real-Time Detection Transformer (RT-DETR), aiming to optimize the overall accuracy of defect detection. To achieve this goal, three main improvements are proposed: (1) the introduction of the Channel-Spatial Shuffle (CSS) attention mechanism in the third (S3) and fourth (S4) stages of the ResNet backbone, which correspond to mid-level and high-level feature layers, enabling the model to focus more precisely on road defect features; (2) the adoption of the Conv3XC structure for feature fusion enhances the model’s ability to differentiate between the foreground and background, which is achieved through multi-level convolutions, channel expansion, and skip connections, which also contribute to improved gradient flow and training stability; (3) the proposal of a loss function combining Powerful-IoU v2 (PIoU v2) and Normalized Wasserstein Distance (NWD) weighted averaging, where PIoU v2 focuses on optimizing overlapping regions, and NWD targets small object optimization. The combined loss function enables comprehensive optimization of the bounding boxes, improving the model’s accuracy and convergence speed. Experimental results show that on the UAV-PDD2023 dataset, Pavement-DETR improves the mean average precision (mAP) by 7.7% at IoU = 0.5, increases mAP by 8.9% at IoU = 0.5–0.95, and improves F1 Score by 7%. These results demonstrate that Pavement-DETR exhibits better performance in road defect detection, making it highly significant for road maintenance work. Full article

(This article belongs to the Section Fault Diagnosis & Sensors)

► Show Figures

Figure 1

18 pages, 2152 KiB

Open AccessArticle

Insulator Defect Detection via a Residual Denoising Diffusion Mechanism

by Li Zhang, Mengyang Song, Huaping Guo, Yange Sun and Xinxia Wang

Materials 2025, 18(8), 1738; https://doi.org/10.3390/ma18081738 - 10 Apr 2025

Viewed by 519

Abstract

Insulators are critical components of transmission lines, and defective insulators pose a serious threat to the safety of power supply systems. Timely detection of these defects is crucial to prevent catastrophic consequences for human lives and property. However, insulator defects are often small [...] Read more.

Insulators are critical components of transmission lines, and defective insulators pose a serious threat to the safety of power supply systems. Timely detection of these defects is crucial to prevent catastrophic consequences for human lives and property. However, insulator defects are often small and easily affected by the noise of rain, fog, sunlight, dirt, and other pollutants, making detection challenging. We observe that diffusion models learn data distribution by progressively introducing noise and subsequently performing denoising. The progressive denoising mechanism can naturally simulate the randomness of environmental noise. Based on this observation, we treat the localization of insulator defects as a denoising-based recovery process, where the true defect bounding boxes are progressively reconstructed from noisy representations. To this end, we propose a novel diffusion-based Insulator Defect Detector (IDDet) that is specifically designed to handle complex environmental noise. IDDet introduces noise to the true bounding boxes to generate noisy target boxes with random distributions and is then trained to recover the true bounding boxes from these noisy representations through a residual denoising diffusion mechanism. For the inference stage, IDDet refines the defect location from a random noise bounding box by gradually removing the noise, ultimately achieving the task of precisely locating the defect in the image. Experimental results show that IDDet significantly improves detection capability in noisy environments, achieving the best mean average precision (mAP) of 92.3%, confirming the feasibility and effectiveness of our approach. Full article

(This article belongs to the Special Issue Advancements in Ultrasonic Testing for Metallurgical Materials)

► Show Figures

Figure 1

21 pages, 1425 KiB

Open AccessArticle

Integrated Stochastic Approach for Instantaneous Energy Performance Analysis of Thermal Energy Systems

by Anthony Kpegele Le-ol, Sidum Adumene, Duabari Silas Aziaka, Mohammad Yazdi and Javad Mohammadpour

Energies 2025, 18(1), 160; https://doi.org/10.3390/en18010160 - 3 Jan 2025

Viewed by 676

Abstract

To ascertain energy availability and system performance, a comprehensive understanding of the systems’ degradation profile and impact on overall plant reliability is imperative. The current study presents an integrated Failure Mode and Effects Analysis (FMEA)–Markovian algorithm for reliability-based instantaneous energy performance prediction for [...] Read more.

To ascertain energy availability and system performance, a comprehensive understanding of the systems’ degradation profile and impact on overall plant reliability is imperative. The current study presents an integrated Failure Mode and Effects Analysis (FMEA)–Markovian algorithm for reliability-based instantaneous energy performance prediction for thermal energy systems. The FMEA methodology is utilized to identify and categorize the various failure modes of the gas turbines, establishing a reliability pattern that informs overall system performance. Meanwhile, the Markovian algorithm discretizes the system into states based on its operational energy performance envelope. The algorithm predicts instantaneous energy performance according to upper and lower bounds criteria. This integrated methodology has been subjected to testing in three case studies, yielding results that demonstrate improved reliability and instantaneous energy performance prediction during system degradation. It was observed that after 14 years of operation, the likelihood of major failures increases to 79.6%, 88.7%, and 82.8%, with corresponding decreases in system performance reliability of 10.1%, 4.5%, and 7.8% for the Afam, Ibom, and Sapele gas turbine plants, respectively. Furthermore, the percentage of instantaneous mean power performance relative to the rated capacity is 37.9%, 35.1%, and 46.3% for the three gas turbine plants. These results indicate that the Sapele thermal power plant performs better relative to its rated capacity. Overall, this integrated methodology serves as a valuable tool for monitoring gas turbine engine health and predicting energy performance under varying operating conditions. Full article

(This article belongs to the Section J: Thermal Management)

► Show Figures

Figure 1

30 pages, 6897 KiB

Open AccessArticle

Research on UAV Autonomous Recognition and Approach Method for Linear Target Splicing Sleeves Based on Deep Learning and Active Stereo Vision

by Guocai Zhang, Guixiong Liu and Fei Zhong

Electronics 2024, 13(24), 4872; https://doi.org/10.3390/electronics13244872 (registering DOI) - 10 Dec 2024

Cited by 1 | Viewed by 1113

Abstract

This study proposes an autonomous recognition and approach method for unmanned aerial vehicles (UAVs) targeting linear splicing sleeves. By integrating deep learning and active stereo vision, this method addresses the navigation challenges faced by UAVs during the identification, localization, and docking of splicing [...] Read more.

This study proposes an autonomous recognition and approach method for unmanned aerial vehicles (UAVs) targeting linear splicing sleeves. By integrating deep learning and active stereo vision, this method addresses the navigation challenges faced by UAVs during the identification, localization, and docking of splicing sleeves on overhead power transmission lines. First, a two-stage localization strategy, LC (Local Clustering)-RB (Reparameterization Block)-YOLO (You Only Look Once)v8n (OBB (Oriented Bounding Box)), is developed for linear target splicing sleeves. This strategy ensures rapid, accurate, and reliable recognition and localization while generating precise waypoints for UAV docking with splicing sleeves. Next, virtual reality technology is utilized to expand the splicing sleeve dataset, creating the D_SS dataset tailored to diverse scenarios. This enhancement improves the robustness and generalization capability of the recognition model. Finally, a UAV approach splicing sleeve (UAV-ASS) visual navigation simulation platform is developed using the Robot Operating System (ROS), the PX4 open-source flight control system, and the GAZEBO 3D robotics simulator. This platform simulates the UAV’s final approach to the splicing sleeves. Experimental results demonstrate that, on the D_SS dataset, the RB-YOLOv8n(OBB) model achieves a mean average precision (mAP0.5) of 96.4%, with an image inference speed of 86.41 frames per second. By incorporating the LC-based fine localization method, the five rotational bounding box parameters (x, y, w, h, and angle) of the splicing sleeve achieve a mean relative error (MRE) ranging from 3.39% to 4.21%. Additionally, the correlation coefficients (ρ) with manually annotated positions improve to 0.99, 0.99, 0.98, 0.95, and 0.98, respectively. These improvements significantly enhance the accuracy and stability of splicing sleeve localization. Moreover, the developed UAV-ASS visual navigation simulation platform effectively validates high-risk algorithms for UAV autonomous recognition and docking with splicing sleeves on power transmission lines, reducing testing costs and associated safety risks. Full article

(This article belongs to the Section Computer Science & Engineering)

► Show Figures

Figure 1

80 pages, 858 KiB

Open AccessArticle

Uniform in Number of Neighbor Consistency and Weak Convergence of k-Nearest Neighbor Single Index Conditional Processes and k-Nearest Neighbor Single Index Conditional U-Processes Involving Functional Mixing Data

by Salim Bouzebda

Symmetry 2024, 16(12), 1576; https://doi.org/10.3390/sym16121576 - 25 Nov 2024

Cited by 5 | Viewed by 1395

Abstract

U-statistics are fundamental in modeling statistical measures that involve responses from multiple subjects. They generalize the concept of the empirical mean of a random variable X to include summations over each m-tuple of distinct observations of X. W. Stute introduced [...] Read more.

U-statistics are fundamental in modeling statistical measures that involve responses from multiple subjects. They generalize the concept of the empirical mean of a random variable X to include summations over each m-tuple of distinct observations of X. W. Stute introduced conditional U-statistics, extending the Nadaraya–Watson estimates for regression functions. Stute demonstrated their strong pointwise consistency with the conditional expectation

r^{(m)} (φ, t)

, defined as

E [φ (Y_{1}, \dots, Y_{m}) | (X_{1}, \dots, X_{m}) = t]

for

t \in X^{m}

. This paper focuses on estimating functional single index (FSI) conditional U-processes for regular time series data. We propose a novel, automatic, and location-adaptive procedure for estimating these processes based on k-Nearest Neighbor (kNN) principles. Our asymptotic analysis includes data-driven neighbor selection, making the method highly practical. The local nature of the kNN approach improves predictive power compared to traditional kernel estimates. Additionally, we establish new uniform results in bandwidth selection for kernel estimates in FSI conditional U-processes, including almost complete convergence rates and weak convergence under general conditions. These results apply to both bounded and unbounded function classes, satisfying certain moment conditions, and are proven under standard Vapnik–Chervonenkis structural conditions and mild model assumptions. Furthermore, we demonstrate uniform consistency for the nonparametric inverse probability of censoring weighted (I.P.C.W.) estimators of the regression function under random censorship. This result is independently valuable and has potential applications in areas such as set-indexed conditional U-statistics, the Kendall rank correlation coefficient, and discrimination problems. Full article

(This article belongs to the Section Mathematics)

16 pages, 6173 KiB

Open AccessArticle

Control Power in Continuous Variable Controlled Quantum Teleportation

by Yuehan Tian, Dunbo Cai, Nengfei Gong, Yining Li, Ling Qian, Runqing Zhang, Zhiguo Huang and Tiejun Wang

Entropy 2024, 26(12), 1017; https://doi.org/10.3390/e26121017 - 25 Nov 2024

Viewed by 842

Abstract

Controlled quantum teleportation is an important extension of multipartite quantum teleportation, which plays an indispensable role in building quantum networks. Compared with discrete variable counterparts, continuous variable controlled quantum teleportation can generate entanglement deterministically and exhibit higher superiority of the supervisor’s authority. Here, [...] Read more.

Controlled quantum teleportation is an important extension of multipartite quantum teleportation, which plays an indispensable role in building quantum networks. Compared with discrete variable counterparts, continuous variable controlled quantum teleportation can generate entanglement deterministically and exhibit higher superiority of the supervisor’s authority. Here, we define a measure to quantify the control power in continuous variable controlled quantum teleportation via Greenberger–Horne–Zeilinger-type entangled coherent state channels. Our results show that control power in continuous variable controlled quantum teleportation increases with the mean photon number of coherent states. Its upper bound is 1/2, which exceeds the upper bound in discrete variable controlled quantum teleportation (1/3). The robustness of the protocol is analyzed with photon absorption. The results show that the improving ability of the control power will descend by the increasing photon loss, with the upper bound unchanged and robust. Our results illuminate the role of control power in multipartite continuous variable quantum information processing and provide a criterion for evaluating the quality of quantum communication networks. Full article

(This article belongs to the Special Issue Quantum Information: Working Towards Applications)

► Show Figures

Figure 1

10 pages, 355 KiB

Open AccessArticle

Partial Path Overlapping Mitigation: An Initial Stage for Joint Detection and Decoding in Multipath Channels Using the Sum–Product Algorithm

by Anoush Mirbadin and Abolfazl Zaraki

Appl. Sci. 2024, 14(20), 9175; https://doi.org/10.3390/app14209175 - 10 Oct 2024

Viewed by 1304

Abstract

This paper addresses the problem of mitigating unknown partial path overlaps in communication systems. This study demonstrates that by utilizing the front-end insight of communication systems along with the sum–product algorithm applied to factor graphs, it is possible not only to track these [...] Read more.

This paper addresses the problem of mitigating unknown partial path overlaps in communication systems. This study demonstrates that by utilizing the front-end insight of communication systems along with the sum–product algorithm applied to factor graphs, it is possible not only to track these overlapping components accurately, but also to detect all multipath channel impairments simultaneously. The proposed methodology involves discretizing channel parameters, such as channel paths and attenuation coefficients, to ensure the most accurate computation of means of Gaussian observations. These parameters are modeled as Bernoulli random variables with priors set to

0.5

. A notable aspect of the algorithm is its integration of the received signal power into the calculation of noise variance, which is critical for its performance. To further reduce the receiver complexity, a novel implementation strategy, based on provided pre-defined look up tables (LOTs) to the reciver, is introduced. The simulation results, covering both distributed and concentrated pilot scenarios, reveal that the algorithm performs almost equally under both conditions and surpasses the established upper bound in performance. Full article

(This article belongs to the Special Issue Advances in Wireless Communication Technologies)

► Show Figures

Figure 1

25 pages, 728 KiB

Open AccessArticle

On Extended Class of Totally Ordered Interval-Valued Convex Stochastic Processes and Applications

by Muhammad Zakria Javed, Muhammad Uzair Awan, Loredana Ciurdariu, Silvestru Sever Dragomir and Yahya Almalki

Fractal Fract. 2024, 8(10), 577; https://doi.org/10.3390/fractalfract8100577 - 30 Sep 2024

Cited by 7 | Viewed by 1074

Abstract

The intent of the current study is to explore convex stochastic processes within a broader context. We introduce the concept of unified stochastic processes to analyze both convex and non-convex stochastic processes simultaneously. We employ weighted quasi-mean, non-negative mapping

γ

, and center-radius [...] Read more.

The intent of the current study is to explore convex stochastic processes within a broader context. We introduce the concept of unified stochastic processes to analyze both convex and non-convex stochastic processes simultaneously. We employ weighted quasi-mean, non-negative mapping

γ

, and center-radius ordering relations to establish a class of extended

c r

-interval-valued convex stochastic processes. This class yields a combination of innovative convex and non-convex stochastic processes. We characterize our class by illustrating its relationships with other classes as well as certain key attributes and sufficient conditions for this class of processes. Additionally, leveraging Riemann–Liouville stochastic fractional operators and our proposed class, we prove parametric fractional variants of Jensen’s inequality, Hermite–Hadamard’s inequality, Fejer’s inequality, and product Hermite–Hadamard’s like inequality. We establish an interesting relation between means by means of Hermite–Hadamard’s inequality. We utilize the numerical and graphical approaches to showcase the significance and effectiveness of primary findings. Also, the proposed results are powerful tools to evaluate the bounds for stochastic Riemann–Liouville fractional operators in different scenarios for a larger space of processes. Full article

(This article belongs to the Special Issue New Trends on Generalized Fractional Calculus, 2nd Edition)

► Show Figures

Figure 1

17 pages, 3554 KiB

Open AccessArticle

Robot Operating Systems–You Only Look Once Version 5–Fleet Efficient Multi-Scale Attention: An Improved You Only Look Once Version 5-Lite Object Detection Algorithm Based on Efficient Multi-Scale Attention and Bounding Box Regression Combined with Robot Operating Systems

by Haiyan Wang, Zhan Shi, Guiyuan Gao, Chuang Li, Jian Zhao and Zhiwei Xu

Appl. Sci. 2024, 14(17), 7591; https://doi.org/10.3390/app14177591 - 28 Aug 2024

Viewed by 1498

Abstract

This paper primarily investigates enhanced object detection techniques for indoor service mobile robots. Robot operating systems (ROS) supply rich sensor data, which boost the models’ ability to generalize. However, the model’s performance might be hindered by constraints in the processing power, memory capacity, [...] Read more.

This paper primarily investigates enhanced object detection techniques for indoor service mobile robots. Robot operating systems (ROS) supply rich sensor data, which boost the models’ ability to generalize. However, the model’s performance might be hindered by constraints in the processing power, memory capacity, and communication capabilities of robotic devices. To address these issues, this paper proposes an improved you only look once version 5 (YOLOv5)-Lite object detection algorithm based on efficient multi-scale attention and bounding box regression combined with ROS. The algorithm incorporates efficient multi-scale attention (EMA) into the traditional YOLOv5-Lite model and replaces the C3 module with a lightweight C3Ghost module to reduce computation and model size during the convolution process. To enhance bounding box localization accuracy, modified precision-defined intersection over union (MPDIoU) is employed to optimize the model, resulting in the ROS–YOLOv5–FleetEMA model. The results indicated that relative to the conventional YOLOv5-Lite model, the ROS–YOLOv5–FleetEMA model enhanced the mean average precision (mAP) by 2.7% post-training, reduced giga floating-point operations per second (GFLOPS) by 13.2%, and decreased the params by 15.1%. In light of these experimental findings, the model was incorporated into ROS, leading to the development of a ROS-based object detection platform that offers rapid and precise object detection capabilities. Full article

(This article belongs to the Special Issue Object Detection and Image Classification)

► Show Figures

Figure 1

15 pages, 5521 KiB

Open AccessArticle

A Historical Handwritten French Manuscripts Text Detection Method in Full Pages

by Rui Sang, Shili Zhao, Yan Meng, Mingxian Zhang, Xuefei Li, Huijie Xia and Ran Zhao

Information 2024, 15(8), 483; https://doi.org/10.3390/info15080483 - 14 Aug 2024

Viewed by 1423

Abstract

Historical handwritten manuscripts pose challenges to automated recognition techniques due to their unique handwriting styles and cultural backgrounds. In order to solve the problems of complex text word misdetection, omission, and insufficient detection of wide-pitch curved text, this study proposes a high-precision text [...] Read more.

Historical handwritten manuscripts pose challenges to automated recognition techniques due to their unique handwriting styles and cultural backgrounds. In order to solve the problems of complex text word misdetection, omission, and insufficient detection of wide-pitch curved text, this study proposes a high-precision text detection method based on improved YOLOv8s. Firstly, the Swin Transformer is used to replace C2f at the end of the backbone network to solve the shortcomings of fine-grained information loss and insufficient learning features in text word detection. Secondly, the Dysample (Dynamic Upsampling Operator) method is used to retain more detailed features of the target and overcome the shortcomings of information loss in traditional upsampling to realize the text detection task for dense targets. Then, the LSK (Large Selective Kernel) module is added to the detection head to dynamically adjust the feature extraction receptive field, which solves the cases of extreme aspect ratio words, unfocused small text, and complex shape text in text detection. Finally, in order to overcome the CIOU (Complete Intersection Over Union) loss in target box regression with unclear aspect ratio, insensitive to size change, and insufficient correlation between target coordinates, Gaussian Wasserstein Distance (GWD) is introduced to modify the regression loss to measure the similarity between the two bounding boxes in order to obtain high-quality bounding boxes. Compared with the State-of-the-Art methods, the proposed method achieves optimal performance in text detection, with the precision and mAP@0.5 reaching 86.3% and 82.4%, which are 8.1% and 6.7% higher than the original method, respectively. The advancement of each module is verified by ablation experiments. The experimental results show that the method proposed in this study can effectively realize complex text detection and provide a powerful technical means for historical manuscript reproduction. Full article

► Show Figures

Figure 1

22 pages, 7602 KiB

Open AccessArticle

Research on the Method of Foreign Object Detection for Railway Tracks Based on Deep Learning

by Shanping Ning, Feng Ding and Bangbang Chen

Sensors 2024, 24(14), 4483; https://doi.org/10.3390/s24144483 - 11 Jul 2024

Cited by 8 | Viewed by 3137

Abstract

Addressing the limitations of current railway track foreign object detection techniques, which suffer from inadequate real-time performance and diminished accuracy in detecting small objects, this paper introduces an innovative vision-based perception methodology harnessing the power of deep learning. Central to this approach is [...] Read more.

Addressing the limitations of current railway track foreign object detection techniques, which suffer from inadequate real-time performance and diminished accuracy in detecting small objects, this paper introduces an innovative vision-based perception methodology harnessing the power of deep learning. Central to this approach is the construction of a railway boundary model utilizing a sophisticated track detection method, along with an enhanced UNet semantic segmentation network to achieve autonomous segmentation of diverse track categories. By employing equal interval division and row-by-row traversal, critical track feature points are precisely extracted, and the track linear equation is derived through the least squares method, thus establishing an accurate railway boundary model. We optimized the YOLOv5s detection model in four aspects: incorporating the SE attention mechanism into the Neck network layer to enhance the model’s feature extraction capabilities, adding a prediction layer to improve the detection performance for small objects, proposing a linear size scaling method to obtain suitable anchor boxes, and utilizing Inner-IoU to refine the boundary regression loss function, thereby increasing the positioning accuracy of the bounding boxes. We conducted a detection accuracy validation for railway track foreign object intrusion using a self-constructed image dataset. The results indicate that the proposed semantic segmentation model achieved an MIoU of 91.8%, representing a 3.9% improvement over the previous model, effectively segmenting railway tracks. Additionally, the optimized detection model could effectively detect foreign object intrusions on the tracks, reducing missed and false alarms and achieving a 7.4% increase in the mean average precision (IoU = 0.5) compared to the original YOLOv5s model. The model exhibits strong generalization capabilities in scenarios involving small objects. This proposed approach represents an effective exploration of deep learning techniques for railway track foreign object intrusion detection, suitable for use in complex environments to ensure the operational safety of rail lines. Full article

(This article belongs to the Section Vehicular Sensing)

► Show Figures

Figure 1

Search Results (132)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (132)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI