MDPI - Publisher of Open Access Journals

26 pages, 32734 KB

Open AccessArticle

Contextual-Semantic Interactive Perception Network for Small Object Detection in UAV Aerial Images

by Yiming Xu and Hongbing Ji

Remote Sens. 2025, 17(21), 3581; https://doi.org/10.3390/rs17213581 - 29 Oct 2025

Unmanned Aerial Vehicle (UAV)-based aerial object detection has been widely applied in various fields, including logistics, public security, disaster response, and smart agriculture. However, numerous small objects in UAV aerial images are often overwhelmed by large-scale complex backgrounds, making their appearance difficult to [...] Read more.

Unmanned Aerial Vehicle (UAV)-based aerial object detection has been widely applied in various fields, including logistics, public security, disaster response, and smart agriculture. However, numerous small objects in UAV aerial images are often overwhelmed by large-scale complex backgrounds, making their appearance difficult to distinguish and thereby prone to being missed by detectors. To tackle these issues, we propose a novel Contextual-Semantic Interactive Perception Network (CSIPN) for small object detection in UAV aerial scenarios, which enhances detection performance through scene interaction modeling, dynamic context modeling, and dynamic feature fusion. The core components of the CSIPN include the Scene Interaction Modeling Module (SIMM), the Dynamic Context Modeling Module (DCMM), and the Semantic-Context Dynamic Fusion Module (SCDFM). Specifically, the SIMM introduces a lightweight self-attention mechanism to generate a global scene semantic embedding vector, which then interacts with shallow spatial descriptors to explicitly depict the latent relationships between small objects and complex background, thereby selectively activating key spatial responses. The DCMM employs two dynamically adjustable receptive-field branches to adaptively model contextual cues and effectively supplement the contextual information required for detecting various small objects. The SCDFM utilizes a dual-weighting strategy to dynamically fuse deep semantic information with shallow contextual details, highlighting features relevant to small object detection while suppressing irrelevant background. Our method achieves mAPs of 37.2%, 93.4%, 50.8%, and 48.3% on the TinyPerson dataset, the WAID dataset, the VisDrone-DET dataset, and our self-built WildDrone dataset, respectively, while using only 25.3M parameters, surpassing existing state-of-the-art detectors and demonstrating its superiority and robustness. Full article

(This article belongs to the Special Issue Deep Learning-Driven Remote Sensing Image Processing for Object Detection and Localization)

► Show Figures

Figure 1

17 pages, 294 KB

Open AccessFeature PaperArticle

Approximate Fiber Products of Schemes and Their Étale Homotopical Invariants

by Dongfang Zhao

Mathematics 2025, 13(21), 3448; https://doi.org/10.3390/math13213448 - 29 Oct 2025

Abstract

The classical fiber product in algebraic geometry provides a powerful tool for studying loci where two morphisms to a base scheme,

ϕ : X \to S

and

ψ : Y \to S

, coincide exactly. This condition of strict equality, however, is insufficient [...] Read more.

The classical fiber product in algebraic geometry provides a powerful tool for studying loci where two morphisms to a base scheme,

ϕ : X \to S

and

ψ : Y \to S

, coincide exactly. This condition of strict equality, however, is insufficient for describing many real-world applications, such as the geometric structure of semantic spaces in modern large language models whose foundational architecture is the Transformer neural network: The token spaces of these models are fundamentally approximate, and recent work has revealed complex geometric singularities, challenging the classical manifold hypothesis. This paper develops a new framework to study and quantify the nature of approximate alignment between morphisms in the context of arithmetic geometry, using the tools of étale homotopy theory. We introduce the central object of our work, the étale mismatch torsor, which is a sheaf of torsors over the product scheme

X \times_{S} Y

. The structure of this sheaf serves as a rich, intrinsic, and purely algebraic object amenable to both qualitative classification and quantitative analysis of the global relationship between the two morphisms. Our main results are twofold. First, we provide a complete classification of these structures, establishing a bijection between their isomorphism classes and the first étale cohomology group

H_{é t}^{1} (X \times_{S} Y, \underset{̲}{π_{1}^{é t} (S)})

. Second, we construct a canonical filtration on this classifying cohomology group based on the theory of infinitesimal neighborhoods. This filtration induces a new invariant, which we term the order of mismatch, providing a hierarchical, algebraic measure for the degree of approximation between the morphisms. We apply this framework to the concrete case of generalized Howe curves over finite fields, demonstrating how both the characteristic class and its order reveal subtle arithmetic properties. Full article

(This article belongs to the Section B: Geometry and Topology)

9 pages, 236 KB

Open AccessArticle

The Dialectics of Energy: From the Concept to Actuality, from Actuality to Virtuality, from Virtuality to…

by Michael Marder

Religions 2025, 16(11), 1370; https://doi.org/10.3390/rel16111370 - 29 Oct 2025

Viewed by 59

Abstract

I will consider, first, the positive charge of energeia in Aristotle, who identified it with the actuality of the actual. Then, I will pay attention to the negative charge that re-signifies the term, bestowing on it the exact opposite sense of potentiality. Rather [...] Read more.

I will consider, first, the positive charge of energeia in Aristotle, who identified it with the actuality of the actual. Then, I will pay attention to the negative charge that re-signifies the term, bestowing on it the exact opposite sense of potentiality. Rather than a radical correction of Aristotle, this polarizing modern signification unfolds in the field prepared in, if also rejected by, Greek Antiquity and unblocks the electric current of the concept of energy, a directional flow of charged particles of meaning from the positive to the negative pole. Still, the flow does not just happen by itself: the equivalent of electromotive force (EMF) is a fresh glance at the history of philosophy, not as a field dotted with static monuments to past intellectual achievements, but as an electric, or electromagnetic, semantic field. Only by grasping the conceptual circuitry of energy as a whole is it possible to appreciate the complex relation of this concept to the history of philosophical and theological thought and to the present. Full article

(This article belongs to the Special Issue Energy and Religion)

23 pages, 3485 KB

Open AccessArticle

MMA-Net: A Semantic Segmentation Network for High-Resolution Remote Sensing Images Based on Multimodal Fusion and Multi-Scale Multi-Attention Mechanisms

by Xuanxuan Huang, Xuejie Zhang, Longbao Wang, Dandan Yuan, Shufang Xu, Fengguang Zhou and Zhijun Zhou

Remote Sens. 2025, 17(21), 3572; https://doi.org/10.3390/rs17213572 - 28 Oct 2025

Viewed by 214

Abstract

Semantic segmentation of high-resolution remote sensing images is of great application value in fields like natural disaster monitoring. Current multimodal semantic segmentation methods have improved the model’s ability to recognize different ground objects and complex scenes by integrating multi-source remote sensing data. However, [...] Read more.

Semantic segmentation of high-resolution remote sensing images is of great application value in fields like natural disaster monitoring. Current multimodal semantic segmentation methods have improved the model’s ability to recognize different ground objects and complex scenes by integrating multi-source remote sensing data. However, these methods still face challenges such as blurred boundary segmentation and insufficient perception of multi-scale ground objects when achieving high-precision classification. To address these issues, this paper proposes MMA-Net, a semantic segmentation network enhanced by two key modules: cross-layer multimodal fusion module and multi-scale multi-attention module. These modules effectively improve the model’s ability to capture detailed features and model multi-scale ground objects, thereby enhancing boundary segmentation accuracy, detail feature preservation, and consistency in multi-scale object segmentation. Specifically, the cross-layer multimodal fusion module adopts a staged fusion strategy to integrate detailed information and multimodal features, realizing detail preservation and modal synergy enhancement. The multi-scale multi-attention module combines cross-attention and self-attention to leverage long-range dependencies and inter-modal complementary relationships, strengthening the model’s feature representation for multi-scale ground objects. Experimental results show that MMA-Net outperforms state-of-the-art methods on the Potsdam and Vaihingen datasets. Its mIoU reaches 88.74% and 84.92% on the two datasets, respectively. Ablation experiments further verify that each proposed module contributes to the final performance. Full article

► Show Figures

Graphical abstract

21 pages, 9130 KB

Open AccessArticle

Feature-Differentiated Perception with Dynamic Mixed Convolution and Spatial Orthogonal Attention for Faster Aerial Object Detection

by Yiming Ma, Noridayu Manshor and Fatimah binti Khalid

Algorithms 2025, 18(11), 684; https://doi.org/10.3390/a18110684 - 28 Oct 2025

Viewed by 119

Abstract

In the field of remote sensing (RS) object detection, efficient and accurate target recognition is crucial for applications such as national defense and maritime monitoring. However, existing detection methods either have high computational complexity, making them unsuitable for real-time applications, or suffer from [...] Read more.

In the field of remote sensing (RS) object detection, efficient and accurate target recognition is crucial for applications such as national defense and maritime monitoring. However, existing detection methods either have high computational complexity, making them unsuitable for real-time applications, or suffer from feature redundancy issues that affect detection accuracy. To address these challenges, this paper proposes a Feature-Differentiated Perception (FDP) lightweight remote sensing object detection method, which optimizes computational efficiency while maintaining high detection accuracy. The proposed method introduces two critical innovations: (1) Dynamic mixed convolution (DM-Conv), which uses linear mapping to efficiently generate redundant feature maps, reducing convolutional computation. It combines features from different intermediate layers through weighted fusion, effectively reducing the number of channels and improving feature utilization. Channel refers to a single feature map in the multi-dimensional feature representation, where each channel corresponds to a specific feature pattern (e.g., edges, textures, or semantic information) learned by the network. (2) The Spatial Orthogonal Attention (SOA) mechanism, which enhances the ability to model long-range dependencies between distant pixels, thereby improving feature representation capability. Experiments on public remote sensing object detection datasets, including DOTA, HRSC2016, and UCMerced-LandUse, demonstrate that the proposed model achieves a significant reduction in computational complexity while maintaining nearly lossless detection accuracy. On the DOTA dataset, the proposed method achieves an mAP (mean Average Precision) of 79.37%, outperforming existing lightweight models in terms of both speed and accuracy. This study provides new insights and practical solutions for efficient remote sensing object detection in embedded and edge computing environments. Full article

(This article belongs to the Section Algorithms for Multidisciplinary Applications)

► Show Figures

Figure 1

22 pages, 979 KB

Open AccessArticle

Multi-Modal Semantic Fusion for Smart Contract Vulnerability Detection in Cloud-Based Blockchain Analytics Platforms

by Xingyu Zeng, Qiaoyan Wen and Sujuan Qin

Electronics 2025, 14(21), 4188; https://doi.org/10.3390/electronics14214188 - 27 Oct 2025

Viewed by 227

Abstract

With the growth of trusted computing demand for big data analysis, cloud computing platforms are reshaping trusted data infrastructure by integrating Blockchain as a Service (BaaS), which uses elastic resource scheduling and heterogeneous hardware acceleration to support petabyte level multi-institution data security exchange [...] Read more.

With the growth of trusted computing demand for big data analysis, cloud computing platforms are reshaping trusted data infrastructure by integrating Blockchain as a Service (BaaS), which uses elastic resource scheduling and heterogeneous hardware acceleration to support petabyte level multi-institution data security exchange in medical, financial, and other fields. As the core hub of data-intensive scenarios, the BaaS platform has the dual capabilities of privacy computing and process automation. However, its deep dependence on smart contracts generates new code layer vulnerabilities, resulting in malicious contamination of analysis results. The existing detection schemes are limited to the perspective of single-source data, which makes it difficult to capture both global semantic associations and local structural details in a cloud computing environment, leading to a performance bottleneck in terms of scalability and detection accuracy. To address these challenges, this paper proposes a smart contract vulnerability detection method based on multi-modal semantic fusion for the blockchain analysis platform of cloud computing. Firstly, the contract source code is parsed into an abstract syntax tree, and the key code is accurately located based on the predefined vulnerability feature set. Then, the text features and graph structure features of key codes are extracted in parallel to realize the deep fusion of them. Finally, with the help of attention enhancement, the vulnerability probability is output through the fully connected network. The experiments on Ethereum benchmark datasets show that the detection accuracy of our method for re-entrancy vulnerability, timestamp vulnerability, overflow/underflow vulnerability, and delegatecall vulnerability can reach 92.2%, 96.3%, 91.4%, and 89.5%, surpassing previous methods. Additionally, our method has the potential for practical deployment in cloud-based blockchain service environments. Full article

(This article belongs to the Special Issue New Trends in Cloud Computing for Big Data Analytics)

► Show Figures

Figure 1

40 pages, 4303 KB

Open AccessSystematic Review

The Road to Autonomy: A Systematic Review Through AI in Autonomous Vehicles

by Adrian Domenteanu, Paul Diaconu, Margareta-Stela Florescu and Camelia Delcea

Electronics 2025, 14(21), 4174; https://doi.org/10.3390/electronics14214174 - 25 Oct 2025

Viewed by 351

Abstract

In the last decade, the incorporation of Artificial Intelligence (AI) with autonomous vehicles (AVs) has transformed transportation, mobility, and smart mobility systems. The present study provides a systematic review of global trends, applications, and challenges at the intersection of AI, including Machine Learning [...] Read more.

In the last decade, the incorporation of Artificial Intelligence (AI) with autonomous vehicles (AVs) has transformed transportation, mobility, and smart mobility systems. The present study provides a systematic review of global trends, applications, and challenges at the intersection of AI, including Machine Learning (ML), Deep Learning (DL), and autonomous vehicle technologies. Using data extracted from Clarivate Analytics’ Web of Science Core Collection and a set of specific keywords related to both AI and autonomous (electric) vehicles, this paper identifies the themes presented in the scientific literature using thematic maps and thematic map evolution analysis. Furthermore, the research topics are identified using both thematic maps, as well as Latent Dirichlet Allocation (LDA) and BERTopic, offering a more faceted insight into the research field as LDA enables the probabilistic discovery of high-level research themes, while BERTopic, based on transformer-based language models, captures deeper semantic patterns and emerging topics over time. This approach offers richer insights into the systematic review analysis, while comparison in the results obtained through the various methods considered leads to a better overview of the themes associated with the field of AI in autonomous vehicles. As a result, a strong correspondence can be observed between core topics, such as object detection, driving models, control, safety, cybersecurity and system vulnerabilities. The findings offer a roadmap for researchers and industry practitioners, by outlining critical gaps and discussing the opportunities for future exploration. Full article

► Show Figures

Figure 1

27 pages, 4104 KB

Open AccessArticle

CropCLR-Wheat: A Label-Efficient Contrastive Learning Architecture for Lightweight Wheat Pest Detection

by Yan Wang, Chengze Li, Chenlu Jiang, Mingyu Liu, Shengzhe Xu, Binghua Yang and Min Dong

Insects 2025, 16(11), 1096; https://doi.org/10.3390/insects16111096 - 25 Oct 2025

Viewed by 808

Abstract

To address prevalent challenges in field-based wheat pest recognition—namely, viewpoint perturbations, sample scarcity, and heterogeneous data distributions—a pest identification framework named CropCLR-Wheat is proposed, which integrates self-supervised contrastive learning with an attention-enhanced mechanism. By incorporating a viewpoint-invariant feature encoder and a diffusion-based feature [...] Read more.

To address prevalent challenges in field-based wheat pest recognition—namely, viewpoint perturbations, sample scarcity, and heterogeneous data distributions—a pest identification framework named CropCLR-Wheat is proposed, which integrates self-supervised contrastive learning with an attention-enhanced mechanism. By incorporating a viewpoint-invariant feature encoder and a diffusion-based feature filtering module, the model significantly enhances pest damage localization and feature consistency, enabling high-accuracy recognition under limited-sample conditions. In 5-shot classification tasks, CropCLR-Wheat achieves a precision of 89.4%, a recall of 87.1%, and an accuracy of 88.2%; these metrics further improve to 92.3%, 90.5%, and 91.2%, respectively, under the 10-shot setting. In the semantic segmentation of wheat pest damage regions, the model attains a mean intersection over union (mIoU) of 82.7%, with precision and recall reaching 85.2% and 82.4%, respectively, markedly outperforming advanced models such as SegFormer and Mask R-CNN. In robustness evaluation under viewpoint disturbances, a prediction consistency rate of 88.7%, a confidence variation of only 7.8%, and a prediction consistency score (PCS) of 0.914 are recorded, indicating strong stability and adaptability. Deployment results further demonstrate the framework’s practical viability: on the Jetson Nano device, an inference latency of 84 ms, a frame rate of 11.9 FPS, and an accuracy of 88.2% are achieved. These results confirm the efficiency of the proposed approach in edge computing environments. By balancing generalization performance with deployability, the proposed method provides robust support for intelligent agricultural terminal systems and holds substantial potential for wide-scale application. Full article

(This article belongs to the Special Issue Artificial Intelligence (AI) and Insect Pests Management: Securing Food Security, Human Health, and Natural Resources)

► Show Figures

Figure 1

23 pages, 11997 KB

Open AccessArticle

Deep Learning-Driven Automatic Segmentation of Weeds and Crops in UAV Imagery

by Jianghan Tao, Qian Qiao, Jian Song, Shan Sun, Yijia Chen, Qingyang Wu, Yongying Liu, Feng Xue, Hao Wu and Fan Zhao

Sensors 2025, 25(21), 6576; https://doi.org/10.3390/s25216576 - 25 Oct 2025

Viewed by 221

Abstract

Accurate segmentation of crops and weeds is essential for enhancing crop yield, optimizing herbicide usage, and mitigating environmental impacts. Traditional weed management practices, such as manual weeding or broad-spectrum herbicide application, are labor-intensive, environmentally harmful, and economically inefficient. In response, this study introduces [...] Read more.

Accurate segmentation of crops and weeds is essential for enhancing crop yield, optimizing herbicide usage, and mitigating environmental impacts. Traditional weed management practices, such as manual weeding or broad-spectrum herbicide application, are labor-intensive, environmentally harmful, and economically inefficient. In response, this study introduces a novel precision agriculture framework integrating Unmanned Aerial Vehicle (UAV)-based remote sensing with advanced deep learning techniques, combining Super-Resolution Reconstruction (SRR) and semantic segmentation. This study is the first to integrate UAV-based SRR and semantic segmentation for tobacco fields, systematically evaluate recent Transformer and Mamba-based models alongside traditional CNNs, and release an annotated dataset that not only ensures reproducibility but also provides a resource for the research community to develop and benchmark future models. Initially, SRR enhanced the resolution of low-quality UAV imagery, significantly improving detailed feature extraction. Subsequently, to identify the optimal segmentation model for the proposed framework, semantic segmentation models incorporating CNN, Transformer, and Mamba architectures were used to differentiate crops from weeds. Among evaluated SRR methods, RCAN achieved the optimal reconstruction performance, reaching a Peak Signal-to-Noise Ratio (PSNR) of 24.98 dB and a Structural Similarity Index (SSIM) of 69.48%. In semantic segmentation, the ensemble model integrating Transformer (DPT with DINOv2) and Mamba-based architectures achieved the highest mean Intersection over Union (mIoU) of 90.75%, demonstrating superior robustness across diverse field conditions. Additionally, comprehensive experiments quantified the impact of magnification factors, Gaussian blur, and Gaussian noise, identifying an optimal magnification factor of 4

\times

, proving that the method was robust to common environmental disturbances at optimal parameters. Overall, this research established an efficient, precise framework for crop cultivation management, offering valuable insights for precision agriculture and sustainable farming practices. Full article

(This article belongs to the Special Issue Smart Sensing and Control for Autonomous Intelligent Unmanned Systems)

► Show Figures

Figure 1

21 pages, 3381 KB

Open AccessArticle

Aero-Engine Ablation Defect Detection with Improved CLR-YOLOv11 Algorithm

by Yi Liu, Jiatian Liu, Yaxi Xu, Qiang Fu, Jide Qian and Xin Wang

Sensors 2025, 25(21), 6574; https://doi.org/10.3390/s25216574 - 25 Oct 2025

Viewed by 430

Abstract

Aero-engine ablation detection is a critical task in aircraft health management, yet existing rotation-based object detection methods often face challenges of high computational complexity and insufficient local feature extraction. This paper proposes an improved YOLOv11 algorithm incorporating Context-guided Large-kernel attention and Rotated detection [...] Read more.

Aero-engine ablation detection is a critical task in aircraft health management, yet existing rotation-based object detection methods often face challenges of high computational complexity and insufficient local feature extraction. This paper proposes an improved YOLOv11 algorithm incorporating Context-guided Large-kernel attention and Rotated detection head, called CLR-YOLOv11. The model achieves synergistic improvement in both detection efficiency and accuracy through dual structural optimization, with its innovations primarily embodied in the following three tightly coupled strategies: (1) Targeted Data Preprocessing Pipeline Design: To address challenges such as limited sample size, low overall image brightness, and noise interference, we designed an ordered data augmentation and normalization pipeline. This pipeline is not a mere stacking of techniques but strategically enhances sample diversity through geometric transformations (random flipping, rotation), hybrid augmentations (Mixup, Mosaic), and pixel-value transformations (histogram equalization, Gaussian filtering). All processed images subsequently undergo Z-Score normalization. This order-aware pipeline design effectively improves the quality, diversity, and consistency of the input data. (2) Context-Guided Feature Fusion Mechanism: To overcome the limitations of traditional Convolutional Neural Networks in modeling long-range contextual dependencies between ablation areas and surrounding structures, we replaced the original C3k2 layer with the C3K2CG module. This module adaptively fuses local textural details with global semantic information through a context-guided mechanism, enabling the model to more accurately understand the gradual boundaries and spatial context of ablation regions. (3) Efficiency-Oriented Large-Kernel Attention Optimization: To expand the receptive field while strictly controlling the additional computational overhead introduced by rotated detection, we replaced the C2PSA module with the C2PSLA module. By employing large-kernel decomposition and a spatial selective focusing strategy, this module significantly reduces computational load while maintaining multi-scale feature perception capability, ensuring the model meets the demands of high real-time applications. Experiments on a self-built aero-engine ablation dataset demonstrate that the improved model achieves 78.5% mAP@0.5:0.95, representing a 4.2% improvement over the YOLOv11-obb which model without the specialized data augmentation. This study provides an effective solution for high-precision real-time aviation inspection tasks. Full article

(This article belongs to the Special Issue Advanced Neural Architectures for Anomaly Detection in Sensory Data)

► Show Figures

Figure 1

40 pages, 33004 KB

Open AccessArticle

Sampling-Based Path Planning and Semantic Navigation for Complex Large-Scale Environments

by Shakeeb Ahmad and James Sean Humbert

Robotics 2025, 14(11), 149; https://doi.org/10.3390/robotics14110149 - 24 Oct 2025

Viewed by 191

Abstract

This article proposes a multi-agent path planning and decision-making solution for high-tempo field robotic operations, such as search-and-rescue, in large-scale unstructured environments. As a representative example, the subterranean environments can span many kilometers and are loaded with challenges such as limited to no [...] Read more.

This article proposes a multi-agent path planning and decision-making solution for high-tempo field robotic operations, such as search-and-rescue, in large-scale unstructured environments. As a representative example, the subterranean environments can span many kilometers and are loaded with challenges such as limited to no communication, hazardous terrain, blocked passages due to collapses, and vertical structures. The time-sensitive nature of these operations inherently requires solutions that are reliably deployable in practice. Moreover, a human-supervised multi-robot team is required to ensure that mobility and cognitive capabilities of various agents are leveraged for efficiency of the mission. Therefore, this article attempts to propose a solution that is suited for both air and ground vehicles and is adapted well for information sharing between different agents. This article first details a sampling-based autonomous exploration solution that brings significant improvements with respect to the current state of the art. These improvements include relying on an occupancy grid-based sample-and-project solution to terrain assessment and formulating the solution-search problem as a constraint-satisfaction problem to further enhance the computational efficiency of the planner. In addition, the demonstration of the exploration planner by team MARBLE at the DARPA Subterranean Challenge finals is presented. The inevitable interaction of heterogeneous autonomous robots with human operators demands the use of common semantics for reasoning across the robot and human teams making use of different geometric map capabilities suited for their mobility and computational resources. To this end, the path planner is further extended to include semantic mapping and decision-making into the framework. Firstly, the proposed solution generates a semantic map of the exploration environment by labeling position history of a robot in the form of probability distributions of observations. The semantic reasoning solution uses higher-level cues from a semantic map in order to bias exploration behaviors toward a semantic of interest. This objective is achieved by using a particle filter to localize a robot on a given semantic map followed by a Partially Observable Markov Decision Process (POMDP)-based controller to guide the exploration direction of the sampling-based exploration planner. Hence, this article aims to bridge an understanding gap between human and a heterogeneous robotic team not just through a common-sense semantic map transfer among the agents but by also enabling a robot to make use of such information to guide its lower-level reasoning in case such abstract information is transferred to it. Full article

(This article belongs to the Special Issue Autonomous Robotics for Exploration)

► Show Figures

Figure 1

23 pages, 16607 KB

Open AccessArticle

Few-Shot Class-Incremental SAR Target Recognition with a Forward-Compatible Prototype Classifier

by Dongdong Guan, Rui Feng, Yuzhen Xie, Xiaolong Zheng, Bangjie Li and Deliang Xiang

Remote Sens. 2025, 17(21), 3518; https://doi.org/10.3390/rs17213518 - 23 Oct 2025

Viewed by 290

Abstract

In practical Synthetic Aperture Radar (SAR) applications, new-class objects can appear at any time as the rapid accumulation of large-scale and high-quantity SAR imagery and are usually supported by limited instances in most cooperative scenarios. Hence, powering advanced deep-learning (DL)-based SAR Automatic Target [...] Read more.

In practical Synthetic Aperture Radar (SAR) applications, new-class objects can appear at any time as the rapid accumulation of large-scale and high-quantity SAR imagery and are usually supported by limited instances in most cooperative scenarios. Hence, powering advanced deep-learning (DL)-based SAR Automatic Target Recognition (SAR ATR) systems with the ability to continuously learn new concepts from few-shot samples without forgetting the old ones is important. In this paper, we tackle the Few-Shot Class-Incremental Learning (FSCIL) problem in the SAR ATR field and propose a Forward-Compatible Prototype Classifier (FCPC) by emphasizing the model’s forward compatibility to incoming targets before and after deployment. Specifically, the classifier’s sensitivity to diversified cues of emerging targets is improved in advance by a Virtual-class Semantic Synthesizer (VSS), considering the class-agnostic scattering parts of targets in SAR imagery and semantic patterns of the DL paradigm. After deploying the classifier in dynamic worlds, since novel target patterns from few-shot samples are highly biased and unstable, the model’s representability to general patterns and its adaptability to class-discriminative ones are balanced by a Decoupled Margin Adaptation (DMA) strategy, in which only the model’s high-level semantic parameters are timely tuned by improving the similarity of few-shot boundary samples to class prototypes and the dissimilarity to interclass ones. For inference, a Nearest-Class-Mean (NCM) classifier is adopted for prediction by comparing the semantics of unknown targets with prototypes of all classes based on the cosine criterion. In experiments, contributions of the proposed modules are verified by ablation studies, and our method achieves considerable performance on three FSCIL of SAR ATR datasets, i.e., SAR-AIRcraft-FSCIL, MSTAR-FSCIL, and FUSAR-FSCIL, compared with numerous benchmarks, demonstrating its superiority and effectiveness in dealing with the FSCIL of SAR ATR. Full article

► Show Figures

Figure 1

28 pages, 2038 KB

Open AccessArticle

Cognitive-Inspired Multimodal Learning Framework for Hazard Identification in Highway Construction with BIM–GIS Integration

by Jibiao Zhou, Zewei Li, Zhan Shi, Xinhua Mao and Chao Gao

Sustainability 2025, 17(21), 9395; https://doi.org/10.3390/su17219395 - 22 Oct 2025

Viewed by 390

Abstract

Highway construction remains one of the most hazardous sectors in the infrastructure domain, where persistent accident rates challenge the vision of sustainable and safe development. Traditional hazard identification methods rely on manual inspections that are often slow, error-prone, and unable to cope with [...] Read more.

Highway construction remains one of the most hazardous sectors in the infrastructure domain, where persistent accident rates challenge the vision of sustainable and safe development. Traditional hazard identification methods rely on manual inspections that are often slow, error-prone, and unable to cope with complex and dynamic site conditions. To address these limitations, this study develops a cognitive-inspired multimodal learning framework integrated with BIM–GIS-enabled digital twins to advance intelligent hazard identification and digital management for highway construction safety. The framework introduces three key innovations: a biologically grounded attention mechanism that simulates inspector search behavior, an adaptive multimodal fusion strategy that integrates visual, textual, and sensor information, and a closed-loop digital twin platform that synchronizes physical and virtual environments in real time. The system was validated across five highway construction projects over an 18-month period. Results show that the framework achieved a hazard detection accuracy of 91.7% with an average response time of 147 ms. Compared with conventional computer vision methods, accuracy improved by 18.2%, while gains over commercial safety systems reached 24.8%. Field deployment demonstrated a 34% reduction in accidents and a 42% increase in inspection efficiency, delivering a positive return on investment within 8.7 months. By linking predictive safety analytics with BIM–GIS semantics and site telemetry, the framework enhances construction safety, reduces delays and rework, and supports more resource-efficient, low-disruption project delivery, highlighting its potential as a sustainable pathway toward zero-accident highway construction. Full article

(This article belongs to the Special Issue Sustainable Transport: Logistic, Optimization, Traffic Flow and Road Safety)

► Show Figures

Figure 1

23 pages, 4442 KB

Open AccessArticle

Efficient and Lightweight LD-SAGE Model for High-Accuracy Leaf Disease Segmentation in Understory Ginseng

by Yanlei Xu, Ziyuan Yu, Dongze Wang, Chao Liu, Zhen Lu, Chen Zhao and Yang Zhou

Agronomy 2025, 15(11), 2450; https://doi.org/10.3390/agronomy15112450 - 22 Oct 2025

Viewed by 197

Abstract

Understory ginseng, with superior quality compared to field-cultivated varieties, is highly susceptible to diseases, which negatively impact both its yield and quality. Therefore, this paper proposes a lightweight, high-precision leaf spot segmentation model, Lightweight DeepLabv3+ with a StarNet Backbone and Attention-guided Gaussian Edge [...] Read more.

Understory ginseng, with superior quality compared to field-cultivated varieties, is highly susceptible to diseases, which negatively impact both its yield and quality. Therefore, this paper proposes a lightweight, high-precision leaf spot segmentation model, Lightweight DeepLabv3+ with a StarNet Backbone and Attention-guided Gaussian Edge Enhancement (LD-SAGE). This study first introduces StarNet into the DeepLabv3+ framework to replace the Xception backbone, reducing the parameter count and computational complexity. Secondly, the Gaussian-Edge Channel Fusion module uses multi-scale Gaussian convolutions to smooth blurry areas, combining Scharr edge-enhanced features with a lightweight channel attention mechanism for efficient edge and semantic feature integration. Finally, the proposed Multi-scale Attention-guided Context Modulation module replaces the traditional Atrous Spatial Pyramid Pooling. It integrates Multi-scale Grouped Dilated Convolution, Convolutional Multi-Head Self-Attention, and dynamic modulation fusion. This reduces computational costs and improves the model’s ability to capture contextual information and texture details in disease areas. Experimental results show that the LD-SAGE model achieves an mIoU of 92.48%, outperforming other models in terms of precision and recall. The model’s parameter count is only 4.6% of the original, with GFLOPs reduced to 22.1% of the baseline model. Practical deployment experiments on the Jetson Orin Nano device further confirm the advantage of the proposed method in the real-time frame rate, providing support for the diagnosis of leaf diseases in understory ginseng. Full article

(This article belongs to the Section Pest and Disease Management)

► Show Figures

Figure 1

30 pages, 3604 KB

Open AccessArticle

Integrated Systems Ontology (ISOnto): Integrating Engineering Design and Operational Feedback for Dependable Systems

by Haytham Younus, Felician Campean, Sohag Kabir, Pascal Bonnaud and David Delaux

Computers 2025, 14(11), 451; https://doi.org/10.3390/computers14110451 - 22 Oct 2025

Viewed by 314

Abstract

This paper proposes an integrated ontological framework, Integrated Systems Ontology (ISOnto), for dependable systems engineering by semantically linking design models with real-world operational failure data. Building upon the recently proposed Function–Behaviour–Structure–Failure Modes (FBSFM) framework, ISOnto integrates early-stage design information with field-level evidence to [...] Read more.

This paper proposes an integrated ontological framework, Integrated Systems Ontology (ISOnto), for dependable systems engineering by semantically linking design models with real-world operational failure data. Building upon the recently proposed Function–Behaviour–Structure–Failure Modes (FBSFM) framework, ISOnto integrates early-stage design information with field-level evidence to support more informed, traceable, and dependable failure analysis. This extends the semantic scope of the FBSFM ontology to include operational/field feedback from warranty claims and technical inspections, enabling two-way traceability between design-phase assumptions (functions, behaviours, structures, and failure modes) and field-reported failures, causes, and effects. As a theoretical contribution, ISOnto introduces a formal semantic bridge between design and operational phases, strengthening the validation of known failure modes and the discovery of previously undocumented ones. Developed using established ontology engineering practices and formalised in OWL with Protégé, it incorporates domain-specific extensions to represent field data with structured mappings to design entities. A real-world automotive case study conducted with a global manufacturer demonstrates ISOnto’s ability to consolidate multisource lifecycle data into a coherent, machine-readable repository. The framework supports advanced reasoning, structured querying, and system-level traceability, thereby facilitating continuous improvement, data-driven validation, and more reliable decision-making across product development and reliability engineering. Full article

(This article belongs to the Special Issue Recent Trends in Dependable and High Availability Systems)

► Show Figures

Figure 1

Search Results (1,734)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (1,734)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI