Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

Article Types

Countries / Regions

Search Results (21)

Search Parameters:
Keywords = multi-scale attention synergy

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
28 pages, 7608 KiB  
Article
A Forecasting Method for COVID-19 Epidemic Trends Using VMD and TSMixer-BiKSA Network
by Yuhong Li, Guihong Bi, Taonan Tong and Shirui Li
Computers 2025, 14(7), 290; https://doi.org/10.3390/computers14070290 - 18 Jul 2025
Viewed by 174
Abstract
The spread of COVID-19 is influenced by multiple factors, including control policies, virus characteristics, individual behaviors, and environmental conditions, exhibiting highly complex nonlinear dynamic features. The time series of new confirmed cases shows significant nonlinearity and non-stationarity. Traditional prediction methods that rely solely [...] Read more.
The spread of COVID-19 is influenced by multiple factors, including control policies, virus characteristics, individual behaviors, and environmental conditions, exhibiting highly complex nonlinear dynamic features. The time series of new confirmed cases shows significant nonlinearity and non-stationarity. Traditional prediction methods that rely solely on one-dimensional case data struggle to capture the multi-dimensional features of the data and are limited in handling nonlinear and non-stationary characteristics. Their prediction accuracy and generalization capabilities remain insufficient, and most existing studies focus on single-step forecasting, with limited attention to multi-step prediction. To address these challenges, this paper proposes a multi-module fusion prediction model—TSMixer-BiKSA network—that integrates multi-feature inputs, Variational Mode Decomposition (VMD), and a dual-branch parallel architecture for 1- to 3-day-ahead multi-step forecasting of new COVID-19 cases. First, variables highly correlated with the target sequence are selected through correlation analysis to construct a feature matrix, which serves as one input branch. Simultaneously, the case sequence is decomposed using VMD to extract low-complexity, highly regular multi-scale modal components as the other input branch, enhancing the model’s ability to perceive and represent multi-source information. The two input branches are then processed in parallel by the TSMixer-BiKSA network model. Specifically, the TSMixer module employs a multilayer perceptron (MLP) structure to alternately model along the temporal and feature dimensions, capturing cross-time and cross-variable dependencies. The BiGRU module extracts bidirectional dynamic features of the sequence, improving long-term dependency modeling. The KAN module introduces hierarchical nonlinear transformations to enhance high-order feature interactions. Finally, the SA attention mechanism enables the adaptive weighted fusion of multi-source information, reinforcing inter-module synergy and enhancing the overall feature extraction and representation capability. Experimental results based on COVID-19 case data from Italy and the United States demonstrate that the proposed model significantly outperforms existing mainstream methods across various error metrics, achieving higher prediction accuracy and robustness. Full article
Show Figures

Figure 1

21 pages, 21215 KiB  
Article
ES-Net Empowers Forest Disturbance Monitoring: Edge–Semantic Collaborative Network for Canopy Gap Mapping
by Yutong Wang, Zhang Zhang, Jisheng Xia, Fei Zhao and Pinliang Dong
Remote Sens. 2025, 17(14), 2427; https://doi.org/10.3390/rs17142427 - 12 Jul 2025
Viewed by 386
Abstract
Canopy gaps are vital microhabitats for forest carbon cycling and species regeneration, whose accurate extraction is crucial for ecological modeling and smart forestry. However, traditional monitoring methods have notable limitations: ground-based measurements are inefficient; remote-sensing interpretation is susceptible to terrain and spectral interference; [...] Read more.
Canopy gaps are vital microhabitats for forest carbon cycling and species regeneration, whose accurate extraction is crucial for ecological modeling and smart forestry. However, traditional monitoring methods have notable limitations: ground-based measurements are inefficient; remote-sensing interpretation is susceptible to terrain and spectral interference; and traditional algorithms exhibit an insufficient feature representation capability. Aiming at overcoming the bottleneck issues of canopy gap identification in mountainous forest regions, we constructed a multi-task deep learning model (ES-Net) integrating an edge–semantic collaborative perception mechanism. First, a refined sample library containing multi-scale interference features was constructed, which included 2808 annotated UAV images. Based on this, a dual-branch feature interaction architecture was designed. A cross-layer attention mechanism was embedded in the semantic segmentation module (SSM) to enhance the discriminative ability for heterogeneous features. Meanwhile, an edge detection module (EDM) was built to strengthen geometric constraints. Results from selected areas in Yunnan Province (China) demonstrate that ES-Net outperforms U-Net, boosting the Intersection over Union (IoU) by 0.86% (95.41% vs. 94.55%), improving the edge coverage rate by 3.14% (85.32% vs. 82.18%), and reducing the Hausdorff Distance by 38.6% (28.26 pixels vs. 46.02 pixels). Ablation studies further verify that the synergy between SSM and EDM yields a 13.0% IoU gain over the baseline, highlighting the effectiveness of joint semantic–edge optimization. This study provides a terrain-adaptive intelligent interpretation method for forest disturbance monitoring and holds significant practical value for advancing smart forestry construction and ecosystem sustainable management. Full article
Show Figures

Graphical abstract

24 pages, 3937 KiB  
Article
HyperTransXNet: Learning Both Global and Local Dynamics with a Dual Dynamic Token Mixer for Hyperspectral Image Classification
by Xin Dai, Zexi Li, Lin Li, Shuihua Xue, Xiaohui Huang and Xiaofei Yang
Remote Sens. 2025, 17(14), 2361; https://doi.org/10.3390/rs17142361 - 9 Jul 2025
Viewed by 346
Abstract
Recent advances in hyperspectral image (HSI) classification have demonstrated the effectiveness of hybrid architectures that integrate convolutional neural networks (CNNs) and Transformers, leveraging CNNs for local feature extraction and Transformers for global dependency modeling. However, existing fusion approaches face three critical challenges: (1) [...] Read more.
Recent advances in hyperspectral image (HSI) classification have demonstrated the effectiveness of hybrid architectures that integrate convolutional neural networks (CNNs) and Transformers, leveraging CNNs for local feature extraction and Transformers for global dependency modeling. However, existing fusion approaches face three critical challenges: (1) insufficient synergy between spectral and spatial feature learning due to rigid coupling mechanisms; (2) high computational complexity resulting from redundant attention calculations; and (3) limited adaptability to spectral redundancy and noise in small-sample scenarios. To address these limitations, we propose HyperTransXNet, a novel CNN-Transformer hybrid architecture that incorporates adaptive spectral-spatial fusion. Specifically, the proposed HyperTransXNet comprises three key modules: (1) a Hybrid Spatial-Spectral Module (HSSM) that captures the refined local spectral-spatial features and models global spectral correlations by combining depth-wise dynamic convolution with frequency-domain attention; (2) a Mixture-of-Experts Routing (MoE-R) module that adaptively fuses multi-scale features by dynamically selecting optimal experts via Top-K sparse weights; and (3) a Spatial-Spectral Tokens Enhancer (SSTE) module that ensures causality-preserving interactions between spectral bands and spatial contexts. Extensive experiments on the Indian Pines, Houston 2013, and WHU-Hi-LongKou datasets demonstrate the superiority of HyperTransXNet. Full article
(This article belongs to the Special Issue AI-Driven Hyperspectral Remote Sensing of Atmosphere and Land)
Show Figures

Figure 1

17 pages, 2200 KiB  
Article
Visual Place Recognition Based on Dynamic Difference and Dual-Path Feature Enhancement
by Guogang Wang, Yizhen Lv, Lijie Zhao and Yunpeng Liu
Sensors 2025, 25(13), 3947; https://doi.org/10.3390/s25133947 - 25 Jun 2025
Viewed by 375
Abstract
Aiming at the problem of appearance drift and susceptibility to noise interference in visual place recognition (VPR), we propose DD–DPFE: a Dynamic Difference and Dual-Path Feature Enhancement method. Embedding differential attention mechanisms in the DINOv2 model to mitigate the effects of process interference [...] Read more.
Aiming at the problem of appearance drift and susceptibility to noise interference in visual place recognition (VPR), we propose DD–DPFE: a Dynamic Difference and Dual-Path Feature Enhancement method. Embedding differential attention mechanisms in the DINOv2 model to mitigate the effects of process interference and adding serial-parallel adapters allows efficient model parameter migration and task adaptation. Our method constructs a two-way feature enhancement module with global–local branching synergy. The global branch employs a dynamic fusion mechanism with a multi-layer Transformer encoder to strengthen the structured spatial representation to cope with appearance changes, while the local branch suppresses the over-response of redundant noise through an adaptive weighting mechanism and fuses the contextual information from the multi-scale feature aggregation module to enhance the robustness of the scene. The experimental results show that the model architecture proposed in this paper is an obvious improvement in different environmental tests. This is most obvious in the simulation test of a night scene, verifying that the proposed method can effectively enhance the discriminative power of the system and its anti-jamming ability in complex scenes. Full article
(This article belongs to the Section Electronic Sensors)
Show Figures

Figure 1

18 pages, 3936 KiB  
Article
BSE-YOLO: An Enhanced Lightweight Multi-Scale Underwater Object Detection Model
by Yuhang Wang, Hua Ye and Xin Shu
Sensors 2025, 25(13), 3890; https://doi.org/10.3390/s25133890 - 22 Jun 2025
Viewed by 623
Abstract
Underwater images often exhibit characteristics such as low contrast, blurred and small targets, object clustering, and considerable variations in object morphology. Traditional detection methods tend to be susceptible to omission and false positives under these circumstances. Furthermore, owing to the constrained memory and [...] Read more.
Underwater images often exhibit characteristics such as low contrast, blurred and small targets, object clustering, and considerable variations in object morphology. Traditional detection methods tend to be susceptible to omission and false positives under these circumstances. Furthermore, owing to the constrained memory and limited computing power of underwater robots, there is a significant demand for lightweight models in underwater object detection tasks. Therefore, we propose an enhanced lightweight YOLOv10n-based model, BSE-YOLO. Firstly, we replace the original neck with an improved Bidirectional Feature Pyramid Network (Bi-FPN) to reduce parameters. Secondly, we propose a Multi-Scale Attention Synergy Module (MASM) to enhance the model’s perception of difficult features and make it focus on the important regions. Finally, we integrate Efficient Multi-Scale Attention (EMA) into the backbone and neck to improve feature extraction and fusion. The experiment results demonstrate that the proposed BSE-YOLO reaches 83.7% mAP@0.5 on URPC2020 and 83.9% mAP@0.5 on DUO, with the parameters reducing 2.47 M. Compared to the baseline model YOLOv10n, our BSE-YOLO improves mAP@0.5 by 2.2% and 3.0%, respectively, while reducing the number of parameters by approximately 0.2 M. The BSE-YOLO achieves a good balance between accuracy and lightweight, providing an effective solution for underwater object detection. Full article
(This article belongs to the Section Sensing and Imaging)
Show Figures

Figure 1

16 pages, 1058 KiB  
Article
Multi-Scale Context Enhancement Network with Local–Global Synergy Modeling Strategy for Semantic Segmentation on Remote Sensing Images
by Qibing Ma, Hongning Liu, Yifan Jin and Xinyue Liu
Electronics 2025, 14(13), 2526; https://doi.org/10.3390/electronics14132526 - 21 Jun 2025
Viewed by 310
Abstract
Semantic segmentation of remote sensing images is a fundamental task in geospatial analysis and Earth observation research, and has a wide range of applications in urban planning, land cover classification, and ecological monitoring. In complex geographic scenes, low target-background discriminability in overhead views [...] Read more.
Semantic segmentation of remote sensing images is a fundamental task in geospatial analysis and Earth observation research, and has a wide range of applications in urban planning, land cover classification, and ecological monitoring. In complex geographic scenes, low target-background discriminability in overhead views (e.g., indistinct boundaries, ambiguous textures, and low contrast) significantly complicates local–global information modeling and results in blurred boundaries and classification errors in model predictions. To address this issue, in this paper, we proposed a novel Multi-Scale Local–Global Mamba Feature Pyramid Network (MLMFPN) through designing a local–global information synergy modeling strategy, and guided and enhanced the cross-scale contextual information interaction in the feature fusion process to obtain quality semantic features to be used as cues for precise semantic reasoning. The proposed MLMFPN comprises two core components: Local–Global Align Mamba Fusion (LGAMF) and Context-Aware Cross-attention Interaction Module (CCIM). Specifically, LGAMF designs a local-enhanced global information modeling through asymmetric convolution for synergistic modeling of the receptive fields in vertical and horizontal directions, and further introduces the Vision Mamba structure to facilitate local–global information fusion. CCIM introduces positional encoding and cross-attention mechanisms to enrich the global-spatial semantics representation during multi-scale context information interaction, thereby achieving refined segmentation. The proposed methods are evaluated on the ISPRS Potsdam and Vaihingen datasets and the outperformance in the results verifies the effectiveness of the proposed method. Full article
Show Figures

Figure 1

16 pages, 2807 KiB  
Review
Research on the Rapid Curing Mechanism and Technology of Chinese Lacquer
by Jiangyan Hou, Tianyi Wang, Yao Wang, Xinhao Feng and Xinyou Liu
Polymers 2025, 17(12), 1596; https://doi.org/10.3390/polym17121596 - 7 Jun 2025
Viewed by 587
Abstract
Chinese lacquer, a historically significant bio-based coating, has garnered increasing attention in sustainable materials research due to its outstanding corrosion resistance, thermal stability, and environmental friendliness. Its curing process relies on the laccase-catalyzed oxidation and polymerization of urushiol to form a dense lacquer [...] Read more.
Chinese lacquer, a historically significant bio-based coating, has garnered increasing attention in sustainable materials research due to its outstanding corrosion resistance, thermal stability, and environmental friendliness. Its curing process relies on the laccase-catalyzed oxidation and polymerization of urushiol to form a dense lacquer film. However, the stringent temperature and humidity requirements (20–30 °C, 70–80% humidity) and a curing period that can extend over several weeks severely constrain its industrial application. Recent studies have significantly enhanced the curing efficiency through strategies such as pre-polymerization control, metal ion catalysis (e.g., Cu2+ reducing drying time to just one day), and nanomaterial modification (e.g., nano-Al2O3 increasing film hardness to 6H). Nevertheless, challenges remain, including the sensitivity of laccase activity to environmental fluctuations, the trade-off between accelerated curing and film performance, and issues related to toxic pigments and VOC emissions. Future developments should integrate enzyme engineering (e.g., directed evolution to broaden laccase tolerance), intelligent catalytic systems (e.g., photo-enzyme synergy), and green technologies (e.g., UV curing), complemented by multiscale modeling and circular design strategies, to drive the innovative applications of Chinese lacquer in high-end fields such as aerospace sealing and cultural heritage preservation. Full article
(This article belongs to the Section Polymer Composites and Nanocomposites)
Show Figures

Figure 1

19 pages, 12185 KiB  
Article
Dual-Domain Adaptive Synergy GAN for Enhancing Low-Light Underwater Images
by Dechuan Kong, Jinglong Mao, Yandi Zhang, Xiaohu Zhao, Yanyan Wang and Shungang Wang
J. Mar. Sci. Eng. 2025, 13(6), 1092; https://doi.org/10.3390/jmse13061092 - 30 May 2025
Viewed by 662
Abstract
The increasing application of underwater robotic systems in deep-sea exploration, inspection, and resource extraction has created a strong demand for reliable visual perception under challenging conditions. However, image quality is severely degraded in low-light underwater environments due to the combined effects of light [...] Read more.
The increasing application of underwater robotic systems in deep-sea exploration, inspection, and resource extraction has created a strong demand for reliable visual perception under challenging conditions. However, image quality is severely degraded in low-light underwater environments due to the combined effects of light absorption and scattering, resulting in color imbalance, low contrast, and illumination instability. These factors limit the effectiveness of visual-based autonomous operations. We propose ATS-UGAN, a Dual-domain Adaptive Synergy Generative Adversarial Network for low-light underwater image enhancement to confront the above issues. The network integrates Multi-scale Hybrid Attention (MHA) that synergizes spatial and frequency domain representations to capture key image features adaptively. An Adaptive Parameterized Convolution (AP-Conv) module is introduced to handle non-uniform scattering by dynamically adjusting convolution kernels through a multi-branch design. In addition, a Dynamic Content-aware Markovian Discriminator (DCMD) is employed to perceive the dual-domain information synergistically, enhancing image texture realism and improving color correction. Extensive experiments on benchmark underwater datasets demonstrate that ATS-UGAN surpasses state-of-the-art approaches, achieving 28.7/0.92 PSNR/SSIM on EUVP and 28.2/0.91 on UFO-120. Additional reference and no-reference metrics further confirm the improved visual quality and realism of the enhanced images. Full article
(This article belongs to the Section Ocean Engineering)
Show Figures

Figure 1

26 pages, 5390 KiB  
Article
DLF-YOLO: A Dynamic Synergy Attention-Guided Lightweight Framework for Few-Shot Clothing Trademark Defect Detection
by Kefeng Chen, Xinpiao Zhou and Jia Ren
Electronics 2025, 14(11), 2113; https://doi.org/10.3390/electronics14112113 - 22 May 2025
Viewed by 635
Abstract
To address key challenges in clothing trademark quality inspection—namely, insufficient defect samples, unstable performance in complex industrial environments, and low detection efficiency—this paper proposes DLF-YOLO, an enhanced YOLOv11-based model optimized for industrial deployment. To mitigate the problem of limited annotated data, an unsupervised [...] Read more.
To address key challenges in clothing trademark quality inspection—namely, insufficient defect samples, unstable performance in complex industrial environments, and low detection efficiency—this paper proposes DLF-YOLO, an enhanced YOLOv11-based model optimized for industrial deployment. To mitigate the problem of limited annotated data, an unsupervised generative network, CycleGAN, is employed to synthesize rare defect patterns and simulate diverse environmental conditions (e.g., rotation, noise, and contrast variations), thereby improving data diversity and model generalization. To reduce the impact of industrial noise, a novel multi-scale dynamic synergy attention (MDSA) attention mechanism is introduced, which utilizes dual attention in both channel and spatial dimensions to focus more accurately on key regions of the trademark, effectively suppressing false detections caused by lighting variations and fabric textures. Furthermore, the high-level selective feature pyramid network (HS-FPN) module is adopted to make the neck structure more lightweight, where the feature selection sub-module enhances the perception of fine edge defects, while the feature fusion sub-module achieves a balance between model lightweighting and detection accuracy through the aggregation of hierarchical multi-scale context information. In the backbone, DWConv replaces standard convolutions before the C3k2 module to reduce computational complexity, and HetConv is integrated into the C3k2 module to simultaneously reduce computational cost and enhance feature extraction capabilities, achieving the goal of maintaining model accuracy. Experimental results on a custom-built dataset demonstrate that DLF-YOLO achieves an mAP@0.5:0.95 of 80.2%, with a 49.6% reduction in parameters and a 25.6% reduction in computational load compared to the original YOLOv11. These results highlight the potential of DLF-YOLO as a scalable and efficient solution for lightweight, industrial-grade defect detection in clothing trademarks. Full article
Show Figures

Figure 1

19 pages, 2059 KiB  
Article
Synergistic Multi-Granularity Rough Attention UNet for Polyp Segmentation
by Jing Wang and Chia S. Lim
J. Imaging 2025, 11(4), 92; https://doi.org/10.3390/jimaging11040092 - 21 Mar 2025
Viewed by 562
Abstract
Automatic polyp segmentation in colonoscopic images is crucial for the early detection and treatment of colorectal cancer. However, complex backgrounds, diverse polyp morphologies, and ambiguous boundaries make this task difficult. To address these issues, we propose the Synergistic Multi-Granularity Rough Attention U-Net (S-MGRAUNet), [...] Read more.
Automatic polyp segmentation in colonoscopic images is crucial for the early detection and treatment of colorectal cancer. However, complex backgrounds, diverse polyp morphologies, and ambiguous boundaries make this task difficult. To address these issues, we propose the Synergistic Multi-Granularity Rough Attention U-Net (S-MGRAUNet), which integrates three key modules: the Multi-Granularity Hybrid Filtering (MGHF) module for extracting multi-scale contextual information, the Dynamic Granularity Partition Synergy (DGPS) module for enhancing polyp-background differentiation through adaptive feature interaction, and the Multi-Granularity Rough Attention (MGRA) mechanism for further optimizing boundary recognition. Extensive experiments on the ColonDB and CVC-300 datasets demonstrate that S-MGRAUNet significantly outperforms existing methods while achieving competitive results on the Kvasir-SEG and ClinicDB datasets, validating its segmentation accuracy, robustness, and generalization capability, all while effectively reducing computational complexity. This study highlights the value of multi-granularity feature extraction and attention mechanisms, providing new insights and practical guidance for advancing multi-granularity theories in medical image segmentation. Full article
Show Figures

Figure 1

35 pages, 2572 KiB  
Review
A Review of Condition Monitoring of Permanent Magnet Synchronous Machines: Techniques, Challenges and Future Directions
by Alexandros Sergakis, Marios Salinas, Nikolaos Gkiolekas and Konstantinos N. Gyftakis
Energies 2025, 18(5), 1177; https://doi.org/10.3390/en18051177 - 27 Feb 2025
Cited by 6 | Viewed by 2074
Abstract
This paper focuses on the latest advancements in diagnosing faults in Permanent Magnet Synchronous Machines (PMSMs), with particular attention paid to demagnetization, inter-turn short circuits (ITSCs), and eccentricity faults. As PMSMs play an important role in electric vehicles, renewable energy systems and aerospace [...] Read more.
This paper focuses on the latest advancements in diagnosing faults in Permanent Magnet Synchronous Machines (PMSMs), with particular attention paid to demagnetization, inter-turn short circuits (ITSCs), and eccentricity faults. As PMSMs play an important role in electric vehicles, renewable energy systems and aerospace applications, ensuring their reliability is more important than ever. This work examines widely applied methods like Motor Current Signature Analysis (MCSA) and flux monitoring, alongside more recent approaches such as time-frequency analysis, observer-based techniques and machine learning strategies. These methods are discussed in terms of strengths/weaknesses, challenges and suitability for different operating conditions. The review also highlights the importance of experimental validations to connect theoretical research with real-world applications. By exploring potential synergies between these diagnostic methods, the paper outlines ways to improve fault detection accuracy and machine reliability. It concludes by identifying future research directions, such as developing real-time diagnostics, enhancing predictive maintenance and refining sensor and computational technologies, aiming to make PMSMs more robust and fault-tolerant in demanding environments. In addition, the discussion highlights how partial demagnetization or ITSC faults may propagate if not diagnosed promptly, necessitating scalable and efficient multi-physics approaches. Finally, emphasis is placed on bridging theoretical advancements with industrial-scale implementations to ensure seamless integration into existing machine drive systems. Full article
(This article belongs to the Section A: Sustainable Energy)
Show Figures

Figure 1

30 pages, 13159 KiB  
Article
GLMAFuse: A Dual-Stream Infrared and Visible Image Fusion Framework Integrating Local and Global Features with Multi-Scale Attention
by Fu Li, Yanghai Gu, Ming Zhao, Deji Chen and Quan Wang
Electronics 2024, 13(24), 5002; https://doi.org/10.3390/electronics13245002 - 19 Dec 2024
Viewed by 1010
Abstract
Integrating infrared and visible-light images facilitates a more comprehensive understanding of scenes by amalgamating dual-sensor data derived from identical environments. Traditional CNN-based fusion techniques are predominantly confined to local feature emphasis due to their inherently limited receptive fields. Conversely, Transformer-based models tend to [...] Read more.
Integrating infrared and visible-light images facilitates a more comprehensive understanding of scenes by amalgamating dual-sensor data derived from identical environments. Traditional CNN-based fusion techniques are predominantly confined to local feature emphasis due to their inherently limited receptive fields. Conversely, Transformer-based models tend to prioritize global information, which can lead to a deficiency in feature diversity and detail retention. Furthermore, methods reliant on single-scale feature extraction are inadequate for capturing extensive scene information. To address these limitations, this study presents GLMAFuse, an innovative dual-stream encoder–decoder network, which utilizes a multi-scale attention mechanism to harmoniously integrate global and local features. This framework is designed to maximize the extraction of multi-scale features from source images while effectively synthesizing local and global information across all layers. We introduce the global-aware and local embedding (GALE) module to adeptly capture and merge global structural attributes and localized details from infrared and visible imagery via a parallel dual-branch architecture. Additionally, the multi-scale attention fusion (MSAF) module is engineered to optimize attention weights at the channel level, facilitating an enhanced synergy between high-frequency edge details and global backgrounds. This promotes effective interaction and fusion of dual-modal features. Extensive evaluations using standard datasets demonstrate that GLMAFuse surpasses the existing leading methods in both qualitative and quantitative assessments, highlighting its superior capability in infrared and visible image fusion. On the TNO and MSRS datasets, our method achieves outstanding performance across multiple metrics, including EN (7.15, 6.75), SD (46.72, 47.55), SF (12.79, 12.56), MI (2.21, 3.22), SCD (1.75, 1.80), VIF (0.79, 1.08), Qbaf (0.58, 0.71), and SSIM (0.99, 1.00). These results underscore its exceptional proficiency in infrared and visible image fusion. Full article
(This article belongs to the Special Issue Artificial Intelligence Innovations in Image Processing)
Show Figures

Figure 1

17 pages, 3307 KiB  
Article
MCADNet: A Multi-Scale Cross-Attention Network for Remote Sensing Image Dehazing
by Tao Tao, Haoran Xu, Xin Guan and Hao Zhou
Mathematics 2024, 12(23), 3650; https://doi.org/10.3390/math12233650 - 21 Nov 2024
Viewed by 1402
Abstract
Remote sensing image dehazing (RSID) aims to remove haze from remote sensing images to enhance their quality. Although existing deep learning-based dehazing methods have made significant progress, it is still difficult to completely remove the uneven haze, which often leads to color or [...] Read more.
Remote sensing image dehazing (RSID) aims to remove haze from remote sensing images to enhance their quality. Although existing deep learning-based dehazing methods have made significant progress, it is still difficult to completely remove the uneven haze, which often leads to color or structural differences between the dehazed image and the original image. In order to overcome this difficulty, we propose the multi-scale cross-attention dehazing network (MCADNet), which offers a powerful solution for RSID. MCADNet integrates multi-kernel convolution and a multi-head attention mechanism into the U-Net architecture, enabling effective multi-scale information extraction. Additionally, we replace traditional skip connections with a cross-attention-based gating module, enhancing feature extraction and fusion across different scales. This synergy enables the network to maximize the overall similarity between the restored image and the real image while also restoring the details of the complex texture areas in the image. We evaluate MCADNet on two benchmark datasets, Haze1K and RICE, demonstrating its superior performance. Ablation experiments further verify the importance of our key design choices in enhancing dehazing effectiveness. Full article
(This article belongs to the Special Issue Image Processing and Machine Learning with Applications)
Show Figures

Figure 1

15 pages, 2391 KiB  
Article
Multispectral Pedestrian Detection Based on Prior-Saliency Attention and Image Fusion
by Jiaren Guo, Zihao Huang and Yanyun Tao
Electronics 2024, 13(9), 1770; https://doi.org/10.3390/electronics13091770 - 3 May 2024
Cited by 2 | Viewed by 1621
Abstract
Detecting pedestrians in varying illumination conditions poses a significant challenge, necessitating the development of innovative solutions. In response to this, we introduce Prior-AttentionNet, a pedestrian detection model featuring a Prior-Attention mechanism. This model leverages the stark contrast between thermal objects and their backgrounds [...] Read more.
Detecting pedestrians in varying illumination conditions poses a significant challenge, necessitating the development of innovative solutions. In response to this, we introduce Prior-AttentionNet, a pedestrian detection model featuring a Prior-Attention mechanism. This model leverages the stark contrast between thermal objects and their backgrounds in far-infrared (FIR) images by employing saliency attention derived from FIR images via UNet. However, extracting salient regions of diverse scales from FIR images poses a challenge for saliency attention. To address this, we integrate Simple Linear Iterative Clustering (SLIC) superpixel segmentation, embedding the segmentation feature map as prior knowledge into UNet’s decoding stage for comprehensive end-to-end training and detection. This integration enhances the extraction of focused attention regions, with the synergy of segmentation prior and saliency attention forming the core of Prior-AttentionNet. Moreover, to enrich pedestrian details and contour visibility in low-light conditions, we implement multispectral image fusion. Experimental evaluations were conducted on the KAIST and OTCBVS datasets. Applying Prior-Attention mode to FIR-RGB images significantly improves the delineation and focus on multi-scale pedestrians. Prior-AttentionNet’s general detector demonstrates the capability of detecting pedestrians with minimal computational resources. The ablation studies indicate that the FIR-RGB+ Prior-Attention mode markedly enhances detection robustness over other modes. When compared to conventional multispectral pedestrian detection models, Prior-AttentionNet consistently surpasses them by achieving higher mean average precision and lower miss rates in diverse scenarios, during both day and night. Full article
(This article belongs to the Section Computer Science & Engineering)
Show Figures

Figure 1

19 pages, 4252 KiB  
Review
Functionalization of Fabrics with Graphene-Based Coatings: Mechanisms, Approaches, and Functions
by Yang Liu, Bin Fei and John H. Xin
Coatings 2023, 13(9), 1580; https://doi.org/10.3390/coatings13091580 - 11 Sep 2023
Cited by 5 | Viewed by 3584
Abstract
Due to their unique surface-active functionalities, graphene and its derivatives, i.e., graphene oxide (GO) and reduced graphene oxide (rGO), have received enormous research attention in recent decades. One of the most intriguing research hot spots is the integration of GO and rGO coatings [...] Read more.
Due to their unique surface-active functionalities, graphene and its derivatives, i.e., graphene oxide (GO) and reduced graphene oxide (rGO), have received enormous research attention in recent decades. One of the most intriguing research hot spots is the integration of GO and rGO coatings on textiles through dyeing methods, e.g., dip-pad-dry. In general, the GO sheets can quickly diffuse into the fabric matrix and deposit onto the surface of the fibers through hydrogen bonding. The GO sheets can be conformally coated on the fiber surface, forming strong adhesion as a result of the high flakiness ratio, mechanical strength, and deformability. Moreover, multiple functions with application significance, e.g., anti-bacteria, UV protection, conductivity, and wetting control, can be achieved on the GO and rGO-coated fabrics as a result of the intrinsic chemical, physical, electronic, and amphiphilic properties of GO and rGO. On the other hand, extrinsic functions, including self-cleaning, self-healing, directional water transport, and oil/water separation, can be achieved for the GO and rGO coatings by the integration of other functional materials. Therefore, multi-scale, multifunctional, smart fabrics with programmable functions and functional synergy can be achieved by the design and preparation of the hybrid GO and rGO coatings, while advanced applications, e.g., healthcare clothing, E-textiles, anti-fouling ultrafiltration membranes, can be realized. In this review, we aim to provide an in-depth overview of the existing methods for functionalizing fabrics with graphene-based coatings while the corresponding functional performance, underlying mechanisms and applications are highlighted and discussed, which may provide useful insights for the design and fabrication of functional textiles and fabrics for different applications. Full article
(This article belongs to the Special Issue Advanced Nanocomposite Coatings for Biomedical Engineering)
Show Figures

Figure 1

Back to TopTop