Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

Article Types

Countries / Regions

Search Results (156)

Search Parameters:
Keywords = cosine loss

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
15 pages, 3599 KB  
Article
High-Fidelity rPPG Waveform Reconstruction from Palm Videos Using GANs
by Tao Li and Yuliang Liu
Sensors 2026, 26(2), 563; https://doi.org/10.3390/s26020563 - 14 Jan 2026
Abstract
Remote photoplethysmography (rPPG) enables non-contact acquisition of human physiological parameters using ordinary cameras, and has been widely applied in medical monitoring, human–computer interaction, and health management. However, most existing studies focus on estimating specific physiological metrics, such as heart rate and heart rate [...] Read more.
Remote photoplethysmography (rPPG) enables non-contact acquisition of human physiological parameters using ordinary cameras, and has been widely applied in medical monitoring, human–computer interaction, and health management. However, most existing studies focus on estimating specific physiological metrics, such as heart rate and heart rate variability, while paying insufficient attention to reconstructing the underlying rPPG waveform. In addition, publicly available datasets typically record facial videos accompanied by fingertip PPG signals as reference labels. Since fingertip PPG waveforms differ substantially from the true photoplethysmography (PPG) signals obtained from the face, deep learning models trained on such datasets often struggle to recover high-quality rPPG waveforms. To address this issue, we collected a new dataset consisting of palm-region videos paired with wrist-based PPG signals as reference labels, and experimentally validated its effectiveness for training neural network models aimed at rPPG waveform reconstruction. Furthermore, we propose a generative adversarial network (GAN)-based pulse-wave synthesis framework that produces high-quality rPPG waveforms by denoising the mean green-channel signal. By incorporating time-domain peak-aware loss, frequency-domain loss, and adversarial loss, our method achieves promising performance, with an RMSE (Root Mean Square Error) of 0.102, an MAPE (Mean Absolute Percentage Error) of 0.028, a Pearson correlation of 0.987, and a cosine similarity of 0.989. These results demonstrate the capability of the proposed approach to reconstruct high-fidelity rPPG waveforms with improved morphological accuracy compared to noisy raw rPPG signals, rather than directly validating health monitoring performance. This study presents a high-quality rPPG waveform reconstruction approach from both data and model perspectives, providing a reliable foundation for subsequent physiological signal analysis, waveform-based studies, and potential health-related applications. Full article
(This article belongs to the Special Issue Systems for Contactless Monitoring of Vital Signs)
Show Figures

Figure 1

23 pages, 3153 KB  
Article
SSCW-YOLO: A Lightweight and High-Precision Model for Small Object Detection in UAV Scenarios
by Zhuolun He, Rui She, Bo Tan, Jiajian Li and Xiaolong Lei
Drones 2026, 10(1), 41; https://doi.org/10.3390/drones10010041 - 7 Jan 2026
Viewed by 360
Abstract
To address the problems of missed and false detections caused by insufficient feature quality in small object detection from UAV perspectives, this paper proposes a UAV small object detection algorithm based on YOLOv8 feature optimization. A spatial cosine convolution module is introduced into [...] Read more.
To address the problems of missed and false detections caused by insufficient feature quality in small object detection from UAV perspectives, this paper proposes a UAV small object detection algorithm based on YOLOv8 feature optimization. A spatial cosine convolution module is introduced into the backbone network to optimize spatial features, thereby alleviating the problem of small object feature loss and improving the detection accuracy and speed of the model. An improved C2f_SCConv feature fusion module is employed for feature integration, which effectively reduces feature redundancy in spatial and channel dimensions, thereby lowering model complexity and computational cost. Meanwhile, the WIoU loss function is used to replace the original CIoU loss function, reducing the interference of geometric factors in anchor box regression, enabling the model to focus more on low-quality anchor boxes, and enhancing its small object detection capability. Ablation and comparative experiments on the VisDrone dataset validate the effectiveness of the proposed algorithm for small object detection from UAV perspectives, while generalization experiments on the DOTA and SSDD datasets demonstrate that the algorithm possesses strong generalization performance. Full article
Show Figures

Figure 1

19 pages, 5899 KB  
Article
Small-Signal Modeling of Asymmetric PWM Control-Based Parallel Resonant Converter
by Na-Yeon Kim and Kui-Jun Lee
Electronics 2025, 14(24), 4970; https://doi.org/10.3390/electronics14244970 - 18 Dec 2025
Viewed by 238
Abstract
This paper proposes a small-signal model of a DC–DC parallel resonant converter operating in continuous conduction mode based on asymmetric pulse-width modulation (APWM) under light-load conditions. The parallel resonant converter enables soft switching and no-load control over a wide load range because the [...] Read more.
This paper proposes a small-signal model of a DC–DC parallel resonant converter operating in continuous conduction mode based on asymmetric pulse-width modulation (APWM) under light-load conditions. The parallel resonant converter enables soft switching and no-load control over a wide load range because the resonant capacitor is connected in parallel with the load. However, the resonant energy required for soft switching is already sufficient, and the current flowing through the resonant tank is independent of the load magnitude; therefore, as the load decreases, the energy that is not delivered to the load and instead circulates meaninglessly inside the resonant tank increases. This results in conduction loss and reduced efficiency. To address this issue, APWM with a fixed switching frequency is required, which reduces circulating energy and improves efficiency under light-load conditions. Precise small-signal modeling is required to optimize the APWM controller. Unlike PFM or PSFB, APWM includes not only sine components but also DC and cosine components in the control signal due to its asymmetric switching characteristics, and this study proposes a small-signal model that can relatively accurately reflect these multi-harmonic characteristics. The proposed model is derived based on the Extended Describing Function (EDF) concept, and the derived transfer function is useful for systematically analyzing the dynamic characteristics of the APWM-based parallel resonant converter. In addition, it provides information that can systematically analyze the dynamic characteristics of various APWM-based resonant converters and control signals that reflect various harmonic characteristics, and it can be widely applied to future control design and analysis studies. The validity of the model is verified through MATLAB (R2025b) and PLECS (4.7.5) switching-model simulations and experimental results, confirming its high accuracy and practicality. Full article
(This article belongs to the Special Issue New Insights in Power Electronics: Prospects and Challenges)
Show Figures

Figure 1

21 pages, 1505 KB  
Article
WaveletHSI: Direct HSI Classification from Compressed Wavelet Coefficients via Sub-Band Feature Extraction and Fusion
by Xin Li and Baile Sun
J. Imaging 2025, 11(12), 441; https://doi.org/10.3390/jimaging11120441 - 10 Dec 2025
Viewed by 385
Abstract
A major computational bottleneck in classifying large-scale hyperspectral images (HSI) is the mandatory data decompression prior to processing. Compressed-domain computing offers a solution by enabling deep learning on partially compressed data. However, existing compressed-domain methods are predominantly tailored for the Discrete Cosine Transform [...] Read more.
A major computational bottleneck in classifying large-scale hyperspectral images (HSI) is the mandatory data decompression prior to processing. Compressed-domain computing offers a solution by enabling deep learning on partially compressed data. However, existing compressed-domain methods are predominantly tailored for the Discrete Cosine Transform (DCT) used in natural images, while HSIs are typically compressed using the Discrete Wavelet Transform (DWT). The fundamental structural mismatch between the block-based DCT and the hierarchical DWT sub-bands presents two core challenges: how to extract features from multiple wavelet sub-bands, and how to fuse these features effectively? To address these issues, we propose a novel framework that extracts and fuses features from different DWT sub-bands directly. We design a multi-branch feature extractor with sub-band feature alignment loss that processes functionally different sub-bands in parallel, preserving the independence of each frequency feature. We then employ a sub-band cross-attention mechanism that inverts the typical attention paradigm by using the sparse, high-frequency detail sub-bands as queries to adaptively select and enhance salient features from the dense, information-rich low-frequency sub-bands. This enables a targeted fusion of global context and fine-grained structural information without data reconstruction. Experiments on three benchmark datasets demonstrate that our method achieves classification accuracy comparable to state-of-the-art spatial-domain approaches while eliminating at least 56% of the decompression overhead. Full article
(This article belongs to the Special Issue Multispectral and Hyperspectral Imaging: Progress and Challenges)
Show Figures

Figure 1

27 pages, 2900 KB  
Article
Graph-SENet: An Unsupervised Learning-Based Graph Neural Network for Skeleton Extraction from Point Cloud
by Jie Li, Wei Guo and Wenli Zhang
Future Internet 2025, 17(12), 558; https://doi.org/10.3390/fi17120558 - 3 Dec 2025
Viewed by 401
Abstract
Extracting 3D skeletons from point clouds is a challenging task in computer vision. Most existing deep learning methods rely heavily on supervised data requiring extensive manual annotation. Consequently, re-labeling is often necessary for cross-category applications, while the process of 3D point cloud annotation [...] Read more.
Extracting 3D skeletons from point clouds is a challenging task in computer vision. Most existing deep learning methods rely heavily on supervised data requiring extensive manual annotation. Consequently, re-labeling is often necessary for cross-category applications, while the process of 3D point cloud annotation is inherently time-consuming and expensive. Simultaneously, existing unsupervised methods often suffer from significant skeleton point deviations due to limited capabilities in modeling local structures. To address these limitations, we propose Graph-SENet, an unsupervised learning-based graph neural network method for skeleton extraction. This method integrates dynamic graph convolution with a multi-level feature fusion mechanism to more comprehensively capture local geometric relationships. Through a multi-dimensional unsupervised feature loss, it learns the structural representation of skeleton points, significantly improving the precision and stability of skeleton point localization under annotation-free conditions. Furthermore, we propose a graph autoencoder structure optimized by cosine similarity to predict topological connections between skeleton points, thereby recovering semantically consistent and structurally complete 3D skeleton representations in an end-to-end manner. Experimental results on multiple datasets, including ShapeNet, ITOP, and Soybean-MVS, demonstrate that Graph-SENet outperforms existing mainstream unsupervised methods in terms of Chamfer Distance and F1-score. It exhibits superior accuracy, robustness, and cross-category generalization capabilities, effectively reducing manual annotation costs while enhancing the completeness and semantic consistency of skeleton recovery. These results validate the application potential and practical value of Graph-SENet in 3D structure understanding and downstream 3D analysis tasks. Full article
(This article belongs to the Special Issue Algorithms and Models for Next-Generation Vision Systems)
Show Figures

Figure 1

20 pages, 1536 KB  
Article
Contrastive Learning-Based One-Class Classification for Intelligent Manufacturing System
by Seunghwan Song
Machines 2025, 13(12), 1109; https://doi.org/10.3390/machines13121109 - 1 Dec 2025
Viewed by 435
Abstract
Time-series anomaly detection is imperative for ensuring reliability and safety in intelligent manufacturing systems. However, real-world environments typically provide only normal operating data and exhibit significant periodicity, noise, imbalance, and domain variability. The present study proposes CL-OCC, a contrastive learning-based one-class framework that [...] Read more.
Time-series anomaly detection is imperative for ensuring reliability and safety in intelligent manufacturing systems. However, real-world environments typically provide only normal operating data and exhibit significant periodicity, noise, imbalance, and domain variability. The present study proposes CL-OCC, a contrastive learning-based one-class framework that integrates seasonal-trend decomposition using loess (STL) for structure-preserving temporal augmentation, a cosine-regularized soft boundary for compact normal-region formation, and variance-preserving regularization to prevent latent collapse. A convolutional recurrent encoder is first pretrained via an autoencoder objective and subsequently optimized through a unified loss that balances contrastive invariance, soft-boundary constraint, and variance dispersion. Experiments on semiconductor equipment data and three public benchmarks demonstrate that CL-OCC provides competitive or superior performance relative to reconstruction-, prediction-, and contrastive-based baselines. CL-OCC exhibits smoother anomaly trajectories, earlier detection of gradual drifts, and strong robustness to noise, window-length variation, and extreme class imbalance. A study of the effects of ablation and interaction on the stability of representations indicates that STL-based augmentation, boundary shaping, and variance regularization contribute complementary benefits to this stability. While the qualitative results indicate limited sensitivity to extremely short impulsive disturbances, the proposed framework delivers a generalizable and stable solution for unsupervised industrial monitoring, with promising potential for extension to multi-resolution analysis and online prognostics and health management (PHM) applications. Full article
(This article belongs to the Special Issue Smart Manufacturing and Industrial Automation, 2nd Edition)
Show Figures

Figure 1

17 pages, 1121 KB  
Article
TASA: Text-Anchored State–Space Alignment for Long-Tailed Image Classification
by Long Li, Tinglei Jia, Huaizhi Yue, Huize Cheng, Yongfeng Bu and Zhaoyang Zhang
J. Imaging 2025, 11(11), 410; https://doi.org/10.3390/jimaging11110410 - 13 Nov 2025
Viewed by 560
Abstract
Long-tailed image classification remains challenging for vision–language models. Head classes dominate training while tail classes are underrepresented and noisy, and short prompts with weak text supervision further amplify head bias. This paper presents TASA, an end-to-end framework that stabilizes textual supervision and enhances [...] Read more.
Long-tailed image classification remains challenging for vision–language models. Head classes dominate training while tail classes are underrepresented and noisy, and short prompts with weak text supervision further amplify head bias. This paper presents TASA, an end-to-end framework that stabilizes textual supervision and enhances cross-modal fusion. A Semantic Distribution Modulation (SDM) module constructs class-specific text prototypes by cosine-weighted fusion of multiple LLM-generated descriptions with a canonical template, providing stable and diverse semantic anchors without training text parameters. Dual-Space Cross-Modal Fusion (DCF) module incorporates selective-scan state–space blocks into both image and text branches, enabling bidirectional conditioning and efficient feature fusion through a lightweight multilayer perceptron. Together with a margin-aware alignment loss, TASA aligns images with class prototypes for classification without requiring paired image–text data or per-class prompt tuning. Experiments on CIFAR-10/100-LT, ImageNet-LT, and Places-LT demonstrate consistent improvements across many-, medium-, and few-shot groups. Ablation studies confirm that DCF yields the largest single-module gain, while SDM and DCF combined provide the most robust and balanced performance. These results highlight the effectiveness of integrating text-driven prototypes with state–space fusion for long-tailed classification. Full article
(This article belongs to the Section Image and Video Processing)
Show Figures

Figure 1

24 pages, 1470 KB  
Article
Integrating Ecological Semantic Encoding and Distribution-Aligned Loss for Multimodal Forest Ecosystem
by Jing Peng, Zhengjie Fu, Huachen Zhou, Yibin Liu, Yang Zhang, Rui Shi, Jiangfeng Li and Min Dong
Forests 2025, 16(11), 1697; https://doi.org/10.3390/f16111697 - 7 Nov 2025
Viewed by 573
Abstract
In this study, a cross-hierarchical intelligent modeling framework integrating an ecological semantic encoder, a distribution-aligned contrastive loss, and a disturbance-aware attention mechanism was developed to address the semantic alignment challenge between aboveground vegetation and belowground seed banks within forest ecosystems. The proposed framework [...] Read more.
In this study, a cross-hierarchical intelligent modeling framework integrating an ecological semantic encoder, a distribution-aligned contrastive loss, and a disturbance-aware attention mechanism was developed to address the semantic alignment challenge between aboveground vegetation and belowground seed banks within forest ecosystems. The proposed framework leverages artificial intelligence and deep learning to characterize the structural and functional coupling between vegetation and soil communities, thereby elucidating the ecological mechanisms that underlie forest regeneration and stability. Experiments using representative forest ecological plot datasets demonstrated that the model achieved a top-1 accuracy of 78.6%, a top-5 accuracy of 89.3%, a mean cosine similarity of 0.784, and a reduced Kullback–Leibler divergence of 0.128, while the Jaccard index increased to 0.512—surpassing traditional statistical and machine-learning baselines such as RDA, CCA, Procrustes, Siamese, and SimCLR. The model also reduced NMDS stress to 0.094 and improved the Sørensen coefficient to 0.713, reflecting high robustness and precision in reconstructing community structure and ecological distributions. Additionally, the integration of distribution alignment and disturbance-aware mechanisms allows the model to capture dynamic vegetation–soil feedbacks across environmental gradients and disturbance regimes. This enables more accurate identification of regeneration potential, resilience thresholds, and restoration trajectories in degraded forests. Overall, the framework provides a novel theoretical foundation and a data-driven pathway for applying artificial intelligence to forest ecosystem monitoring, degradation diagnosis, and adaptive management for sustainable recovery. Full article
Show Figures

Figure 1

22 pages, 10561 KB  
Article
FSCA-EUNet: Lightweight Classification of Stacked Jasmine Bloom-Stages via Frequency–Spatial Cross-Attention for Industrial Scenting Automation
by Zhiwei Chen, Zhengrui Tian, Haowen Zhang, Xingmin Zhang, Xuesong Zhu and Chunwang Dong
Foods 2025, 14(21), 3780; https://doi.org/10.3390/foods14213780 - 4 Nov 2025
Viewed by 615
Abstract
To address the challenge of monitoring the postharvest jasmine bloom stages during industrial tea scenting processes, this study proposes an efficient U-shaped Network (U-Net) model with frequency–spatial cross-attention (FSCA-EUNet) to resolve critical bottlenecks, including repetitive backgrounds and small interclass differences, caused by stacked [...] Read more.
To address the challenge of monitoring the postharvest jasmine bloom stages during industrial tea scenting processes, this study proposes an efficient U-shaped Network (U-Net) model with frequency–spatial cross-attention (FSCA-EUNet) to resolve critical bottlenecks, including repetitive backgrounds and small interclass differences, caused by stacked jasmine flowers during factory production. High-resolution images of stacked jasmine flowers were first preprocessed and input into FSCA-EUNet, where the encoder extracted multi-scale spatial features and the FSCA module incorporated frequency-domain textures. The decoder then fused and refined these features, and the final classification layer output the predicted bloom stage for each image. The proposed model was designed as a “U-Net”-like structure to preserve multiscale details and employed a frequency–spatial cross-attention module to extract high-frequency texture features via a discrete cosine transform. Long-range dependencies were established by NonLocalBlook, located after the encoders in the model. Finally, a momentum-updated center loss function was introduced to constrain the feature space distribution and enhance intraclass compactness. According to the experimental results, the proposed model achieved the best metrics, including 95.52% precision, 95.42% recall, 95.40% F1-score, and 97.24% mean average precision, on our constructed dataset with only 878.851 K parameters and 15.445 G Floating Point Operations (FLOPs), and enabled real-time deployment at 22.33 FPS on Jetson Orin NX edge devices. The ablation experiments validated the improvements contributed by each module, which significantly improved the fine-grained classification capability of the proposed network. In conclusion, FSCA-EUNet effectively addresses the challenges of stacked flower backgrounds and subtle interclass differences, offering a lightweight yet accurate framework that enables real-time deployment for industrial jasmine tea scenting automation. Full article
Show Figures

Figure 1

45 pages, 2671 KB  
Article
Mathematical Model for Economic Optimization of Tower-Type Solar Thermal Power Generation Systems via Coupled Monte Carlo Ray-Tracing and Multi-Mechanism Heat Loss Equations
by Juanen Li, Yao Chen and Huanhao Su
Mathematics 2025, 13(19), 3132; https://doi.org/10.3390/math13193132 - 30 Sep 2025
Viewed by 795
Abstract
With the global energy transition and decarbonization goals, tower-type solar thermal power generation is increasingly important for dispatchable clean energy due to its high efficiency, thermal storage capacity, and regulation performance. However, current research focuses on ideal conditions, ignoring real geographical constraints on [...] Read more.
With the global energy transition and decarbonization goals, tower-type solar thermal power generation is increasingly important for dispatchable clean energy due to its high efficiency, thermal storage capacity, and regulation performance. However, current research focuses on ideal conditions, ignoring real geographical constraints on heliostat layout and environmental impacts on receiver performance. More practical scene modeling and performance evaluation methods are urgently needed. To address these issues, we propose a heliostat field simulation algorithm based on heat loss mechanisms and real site characteristics. The algorithm includes optical performance evaluation (cosine efficiency, shading, truncation, atmospheric transmittance) and heat loss mechanisms (radiation, convection, conduction) for realistic net heat output estimation. Experiments revealed the following: (1) higher central towers improve optical efficiency by increasing solar elevation angle; (2) radiation losses dominate at high power and tower height, while convection losses dominate at low power and tower height. Using the Economic-Integrated Score (EIS) optimization algorithm, we achieved optimal tower and receiver configuration with 40.22% average improvement over other configurations (maximum 3.9× improvement). This provides a scientific design basis for improving tower-type solar thermal systems’ adaptability and economy in different geographical environments. Full article
(This article belongs to the Special Issue Advances and Applications in Intelligent Computing)
Show Figures

Figure 1

17 pages, 1472 KB  
Article
Active Distribution Network Bi-Level Programming Model Based on Hybrid Whale Optimization Algorithm
by Hao Guo and Yanbo Che
Sustainability 2025, 17(19), 8560; https://doi.org/10.3390/su17198560 - 24 Sep 2025
Viewed by 453
Abstract
In recent years, the integration of flexible resources into active distribution networks (ADNs) has been significantly enhanced. By coordinating a variety of such resources, the economic efficiency, operational security, and overall stability of ADNs can be improved. In this study, a bi-level planning [...] Read more.
In recent years, the integration of flexible resources into active distribution networks (ADNs) has been significantly enhanced. By coordinating a variety of such resources, the economic efficiency, operational security, and overall stability of ADNs can be improved. In this study, a bi-level planning model is proposed for active distribution networks. The upper-level model aims to minimize the annual comprehensive cost, while the lower-level model focuses on reducing network losses. To solve the upper-level problem, a hybrid whale optimization algorithm (HWOA) is developed. The algorithm integrates adaptive mutation based on Gaussian–Cauchy distributions, a nonlinear cosine-based control strategy, and a dual-population co-evolution mechanism. These enhancements allow HWOA to achieve faster convergence, higher accuracy, and stronger global search capabilities, thereby reducing the risk of falling into local optima. The lower-level problem is addressed using the interior point method due to its nonlinear and continuous nature. The proposed model and algorithm are validated through simulations on the IEEE 33-bus system. The results show that DG consumption increases by 88.77 MWh, network losses decrease by 6.8 MWh, and the total system cost is reduced by CNY 3.62 million over the entire project lifecycle. These improvements contribute to both the economic and operational performance of the ADN. Compared with the polar fox optimization algorithm (PFA), HWOA improves algorithmic efficiency by 18.92%, lowers network loss costs by 6.22%, and reduces the total system costs by 0.71%, demonstrating its superior effectiveness in solving complex bi-level optimization problems in active distribution networks. These findings not only demonstrate the technical efficiency of the proposed method but also contribute to the long-term goals of sustainable energy systems by improving renewable energy utilization, reducing operational losses, and supporting carbon reduction targets in active distribution networks. Full article
Show Figures

Figure 1

39 pages, 1281 KB  
Article
Sustainable Metaheuristic-Based Planning of Rural Medium- Voltage Grids: A Comparative Study of Spanning and Steiner Tree Topologies for Cost-Efficient Electrification
by Lina María Riaño-Enciso, Brandon Cortés-Caicedo, Oscar Danilo Montoya, Luis Fernando Grisales-Noreña and Jesús C. Hernández
Sustainability 2025, 17(18), 8145; https://doi.org/10.3390/su17188145 - 10 Sep 2025
Cited by 1 | Viewed by 656
Abstract
This paper presents a heuristic methodology for the optimal expansion of unbalanced three-phase distribution systems in rural areas, simultaneously addressing feeder routing and conductor sizing to minimize the total annualized cost—defined as the sum of investments in conductors and operational energy losses. The [...] Read more.
This paper presents a heuristic methodology for the optimal expansion of unbalanced three-phase distribution systems in rural areas, simultaneously addressing feeder routing and conductor sizing to minimize the total annualized cost—defined as the sum of investments in conductors and operational energy losses. The planning strategy explores two radial topological models: the Minimum Spanning Tree (MST) and the Steiner Tree (ST). The latter incorporates auxiliary nodes to reduce the total line length. For each topology, an initial conductor sizing is performed based on three-phase power flow calculations using Broyden’s method, capturing the unbalanced nature of the rural networks. These initial solutions are refined via four metaheuristic algorithms—the Chu–Beasley Genetic Algorithm (CBGA), Particle Swarm Optimization (PSO), the Sine–Cosine Algorithm (SCA), and the Grey Wolf Optimizer (GWO)—under a master–slave optimization framework. Numerical experiments on 15-, 25- and 50-node rural test systems show that the ST combined with GWO consistently achieves the lowest total costs—reducing expenditures by up to 70.63% compared to MST configurations—and exhibits superior robustness across all performance metrics, including best-, average-, and worst-case solutions, as well as standard deviation. Beyond its technical contributions, the proposed methodology supports the United Nations Sustainable Development Goals by promoting universal energy access (SDG 7), fostering cost-effective rural infrastructure (SDG 9), and contributing to reductions in urban–rural inequalities in electricity access (SDG 10). All simulations were implemented in MATLAB 2024a, demonstrating the practical viability and scalability of the method for planning rural distribution networks under unbalanced load conditions. Full article
Show Figures

Figure 1

21 pages, 33617 KB  
Article
CycloneWind: A Dynamics-Constrained Deep Learning Model for Tropical Cyclone Wind Field Downscaling Using Satellite Observations
by Yuxiang Hu, Kefeng Deng, Qingguo Su, Di Zhang, Xinjie Shi and Kaijun Ren
Remote Sens. 2025, 17(18), 3134; https://doi.org/10.3390/rs17183134 - 10 Sep 2025
Viewed by 1116
Abstract
Tropical cyclones (TCs) rank among the most destructive natural hazards globally, with core damaging potential originating from regions of intense wind shear and steep wind speed gradients within the eyewall and spiral rainbands. Accurately characterizing these fine-scale structural features is therefore critical for [...] Read more.
Tropical cyclones (TCs) rank among the most destructive natural hazards globally, with core damaging potential originating from regions of intense wind shear and steep wind speed gradients within the eyewall and spiral rainbands. Accurately characterizing these fine-scale structural features is therefore critical for understanding TC intensity evolution, wind hazard distribution, and disaster mitigation. Recently, the deep learning-based downscaling methods have shown significant advantages in efficiently obtaining high-resolution wind field distributions. However, existing methods are mainly used to downscale general wind fields, and research on downscaling extreme wind field events remains limited. There are two main difficulties in downscaling TC wind fields. The first one is that high-quality datasets for TC wind fields are scarce; the other is that general deep learning frameworks lack the ability to capture the dynamic characteristics of TCs. Consequently, this study proposes a novel deep learning framework, CycloneWind, for downscaling TC surface wind fields: (1) a high-quality dataset is constructed by integrating Cyclobs satellite observations with ERA5 reanalysis data, incorporating auxiliary variables like low cloud cover, surface pressure, and top-of-atmosphere incident solar radiation; (2) we propose CycloneWind, a dynamically constrained Transformer-based architecture incorporating three wind field dynamical operators, along with a wind dynamics-constrained loss function formulated to enforce consistency in wind divergence and vorticity; (3) an Adaptive Dynamics-Guided Block (ADGB) is designed to explicitly encode TC rotational dynamics using wind shear detection and wind vortex diffusion operators; (4) Filtering Transformer Layers (FTLs) with high-frequency filtering operators are used for modeling wind field small-scale details. Experimental results demonstrate that CycloneWind successfully achieves an 8-fold spatial resolution reconstruction in TC regions. Compared to the best-performing baseline model, CycloneWind reduces the Root Mean Square Error (RMSE) for the U and V wind components by 9.6% and 4.9%, respectively. More significantly, it achieves substantial improvements of 23.0%, 22.6%, and 20.5% in key dynamical metrics such as divergence difference, vorticity difference, and direction cosine dissimilarity. Full article
Show Figures

Figure 1

26 pages, 3998 KB  
Article
Graph-Symmetry Cognitive Learning for Multi-Scale Cloud Imaging: An Uncertainty-Quantified Geometric Paradigm via Hierarchical Graph Networks
by Qing Xu, Zichen Zhang, Guanfang Wang and Yunjie Chen
Symmetry 2025, 17(9), 1477; https://doi.org/10.3390/sym17091477 - 7 Sep 2025
Viewed by 662
Abstract
Cloud imagery analysis from terrestrial observation points represents a fundamental capability within contemporary atmospheric monitoring infrastructure, serving essential functions in meteorological prediction, climatic surveillance, and hazard alert systems. However, traditional ground-based cloud image segmentation methods have fundamental limitations, particularly their inability to effectively [...] Read more.
Cloud imagery analysis from terrestrial observation points represents a fundamental capability within contemporary atmospheric monitoring infrastructure, serving essential functions in meteorological prediction, climatic surveillance, and hazard alert systems. However, traditional ground-based cloud image segmentation methods have fundamental limitations, particularly their inability to effectively model the graph structure and symmetry in cloud data. To address this, we propose G-CLIP, a ground-based cloud image segmentation method based on graph symmetry. G-CLIP synergistically integrates four innovative modules. First, the Prototype-Driven Asymmetric Attention (PDAA) module is designed to reduce complexity and enhance feature learning by leveraging permutation invariance and graph symmetry principles. Second, the Symmetry-Adaptive Graph Convolution Layer (SAGCL) is constructed, modeling pixels as graph nodes, using cosine similarity to build a sparse discriminative structure, and ensuring stability through symmetry and degree normalization. Third, the Multi-Scale Directional Edge Optimizer (MSDER) is developed to explicitly model complex symmetric relationships in cloud features from a graph theory perspective. Finally, the Uncertainty-Driven Loss Optimizer (UDLO) is proposed to dynamically adjust weights to address foreground–background imbalance and provide uncertainty quantification. Extensive experiments on four benchmark datasets demonstrate that our method achieves state-of-the-art performance across all evaluation metrics. Our work provides a novel theoretical framework and practical solution for applying graph neural networks (GNNs) to meteorology, particularly by integrating graph properties with uncertainty and leveraging symmetries from graph theory for complex spatial modeling. Full article
(This article belongs to the Special Issue Symmetry and Asymmetry Study in Graph Theory)
Show Figures

Figure 1

20 pages, 17025 KB  
Article
SODE-Net: A Slender Rotating Object Detection Network Based on Spatial Orthogonality and Decoupled Encoding
by Xiaozhi Yu, Wei Xiang, Lu Yu, Kang Han and Yuan Yang
Remote Sens. 2025, 17(17), 3042; https://doi.org/10.3390/rs17173042 - 1 Sep 2025
Viewed by 1354
Abstract
Remote sensing objects often exhibit significant scale variations, high aspect ratios, and diverse orientations. The anisotropic spatial distribution of such objects’ features leads to the conflict between feature representation and boundary regression caused by the coupling of different attribute parameters: previous detection methods [...] Read more.
Remote sensing objects often exhibit significant scale variations, high aspect ratios, and diverse orientations. The anisotropic spatial distribution of such objects’ features leads to the conflict between feature representation and boundary regression caused by the coupling of different attribute parameters: previous detection methods based on square-kernel convolution lack the overall perception of large-scale or slender objects due to the limited receptive field; if the receptive field is simply expanded, although more context information can be captured to help object perception, a large amount of background noise will be introduced, resulting in inaccurate feature extraction of remote sensing objects. Additionally, the extracted features face issues of feature conflict and discontinuous loss during parameter regression. Existing methods often neglect the holistic optimization of these aspects. To address these challenges, this paper proposes SODE-Net as a systematic solution. Specifically, we first design a multi-scale fusion and spatially orthogonal convolution (MSSO) module in the backbone network. Its multiple shapes of receptive fields can naturally capture the long-range dependence of the object without introducing too much background noise, thereby extracting more accurate target features. Secondly, we design a multi-level decoupled detection head, which decouples target classification, bounding-box position regression and bounding-box angle regression into three subtasks, effectively avoiding the coupling problem in parameter regression. At the same time, the phase-continuous encoding module is used in the angle regression branch, which converts the periodic angle value into a continuous cosine value, thus ensuring the stability of the loss value. Extensive experiments demonstrate that, compared to existing detection networks, our method achieves superior performance on four widely used remote sensing object datasets: DOTAv1.0, HRSC2016, UCAS-AOD, and DIOR-R. Full article
Show Figures

Figure 1

Back to TopTop