Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

Search Results (688)

Search Parameters:
Keywords = multi-level attention mechanisms

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
23 pages, 6440 KiB  
Article
A Gravity Data Denoising Method Based on Multi-Scale Attention Mechanism and Physical Constraints Using U-Net
by Bing Liu, Houpu Li, Shaofeng Bian, Chaoliang Zhang, Bing Ji and Yujie Zhang
Appl. Sci. 2025, 15(14), 7956; https://doi.org/10.3390/app15147956 (registering DOI) - 17 Jul 2025
Abstract
Gravity and gravity gradient data serve as fundamental inputs for geophysical resource exploration and geological structure analysis. However, traditional denoising methods—including wavelet transforms, moving averages, and low-pass filtering—exhibit signal loss and limited adaptability under complex, non-stationary noise conditions. To address these challenges, this [...] Read more.
Gravity and gravity gradient data serve as fundamental inputs for geophysical resource exploration and geological structure analysis. However, traditional denoising methods—including wavelet transforms, moving averages, and low-pass filtering—exhibit signal loss and limited adaptability under complex, non-stationary noise conditions. To address these challenges, this study proposes an improved U-Net deep learning framework that integrates multi-scale feature extraction and attention mechanisms. Furthermore, a Laplace consistency constraint is introduced into the loss function to enhance denoising performance and physical interpretability. Notably, the datasets used in this study are generated by the authors, involving simulations of subsurface prism distributions with realistic density perturbations (±20% of typical rock densities) and the addition of controlled Gaussian noise (5%, 10%, 15%, and 30%) to simulate field-like conditions, ensuring the diversity and physical relevance of training samples. Experimental validation on these synthetic datasets and real field datasets demonstrates the superiority of the proposed method over conventional techniques. For noise levels of 5%, 10%, 15%, and 30% in test sets, the improved U-Net achieves Peak Signal-to-Noise Ratios (PSNR) of 59.13 dB, 52.03 dB, 48.62 dB, and 48.81 dB, respectively, outperforming wavelet transforms, moving averages, and low-pass filtering by 10–30 dB. In multi-component gravity gradient denoising, our method excels in detail preservation and noise suppression, improving Structural Similarity Index (SSIM) by 15–25%. Field data tests further confirm enhanced identification of key geological anomalies and overall data quality improvement. In summary, the improved U-Net not only delivers quantitative advancements in gravity data denoising but also provides a novel approach for high-precision geophysical data preprocessing. Full article
(This article belongs to the Special Issue Applications of Machine Learning in Earth Sciences—2nd Edition)
Show Figures

Figure 1

24 pages, 1991 KiB  
Article
A Multi-Feature Semantic Fusion Machine Learning Architecture for Detecting Encrypted Malicious Traffic
by Shiyu Tang, Fei Du, Zulong Diao and Wenjun Fan
J. Cybersecur. Priv. 2025, 5(3), 47; https://doi.org/10.3390/jcp5030047 (registering DOI) - 17 Jul 2025
Abstract
With the increasing sophistication of network attacks, machine learning (ML)-based methods have showcased promising performance in attack detection. However, ML-based methods often suffer from high false rates when tackling encrypted malicious traffic. To break through these bottlenecks, we propose EFTransformer, an encrypted flow [...] Read more.
With the increasing sophistication of network attacks, machine learning (ML)-based methods have showcased promising performance in attack detection. However, ML-based methods often suffer from high false rates when tackling encrypted malicious traffic. To break through these bottlenecks, we propose EFTransformer, an encrypted flow transformer framework which inherits semantic perception and multi-scale feature fusion, can robustly and efficiently detect encrypted malicious traffic, and make up for the shortcomings of ML in the context of modeling ability and feature adequacy. EFTransformer introduces a channel-level extraction mechanism based on quintuples and a noise-aware clustering strategy to enhance the recognition ability of traffic patterns; adopts a dual-channel embedding method, using Word2Vec and FastText to capture global semantics and subword-level changes; and uses a Transformer-based classifier and attention pooling module to achieve dynamic feature-weighted fusion, thereby improving the robustness and accuracy of malicious traffic detection. Our systematic experiments on the ISCX2012 dataset demonstrate that EFTransformer achieves the best detection performance, with an accuracy of up to 95.26%, a false positive rate (FPR) of 6.19%, and a false negative rate (FNR) of only 5.85%. These results show that EFTransformer achieves high detection performance against encrypted malicious traffic. Full article
(This article belongs to the Section Security Engineering & Applications)
Show Figures

Figure 1

24 pages, 20337 KiB  
Article
MEAC: A Multi-Scale Edge-Aware Convolution Module for Robust Infrared Small-Target Detection
by Jinlong Hu, Tian Zhang and Ming Zhao
Sensors 2025, 25(14), 4442; https://doi.org/10.3390/s25144442 - 16 Jul 2025
Abstract
Infrared small-target detection remains a critical challenge in military reconnaissance, environmental monitoring, forest-fire prevention, and search-and-rescue operations, owing to the targets’ extremely small size, sparse texture, low signal-to-noise ratio, and complex background interference. Traditional convolutional neural networks (CNNs) struggle to detect such weak, [...] Read more.
Infrared small-target detection remains a critical challenge in military reconnaissance, environmental monitoring, forest-fire prevention, and search-and-rescue operations, owing to the targets’ extremely small size, sparse texture, low signal-to-noise ratio, and complex background interference. Traditional convolutional neural networks (CNNs) struggle to detect such weak, low-contrast objects due to their limited receptive fields and insufficient feature extraction capabilities. To overcome these limitations, we propose a Multi-Scale Edge-Aware Convolution (MEAC) module that enhances feature representation for small infrared targets without increasing parameter count or computational cost. Specifically, MEAC fuses (1) original local features, (2) multi-scale context captured via dilated convolutions, and (3) high-contrast edge cues derived from differential Gaussian filters. After fusing these branches, channel and spatial attention mechanisms are applied to adaptively emphasize critical regions, further improving feature discrimination. The MEAC module is fully compatible with standard convolutional layers and can be seamlessly embedded into various network architectures. Extensive experiments on three public infrared small-target datasets (SIRSTD-UAVB, IRSTDv1, and IRSTD-1K) demonstrate that networks augmented with MEAC significantly outperform baseline models using standard convolutions. When compared to eleven mainstream convolution modules (ACmix, AKConv, DRConv, DSConv, LSKConv, MixConv, PConv, ODConv, GConv, and Involution), our method consistently achieves the highest detection accuracy and robustness. Experiments conducted across multiple versions, including YOLOv10, YOLOv11, and YOLOv12, as well as various network levels, demonstrate that the MEAC module achieves stable improvements in performance metrics while slightly increasing computational and parameter complexity. These results validate the MEAC module’s significant advantages in enhancing the detection of small and weak objects and suppressing interference from complex backgrounds. These results validate MEAC’s effectiveness in enhancing weak small-target detection and suppressing complex background noise, highlighting its strong generalization ability and practical application potential. Full article
(This article belongs to the Section Sensing and Imaging)
Show Figures

Figure 1

23 pages, 6348 KiB  
Article
A Framework for Predicting Winter Wheat Yield in Northern China with Triple Cross-Attention and Multi-Source Data Fusion
by Shuyan Pan and Liqun Liu
Plants 2025, 14(14), 2206; https://doi.org/10.3390/plants14142206 - 16 Jul 2025
Abstract
To solve the issue that existing yield prediction methods do not fully capture the interaction between multiple factors, we propose a winter wheat yield prediction framework with triple cross-attention for multi-source data fusion. This framework consists of three modules: a multi-source data processing [...] Read more.
To solve the issue that existing yield prediction methods do not fully capture the interaction between multiple factors, we propose a winter wheat yield prediction framework with triple cross-attention for multi-source data fusion. This framework consists of three modules: a multi-source data processing module, a multi-source feature fusion module, and a yield prediction module. The multi-source data processing module collects satellite, climate, and soil data based on the winter wheat planting range, and constructs a multi-source feature sequence set by combining statistical data. The multi-source feature fusion module first extracts deeper-level feature information based on the characteristics of different data, and then performs multi-source feature fusion through a triple cross-attention fusion mechanism. The encoder part in the production prediction module adds a graph attention mechanism, forming a dual branch with the original multi-head self-attention mechanism to ensure the capture of global dependencies while enhancing the preservation of local feature information. The decoder section generates the final predicted output. The results show that: (1) Using 2021 and 2022 as test sets, the mean absolute error of our method is 385.99 kg/hm2, and the root mean squared error is 501.94 kg/hm2, which is lower than other methods. (2) It can be concluded that the jointing-heading stage (March to April) is the most crucial period affecting winter wheat production. (3) It is evident that our model has the ability to predict the final winter wheat yield nearly a month in advance. Full article
(This article belongs to the Section Plant Modeling)
Show Figures

Figure 1

21 pages, 12122 KiB  
Article
RA3T: An Innovative Region-Aligned 3D Transformer for Self-Supervised Sim-to-Real Adaptation in Low-Altitude UAV Vision
by Xingrao Ma, Jie Xie, Di Shao, Aiting Yao and Chengzu Dong
Electronics 2025, 14(14), 2797; https://doi.org/10.3390/electronics14142797 - 11 Jul 2025
Viewed by 144
Abstract
Low-altitude unmanned aerial vehicle (UAV) vision is critically hindered by the Sim-to-Real Gap, where models trained exclusively on simulation data degrade under real-world variations in lighting, texture, and weather. To address this problem, we propose RA3T (Region-Aligned 3D Transformer), a novel self-supervised framework [...] Read more.
Low-altitude unmanned aerial vehicle (UAV) vision is critically hindered by the Sim-to-Real Gap, where models trained exclusively on simulation data degrade under real-world variations in lighting, texture, and weather. To address this problem, we propose RA3T (Region-Aligned 3D Transformer), a novel self-supervised framework that enables robust Sim-to-Real adaptation. Specifically, we first develop a dual-branch strategy for self-supervised feature learning, integrating Masked Autoencoders and contrastive learning. This approach extracts domain-invariant representations from unlabeled simulated imagery to enhance robustness against occlusion while reducing annotation dependency. Leveraging these learned features, we then introduce a 3D Transformer fusion module that unifies multi-view RGB and LiDAR point clouds through cross-modal attention. By explicitly modeling spatial layouts and height differentials, this component significantly improves recognition of small and occluded targets in complex low-altitude environments. To address persistent fine-grained domain shifts, we finally design region-level adversarial calibration that deploys local discriminators on partitioned feature maps. This mechanism directly aligns texture, shadow, and illumination discrepancies which challenge conventional global alignment methods. Extensive experiments on UAV benchmarks VisDrone and DOTA demonstrate the effectiveness of RA3T. The framework achieves +5.1% mAP on VisDrone and +7.4% mAP on DOTA over the 2D adversarial baseline, particularly on small objects and sparse occlusions, while maintaining real-time performance of 17 FPS at 1024 × 1024 resolution on an RTX 4080 GPU. Visual analysis confirms that the synergistic integration of 3D geometric encoding and local adversarial alignment effectively mitigates domain gaps caused by uneven illumination and perspective variations, establishing an efficient pathway for simulation-to-reality UAV perception. Full article
(This article belongs to the Special Issue Innovative Technologies and Services for Unmanned Aerial Vehicles)
Show Figures

Figure 1

26 pages, 2178 KiB  
Article
Cross-Modal Fake News Detection Method Based on Multi-Level Fusion Without Evidence
by Ping He, Hanxue Zhang, Shufu Cao and Yali Wu
Algorithms 2025, 18(7), 426; https://doi.org/10.3390/a18070426 - 10 Jul 2025
Viewed by 206
Abstract
Although multimodal feature fusion technology in fake news detection can integrate complementary information from different modal data, the semantic inconsistency of multimodal features will lead to feature fusion difficulties. And there is the problem of information loss during one fusion process. In addition, [...] Read more.
Although multimodal feature fusion technology in fake news detection can integrate complementary information from different modal data, the semantic inconsistency of multimodal features will lead to feature fusion difficulties. And there is the problem of information loss during one fusion process. In addition, although it is possible to improve the detection effect by increasing the support of external evidence in fake news detection, there is a lag in obtaining external evidence and the reliability and completeness of the evidence source is difficult to guarantee. Additional noise may be introduced to interfere with the model judgment. Therefore, a cross-modal fake news detection method (CM-MLF) based on evidence-free multilevel fusion is proposed. The method solves the semantic inconsistency problem by utilizing cross-modal alignment processing. And it utilizes the attention mechanism to perform multilevel fusion of text and image features without the assistance of other evidential features to further enhance the expressive power of the features. Experiments show that the method achieves better detection results on multiple benchmark datasets, effectively improving the accuracy and robustness of cross-modal fake news detection. Full article
(This article belongs to the Special Issue Algorithms for Feature Selection (3rd Edition))
Show Figures

Graphical abstract

22 pages, 6857 KiB  
Article
Spatio-Temporal Coupling and Forecasting of Construction Industry High-Quality Development and Human Settlements Environmental Suitability in Southern China: Evidence from 15 Provincial Panel Data
by Keliang Chen, Bo Chen and Wanqing Chen
Buildings 2025, 15(14), 2425; https://doi.org/10.3390/buildings15142425 - 10 Jul 2025
Viewed by 127
Abstract
High-quality growth of the construction industry and an improved human settlements environment are essential to sustainable urbanization. Existing studies have paid limited systematic attention to the spatial and temporal dynamics of the coordinated development between the construction industry and human settlements, as well [...] Read more.
High-quality growth of the construction industry and an improved human settlements environment are essential to sustainable urbanization. Existing studies have paid limited systematic attention to the spatial and temporal dynamics of the coordinated development between the construction industry and human settlements, as well as the underlying factors driving regional disparities. This gap restricts the formulation of precise, differentiated sustainable policies tailored to regions at different development stages and with varying resource endowments. Southern China, characterized by pronounced spatial heterogeneity and unique development trends, offers a natural laboratory for examining the spatio-temporal interaction between these two dimensions. Using panel data for 15 southern provinces (2013–2022), we applied the entropy method, coupling coordination model, Dagum Gini coefficient, spatial trend surface analysis, gravity model, and grey forecasting to evaluate current conditions and predict future trends. The main findings are as follows. (1) The coupling coordination degree rose steadily, forming a stepped spatial pattern from the southwest through the center to the southeast. (2) The coupling coordination degree appears obvious polarization effect, presenting a spatial linkage pattern with Jiangsu-Shanghai-Zhejiang, Hubei-Hunan-Jiangxi, and Sichuan-Chongqing as the core of the three major clusters. (3) The overall Dagum Gini coefficient declined, but intra-regional disparities persisted: values were highest in the southeast, moderate in the center, and lowest in the southwest; inter-regional differences dominated the total inequality. (4) Forecasts for 2023–2027 suggest further improvement in the coupling coordination degree, yet spatial divergence will widen, creating a configuration of “eastern leadership, central catch-up acceleration, and differentiated southwestern development.” This study provides an evidence base for policies that foster high-quality construction sector growth and enhance the living environment. The findings of this study indicate that policymaking should prioritize promoting synergistic regional development, enhancing the radiating and driving role of core regions, and establishing a multi-level coordinated governance mechanism to bridge regional disparities and foster more balanced and sustainable development. Full article
Show Figures

Figure 1

28 pages, 14588 KiB  
Article
CAU2DNet: A Dual-Branch Deep Learning Network and a Dataset for Slum Recognition with Multi-Source Remote Sensing Data
by Xi Lyu, Chenyu Zhang, Lizhi Miao, Xiying Sun, Xinxin Zhou, Xinyi Yue, Zhongchang Sun and Yueyong Pang
Remote Sens. 2025, 17(14), 2359; https://doi.org/10.3390/rs17142359 - 9 Jul 2025
Viewed by 158
Abstract
The efficient and precise identification of urban slums is a significant challenge for urban planning and sustainable development, as their morphological diversity and complex spatial distribution make it difficult to use traditional remote sensing inversion methods. Current deep learning (DL) methods mainly face [...] Read more.
The efficient and precise identification of urban slums is a significant challenge for urban planning and sustainable development, as their morphological diversity and complex spatial distribution make it difficult to use traditional remote sensing inversion methods. Current deep learning (DL) methods mainly face challenges such as limited receptive fields and insufficient sensitivity to spatial locations when integrating multi-source remote sensing data, and high-quality datasets that integrate multi-spectral and geoscientific indicators to support them are scarce. In response to these issues, this study proposes a DL model (coordinate-attentive U2-DeepLab network [CAU2DNet]) that integrates multi-source remote sensing data. The model integrates the multi-scale feature extraction capability of U2-Net with the global receptive field advantage of DeepLabV3+ through a dual-branch architecture. Thereafter, the spatial semantic perception capability is enhanced by introducing the CoordAttention mechanism, and ConvNextV2 is adopted to optimize the backbone network of the DeepLabV3+ branch, thereby improving the modeling capability of low-resolution geoscientific features. The two branches adopt a decision-level fusion mechanism for feature fusion, which means that the results of each are weighted and summed using learnable weights to obtain the final output feature map. Furthermore, this study constructs the São Paulo slums dataset for model training due to the lack of a multi-spectral slum dataset. This dataset covers 7978 samples of 512 × 512 pixels, integrating high-resolution RGB images, Normalized Difference Vegetation Index (NDVI)/Modified Normalized Difference Water Index (MNDWI) geoscientific indicators, and POI infrastructure data, which can significantly enrich multi-source slum remote sensing data. Experiments have shown that CAU2DNet achieves an intersection over union (IoU) of 0.6372 and an F1 score of 77.97% on the São Paulo slums dataset, indicating a significant improvement in accuracy over the baseline model. The ablation experiments verify that the improvements made in this study have resulted in a 16.12% increase in precision. Moreover, CAU2DNet also achieved the best results in all metrics during the cross-domain testing on the WHU building dataset, further confirming the model’s generalizability. Full article
Show Figures

Figure 1

30 pages, 907 KiB  
Article
Evaluating the Impact of Green Manufacturing on Corporate Resilience: A Quasi-Natural Experiment Based on Chinese Green Factories
by Li Long and Hanhan Wang
Sustainability 2025, 17(14), 6281; https://doi.org/10.3390/su17146281 - 9 Jul 2025
Viewed by 188
Abstract
Corporate resilience, a critical metric assessing firms’ capacity to withstand risks, recover rapidly, and maintain growth in dynamic environments, has garnered increasing attention from academia and industry. This study employs China’s Green Factory certification policy within its green manufacturing system as a quasi-natural [...] Read more.
Corporate resilience, a critical metric assessing firms’ capacity to withstand risks, recover rapidly, and maintain growth in dynamic environments, has garnered increasing attention from academia and industry. This study employs China’s Green Factory certification policy within its green manufacturing system as a quasi-natural experiment, utilizing a multi-period difference-in-differences (DID) model to evaluate the impact of green manufacturing implementation on corporate resilience. Results confirm that Green Factory certification significantly enhances firms’ resilience. Mechanism analyses identify three reinforcing pathways: alleviating financing constraints, optimizing resource allocation efficiency, and fostering green technological innovation. Heterogeneity analyses reveal more pronounced effects among heavily polluting industries, firms with low reputations, and those with higher levels of managerial myopia. Furthermore, the certification exhibits significant spillover effects, transmitting resilience improvements to industry peers and geographic clusters. This research expands the theoretical boundaries of corporate resilience literature while offering practical implications and empirical evidence for enterprises undergoing green manufacturing transitions. Full article
(This article belongs to the Special Issue Advances in Business Model Innovation and Corporate Sustainability)
Show Figures

Figure 1

17 pages, 7786 KiB  
Article
Video Coding Based on Ladder Subband Recovery and ResGroup Module
by Libo Wei, Aolin Zhang, Lei Liu, Jun Wang and Shuai Wang
Entropy 2025, 27(7), 734; https://doi.org/10.3390/e27070734 - 8 Jul 2025
Viewed by 240
Abstract
With the rapid development of video encoding technology in the field of computer vision, the demand for tasks such as video frame reconstruction, denoising, and super-resolution has been continuously increasing. However, traditional video encoding methods typically focus on extracting spatial or temporal domain [...] Read more.
With the rapid development of video encoding technology in the field of computer vision, the demand for tasks such as video frame reconstruction, denoising, and super-resolution has been continuously increasing. However, traditional video encoding methods typically focus on extracting spatial or temporal domain information, often facing challenges of insufficient accuracy and information loss when reconstructing high-frequency details, edges, and textures of images. To address this issue, this paper proposes an innovative LadderConv framework, which combines discrete wavelet transform (DWT) with spatial and channel attention mechanisms. By progressively recovering wavelet subbands, it effectively enhances the video frame encoding quality. Specifically, the LadderConv framework adopts a stepwise recovery approach for wavelet subbands, first processing high-frequency detail subbands with relatively less information, then enhancing the interaction between these subbands, and ultimately synthesizing a high-quality reconstructed image through inverse wavelet transform. Moreover, the framework introduces spatial and channel attention mechanisms, which further strengthen the focus on key regions and channel features, leading to notable improvements in detail restoration and image reconstruction accuracy. To optimize the performance of the LadderConv framework, particularly in detail recovery and high-frequency information extraction tasks, this paper designs an innovative ResGroup module. By using multi-layer convolution operations along with feature map compression and recovery, the ResGroup module enhances the network’s expressive capability and effectively reduces computational complexity. The ResGroup module captures multi-level features from low level to high level and retains rich feature information through residual connections, thus improving the overall reconstruction performance of the model. In experiments, the combination of the LadderConv framework and the ResGroup module demonstrates superior performance in video frame reconstruction tasks, particularly in recovering high-frequency information, image clarity, and detail representation. Full article
(This article belongs to the Special Issue Rethinking Representation Learning in the Age of Large Models)
Show Figures

Figure 1

27 pages, 19258 KiB  
Article
A Lightweight Multi-Frequency Feature Fusion Network with Efficient Attention for Breast Tumor Classification in Pathology Images
by Hailong Chen, Qingqing Song and Guantong Chen
Information 2025, 16(7), 579; https://doi.org/10.3390/info16070579 - 6 Jul 2025
Viewed by 308
Abstract
The intricate and complex tumor cell morphology in breast pathology images is a key factor for tumor classification. This paper proposes a lightweight breast tumor classification model with multi-frequency feature fusion (LMFM) to tackle the problem of inadequate feature extraction and poor classification [...] Read more.
The intricate and complex tumor cell morphology in breast pathology images is a key factor for tumor classification. This paper proposes a lightweight breast tumor classification model with multi-frequency feature fusion (LMFM) to tackle the problem of inadequate feature extraction and poor classification performance. The LMFM utilizes wavelet transform (WT) for multi-frequency feature fusion, integrating high-frequency (HF) tumor details with high-level semantic features to enhance feature representation. The network’s ability to extract irregular tumor characteristics is further reinforced by dynamic adaptive deformable convolution (DADC). The introduction of the token-based Region Focus Module (TRFM) reduces interference from irrelevant background information. At the same time, the incorporation of a linear attention (LA) mechanism lowers the model’s computational complexity and further enhances its global feature extraction capability. The experimental results demonstrate that the proposed model achieves classification accuracies of 98.23% and 97.81% on the BreaKHis and BACH datasets, with only 9.66 M parameters. Full article
(This article belongs to the Section Biomedical Information and Health)
Show Figures

Figure 1

25 pages, 4568 KiB  
Article
Lithium-Ion Battery State of Health Estimation Based on CNN-LSTM-Attention-FVIM Algorithm and Fusion of Multiple Health Features
by Guoju Liu, Zhihui Deng, Yonghong Xu, Lianfeng Lai, Guoqing Gong, Liang Tong, Hongguang Zhang, Yiyang Li, Minghui Gong, Mengxiang Yan and Zheng Ye
Appl. Sci. 2025, 15(13), 7555; https://doi.org/10.3390/app15137555 - 5 Jul 2025
Viewed by 347
Abstract
Lithium-ion batteries play a vital role in human society. Therefore, it is of critical significance to reliably predict the evolution of State of Health (SOH) degradation patterns in order to improve the high accuracy and stability of lithium-ion battery SOH prediction. This paper [...] Read more.
Lithium-ion batteries play a vital role in human society. Therefore, it is of critical significance to reliably predict the evolution of State of Health (SOH) degradation patterns in order to improve the high accuracy and stability of lithium-ion battery SOH prediction. This paper proposes a novel SOH predication method by combing the four-vector intelligent metaheuristic (FVIM) with the CNN-LSTM-Attention basic model. The model adopts the collaborative architecture of a convolutional neural network and time series module, strengthens the cross-level feature interaction by introducing a multi-level attention mechanism, then uses the FVIM optimization algorithm to optimize the key parameters to realize the overall model architecture. By analyzing the charging voltage curve of lithium-ion batteries, the health factors with high correlation are extracted, and the correlation between the health factors and battery capacity is verified using two correlation coefficients. After the model is verified on a single NASA battery aging dataset, the model is compared with other models under the same relevant parameters and environmental settings to verify the high-precision prediction of the model. During the analysis and comparison process, CNN-LSTM-Attention-FVIM achieved a high fitting ability for battery SOH prediction estimation, with the mean absolute error (MAE) and root mean square error (RMSE) within 0.99% and 1.33%, respectively, reflecting the model’s high generalization ability and high prediction performance. Full article
Show Figures

Figure 1

19 pages, 51503 KiB  
Article
LSANet: Lightweight Super Resolution via Large Separable Kernel Attention for Edge Remote Sensing
by Tingting Yong and Xiaofang Liu
Appl. Sci. 2025, 15(13), 7497; https://doi.org/10.3390/app15137497 - 3 Jul 2025
Viewed by 258
Abstract
In recent years, remote sensing imagery has become indispensable for applications such as environmental monitoring, land use classification, and urban planning. However, the physical constraints of satellite imaging systems frequently limit the spatial resolution of these images, impeding the extraction of fine-grained information [...] Read more.
In recent years, remote sensing imagery has become indispensable for applications such as environmental monitoring, land use classification, and urban planning. However, the physical constraints of satellite imaging systems frequently limit the spatial resolution of these images, impeding the extraction of fine-grained information critical to downstream tasks. Super-resolution (SR) techniques thus emerge as a pivotal solution to enhance the spatial fidelity of remote sensing images via computational approaches. While deep learning-based SR methods have advanced reconstruction accuracy, their high computational complexity and large parameter counts restrict practical deployment in real-world remote sensing scenarios—particularly on edge or low-power devices. To address this gap, we propose LSANet, a lightweight SR network customized for remote sensing imagery. The core of LSANet is the large separable kernel attention mechanism, which efficiently expands the receptive field while retaining low computational overhead. By integrating this mechanism into an enhanced residual feature distillation module, the network captures long-range dependencies more effectively than traditional shallow residual blocks. Additionally, a residual feature enhancement module, leveraging contrast-aware channel attention and hierarchical skip connections, strengthens the extraction and integration of multi-level discriminative features. This design preserves fine textures and ensures smooth information propagation across the network. Extensive experiments on public datasets such as UC Merced Land Use and NWPU-RESISC45 demonstrate LSANet’s competitive or superior performance compared to state-of-the-art methods. On the UC Merced Land Use dataset, LSANet achieves a PSNR of 34.33, outperforming the best-baseline HSENet with its PSNR of 34.23 by 0.1. For SSIM, LSANet reaches 0.9328, closely matching HSENet’s 0.9332 while demonstrating excellent metric-balancing performance. On the NWPU-RESISC45 dataset, LSANet attains a PSNR of 35.02, marking a significant improvement over prior methods, and an SSIM of 0.9305, maintaining strong competitiveness. These results, combined with the notable reduction in parameters and floating-point operations, highlight the superiority of LSANet in remote sensing image super-resolution tasks. Full article
Show Figures

Figure 1

22 pages, 4465 KiB  
Article
Urban Expansion Scenario Prediction Model: Combining Multi-Source Big Data, a Graph Attention Network, a Vector Cellular Automata, and an Agent-Based Model
by Yunqi Gao, Dongya Liu, Xinqi Zheng, Xiaoli Wang and Gang Ai
Remote Sens. 2025, 17(13), 2272; https://doi.org/10.3390/rs17132272 - 2 Jul 2025
Viewed by 241
Abstract
The construction of transition rules is the core and difficulty faced by the cellular automata (CA) model. Dynamic mining of transition rules can more accurately simulate urban land use change. By introducing a graph attention network (GAT) to mine CA model transition rules, [...] Read more.
The construction of transition rules is the core and difficulty faced by the cellular automata (CA) model. Dynamic mining of transition rules can more accurately simulate urban land use change. By introducing a graph attention network (GAT) to mine CA model transition rules, the temporal and spatial dynamics of the model are increased based on the construction of a real-time dynamic graph structure. At the same time, by adding an agent-based model (ABM) to the CA model, the simulation evolution of different human decision-making behaviors can be achieved. Based on this, an urban expansion scenario prediction (UESP) model has been proposed: (1) the UESP model employs a multi-head attention mechanism to dynamically capture high-order spatial dependencies, supporting the efficient processing of large-scale datasets with over 50,000 points of interest (POIs); (2) it incorporates the behaviors of agents such as residents, governments, and transportation systems to more realistically reflect human micro-level decision-making; and (3) by integrating macro-structural learning with micro-behavioral modeling, it effectively addresses the existing limitations in representing high-order spatial relationships and human decision-making processes in urban expansion simulations. Based on the policy context of the Outline of the Beijing–Tianjin–Hebei (BTH) Coordinated Development Plan, four development scenarios were designed to simulate construction land change by 2030. The results show that (1) the UESP model achieved an overall accuracy of 0.925, a Kappa coefficient of 0.878, and a FoM index of 0.048, outperforming traditional models, with the FoM being 3.5% higher; (2) through multi-scenario simulation prediction, it is found that under the scenario of ecological conservation and farmland protection, forest and grassland increase by 3142 km2, and cultivated land increases by 896 km2, with construction land showing a concentrated growth trend; and (3) the expansion of construction land will mainly occur at the expense of farmland, concentrated around Beijing, Tianjin, Tangshan, Shijiazhuang, and southern core cities in Hebei, forming a “core-driven, axis-extended, and cluster-expanded” spatial pattern. Full article
Show Figures

Figure 1

20 pages, 3602 KiB  
Article
Dust Aerosol Classification in Northwest China Using CALIPSO Data and an Enhanced 1D U-Net Network
by Xin Gong, Delong Xiu, Xiaoling Sun, Ruizhao Zhang, Jiandong Mao, Hu Zhao and Zhimin Rao
Atmosphere 2025, 16(7), 812; https://doi.org/10.3390/atmos16070812 - 2 Jul 2025
Viewed by 230
Abstract
Dust aerosols significantly affect climate and air quality in Northwest China (30–50° N, 70–110° E), where frequent dust storms complicate accurate aerosol classification when using CALIPSO satellite data. This study introduces an Enhanced 1D U-Net model to enhance dust aerosol retrieval, incorporating Inception [...] Read more.
Dust aerosols significantly affect climate and air quality in Northwest China (30–50° N, 70–110° E), where frequent dust storms complicate accurate aerosol classification when using CALIPSO satellite data. This study introduces an Enhanced 1D U-Net model to enhance dust aerosol retrieval, incorporating Inception modules for multi-scale feature extraction, Transformer blocks for global contextual modeling, CBAM attention mechanisms for improved feature selection, and residual connections for training stability. Using CALIPSO Level 1B and Level 2 Vertical Feature Mask (VFM) data from 2015 to 2020, the model processed backscatter coefficients, polarization characteristics, and color ratios at 532 nm and 1064 nm to classify aerosol types. The model achieved a precision of 94.11%, recall of 99.88%, and F1 score of 96.91% for dust aerosols, outperforming baseline models. Dust aerosols were predominantly detected between 0.44 and 4 km, consistent with observations from CALIPSO. These results highlight the model’s potential to improve climate modeling and air quality monitoring, providing a scalable framework for future atmospheric research. Full article
(This article belongs to the Section Aerosols)
Show Figures

Figure 1

Back to TopTop