MDPI - Publisher of Open Access Journals

26 pages, 1459 KiB

Open AccessArticle

Sparse Attention-Based Residual Joint Network for Aspect-Category-Based Sentiment Analysis

by Jooan Kim and Hyunyoung Kil

Mathematics 2025, 13(15), 2437; https://doi.org/10.3390/math13152437 - 29 Jul 2025

Viewed by 191

Aspect-based sentiment analysis (ABSA) aims at identifying the sentiment polarity for a particular aspect in a review. ABSA studies based on deep learning models have exploited the attention mechanism to detect aspect-related parts. Conventional softmax-based attention mechanisms generate dense distributions, which may limit [...] Read more.

Aspect-based sentiment analysis (ABSA) aims at identifying the sentiment polarity for a particular aspect in a review. ABSA studies based on deep learning models have exploited the attention mechanism to detect aspect-related parts. Conventional softmax-based attention mechanisms generate dense distributions, which may limit performance in tasks that inherently require sparsity. Recent studies on sparse attention transformation functions have demonstrated their effectiveness over the conventional softmax function. However, these studies primarily focus on highly sparse tasks based on self-attention architectures, leaving their applicability to the ABSA domain unexplored. In addition, most ABSA research has focused on leveraging aspect terms despite the usefulness of aspect categories. To address these issues, we propose a sparse-attention-based residual joint network (SPA-RJ Net) for the aspect-category-based sentiment analysis (ACSA) task. SPA-RJ Net incorporates two aspect-guided sparse attentions—sparse aspect-category attention and sparse aspect-sentiment attention—that introduce sparsity in attention via a sparse distribution transformation function, enabling the model to selectively focus on aspect-related information. In addition, it employs a residual joint learning framework that connects the aspect category detection (ACD) task module and the ACSA task module via residual connections, enabling the ACSA module to receive explicit guidance on relevant aspect categories from the ACD module. Our experiment validates that SPA-RJ Net consistently outperforms existing models, demonstrating the effectiveness of sparse attention and residual joint learning for aspect category-based sentiment classification. Full article

(This article belongs to the Section E1: Mathematics and Computer Science)

► Show Figures

Figure 1

34 pages, 10519 KiB

Open AccessArticle

A Remote Sensing Image Object Detection Model Based on Improved YOLOv11

by Aili Wang, Zhijia Fu, Yanran Zhao and Haisong Chen

Electronics 2025, 14(13), 2607; https://doi.org/10.3390/electronics14132607 - 27 Jun 2025

Viewed by 449

Abstract

Due to the challenges posed by high resolution, substantial background noise, significant object scale variation, and long-tailed data distribution in remote sensing images, traditional techniques often struggle to maintain both high accuracy and low latency. This paper proposes YOLO11-FSDAT, an advanced object detection [...] Read more.

Due to the challenges posed by high resolution, substantial background noise, significant object scale variation, and long-tailed data distribution in remote sensing images, traditional techniques often struggle to maintain both high accuracy and low latency. This paper proposes YOLO11-FSDAT, an advanced object detection framework tailored for remote sensing imagery, which integrates not only modular enhancements but also theoretical and architectural innovations to address these limitations. First, we propose the frequency–spatial feature extraction fusion module (Freq-SpaFEFM), which breaks the conventional paradigm of spatial-domain-dominated feature learning by introducing a multi-branch architecture that fuses frequency- and spatial-domain features in parallel. This design provides a new processing paradigm for multi-scale object detection, particularly enhancing the model’s capability in handling dense and small-object scenarios with complex backgrounds. Second, we introduce the deformable attention-based global–local fusion module (DAGLF), which combines fine-grained local features with global context through deformable attention and residual connections. This enables the model to adaptively capture irregularly oriented objects (e.g., tilted aircraft) and effectively mitigates the issue of information dilution in deep networks. Third, we develop the adaptive threshold focal loss (ATFL), which is the first loss function to systematically address the long-tailed distribution in remote sensing datasets by dynamically adjusting focus based on sample difficulty. Unlike traditional focal loss with fixed hyperparameters, ATFL decouples hard and easy samples and automatically adapts to varying class distributions. Experimental results on the public DOTAv1, SIMD, and DIOR datasets demonstrated that YOLO11-FSDAT achieved 75.22%, 82.79%, and 88.01% mAP, respectively, outperforming baseline YOLOv11n by up to 4.11%. These results confirm the effectiveness, robustness, and broader theoretical value of the proposed framework in addressing key challenges in remote sensing object detection. Full article

(This article belongs to the Special Issue Machine Learning and Computational Intelligence in Remote Sensing)

► Show Figures

Figure 1

21 pages, 7615 KiB

Open AccessArticle

A Glacier Ice Thickness Estimation Method Based on Deep Convolutional Neural Networks

by Zhiqiang Li, Jia Li, Xuyan Ma, Lei Guo, Long Li, Jiahao Dian, Lingshuai Kong and Huiguo Ye

Geosciences 2025, 15(7), 242; https://doi.org/10.3390/geosciences15070242 - 27 Jun 2025

Viewed by 386

Abstract

Ice thickness is a key parameter for glacier mass estimations and glacier dynamics simulations. Multiple physical models have been developed by glaciologists to estimate glacier ice thickness. However, obtaining internal and basal glacier parameters required by physical models is challenging, often leading to [...] Read more.

Ice thickness is a key parameter for glacier mass estimations and glacier dynamics simulations. Multiple physical models have been developed by glaciologists to estimate glacier ice thickness. However, obtaining internal and basal glacier parameters required by physical models is challenging, often leading to simplified models that struggle to capture the nonlinear characteristics of ice flow and resulting in significant uncertainties. To address this, this study proposes a convolutional neural network (CNN)-based deep learning model for glacier ice thickness estimation, named the Coordinate-Attentive Dense Glacier Ice Thickness Estimate Model (CADGITE). Based on in situ ice thickness measurements in the Swiss Alps, a CNN is designed to estimate glacier ice thickness by incorporating a new architecture that includes a Residual Coordinate Attention Block together with a Dense Connected Block, using the distance to glacier boundaries as a complement to inputs that include surface velocity, slope, and hypsometry. Taking ground-penetrating radar (GPR) measurements as a reference, the proposed model achieves a mean absolute deviation (MAD) of 24.28 m and a root mean square error (RMSE) of 37.95 m in Switzerland, outperforming mainstream physical models. When applied to 14 glaciers in High Mountain Asia, the model achieves an MAD of 20.91 m and an RMSE of 27.26 m compared to reference measurements, also exhibiting better performance than mainstream physical models. These comparisons demonstrate the good accuracy and cross-regional transferability of our approach, highlighting the potential of using deep learning-based methods for larger-scale glacier ice thickness estimation. Full article

(This article belongs to the Section Climate and Environment)

► Show Figures

Figure 1

15 pages, 4420 KiB

Open AccessArticle

Single-Pixel Imaging Reconstruction Network with Hybrid Attention and Enhanced U-Net

by Bingrui Xiao, Huibin Wang and Yang Bu

Photonics 2025, 12(6), 607; https://doi.org/10.3390/photonics12060607 - 12 Jun 2025

Viewed by 707

Abstract

Single-pixel imaging has the characteristics of a simple structure and low cost, which means it has potential applications in many fields. This paper proposes an image reconstruction method for single-pixel imaging (SPI) based on deep learning. This method takes the Generative Adversarial Network [...] Read more.

Single-pixel imaging has the characteristics of a simple structure and low cost, which means it has potential applications in many fields. This paper proposes an image reconstruction method for single-pixel imaging (SPI) based on deep learning. This method takes the Generative Adversarial Network (GAN) as the basic architecture, combines the dense residual structure and the deep separable attention mechanism, and reduces the parameters while ensuring the diversity of feature extraction. It also reduces the amount of computation and improves the computational efficiency. In addition, dual-skip connections between the encoder and decoder parts are used to combine the original detailed information with the overall information processed by the network structure. This approach enables a more comprehensive and efficient reconstruction of the target image. Both simulations and experiments have confirmed that the proposed method can effectively reconstruct images at low sampling rates and also achieve good reconstruction results on natural images not seen during training, demonstrating a strong generalization capability. Full article

► Show Figures

Figure 1

19 pages, 9732 KiB

Open AccessArticle

YOLO-MARS: An Enhanced YOLOv8n for Small Object Detection in UAV Aerial Imagery

by Guofeng Zhang, Yanfei Peng and Jincheng Li

Sensors 2025, 25(8), 2534; https://doi.org/10.3390/s25082534 - 17 Apr 2025

Cited by 2 | Viewed by 1033

Abstract

In unmanned aerial vehicle (UAV) aerial imagery scenarios, challenges such as small target size, compact distribution, and mutual occlusion often result in missed detections and false alarms. To address these challenges, this paper introduces YOLO-MARS, a small target recognition model that incorporates a [...] Read more.

In unmanned aerial vehicle (UAV) aerial imagery scenarios, challenges such as small target size, compact distribution, and mutual occlusion often result in missed detections and false alarms. To address these challenges, this paper introduces YOLO-MARS, a small target recognition model that incorporates a multi-level attention residual mechanism. Firstly, an ERAC module is designed to enhance the ability to capture small targets by expanding the feature perception range, incorporating channel attention weight allocation strategies to strengthen the extraction capability for small targets and introducing a residual connection mechanism to improve gradient propagation stability. Secondly, a PD-ASPP structure is proposed, utilizing parallel paths for differentiated feature extraction and incorporating depthwise separable convolutions to reduce computational redundancy, thereby enabling the effective identification of targets at various scales under complex backgrounds. Thirdly, a multi-scale SGCS-FPN fusion architecture is proposed, adding a shallow feature guidance branch to establish cross-level semantic associations, thereby effectively addressing the issue of small target loss in deep networks. Finally, a dynamic WIoU evaluation function is implemented, constructing adaptive penalty terms based on the spatial distribution characteristics of predicted and ground-truth bounding boxes, thereby optimizing the boundary localization accuracy of densely packed small targets from the UAV viewpoint. Experiments conducted on the VisDrone2019 dataset demonstrate that the YOLO-MARS method achieves 40.9% and 23.4% in the mAP50 and mAP50:95 metrics, respectively, representing improvements of 8.1% and 4.3% in detection accuracy compared to the benchmark model YOLOv8n, thus demonstrating its advantages in UAV aerial target detection. Full article

(This article belongs to the Section Sensing and Imaging)

► Show Figures

Figure 1

19 pages, 6402 KiB

Open AccessArticle

The Elitist Non-Dominated Sorting Crisscross Algorithm (Elitist NSCA): Crisscross-Based Multi-Objective Neural Architecture Search

by Zhihui Chen, Ting Lan, Dan He and Zhanchuan Cai

Mathematics 2025, 13(8), 1258; https://doi.org/10.3390/math13081258 - 11 Apr 2025

Viewed by 446

Abstract

In recent years, neural architecture search (NAS) has been proposed for automatically designing neural network architectures, which searches for network architectures that outperform novel human-designed convolutional neural network (CNN) architectures. Related research has always been a hot topic. This paper proposes a multi-objective [...] Read more.

In recent years, neural architecture search (NAS) has been proposed for automatically designing neural network architectures, which searches for network architectures that outperform novel human-designed convolutional neural network (CNN) architectures. Related research has always been a hot topic. This paper proposes a multi-objective evolutionary algorithm called the elitist non-dominated sorting crisscross algorithm (elitist NSCA) and applies it to neural architecture search, which considers two optimization objectives: the accuracy and network parameters. In the algorithm, an innovative search space borrowed from the latest residual block and dense connection is proposed to ensure the quality of the compact architectures. A variable-length crisscross optimization strategy, which creatively iterates the evolution through inter-individual horizontal crossovers and intra-individual vertical crossovers, is employed to simultaneously optimize the microstructure parameters and macroscopic architecture of the CNN. In addition, a corresponding mutation operator is added pertinently based on the performance of the proxy model, and the elitist strategy is improved through pruning to reduce the impact of abnormal fitnesses. The experimental results on multiple datasets show that the proposed algorithm has a higher accuracy and robustness than those of certain state-of-the-art algorithms. Full article

(This article belongs to the Section E1: Mathematics and Computer Science)

► Show Figures

Figure 1

33 pages, 8279 KiB

Open AccessArticle

A Dense Pyramidal Residual Network with a Tandem Spectral–Spatial Attention Mechanism for Hyperspectral Image Classification

by Yunlan Guan, Zixuan Li and Nan Wang

Sensors 2025, 25(6), 1858; https://doi.org/10.3390/s25061858 - 17 Mar 2025

Viewed by 577

Abstract

In recent years, convolutional neural networks (CNNs) have become a potent tool for hyperspectral image classification (HSIC), where classification accuracy, computational cost, and generalization ability are the main focuses. In this study, a novel approach to hyperspectral image classification is proposed. A tandem [...] Read more.

In recent years, convolutional neural networks (CNNs) have become a potent tool for hyperspectral image classification (HSIC), where classification accuracy, computational cost, and generalization ability are the main focuses. In this study, a novel approach to hyperspectral image classification is proposed. A tandem spectral–spatial attention module (TAM) was designed to select significant spectral and spatial features automatically. At the same time, a dense pyramidal residual module (DPRM) with three residual units (RUs) was constructed, where feature maps exhibit linear growth; a dense connection structure was employed between each RU, and a TAM was embedded in each RU. Dilated convolution structures were used in the last two layers of the pyramid network, which can enhance the network’s perception of fine textures and features, improving information transfer efficiency. Tests on four public datasets, namely, the Pavia University, Salinas, TeaFarm, and WHU-Hi-HongHu datasets, were carried out, and the classification accuracies of our method were 99.60%, 99.95%, 99.81%, and 99.84%, respectively. Moreover, the method enhanced the processing speed, especially for large datasets such as WHU-Hi-HongHu. The training time and testing time of one epoch were 53 s and 1.28 s, respectively. Comparative experiments with five methods showed the correctness and high efficiency of our method. Full article

(This article belongs to the Section Remote Sensors)

► Show Figures

Figure 1

17 pages, 7100 KiB

Open AccessArticle

Remote Sensing Techniques with the Use of Deep Learning in the Determining Dynamics of the Illegal Occupation of Rivers and Lakes: A Case Study in the Jinshui River Basin, Wuhan, China

by Laiyin Shen, Yuhong Huang, Chi Zhou and Lihui Wang

Sustainability 2025, 17(3), 996; https://doi.org/10.3390/su17030996 - 26 Jan 2025

Viewed by 739

Abstract

The “Four Illegal Activities”, which involve occupation, extraction, and construction along shorelines, have become significant challenges in river and lake management in recent years. Due to the diverse and scattered nature of monitoring targets, coupled with the large volumes of data involved, traditional [...] Read more.

The “Four Illegal Activities”, which involve occupation, extraction, and construction along shorelines, have become significant challenges in river and lake management in recent years. Due to the diverse and scattered nature of monitoring targets, coupled with the large volumes of data involved, traditional manual inspection methods are no longer sufficient to meet regulatory demands. Late fusion change detection methods in deep learning are particularly effective for monitoring river and lake occupation due to their straightforward principles and processes. However, research on this topic remains limited. To fill this gap, we selected eight popular deep learning networks—VGGNet, ResNet, MobileNet, EfficientNet, DenseNet, Inception-ResNet, SeNet, and DPN—and used the Jinshui River Basin in Wuhan as a case study to explore the application of Siamese networks to monitor river and lake occupation. Our results indicate that the Siamese network based on EfficientNet outperforms all other models. It can be reasonably concluded that the combination of the SE module and residual connections provides an effective approach for improving the performance of deep learning models in monitoring river and lake occupation. Our findings contribute to improving the efficiency of monitoring river and lake occupation, thereby enhancing the effectiveness of water resource and ecological environment protection. In addition, they aid in the development and implementation of efficient strategies for promoting sustainable development. Full article

(This article belongs to the Special Issue Research on Ecological and Environmental Sustainability Based on Remote Sensing and Geographic Information Systems)

► Show Figures

Figure 1

20 pages, 6712 KiB

Open AccessArticle

A Parallel Image Denoising Network Based on Nonparametric Attention and Multiscale Feature Fusion

by Jing Mao, Lianming Sun, Jie Chen and Shunyuan Yu

Sensors 2025, 25(2), 317; https://doi.org/10.3390/s25020317 - 7 Jan 2025

Viewed by 943

Abstract

Convolutional neural networks have achieved excellent results in image denoising; however, there are still some problems: (1) The majority of single-branch models cannot fully exploit the image features and often suffer from the loss of information. (2) Most of the deep CNNs have [...] Read more.

Convolutional neural networks have achieved excellent results in image denoising; however, there are still some problems: (1) The majority of single-branch models cannot fully exploit the image features and often suffer from the loss of information. (2) Most of the deep CNNs have inadequate edge feature extraction and saturated performance problems. To solve these problems, this paper proposes a two-branch convolutional image denoising network based on nonparametric attention and multiscale feature fusion, aiming to improve the denoising performance while better recovering the image edge and texture information. Firstly, ordinary convolutional layers were used to extract shallow features of noise in the image. Then, a combination of two-branch networks with different and complementary structures was used to extract deep features from the noise information in the image to solve the problem of insufficient feature extraction by the single-branch network model. The upper branch network used densely connected blocks to extract local features of the noise in the image. The lower branch network used multiple dilation convolution residual blocks with different dilation rates to increase the receptive field and extend more contextual information to obtain the global features of the noise in the image. It not only solved the problem of insufficient edge feature extraction but also solved the problem of the saturation of deep CNN performance. In this paper, a nonparametric attention mechanism is introduced in the two-branch feature extraction module, which enabled the network to pay attention to and learn the key information in the feature map, and improved the learning performance of the network. The enhanced features were then processed through the multiscale feature fusion module to obtain multiscale image feature information at different depths to obtain more robust fused features. Finally, the shallow features and deep features were summed using a long jump join and were processed through an ordinary convolutional layer and output to obtain a residual image. In this paper, Set12, BSD68, Set5, CBSD68, and SIDD are used as a test dataset to which different intensities of Gaussian white noise were added for testing and compared with several mainstream denoising methods currently available. The experimental results showed that this paper’s algorithm had better objective indexes on all test sets and outperformed the comparison algorithms. The method in this paper not only achieved a good denoising effect but also effectively retained the edge and texture information of the original image. The proposed method provided a new idea for the study of deep neural networks in the field of image denoising. Full article

(This article belongs to the Section Sensing and Imaging)

► Show Figures

Figure 1

20 pages, 3238 KiB

Open AccessArticle

Enhanced Disc Herniation Classification Using Grey Wolf Optimization Based on Hybrid Feature Extraction and Deep Learning Methods

by Yasemin Sarı and Nesrin Aydın Atasoy

Tomography 2025, 11(1), 1; https://doi.org/10.3390/tomography11010001 - 26 Dec 2024

Viewed by 1246

Abstract

Due to the increasing number of people working at computers in professional settings, the incidence of lumbar disc herniation is increasing. Background/Objectives: The early diagnosis and treatment of lumbar disc herniation is much more likely to yield favorable results, allowing the hernia to [...] Read more.

Due to the increasing number of people working at computers in professional settings, the incidence of lumbar disc herniation is increasing. Background/Objectives: The early diagnosis and treatment of lumbar disc herniation is much more likely to yield favorable results, allowing the hernia to be treated before it develops further. The aim of this study was to classify lumbar disc herniations in a computer-aided, fully automated manner using magnetic resonance images (MRIs). Methods: This study presents a hybrid method integrating residual network (ResNet50), grey wolf optimization (GWO), and machine learning classifiers such as multi-layer perceptron (MLP) and support vector machine (SVM) to improve classification performance. The proposed approach begins with feature extraction using ResNet50, a deep convolutional neural network known for its robust feature representation capabilities. ResNet50’s residual connections allow for effective training and high-quality feature extraction from input images. Following feature extraction, the GWO algorithm, inspired by the social hierarchy and hunting behavior of grey wolves, is employed to optimize the feature set by selecting the most relevant features. Finally, the optimized feature set is fed into machine learning classifiers (MLP and SVM) for classification. The use of various activation functions (e.g., ReLU, identity, logistic, and tanh) in MLP and various kernel functions (e.g., linear, rbf, sigmoid, and polynomial) in SVM allows for a thorough evaluation of the classifiers’ performance. Results: The proposed methodology demonstrates significant improvements in metrics such as accuracy, precision, recall, and F1 score, outperforming traditional approaches in several cases. These results highlight the effectiveness of combining deep learning-based feature extraction with optimization and machine learning classifiers. Conclusions: Compared to other methods, such as capsule networks (CapsNet), EfficientNetB6, and DenseNet169, the proposed ResNet50-GWO-SVM approach achieved superior performance across all metrics, including accuracy, precision, recall, and F1 score, demonstrating its robustness and effectiveness in classification tasks. Full article

(This article belongs to the Topic Deep Learning for Medical Image Analysis and Medical Natural Language Processing)

► Show Figures

Figure 1

14 pages, 16241 KiB

Open AccessArticle

Seismic Random Noise Attenuation Using DARE U-Net

by Tara P. Banjade, Cong Zhou, Hui Chen, Hongxing Li, Juzhi Deng, Feng Zhou and Rajan Adhikari

Remote Sens. 2024, 16(21), 4051; https://doi.org/10.3390/rs16214051 - 30 Oct 2024

Cited by 6 | Viewed by 1477

Abstract

Seismic data processing plays a pivotal role in extracting valuable subsurface information for various geophysical applications. However, seismic records often suffer from inherent random noise, which obscures meaningful geological features and reduces the reliability of interpretations. In recent years, deep learning methodologies have [...] Read more.

Seismic data processing plays a pivotal role in extracting valuable subsurface information for various geophysical applications. However, seismic records often suffer from inherent random noise, which obscures meaningful geological features and reduces the reliability of interpretations. In recent years, deep learning methodologies have shown promising results in performing noise attenuation tasks on seismic data. In this research, we propose modifications to the standard U-Net structure by integrating dense and residual connections, which serve as the foundation of our approach named the dense and residual (DARE U-Net) network. Dense connections enhance the receptive field and ensure that information from different scales is considered during the denoising process. Our model implements local residual connections between layers within the encoder, which allows earlier layers to directly connect with deep layers. This promotes the flow of information, allowing the network to utilize filtered and unfiltered input. The combined network mechanisms preserve the spatial information loss during the contraction process so that the decoder can locate the features more accurately by retaining the high-resolution features, enabling precise location in seismic image denoising. We evaluate this adapted architecture by applying synthetic and real data sets and calculating the peak signal-to-noise ratio (PSNR) and structural similarity index measure (SSIM). The effectiveness of this method is well noted. Full article

(This article belongs to the Special Issue Deep Learning and Multi-modal Data Processing for Geological Environment Remote Sensing Interpretation: Methods, Techniques and Applications)

► Show Figures

Graphical abstract

30 pages, 30880 KiB

Open AccessArticle

Development of a New Non-Destructive Analysis Method in Cultural Heritage with Artificial Intelligence

by Bengin Bilici Genc, Erkan Bostanci, Bekir Eskici, Hakan Erten, Berna Caglar Eryurt, Koray Acici, Didem Ketenoglu and Tunc Asuroglu

Electronics 2024, 13(20), 4039; https://doi.org/10.3390/electronics13204039 - 14 Oct 2024

Cited by 1 | Viewed by 1584

Abstract

Cultural assets are all movable and immovable assets that have been the subject of social life in historical periods, have unique scientific and cultural value, and are located above ground, underground or underwater. Today, the fact that most of the analyses conducted to [...] Read more.

Cultural assets are all movable and immovable assets that have been the subject of social life in historical periods, have unique scientific and cultural value, and are located above ground, underground or underwater. Today, the fact that most of the analyses conducted to understand the technologies of these assets require sampling and that non-destructive methods that allow analysis without taking samples are costly is a problem for cultural heritage workers. In this study, which was prepared to find solutions to national and international problems, it is aimed to develop a non-destructive, cost-minimizing and easy-to-use analysis method. Since this article aimed to develop methodology, the materials were prepared for preliminary research purposes. Therefore, it was limited to four primary colors. These four primary colors were red and yellow ochre, green earth, Egyptian blue and ultramarine blue. These pigments were used with different binders. The produced paints were photographed in natural and artificial light at different light intensities and brought to a 256 × 256 pixel size, and then trained on support vector machine, convolutional neural network, densely connected convolutional network, residual network 50 and visual geometry group 19 models. It was asked whether the trained VGG19 model could classify the paints used in archaeological and artistic works analyzed with instrumental methods in the literature with their real identities. As a result of the test, the model was able to classify paints in artworks from photographs non-destructively with a 99% success rate, similar to the result of the McNemar test. Full article

(This article belongs to the Special Issue Electronics and Computer Science for Cultural Heritage: Advancements, Preservation, and Applications)

► Show Figures

Figure 1

15 pages, 4276 KiB

Open AccessArticle

Spectrum Sensing Method Based on STFT-RADN in Cognitive Radio Networks

by Anyi Wang, Tao Zhu and Qifeng Meng

Sensors 2024, 24(17), 5792; https://doi.org/10.3390/s24175792 - 6 Sep 2024

Cited by 5 | Viewed by 1417

Abstract

To address the common issues in traditional convolutional neural network (CNN)-based spectrum sensing algorithms in cognitive radio networks (CRNs), including inadequate signal feature representation, inefficient utilization of feature map information, and limited feature extraction capabilities due to shallow network structures, this paper proposes [...] Read more.

To address the common issues in traditional convolutional neural network (CNN)-based spectrum sensing algorithms in cognitive radio networks (CRNs), including inadequate signal feature representation, inefficient utilization of feature map information, and limited feature extraction capabilities due to shallow network structures, this paper proposes a spectrum sensing algorithm based on a short-time Fourier transform (STFT) and residual attention dense network (RADN). Specifically, the RADN model improves the basic residual block and introduces the convolutional block attention module (CBAM), combining residual connections and dense connections to form a powerful deep feature extraction structure known as residual in dense (RID). This significantly enhances the network’s feature extraction capabilities. By performing STFT on the received signals and normalizing them, the signals are converted into time–frequency spectrograms as network inputs, better capturing signal features. The RADN is trained to extract abstract features from the time–frequency images, and the trained RADN serves as the final classifier for spectrum sensing. Experimental results demonstrate that the STFT-RADN spectrum sensing method significantly improves performance under low signal-to-noise ratio (SNR) conditions compared to traditional deep-learning-based methods. This method not only adapts to various modulation schemes but also exhibits high detection probability and strong robustness. Full article

(This article belongs to the Special Issue Sensors for Enabling Wireless Spectrum Access)

► Show Figures

Figure 1

17 pages, 8056 KiB

Open AccessArticle

A Multi-Organ Segmentation Network Based on Densely Connected RL-Unet

by Qirui Zhang, Bing Xu, Hu Liu, Yu Zhang and Zhiqiang Yu

Appl. Sci. 2024, 14(17), 7953; https://doi.org/10.3390/app14177953 - 6 Sep 2024

Cited by 1 | Viewed by 1615

Abstract

The convolutional neural network (CNN) has been widely applied in medical image segmentation due to its outstanding nonlinear expression ability. However, applications of CNN are often limited by the receptive field, preventing it from modeling global dependencies. The recently proposed transformer architecture, which [...] Read more.

The convolutional neural network (CNN) has been widely applied in medical image segmentation due to its outstanding nonlinear expression ability. However, applications of CNN are often limited by the receptive field, preventing it from modeling global dependencies. The recently proposed transformer architecture, which uses a self-attention mechanism to model global context relationships, has achieved promising results. Swin-Unet is a Unet-like simple transformer semantic segmentation network that combines the dominant feature of both the transformer and Unet. Even so, Swin-Unet has some limitations, such as only learning single-scale contextual features, and it lacks inductive bias and effective multi-scale feature selection for processing local information. To solve these problems, the Residual Local induction bias-Unet (RL-Unet) algorithm is proposed in this paper. First, the algorithm introduces a local induction bias module into the RLSwin-Transformer module and changes the multi-layer perceptron (MLP) into a residual multi-layer perceptron (Res-MLP) module to model local and remote dependencies more effectively and reduce feature loss. Second, a new densely connected double up-sampling module is designed, which can further integrate multi-scale features and improve the segmentation accuracy of the target region. Third, a novel loss function is proposed that can significantly enhance the performance of multiple scales segmentation and the segmentation results for small targets. Finally, experiments were conducted using four datasets: Synapse, BraTS2021, ACDC, and BUSI. The results show that the performance of RL-Unet is better than that of Unet, Swin-Unet, R2U-Net, Attention-Unet, and other algorithms. Compared with them, RL-Unet produces significantly a lower Hausdorff Distance at 95% threshold (HD95) and comparable Dice Similarity Coefficient (DSC) results. Additionally, it exhibits higher accuracy in segmenting small targets. Full article

► Show Figures

Figure 1

18 pages, 5155 KiB

Open AccessArticle

Evaluation of Fracturing Effect of Tight Reservoirs Based on Deep Learning

by Ankang Feng, Yuxin Ke and Chuang Hei

Sensors 2024, 24(17), 5775; https://doi.org/10.3390/s24175775 - 5 Sep 2024

Cited by 1 | Viewed by 1174

Abstract

The utilization of hydraulic fracturing technology is indispensable for unlocking the potential of tight oil and gas reservoirs. Understanding and accurately evaluating the impact of fracturing is pivotal in maximizing oil and gas production and optimizing wellbore performance. Currently, evaluation methods based on [...] Read more.

The utilization of hydraulic fracturing technology is indispensable for unlocking the potential of tight oil and gas reservoirs. Understanding and accurately evaluating the impact of fracturing is pivotal in maximizing oil and gas production and optimizing wellbore performance. Currently, evaluation methods based on acoustic logging, such as orthogonal dipole anisotropy and radial tomography imaging, are widely used. However, when the fractures generated by hydraulic fracturing form a network-like pattern, orthogonal dipole anisotropy fails to accurately assess the fracturing effects. Radial tomography imaging can address this issue, but it is challenged by high manpower and time costs. This study aims to develop a more efficient and accurate method for evaluating fracturing effects in tight reservoirs using deep learning techniques. Specifically, the method utilizes dipole array acoustic logging curves recorded before and after fracturing. Manual labeling was conducted by integrating logging data interpretation results. An improved WGAN-GP was employed to generate adversarial samples for data augmentation, and fracturing effect evaluation was implemented using SE-ResNet, ResNet, and DenseNet. The experimental results demonstrated that ResNet with residual connections is more suitable for the dataset in this study, achieving higher accuracy in fracturing effect evaluation. The inclusion of the SE module further enhanced model accuracy by adaptively adjusting the weights of feature map channels, with the highest accuracy reaching 99.75%. Full article

(This article belongs to the Section Sensing and Imaging)

► Show Figures

Figure 1

Search Results (99)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (99)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI