Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (1,841)

Search Parameters:
Keywords = convolution product

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
23 pages, 7288 KB  
Article
ECA-RepNet: A Lightweight Coal–Rock Recognition Network Using Recurrence Plot Transformation
by Jianping Zhou, Zhixin Jin, Hongwei Wang, Wenyan Cao, Xipeng Gu, Qingyu Kong, Jianzhong Li and Zeping Liu
Information 2026, 17(2), 140; https://doi.org/10.3390/info17020140 (registering DOI) - 1 Feb 2026
Abstract
Coal and rock recognition is one of the key technologies in mining production, but traditional methods have limitations such as single-feature representation dimension, insufficient robustness, and unbalanced performance in lightweight design under noise interference and complex feature conditions. To address these issues, an [...] Read more.
Coal and rock recognition is one of the key technologies in mining production, but traditional methods have limitations such as single-feature representation dimension, insufficient robustness, and unbalanced performance in lightweight design under noise interference and complex feature conditions. To address these issues, an Efficient Channel Attention Reparameterized Network (ECA-RepNet) based on recurrence plot and Efficient Channel Attention mechanism is proposed. The one-dimensional vibration signal is mapped to the two-dimensional image space through a recurrence plot (RP), which retains the dynamic characteristics of the time series while capturing the complex patterns in the signal. Multi-scale feature extraction and lightweight design are achieved through the reparameterized large kernel block (RepLK Block) and the depthwise separable convolution (DSConv) module. The ECA module is introduced to embed multiple convolutional layers. Through global average pooling, one-dimensional convolution, and dynamic weight allocation, the modeling ability of inter-channel dependencies is enhanced, the model robustness is improved, and the computational overhead is reduced. Experimental results demonstrate that the ECA-RepNet model achieves 97.33% accuracy, outperforming classic models including ResNet, CNN, and MobileNet in parameter efficiency, training time, and inference speed. Full article
Show Figures

Graphical abstract

28 pages, 7980 KB  
Article
Smart Predictive Maintenance: A TCN-Based System for Early Fault Detection in Industrial Machinery
by Abuzar Khan, Ahmad Junaid, Muhammad Farooq Siddique, Abid Iqbal, Husam S. Samkari, Mohammed F. Allehyani and Ghassan Husnain
Machines 2026, 14(2), 164; https://doi.org/10.3390/machines14020164 (registering DOI) - 1 Feb 2026
Abstract
Modern factories still struggle with unexpected machine failures because traditional maintenance systems depend on fixed rules and threshold-based alerts. These older approaches often overlook subtle or complex patterns in multimodal sensor data, causing them to miss early signs of wear and leading to [...] Read more.
Modern factories still struggle with unexpected machine failures because traditional maintenance systems depend on fixed rules and threshold-based alerts. These older approaches often overlook subtle or complex patterns in multimodal sensor data, causing them to miss early signs of wear and leading to late or incorrect maintenance decisions. As a result, production can slow down, costs increase and equipment reliability suffers. To address this challenge, this study introduces a smart and interpretable fault diagnosis and predictive maintenance framework designed to detect wear, degradation and potential failures before they disrupt operations. The proposed framework integrates multiscale feature extraction, multimodal sensor fusion and cross-sensor correlation analysis with advanced temporal modeling using a Temporal Convolutional Network (TCN). By jointly performing tool-health classification and Remaining Useful Life (RUL) estimation, the framework provides a comprehensive assessment of machine condition. When evaluated on the NASA Ames milling dataset, the model achieved an overall accuracy of 86%, correctly classifying healthy and failed tools in more than 88% of cases and worn tools in over 75%, demonstrating consistent performance across different stages of tool wear. Explainable artificial intelligence (XAI) techniques, including attention-based visualizations and SHAP-based feature attribution, reveal that electrical and vibration signals are the most influential early indicators of tool degradation. The proposed framework exhibits low computational latency and minimal memory requirements, making it suitable for real-time fault diagnosis and deployment on industrial edge devices. Overall, the framework balances predictive accuracy, interpretability and practical applicability, enabling proactive and reliable maintenance decisions that enhance machine uptime and support efficient smart manufacturing operations. Full article
Show Figures

Figure 1

22 pages, 7120 KB  
Article
Enhancing Cross-Species Prediction of Leaf Mass per Area from Hyperspectral Remote Sensing Using Fractional Order Derivatives and 1D-CNNs
by Shijie Shan, Qiaozhen Guo, Lu Xu, Weiguo Jiang, Shuo Shi and Yiyun Chen
Remote Sens. 2026, 18(3), 444; https://doi.org/10.3390/rs18030444 (registering DOI) - 1 Feb 2026
Abstract
Leaf mass per area (LMA) plays an important role in vegetation productivity, carbon cycling, and remote sensing-based ecosystem monitoring. However, remotely predicting LMA from hyperspectral reflectance remains challenging due to the weak and strongly overlapping spectral response of LMA and spectral variability across [...] Read more.
Leaf mass per area (LMA) plays an important role in vegetation productivity, carbon cycling, and remote sensing-based ecosystem monitoring. However, remotely predicting LMA from hyperspectral reflectance remains challenging due to the weak and strongly overlapping spectral response of LMA and spectral variability across species. To address these limitations, this study proposed an integrated framework that combines a fractional-order spectral derivative (FOD) with a one-dimensional convolutional neural network (1D-CNN) to enhance LMA prediction accuracy and cross-species generalization. Leaf hyperspectral reflectance was processed using FOD with 0–2 orders, and the relationship between FOD-enhanced spectra and LMA was analyzed. Model performance was assessed using (i) overall prediction accuracy by an 8:2 random split between training and test sets, and (ii) cross-species generalization through leave-one-species-out validation. The results demonstrated that the 1D-CNN using a 1.5-order derivative achieved the best performance (R2 = 0.85; RMSE = 11.57 g/m2), outperforming common machine-learning models including partial least squares regression (PLSR), random forest (RF), and support vector regression (SVR). The proposed method also demonstrated great generalization in cross-species prediction. These results indicate that integrating FOD with 1D-CNN effectively enhances LMA-related spectral information and improves LMA prediction across various species. It provides a promising pathway for applying airborne and satellite hyperspectral images in vegetation biochemical parameter mapping, crop monitoring, and ecological assessment. Full article
Show Figures

Figure 1

18 pages, 4025 KB  
Article
A Pre-Activated Residual Parallel Convolutional Block-Based BiGRU Model for Remaining Useful Life Prediction
by Yifan Sun, Qiuyang Zhou and Yu Xia
Machines 2026, 14(2), 159; https://doi.org/10.3390/machines14020159 - 30 Jan 2026
Abstract
The accurate prediction of the Remaining Useful Life (RUL) of key mechanical equipment in modern industry is crucial for reducing production risks and optimizing maintenance decisions. However, existing Convolutional Neural Network (CNN)-based models lack an inherent memory mechanism, and single convolutional kernel-based CNN [...] Read more.
The accurate prediction of the Remaining Useful Life (RUL) of key mechanical equipment in modern industry is crucial for reducing production risks and optimizing maintenance decisions. However, existing Convolutional Neural Network (CNN)-based models lack an inherent memory mechanism, and single convolutional kernel-based CNN models fail to capture multi-scale temporal features effectively. Moreover, some existing methods fail to account for the stability of the model training process, which tends to result in prolonged training time and an elevated risk of overfitting. To overcome these problems, a pre-activated residual parallel convolutional block-based BiGRU model (PRPC-BiGRU) is proposed in this study. First, the residual parallel convolutional block (RPCB) is constructed to simultaneously extract multi-scale temporal features. Subsequently, the pre-activated convolutional structure, which applies normalization and activation function prior to convolution operations, is utilized to improve gradient propagation and training stability. Finally, experimental results using the aero-engine benchmark datasets to verify the effectiveness and superior prediction performance of the proposed PRPC-BiGRU model. Full article
(This article belongs to the Special Issue Intelligent Predictive Maintenance and Machine Condition Monitoring)
27 pages, 7975 KB  
Article
Identification and Prediction of the Invasion Pattern of the Mikania micrantha with WaveEdgeNet Model Using UAV-Based Images in Shenzhen
by Hui Lin, Yang Yin, Xiaofen He, Jiangping Long, Tingchen Zhang, Zilin Ye and Xiaojia Deng
Remote Sens. 2026, 18(3), 437; https://doi.org/10.3390/rs18030437 - 30 Jan 2026
Viewed by 30
Abstract
Mikania micrantha is one of the most detrimental invasive plant species in the southeastern coastal region of China. To accurately predict the invasion pattern of Mikania micrantha and offer guidance for production practices, it is essential to determine its precise location and the [...] Read more.
Mikania micrantha is one of the most detrimental invasive plant species in the southeastern coastal region of China. To accurately predict the invasion pattern of Mikania micrantha and offer guidance for production practices, it is essential to determine its precise location and the driving factors. Therefore, a design of the wavelet convolution and dynamic feature fusion module was studied, and WaveEdgeNet was proposed. This model has the abilities to deeply extract image semantic features, retain features, perform multi-scale segmentation, and conduct fusion. Moreover, to quantify the impact of human and natural factors, we developed a novel proximity factor based on land use data. Additionally, a new feature selection framework was applied to identify driving factors by analyzing the relationships between environmental variables and Mikania micrantha. Finally, the MaxEnt model was utilized to forecast its potential future habitats. The results demonstrate that WaveEdgeNet effectively extracts image features and improves model performance, attaining an MIoU of 85% and an overall accuracy of 98.62%, outperforming existing models. Spatial analysis shows that the invaded area in 2024 was smaller than that in 2023, indicating that human intervention measures have achieved some success. Furthermore, the feature selection framework not only enhances MaxEnt’s accuracy but also cuts down computational time by 82.61%. According to MaxEnt modeling, human disturbance, proximity to forests, distance from roads, and elevation are recognized as the primary factors. In the future, we will concentrate on overcoming the seasonal limitations and attaining the objective of predicting the growth and reproduction of kudzu before they happen, which can offer a foundation for manual intervention. This study lays a solid technical foundation and offers comprehensive data support for comprehending the species’ dispersal patterns and driving factors and for guiding environmental conservation. Full article
(This article belongs to the Section Forest Remote Sensing)
Show Figures

Figure 1

15 pages, 6693 KB  
Article
Bridging the Time-Space Scale Gap: A Physics-Informed UAV Upscaling Framework for Radiometric Validation of Microsatellite Constellations in Heterogeneous Built Environments
by Seung-Hwan Go, Dong-Ho Lee, Won-Ki Jo and Jong-Hwa Park
Drones 2026, 10(2), 99; https://doi.org/10.3390/drones10020099 - 30 Jan 2026
Viewed by 26
Abstract
The exponential rise in microsatellite constellations offers unprecedented temporal resolution for urban monitoring. However, ensuring the radiometric integrity of these sensors over heterogeneous built environments remains a critical challenge due to low signal-to-noise ratios and spectral uncertainties. Traditional vicarious calibration relies on homogeneous [...] Read more.
The exponential rise in microsatellite constellations offers unprecedented temporal resolution for urban monitoring. However, ensuring the radiometric integrity of these sensors over heterogeneous built environments remains a critical challenge due to low signal-to-noise ratios and spectral uncertainties. Traditional vicarious calibration relies on homogeneous pseudo-invariant calibration sites (PICS) in deserts, which fail to represent the spectral complexity and adjacency effects of urban landscapes. This study presents a novel triple-platform validation framework integrating ground (Hyperspectral), UAV (Multispectral), and satellite (Sentinel-2) data to bridge the “Point-to-Pixel” scale gap. We introduce a physics-informed “Double Calibration” protocol—combining the empirical line method with spectral response function convolution—and a block kriging spatial upscaling technique to mathematically model intra-pixel heterogeneity. Results from a 2025 campaign in a complex urban environment (Cheongju, Republic of Korea) demonstrate that simple point-averaging introduces significant representation errors (R20.46 with time lag). In contrast, our UAV-based block kriging approach recovered high correlations even with a 1-day time lag and dramatically improved the coefficient of determination (R2) under simultaneous acquisition conditions: from 0.68 to 0.92 in the blue band and to 0.96 in the NIR band. Furthermore, quantitative spatial analysis identified artificial grass as the most stable “Urban PICS” (σ0.020), whereas asphalt exhibited unexpected high spatial heterogeneity (σ> 0.09) due to surface aging and challenging conventional assumptions. This framework establishes a rigorous, scalable standard for validating “New Space” data products in complex urban domains. Full article
Show Figures

Figure 1

24 pages, 1520 KB  
Article
A Lightweight YOLO-PEGA-Based Method for Quantifying Fish Feeding Intensity
by Xinyu Ai, Shengmao Zhang, Shenglong Yang, Ai Guo, Zuli Wu, Xiumei Fan, Yumei Wu and Yongchuang Shi
Animals 2026, 16(3), 432; https://doi.org/10.3390/ani16030432 - 29 Jan 2026
Viewed by 99
Abstract
In aquaculture production, manual or fixed-schedule feeding often fails to match the real-time feeding level of fish schools, and overfeeding can lead to feed wastage and water-quality deterioration, which has become a major bottleneck for both large-scale farming efficiency and environmental sustainability. During [...] Read more.
In aquaculture production, manual or fixed-schedule feeding often fails to match the real-time feeding level of fish schools, and overfeeding can lead to feed wastage and water-quality deterioration, which has become a major bottleneck for both large-scale farming efficiency and environmental sustainability. During feeding, intense competition and jumping behaviors generate splashes of varying magnitudes, which can serve as an indirect visual proxy for hunger intensity. In this study, we constructed a frame-level splash-annotated dataset and performed data preprocessing. Building upon YOLO11 pretrained weights, we introduced a P2–P5 four-scale detection head to enhance small-splash recognition, injected EGMA into the backbone C3k2 blocks, and replaced stride-2 downsampling convolutions with a three-branch ADown operator. On the validation set, the proposed YOLO11-PEGA achieved a precision of 0.86 and a recall of 0.80, with mAP@0.5 exceeding 0.80 and mAP@0.5–0.95 exceeding 0.30. Compared with the baseline model, the parameter count was reduced by 72.3%. The results demonstrate that the proposed model maintains stable detection and evaluation performance under complex environmental conditions, providing actionable decision support for feeding-threshold setting, feeding-time determination, and feed-amount adjustment. Full article
21 pages, 3624 KB  
Article
Multi-Scale Feature Fusion and Attention-Enhanced R2U-Net for Dynamic Weight Monitoring of Chicken Carcasses
by Tian Hua, Pengfei Zou, Ao Zhang, Runhao Chen, Hao Bai, Wenming Zhao, Qian Fan and Guobin Chang
Animals 2026, 16(3), 410; https://doi.org/10.3390/ani16030410 - 28 Jan 2026
Viewed by 151
Abstract
In recent years, real-time monitoring of broiler chicken weight has become crucial for assessing growth and health status. Currently, obtaining weight data often relies on manual collection. However, this process is cumbersome, labor-intensive, and inefficient. This paper proposes a broiler carcass weight detection [...] Read more.
In recent years, real-time monitoring of broiler chicken weight has become crucial for assessing growth and health status. Currently, obtaining weight data often relies on manual collection. However, this process is cumbersome, labor-intensive, and inefficient. This paper proposes a broiler carcass weight detection model based on deep learning image segmentation and regression to address these issues. The model first segments broiler carcasses and then uses the pixel area of the segmented region as a key feature for a regression model to predict weight. A custom dataset comprising 2709 images from 301 Taihu yellow chickens was established for this study. A novel segmentation network, AR2U-AtNet, derived from R2U-Net, is proposed. To mitigate the interference of background color and texture on target carcasses in slaughterhouse production lines, the Convolutional Block Attention Module (CBAM) is introduced to enable the network to focus on areas containing carcasses. Furthermore, broilers exhibit significant variations in size, morphology, and posture, which impose high demands on the model’s scale adaptability. Selective Kernel Attention (SKAttention) is therefore integrated to flexibly handle broiler images with diverse body conditions. The model achieved an average Intersection over Union (mIoU) score of 90.45%, and Dice and F1 scores of 95.18%. The regression-based weight prediction achieved an R2 value of 0.9324. The results demonstrate that the proposed method can quickly and accurately determine individual broiler carcass weights, thereby alleviating the burden of traditional weighing methods and ultimately improving the production efficiency of yellow-feather broilers. Full article
(This article belongs to the Section Poultry)
Show Figures

Figure 1

24 pages, 17827 KB  
Article
Region-Based Hyperspectral Imaging and Lightweight CNN Model for Nondestructive Prediction of Soluble Solid Content in Strawberries
by Seung-Woo Chun, Hong-Gu Lee, Jeong-Eun Lee, Woo-Hyeong Yu, In Geun Hwang and Changyeun Mo
Agriculture 2026, 16(3), 321; https://doi.org/10.3390/agriculture16030321 - 28 Jan 2026
Viewed by 107
Abstract
Soluble solid content (SSC) is a key indicator of strawberry quality. Conventional SSC measurement methods are destructive and impractical for large-scale applications. Therefore, this study developed a region-based hyperspectral imaging (HSI) and lightweight one-dimensional convolutional neural network (1D CNN) framework for nondestructive SSC [...] Read more.
Soluble solid content (SSC) is a key indicator of strawberry quality. Conventional SSC measurement methods are destructive and impractical for large-scale applications. Therefore, this study developed a region-based hyperspectral imaging (HSI) and lightweight one-dimensional convolutional neural network (1D CNN) framework for nondestructive SSC prediction in strawberries. To evaluate spatial effects on predictive accuracy, the fruit surface was segmented into five groups (G1–G5). Three spectral preprocessing methods were applied with partial least squares regression and five convolutional neural network (CNN) architectures, including a simplified VGG-CNN. Larger regions generally improved prediction performance; however, the 50% region (G2) and 75% region (G3) achieved comparable performance to the full region, reducing data requirements. The simplified VGG-CNN model with SNV outperformed other models, exhibiting high accuracy with reduced computational cost, supporting its potential integration into portable and real-time sensing systems. The proposed approach can contribute to improved postharvest quality control and enhanced consumer confidence in strawberry products. Full article
(This article belongs to the Section Agricultural Product Quality and Safety)
Show Figures

Figure 1

23 pages, 2976 KB  
Article
Transfer Learning-Based Piezoelectric Actuators Feedforward Control with GRU-CNN
by Yaqian Hu, Herong Jin, Xiangcheng Chu and Yali Yi
Appl. Sci. 2026, 16(3), 1305; https://doi.org/10.3390/app16031305 - 27 Jan 2026
Viewed by 140
Abstract
To compensate for hysteresis, low damping vibration, and their coupling effects, this paper proposes a gated recurrent unit and convolutional neural network (GRU-CNN) model as a feedforward control model that maps desired displacement trajectories to driving voltages. The GRU-CNN integrates a gated recurrent [...] Read more.
To compensate for hysteresis, low damping vibration, and their coupling effects, this paper proposes a gated recurrent unit and convolutional neural network (GRU-CNN) model as a feedforward control model that maps desired displacement trajectories to driving voltages. The GRU-CNN integrates a gated recurrent unit (GRU) layer to capture long-term temporal dependencies, a multi-layer convolutional neural network (CNN) to extract local data features, and residual connections to mitigate information distortion. The GRU-CNN is then combined with transfer learning (TL) for feedforward control of cross-batch and cross-type piezoelectric actuators (PEAs), so as to reduce reliance on training datasets. The analysis focuses on the impacts of target PEA data volume and source-target similarity on transfer learning strategies. The GRU-CNN trained on PEA #1 achieves high control accuracy, with a mean absolute error (MAE) of 0.077, a root mean square error (RMSE) of 0.129, and a coefficient of determination (R2) of 0.997. When transferred to cross-batch PEA #2 and cross-type PEA #3, the GRU-CNN feedforward controller still delivers favorable performance; R2 values all exceed 0.98, representing at least a 27% improvement compared to training from scratch. These results indicate that the proposed transfer learning-based feedforward control method can effectively reduce retraining effort, suggesting its potential applicability to batch production scenarios. Full article
19 pages, 1811 KB  
Article
Defective Wheat Kernel Recognition Using EfficientNet with Attention Mechanism and Multi-Binary Classification
by Duolin Wang, Jizhong Li, Han Gong and Jianyi Chen
Appl. Sci. 2026, 16(3), 1247; https://doi.org/10.3390/app16031247 - 26 Jan 2026
Viewed by 108
Abstract
As a globally significant food crop, the assessment of wheat quality is essential for ensuring food security and enhancing the processing quality of agricultural products. Conventional methods for assessing wheat kernel quality are often inefficient and markedly subjective, which hampers their ability to [...] Read more.
As a globally significant food crop, the assessment of wheat quality is essential for ensuring food security and enhancing the processing quality of agricultural products. Conventional methods for assessing wheat kernel quality are often inefficient and markedly subjective, which hampers their ability to accurately distinguish the complex and diverse phenotypic characteristics of wheat kernels. To tackle the aforementioned issues, this study presents an enhanced recognition method for defective wheat kernels, based on the EfficientNet-B1 architecture. Building upon the original EfficientNet-B1 network structure, this approach incorporates the lightweight attention mechanism known as CBAM (Convolutional Block Attention Module) to augment the model’s capacity to discern features in critical regions. Simultaneously, it modifies the classification head structure to facilitate better alignment with the data, thereby enhancing accuracy. The experiment employs a self-constructed dataset comprising five categories of wheat kernels—perfect wheat kernels, insect-damaged wheat kernels, scab-damaged wheat kernels, moldy wheat kernels, and black germ wheat kernels—which are utilized for training and validation purposes. The results indicate that the enhanced model attains a classification accuracy of 99.80% on the test set, reflecting an increase of 2.6% compared to its performance prior to the enhancement. Furthermore, the Precision, Recall, and F1-score all demonstrated significant improvements. The proposed model achieves near-perfect performance on several categories under controlled experimental conditions, with particularly high precision and recall for scab-damaged and insect-damaged kernels. This study demonstrates the efficacy of the enhanced EfficientNet-B1 model in the recognition of defective wheat kernels and offers novel technical insights and methodological references for intelligent wheat quality assessment. Full article
(This article belongs to the Section Agricultural Science and Technology)
Show Figures

Figure 1

18 pages, 2599 KB  
Article
C-ViT: An Improved ViT Model for Multi-Label Classification of Bamboo Chopstick Defects
by Waizhong Wang, Wei Peng, Liancheng Zeng, Yue Shen, Chaoyun Zhu and Yingchun Kuang
Sensors 2026, 26(3), 812; https://doi.org/10.3390/s26030812 - 26 Jan 2026
Viewed by 208
Abstract
The quality of disposable bamboo chopsticks directly affects consumers’ usage experience and health safety. Therefore, quality inspection is particularly important, and multi-label classification of defects can better meet the refined demands of actual production. While ViT has made significant progress in visual tasks, [...] Read more.
The quality of disposable bamboo chopsticks directly affects consumers’ usage experience and health safety. Therefore, quality inspection is particularly important, and multi-label classification of defects can better meet the refined demands of actual production. While ViT has made significant progress in visual tasks, it has limitations when dealing with extreme aspect ratios like bamboo chopsticks. To address this, this paper proposes an improved ViT model, C-ViT, introducing a convolutional neural network feature extraction module (CFE) to replace traditional patch embedding, making the input features more suitable for the ViT model. Moreover, existing loss functions in multi-label classification tasks focus on label prediction optimization, making hard labels difficult to learn due to their low gradient contribution. Therefore, this paper proposes a Hard Examples Contrastive Loss (HCL) function, dynamically selecting hard examples and combining label and feature correlation to construct a contrastive learning mechanism, enhancing the model’s ability to model hard examples. Experimental results show that on the self-built bamboo chopstick defect dataset (BCDD), C-ViT improves the mAP by 1.2% to 92.8% compared to the ViTS model, and can reach 94.3% after adding HCL. In addition, we further verified the effectiveness of the proposed HCL function in multi-label classification tasks on the VOC2012 public dataset. Full article
(This article belongs to the Section Sensing and Imaging)
Show Figures

Figure 1

20 pages, 3656 KB  
Article
Efficient Model for Detecting Steel Surface Defects Utilizing Dual-Branch Feature Enhancement and Downsampling
by Quan Lu, Minsheng Gong and Linfei Yin
Appl. Sci. 2026, 16(3), 1181; https://doi.org/10.3390/app16031181 - 23 Jan 2026
Viewed by 96
Abstract
Surface defect evaluation in steel production demands both high inference speed and accuracy for efficient production. However, existing methods face two critical challenges: (1) the diverse dimensions and irregular morphologies of surface defects reduce detection accuracy, and (2) computationally intensive feature extraction slows [...] Read more.
Surface defect evaluation in steel production demands both high inference speed and accuracy for efficient production. However, existing methods face two critical challenges: (1) the diverse dimensions and irregular morphologies of surface defects reduce detection accuracy, and (2) computationally intensive feature extraction slows inference. In response to these challenges, this study proposes an innovative network based on dual-branch feature enhancement and downsampling (DFED-Net). First, an atrous convolution and multi-scale dilated attention fusion module (AMFM) is developed, incorporating local–global feature representation. By emphasizing local details and global semantics, the module suppresses noise interference and enhances the capability of the model to separate small-object features from complex backgrounds. Additionally, a dual-branch downsampling module (DBDM) is developed to preserve the fine details related to scale that are typically lost during downsampling. The DBDM efficiently fuses semantic and detailed information, improving consistency across feature maps at different scales. A lightweight dynamic upsampling (DySample) is introduced to supplant traditional fixed methods with a learnable, adaptive approach, which retains critical feature information more flexibly while reducing redundant computation. Experimental evaluation shows a mean average precision (mAP) of 81.5% on the Northeastern University surface defect detection (NEU-DET) dataset, a 5.2% increase compared to the baseline, while maintaining a real-time inference speed of 120 FPS compared to the 118 FPS of the baseline. The proposed DFED-Net provides strong support for the development of automated visual inspection systems for detecting defects on steel surfaces. Full article
(This article belongs to the Section Computing and Artificial Intelligence)
Show Figures

Figure 1

21 pages, 4363 KB  
Article
LESSDD-Net: A Lightweight and Efficient Steel Surface Defect Detection Network Based on Feature Segmentation and Partially Connected Structures
by Jiayu Wu, Longxin Zhang and Xinyi Pu
Sensors 2026, 26(3), 753; https://doi.org/10.3390/s26030753 - 23 Jan 2026
Viewed by 121
Abstract
Steel surface defect detection is essential for maintaining industrial production quality and operational safety. However, existing deep learning-based methods often encounter high computational costs, hindering their deployment on mobile devices. To effectively address this challenge, we propose a lightweight and efficient steel surface [...] Read more.
Steel surface defect detection is essential for maintaining industrial production quality and operational safety. However, existing deep learning-based methods often encounter high computational costs, hindering their deployment on mobile devices. To effectively address this challenge, we propose a lightweight and efficient steel surface defect detection network based on feature segmentation and partially connected structures, termed LESSDD-Net. In LESSDD-Net, we first introduce a lightweight downsampling module called the cross-stage partial-based dual-branch downsampling module (CSPDDM). This module significantly reduces the number of model parameters and computational costs while facilitating more efficient downsampling operations. Next, we present a lightweight attention mechanism known as coupled channel attention (CCAttention), which enhances the model’s capability to capture essential information in feature maps. Finally, we improve the faster implementation of cross-stage partial bottleneck with two convolutions (C2f) and design a lightweight version called the lightweight and partial faster implementation of cross-stage partial bottleneck with two convolutions (LP-C2f). This module not only enhances detection accuracy but also further diminishes the model’s size. Experimental results on the data-augmented Northeastern University surface defect detection (NEU-DET) dataset indicate that the mean average precision (mAP) of LESSDD-Net improves by 3.19% compared to the baseline model YOLO11n. Additionally, in terms of model complexity, LESSDD-Net reduces the number of parameters and computational costs by 39.92% and 20.63%, respectively, compared to YOLO11n. When compared with other mainstream object detection models, LESSDD-Net achieves top detection accuracy with the highest mAP value and demonstrates significant advantages in model complexity, characterized by the lowest number of parameters and computational costs. Full article
(This article belongs to the Section Physical Sensors)
Show Figures

Figure 1

17 pages, 4725 KB  
Article
Hyperspectral Inversion of Soil Organic Carbon in Daylily Cultivation Areas of Yunzhou District
by Zelong Yao, Xiuping Ran, Chenbo Yang, Ping Li and Rutian Bi
Sensors 2026, 26(2), 740; https://doi.org/10.3390/s26020740 - 22 Jan 2026
Viewed by 81
Abstract
Accurate determination of Soil Organic Carbon (SOC), which is the foundation of soil health and safeguards ecological and food security, is crucial in local agricultural production. We aimed to investigate the influence of soil texture on hyperspectral models for predicting SOC content and [...] Read more.
Accurate determination of Soil Organic Carbon (SOC), which is the foundation of soil health and safeguards ecological and food security, is crucial in local agricultural production. We aimed to investigate the influence of soil texture on hyperspectral models for predicting SOC content and to evaluate the role of different preprocessing methods and feature band selection algorithms in improving modeling efficiency. Laboratory-determined SOC content and hyperspectral reflectance data were obtained using soil samples from daylily cultivation areas in Yunzhou District, Datong City. Mathematical transformations, including Savitzky–Golay smoothing (SG), First Derivative (FD), Second Derivative (SD), Multiplicative Scatter Correction (MSC), and Standard Normal Variate (SNV), were applied to the spectral reflectance data. Feature bands extracted based on the successive projection algorithm (SPA) and Competitive Adaptive Reweighted Sampling (CARS) were used to establish SOC content inversion models employing four algorithms: partial least-squares regression (PLSR), Random Forest (RF), Backpropagation Neural Network (BP), and Convolutional Neural Network (CNN). The results indicate the following: (1) Preprocessing can effectively increase the correlation between the soil spectral reflectance process and SOC content. (2) SPA and CARS effectively screened the characteristic bands of SOC in daylily cultivated soil from the spectral curves. The SPA algorithm and CARS selected 4–11 and 9–122 bands, respectively, and both algorithms facilitated model construction. (3) Among all the constructed models, the FD-CARS-PLSR performed most prominently, with coefficients of determination (R2) for the training and validation sets reaching 0.93 and 0.83, respectively, demonstrating high model stability and reliability. (4) Incorporating soil texture as an auxiliary variable into the PLSR inversion model improved the inversion accuracy, with accuracy gains ranging between 0.01 and 0.05. Full article
(This article belongs to the Special Issue Spectroscopy and Sensing Technologies for Smart Agriculture)
Show Figures

Figure 1

Back to TopTop