1. Introduction
The rapid development of the global textile industry, coupled with the widespread adoption of fast fashion consumption patterns, has led to a continuous reduction in the lifecycle of textiles. This trend has resulted in the generation of substantial amounts of textile waste, which poses a significant challenge for solid waste management [
1]. Statistics reveal that China alone produces over 20 million tons of textile waste each year, yet the overall recycling rate remains below 20% [
2]. This figure is markedly lower than the 25% target set for 2025 in the “Implementation Opinions on Accelerating the Recycling of Waste Textiles” [
3]. A considerable volume of unused textiles is relegated to landfills or incinerators, leading to considerable resource wastage and environmental pollution. Therefore, achieving efficient and precise sorting of waste textiles—especially blended fabrics such as cotton and polyester—constitutes a crucial prerequisite for enhancing their resource value and promoting a circular economy within the textile sector [
4].
In contemporary industrial contexts, the mainstream methods for sorting waste textiles still rely on manual sorting guided by experience or mechanical sorting that utilizes basic optical properties. Although commercial textiles are generally labeled with clear fiber composition ratios, such label information is completely inadequate for sorting in actual waste textile sorting scenarios, and is even inaccessible in many cases [
5]. Post-consumer waste textiles, including used garments and bed linens, have labels that are prone to blurring, abrasion or complete loss after long-term use, washing and wearing. In contrast, industrial waste textiles like production leftovers and defective products, unified and standardized labeling is usually lacking, and even existing labels are easily mixed into the waste stream and difficult to trace. Manual label verification is not only time-consuming and labor-intensive but also inefficient, resulting in a high error rate that hinders the large-scale resource utilization of waste textiles [
6]. This issue is particularly prominent for cotton/polyester blended waste fabrics with similar content ratios, because the difference in cotton-polyester blend ratios directly determines the recycling pathways, processing efficiency, and quality of recycled products. This challenge represents a fundamental research difficulty, however, with the increasing demand field of waste textile recycling. From the perspective of recycling processes, the pathways for pure cotton and pure polyester fabrics are well established and efficient. Pure cotton fabrics can be regenerated through mechanical crushing, biodegradation and other processes; pure polyester fabrics can be recycled through melt spinning. However, cotton-polyester blended fabrics with different ratios require tailored separation and purification processes; improper classification may result in the failure of subsequent procedures [
7,
8]. Due to the similar cotton and polyester content in blended waste fabrics, their chemical properties have slight differences, making it difficult for traditional sorting methods to achieve accurate composition identification and classification. Consequently, the development of automated and intelligent identification technologies is not only the key to breaking the bottleneck of waste textile sorting, but also an inevitable trend [
9]. In this context, NIR hyperspectral imaging technology exhibits great potential in the non-destructive and rapid identification of material composition. This technology can simultaneously capture spatial and spectral information of the target object, enabling the detection of subtle spectral differences between materials and facilitating high-precision classification [
10], thus providing a novel and reliable technical solution for the intelligent sorting of cotton-polyester blended waste fabrics and other solid wastes.
In recent years, component identification schemes combining spectral detection technology with algorithms have been extensively investigated in the field of resource recovery from waste textiles, owing to their non-destructive nature, efficiency, and cost-effectiveness. Within machine learning methodologies, Partial Least Squares (PLS) has emerged as a prominent algorithm for quantitative spectral analysis and is widely utilized for detecting blended content in textiles. Paz and Sousa [
11] utilized Partial Least Squares (PLS) to quantitatively predict cotton/polyester blend ratios. However, this approach demonstrates limited capacity for fitting nonlinear features in NIR hyperspectral data, leading to increased errors among samples with similar fiber compositions. Liu et al. [
12] and Rashed et al. [
13] applied Support Vector Machine (SVM) and Random Forest (RF) algorithms, respectively. Although they attempted qualitative classification and robust optimization, they encountered difficulties in accurately extracting subtle differential features from NIR hyperspectral data, ultimately failing to satisfy the requirements for industrial-level content classification. Deep learning has demonstrated excellent feature learning and nonlinear fitting capabilities in various research fields. For instance, in image dehazing, Zhang et al. proposed a wavelet-based physics guided normalization network (WBPGNDN), which demonstrated excellent haze removal performance in real-time traffic [
14]. In medical image analysis, segmentation technologies utilizing deep convolutional neural networks have emerged as a focal point in computer vision, significantly advancing the field of smart healthcare [
15]. In intelligent detection, U-Net and YOLOv4 models based on deep neural networks have been effectively employed for detecting steel surface defects, thereby enhancing the accuracy and efficiency of industrial assessments [
16]. As a typical representative of deep learning, Convolutional Neural Networks (CNNs) are feedforward neural networks with specific depths that optimize parameters through gradient descent algorithms [
17]. Traditional CNNs are mostly used for processing two-dimensional and three-dimensional image data. However, with the integration of deep learning and spectral technology, 1D-CNNs have gained prominence in the analysis of one-dimensional NIR hyperspectral data [
18]. By leveraging their local receptive fields and weight-sharing properties, 1D-CNNs efficiently extract spectral features and process nonlinear NIR hyperspectral data, showcasing significant application potential in agriculture, medicine, food, and other fields. In textile inspection, various models reveal considerable differences in design objectives and technical approaches, resulting in varying degrees of adaptation to the recycling and classification requirements of blended waste fabrics. Huang et al. [
19] utilized a traditional 1D CNN for the regression prediction of blended compositions and assessed the influence of various preprocessing methods on the 1D CNN model. Their primary aim was to accurately determine blend ratios, which contrasts sharply with the classification and sorting focus of the present study. In a different approach, Chen et al. [
20] developed a dual-input multi-scale 1D CNN model for predicting cashmere-wool fiber content. They constructed dual input features from both raw spectral data and dimension-reduced data to validate multi-channel feature fusion. However, their reliance on only raw spectral data limited the full utilization of spectral information. Qiu et al. [
21] sought to introduce a residual module into the 1D CNN to address the vanishing gradient problem. Nonetheless, their single-channel input architecture was restricted to extracting features from preprocessed NIR hyperspectral data, resulting in a limited feature representation dimension. Additionally, it did not effectively enhance key spectral features sensitive to variations in cotton/polyester content, leading to inadequate model discrimination capability for cotton/polyester blended waste fabrics with similar content gradients.
This paper integrates NIR hyperspectral imaging technology with deep learning techniques to propose an enhanced dual-input 1DCNN model (Dual-1DCNN-Residual-SE). The model generates input features by concatenating dimensions of raw NIR hyperspectral data with SG-smoothed NIR hyperspectral data. It addresses gradient vanishing issues during deep network training through a residual module and improves the weighting of key spectral features using the SE attention mechanism, ultimately achieving high-precision classification of five categories of cotton/polyester blended waste fabrics. To further assess the optimization effects of the attention mechanism, comparative experiments involving various attention mechanisms (e.g., CBAM, ECA) were conducted to systematically evaluate the influence of different attention modules on spectral feature extraction. Ablation experiments examined the individual contributions of dual-channel input, the residual module, and the SE attention mechanism, thereby providing technical support for the intelligent sorting of industrial waste fabrics.
3. Materials and Methods
3.1. Experimental Instruments and Samples
The experimental platform includes a NIR hyperspectral camera, a light source, a conveyor belt (
Figure 3d), and an industrial control computer (
Figure 3b), as illustrated in
Figure 3, which depicts the NIR hyperspectral acquisition and sorting system. The HX-17S NIR hyperspectral camera, manufactured by Hangzhou Gaopu Imaging Technology Co., Ltd., Hangzhou, China (
Figure 3a), was employed to capture NIR hyperspectral images of waste fabrics with varying cotton/polyester content. The distance between the lens and the fabric was maintained at 70 cm, while the platform moved at a speed of 1 m/s. Detailed parameters, including the spectral range, are provided in
Table 5. The light source consisted of two rows of eight 100 W adjustable-power halogen bulbs (
Figure 3c).
The experiment utilized 80 samples of waste cotton/polyester blended fabrics with varying content ratios, provided by the Zhejiang Institute of Quality Sciences (ZIQS). Each fabric sample was labeled to indicate its quantitative composition. Before the experiment commenced, all samples were documented. The fabrics were then categorized into five groups based on their cotton/polyester content ratios, as detailed in
Table 6 (where C represents cotton and T represents polyester). The interval classification of cotton content simulates the actual composition gradient of industrial cotton/polyester blended waste fabrics, and the sample quantity is determined based on the actual proportion of waste fabrics in industrial recycling. The total number of experimental samples is 80.
3.2. Black-and-White Correction
In the RGB color system, a standard photograph consists of three channels, each representing a grayscale image. Each pixel is associated with three brightness values, typically ranging from 0 to 255. In contrast, NIR hyperspectral images comprise 512 bands, with each band serving as a grayscale image that contains brightness information. Each pixel within these bands corresponds to a brightness value, which reflects the radiation captured by the camera’s internal sensor. In the context of NIR hyperspectral data, this brightness value is designated as a Digital Number (DN). These DN values are obtained using a NIR hyperspectral camera.
To mitigate the effects of environmental factors, such as dark current and unstable light sources, on NIR hyperspectral images during data acquisition, it is essential to apply black-and-white correction when converting DN values into more physically meaningful reflectance values. Initially, raw NIR hyperspectral data of the sample are obtained using the camera’s built-in software. Following this, raw NIR hyperspectral data of a standard white plate and a black background are collected to facilitate the black-and-white calibration. The calibration formula is represented as Equation (1):
In the equation, represents the corrected reflectance value of the sample; denotes the corrected DN value of the sample; indicates the DN value of the black background; signifies the DN value of the corrected whiteboard; ⨀ denotes bitwise multiplication; represents the reflectance value of the corrected whiteboard.
3.3. Data Analysis
Figure 4 shows the average NIR hyperspectral curves of the five sample groups. Notably, pure cotton (C100), pure polyester (T100), and cotton/polyester blended fabrics with different ratios (C40, C60, and C80) exhibit distinct reflectance characteristics in the 1400–1500 nm wavelength range. Pure cotton shows the most pronounced reflectance trough in this band, with a significantly lower reflectance intensity than the other groups. In contrast, pure polyester has the shallowest trough and maintains a relatively high reflectance level. The blended fabrics (C40, C60, C80) display intermediate reflectance values, forming a clear gradient with increasing cotton content. As the proportion of cotton rises, the reflectance trough deepens and the overall reflectance intensity decreases gradually. This difference is mainly caused by the distinct molecular structures and hydrophilic/hydrophobic properties of cotton and polyester. Cotton’s main component, cellulose, contains numerous polar hydroxyl groups (-OH) that readily form hydrogen bonds with water molecules in the 1400–1500 nm region. This interaction leads to strong near-infrared absorption and thus low reflectance. Conversely, polyester exhibits weak polarity and strong hydrophobicity, which limits its interaction with water molecules. Consequently, its absorption in this wavelength range is weak, resulting in higher reflectance.
In general, higher cotton content strengthens the absorption of near-infrared light through hydroxyl groups, leading to deeper reflectance troughs. This trend forms a clear spectral differentiation feature in the 1400–1500 nm band that is closely related to the cotton/polyester blending ratio.
3.4. Preprocessing of Spectral Data
Preprocessing of NIR hyperspectral data is essential before model training to minimize noise interference on experimental results. Common preprocessing methods for NIR hyperspectral data include Savitzky-Golay (SG) smoothing [
27], Multiplicative Scatter Correction (MSC) [
28], Standard Normal Variate Transformation (SNV) [
29], as well as first-order and second-order derivatives [
30].
Different preprocessing methods are suited to specific application scenarios: MSC and SNV are particularly effective for eliminating baseline drift resulting from sample particle size and uneven surface scattering. In contrast, first and second derivatives enhance spectral feature resolution but may also amplify high-frequency noise. The primary objective of this study is to effectively suppress random noise while preserving the spectral details related to fibers in cotton/polyester blended fabrics. This goal aligns well with the benefits of Savitzky-Golay (SG) smoothing. Consequently, SG smoothing was employed to preprocess five sets of NIR hyperspectral data. The resulting preprocessed hyperspectral data maps are presented in
Figure 5. SG smoothing, a widely utilized preprocessing technique, reduces noise while maintaining overall spectral trends and characteristic details. Its fundamental approach involves a sliding window strategy that performs local polynomial fitting on NIR hyperspectral data. When compared to the original NIR hyperspectral data curves, the SG-smoothed data clearly demonstrate diminished high-frequency fluctuations while retaining reflectance trends and differences across the entire wavelength range for each group. The overall smoother curves effectively suppress various interference signals.
3.5. Model Establishment
When analyzing NIR hyperspectral data from cotton/polyester blended waste fabrics, traditional 1D-CNN models often face challenges such as vanishing gradients in deep networks, uniform treatment of all spectral channels, and limitations from single-channel input. These issues make it difficult to accurately distinguish subtle spectral differences between categories with similar compositions. To address these problems, this paper proposes a dual-channel spectral feature fusion one-dimensional convolutional neural network (Dual-1DCNN-Residual-SE) based on the conventional 1DCNN framework. The network structure is illustrated in
Figure 6.
First, the raw NIR hyperspectral data and the SG-smoothed NIR hyperspectral data are concatenated along the channel dimension to from a dual-channel spectral feature matrix of size (2, 196). Here, 2 denotes the two channels—raw NIR hyperspectral data and SG-smoothed NIR hyperspectral data—while 196 indicates the number of spectral wavelength points. Subsequently, this feature matrix is fed into the network’s initial convolutional module. This module increases the number of channels to 32 by employing 32 one-dimensional convolutional kernels of size 7 × 1. It then reduces the dimensionality further through a 3 × 1 max-pooling layer, ultimately producing a feature map of size (32, 49). This procedure facilitates the preliminary extraction of local spectral band features while maintaining a balance between feature retention and computational efficiency.
Second, to mitigate the vanishing gradient problem and to enhance feature reuse, the (32, 49) feature map is input into the residual module, as depicted in
Figure 7. This module comprises two stacked one-dimensional convolutional layers, each with a kernel size of 3 × 1, each followed by batch Normalization and a ReLU activation function are applied. A shortcut connection is incorporated to directly sum the input and output features of the module, maintaining the output size at (32, 49). This residual structure allows the network to learn both the original shallow features and the transformed deep features concurrently, thereby effectively capturing subtle spectral differences among similar categories.
Based on the residual features, the residual module’s feature maps are utilized to pinpoint crucial spectral channel characteristics for the adaptive focus classification of cotton/polyester blended waste fabrics while suppressing redundant noise channels. To achieve this, the feature maps are inputted into the channel attention module, which is structured based on the SE (Squeeze-and-Excitation) attention mechanism [
31] as depicted in
Figure 8. Initially, adaptive global average pooling compresses the 49-dimensional local features of each channel into 1-dimensional global features. Subsequently, a fully connected layer with a dimension reduction factor of 16 (dimension transformation: 32 → 2 → 32) learns dependencies between channels, producing channel weight coefficients ranging from 0 to 1. By matching these coefficients to the original feature map’s dimensions and conducting per-channel multiplication, the mechanism enhances spectral feature bands based on their weights. The resulting attention-weighted feature map maintains a size of (32, 49), allowing the model to selectively amplify core spectral features and improve its capability to differentiate between similar categories of cotton/polyester blended waste fabrics.
The attention-weighted feature map (32, 49) is subsequently processed by the convolution and attention fusion module. A 3 × 1 one-dimensional convolution layer increases the number of channels from 32 to 64 while reducing the spectral length to 25, resulting in a high-dimensional feature map of size (64, 25). To emphasize critical information within these deep, high-dimensional features, the feature map is further processed through a second SE channel attention module, which shares the same structure as the preceding attention module. This step generates channel weight coefficients and performs weighted enhancement, keeping the output size at (64, 25).
Finally, adaptive average pooling converts the feature map into a 128-dimensional global feature vector. After Dropout regularization with a probability of 0.3, the vector is fed into a fully connected classification layer to complete the five-category classification of cotton/polyester blended waste fabrics.
3.6. Model Evaluation
To evaluate model performance, accuracy and F1-score serve as the primary evaluation metrics. The formulas for calculating these metrics are as follows:
In these formulas, TP represents the number of samples predicted as positive that are indeed positive; TN indicates the number of samples predicted as negative that are actually negative; FP refers to the number of samples predicted as positive but are actually negative; and FN denotes the number of samples predicted as negative that are, in fact, positive.
4. Conclusions
This paper focuses on the sorting and recycling of industrial cotton/polyester blended waste fabrics, and solves the key problems including the low efficiency of traditional sorting methods, difficulty in distinguishing cotton/polyester blended waste fabrics with similar blending ratios due to their similar chemical compositions, and limited feature extraction capabilities of existing models. Different from most existing studies that focus on quantitative composition prediction, this work realizes high-precision classification identification of cotton-polyester blended fabrics with five typical content gradients, which is more in line with the actual needs of industrial waste textile sorting. By combining near-infrared hyperspectral imaging technology with deep learning, we propose an enhanced 1DCNN model (Dual-1DCNN-Residual-SE) with dual-channel input, residual structure, and SE attention mechanism. This model accurately classifies five cotton/polyester blending ratios. Compared with methods based on visual or physical/chemical sensing, our approach is less affected by fabric appearance and achieves non-destructive, high-precision recognition, making it more suitable for industrial recycling scenarios.
Experimental results indicate that the Dual-1DCNN-Residual-SE model achieves a classification accuracy of 95.94%, significantly surpassing traditional machine learning algorithms including K-Nearest Neighbors (KNN), Random Forest (RF), Support Vector Machine (SVM), and Partial Least Squares (PLS). Comparison of three common attention mechanisms—SE, ECA, and CBAM—reveals that the SE attention mechanism exhibits superior performance and is better suited for this model. Ablation experiments confirmed the performance advantage of dual-channel NIR hyperspectral input compared to single-channel input, as well as the optimization effects of the residual and attention modules on the model. Additionally, t-SNE dimensionality reduction visualization analysis elucidates the reasons behind the model’s misclassification of categories with similar cotton/polyester blend ratios.
In the future, we plan to further integrate the method proposed in this study into the intelligent sorting assembly line for industrial waste textiles. Following near-infrared hyperspectral acquisition, real-time classification output will be achieved through model inference and be seamlessly incorporated into subsequent recycling processes to provide practical support for efficient industrial sorting. Meanwhile, future work will expand the dataset size and explore the use of Generative Adversarial Networks (GANs) to expand spectral samples, thereby enhancing the model’s generalization capability in complex scenarios. In addition, the model will be expanded to classify and identify other common blended textile materials, thereby further improving the technical system for the resource recycling of waste textiles.