Next Article in Journal
Comparative Analysis of AWJM Performance in FFF-Printed PLA and PLA–CF: Influence of Process Parameters and Cutting Regions
Next Article in Special Issue
A Statistically Grounded and Physics-Aware Vision Framework for Detecting Barely Visible Impact Damage (BVID) in Heterogeneous Polymer-Matrix Composites
Previous Article in Journal
pH and Glucose Dual-Responsive Hybrid Polymeric Smart Insulin Carrier for Diabetes Treatment
Previous Article in Special Issue
Manufacturing-Induced Defect Taxonomy and Visual Detection in UD Tapes with Carbon and Glass Fiber Reinforcements
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

AI-Based Polymer Classification Using Ensemble Deep Learning and Heuristic Optimization: Implications for Recycling Applications

by
Mohammad Anwar Parvez
Department of Chemical Engineering, College of Engineering, King Faisal University, P.O. Box 380, Al-Ahsa 31982, Saudi Arabia
Polymers 2026, 18(10), 1208; https://doi.org/10.3390/polym18101208
Submission received: 1 April 2026 / Revised: 11 May 2026 / Accepted: 13 May 2026 / Published: 15 May 2026
(This article belongs to the Special Issue Artificial Intelligence in Polymers)

Abstract

Polymer-based product use is rapidly increasing worldwide, resulting in critical social, environmental, ecological, economic, and health effects. Worldwide efforts have increasingly focused on solutions to the equilibrium consumption, production, and disposal of plastics to tackle these issues. The frontiers of biodegradable and bio-based polymers are continually advancing in pursuit of sustainability. Therefore, designing ecological bioplastics made of both biodegradable and bio-based polymers reveals chances to overcome plastic pollution and resource depletion. Polymeric materials are mainly used to manufacture different products at the beginning of their lifespans and which become waste after usage. Numerous sustainability strategies and polymer recycling methods are described and mostly classified into chemical, mechanical, and thermal recycling processes. This manuscript presents a New Polymers Frontier in Recycling and Sustainability Using an Ensemble of Deep Learning with a Heuristic Search Algorithm (NPFRS-EDLHSA). This work is devoted to computational polymer typology, which is based on machine learning algorithms applied to data on physicochemical properties. Although polymer classification can facilitate downstream materials research, the present study does not directly simulate recycling, environmental impacts, or sustainability. The main contributions made by this work include (i) an exploratory analysis of ensemble deep learning models to classify polymers by type on a small and unbalanced dataset; (ii) an evaluation of the effect of feature selection with a heuristic optimization methodology; and (iii) a comparison of the effects on classification performance under limited data conditions. This research sets out to provide a methodological explanation, not arguments for industrial-scale applicability. For the polymer-type classification process, the proposed NPFRS-EDLHSA model designs an ensemble of deep learning techniques, namely a bidirectional recurrent neural network (BiRNN) model, a bidirectional gated recurrent unit (BiGRU) method, and a graph autoencoder (GAE) technique. Finally, the grasshopper optimization algorithm (GOA) adjusts the hyperparameter values of the ensemble models optimally and results in an improved classification performance. A wide-ranging set of experiments was conducted to validate the performance of the NPFRS-EDLHSA method. The experimental results indicated that the NPFRS-EDLHSA technique achieved a better performance than an existing model.

1. Introduction

Polymeric materials are essential in the modern world because of their durability and high strength-to-weight ratio [1]. In recent years, the production and use of plastics have expanded to include many other man-made materials, such as steel and cement, and millions of tons of plastics are produced worldwide. This number is expected to rise exponentially over the next 30 years across several production sectors [2]. Consequently, the rapid development of plastics has had a negative global effect stemming from the linear economy strategy, in which raw materials are gathered, converted into products, and ultimately discarded as waste [3]. Furthermore, the majority of plastic production utilizes non-renewable natural resources, and the handling of post-consumer products results in waste accumulation, environmental plastic contamination, and the depletion of additional resources. To address this issue, there is a need to create a more circular, closed-loop plastics consumption method. Plastic items and polymers owe their dominance and significant dependence to their remarkable physicochemical properties that guarantee affordability, durability, and light weight [4]. Due to their versatility, plastics are widely used materials with applications across multiple industrial sectors, like automotive vehicles, packaging, electronic gadgets, and construction [5].
Nowadays, recycling is the most significant activity available to decrease these effects and represents a popular and dynamic field in the plastics sector [6]. Reusing provides opportunities to decrease carbon dioxide emissions, the utilization of oil, and the quantities of waste requiring disposal [7]. Polymers are employed for many applications, including automotive, aerospace, sports, consumer goods, and medical; nevertheless, being non-biodegradable by nature, they extensively pollute the environment [8]. The waste disposal industries gather the material in general waste or in separated form, based on the geographical position [9]. Discharging waste into the environment and landfilling have led to enormous difficulties and pose a serious threat to life on Earth. Furthermore, raw material utilization and extraction might be decreased [10]. The application of artificial intelligence in polymer science is growing, but there are several problems to overcome. First, there is a limited availability of well-curated and labeled polymer datasets for the development and generalization of data-driven models. Datasets are often small, skewed, or missing standardized physicochemical descriptors, limiting their use. Second, it is inherently difficult to establish reliable relationships between polymer structure, physicochemical properties, and classification results: these relationships are complex and nonlinear due to complex molecular interactions. This makes it difficult for traditional machine learning approaches to identify important features. Third, although techniques such as deep learning or heuristic search have been used, there is a need for multi-strategy approaches in polymer informatics that can make use of multiple ways of feature engineering, machine learning, and optimization. Addressing these issues is essential to building intelligent systems for polymer classification, which are required to aid in downstream tasks such as material selection, recycling, and sustainable design.
This manuscript presents a New Polymers Frontier in Recycling and Sustainability Using an Ensemble of Deep Learning with Heuristic Search Algorithm (NPFRS-EDLHSA). The proposed NPFRS-EDLHSA model mainly focuses on providing an attractive solution for polymer recycling and sustainability using state-of-the-art techniques. The min–max normalization method is used in the pre-processing step to transform the feature values to a desired range, often [0, 1]. It allows all input variables to contribute equally to the model training and avoids features with large ranges from having more influence on the training process. It is worth noting that normalization does not eliminate noise from the data; rather, it scales the features and enhances numerical stability, especially in gradient-based optimization algorithms commonly employed in deep learning models. Following this, the binary dwarf mongoose optimizer (BDMO) algorithm has been implemented for the process of feature selection. For the polymer type classification process, the proposed NPFRS-EDLHSA model designs an ensemble of deep learning techniques, namely a bidirectional recurrent neural network (BiRNN) model, a bidirectional gated recurrent unit (BiGRU) method, and a graph autoencoder (GAE) technique. At last, the grasshopper optimization algorithm (GOA) adjusts the hyperparameter values of the ensemble models optimally and results in greater classification performance. A wide-ranging experimentation was performed to validate the performance of the NPFRS-EDLHSA method. The current research paper aims to compute the types of polymers by their physicochemical characteristics. Although polymer classification may be useful in materials research at the downstream level, this model does not explicitly simulate a recycling process or environmental impact.

2. Related Works

The integration of artificial intelligence (AI) into polymer recycling has gained significant attention in recent years, as the environmental and economic challenges posed by plastic waste become more pressing. Several studies have explored various AI techniques to enhance recycling processes, improve sorting accuracy, and optimize resource usage. Machine learning (ML) and deep learning (DL) methods have become central to addressing the challenges of polymer recycling. Kannan et al. [11] emphasize the application of ML and AI in multiple waste streams. These methods utilize AI-powered image detection and categorization to enhance distinct materials, assisting in improving the effectiveness of chemical recycling technologies; simultaneously, ML models allow cleaner processes for dealing with chemicals from material recovery. Higher accuracy is achieved in the elimination of beneficial elements from electronic waste through predictive analytics and automated disassembly. Despite the promising applications of AI, the recycling industry still faces significant hurdles. One such challenge is the inefficient classification of mixed plastic waste. Atasi et al. [12] proposed an AI-driven approach for developing innovative solutions in polymer synthesis and recycling processes. This model utilizes GA, which intends novel monomers and employs VFS to create nearly a million ROP polymers. ML techniques to forecast thermodynamic, thermal, and mechanical assets are vital for recyclability, and application-specific performance is employed to organize GA to optimize polymers. This model also projected possible additional polymers for polystyrene (PS) that attain each property, aiming to lower the estimated synthetic complexity. To overcome the limitations of traditional machine learning models, researchers have started integrating heuristic optimization techniques. The use of algorithms such as Genetic Algorithms (GAs), Particle Swarm Optimization (PSO), and Grasshopper Optimization Algorithm (GOA) has proven effective in improving the accuracy and efficiency of recycling processes. Duan et al. [13] elucidate the linkage between polymerization phenomena and catalyst deactivation in HG-AOP utilizing carbon materials. This technique shows that the removal of polymerization products results in a self-inhibition impact by sustaining a reliable polymerization energy barrier; however, to enhance the degree of polymerization (DP). These adherent layers contend with oxidants for active sites, obstructing electron transfer, impeding oxidant adsorption, and eventually hindering added catalytic activity.
Sundaralingam and Ramanathan [14] aimed to advance an automated method able to categorize plastic waste depending on its visual attributes. In real-world operation, an overhead camera places the positions of either the gripper or the waste items. By assessing the positional variance among gripper and waste, this method attains a higher degree of segregation precision, similar to human-like hand–eye coordination. Liang et al. [15] developed a method to incorporate the experimental process with AI to improve the purification of Se(IV)-affected water. In particular, an innovative approach is advanced for Se(IV) elimination by utilizing chitosan to decrease aqueous outputs of Se (IV).
Recent studies have shown that nanomaterial-reinforced polymer composites have become an important research direction because nanofillers can significantly modify the electrical, thermal, mechanical, and multifunctional behavior of polymeric materials. Carbon-based nanomaterials, particularly carbon nanotubes, graphene, graphene nanoplatelets, and carbon black, are widely investigated because their high aspect ratio, large surface area, and intrinsic conductivity allow them to form conductive networks inside polymer matrices. Shchegolkov et al. [16] reviewed the use of carbon nanotubes and graphene in polymer composites and highlighted the importance of synthesis, functionalization, and application strategies for improving functional composite performance. Similarly, Zare et al. [17] investigated the electrical conductivity of polymer composites containing carbon black nanoparticles and emphasized the roles of interphase depth, tunneling characteristics, and conductive network formation. From the thermal-property perspective, Li et al. [18] reviewed polymer-based thermally conductive functional materials and explained that suitable fillers can form thermally conductive networks or oriented structures within polymer matrices, thereby improving heat transfer performance. Musa et al. [19] further summarized recent advances in nano-enhanced polymer composite materials and showed that nanomaterials, especially carbon-based nanofillers, can improve electrical conductivity, thermal stability, mechanical strength, and multifunctional performance. In addition, VijayKashimatt et al. [20] reviewed conductive polymer composites with a focus on electrical conductivity and electromagnetic shielding, including conductivity mechanisms, thermal conductivity behavior, measurement techniques, and modeling approaches. These studies confirm that polymer properties are strongly influenced by nanofiller type, filler concentration, dispersion quality, polymer–filler interaction, interfacial characteristics, and conductive-network formation. However, unlike these studies, which mainly focus on experimental or theoretical analysis of nanofiller-enhanced electrical and thermal conductivity, the present work focuses on computational polymer typology classification using ensemble deep learning and heuristic optimization under limited and imbalanced data conditions.
Kern et al. [21] utilize digital reactions employing virtual forward synthesis (VFS) and ML models to quickly forecast mechanical, thermodynamic, and thermal assets. This work projected the structure, approaches, and primary outcome of the paper, emphasizing the possibility of ML and VFS. Lee et al. [22] focus on improving the rate of material recycling by advancing a separation method employing DL-based object detection. To enhance image data, labeling competence is attained for every type of material, and the execution of diverse labeling models is assessed. The average precision (AP) values for the single-material learning technique employing hybrid labeling showed better performance in contrast with other models. An integrated learning method is also advanced for composite materials. While significant progress has been made, many existing models still face challenges in terms of scalability, real-time processing, and environmental adaptability. Most approaches focus on isolated techniques, either applying deep learning or optimization algorithms, without fully exploring the synergies between these methods. The NPFRS-EDLHSA model presented in this study addresses these gaps by integrating deep learning techniques with heuristic search algorithms, providing a more robust, accurate, and scalable solution for polymer recycling and sustainability.

3. Proposed Method

In this manuscript, we have presented an NPFRS-EDLHSA methodology. The proposed NPFRS-EDLHSA model mainly focuses on providing an attractive solution for polymer recycling and sustainability using state-of-the-art techniques. Figure 1 depicts the entire flow of the NPFRS-EDLHSA method.

3.1. Pre-Processing

The data pre-processing stage uses min–max normalization to scale the features as well as bring about numerical stability when training the models. This method normalizes each feature to an absolute range or [0, 1], depending on the lowest and highest values of the feature. There is a need to scale the features to ensure that attributes with higher numbers do not overpower the learning process, mainly through gradient-based optimization techniques. It is necessary to note that normalization does not eliminate noise in the data; on the contrary, it standardizes the magnitudes of features to ease the convergence and comparison of effective models across attributes. Initially, the min–max normalization is applied in the data pre-processing stage to clean and convert raw data into an appropriate format. Min–max normalization is an approach applied for scaling data inside a particular range, normally between 0 and 1, by converting the values according to the maximum and minimum values of the dataset [23]. In terms of sustainability and recycling of polymers, it is used to normalize different environmental indicators, like material efficiency, energy consumption, and waste reduction amounts, permitting simpler comparison through dissimilar polymer materials or recycling processes. By converting data to a steady range, this method assists in recognizing patterns and tendencies that are important in enhancing recycling methods. It furthermore helps the growth of methods that enhance resource sustainability and utilization efforts in polymer recycling. This method guarantees that changing data scales does not bias study or decision-making in sustainability advantages.

3.2. BDMO-Based Feature Selection

The BDMO algorithm has been implemented for the process of feature selection. The BDMO is derived from the constant variation that is DMO [24]. The model is influenced by the dwarf mongoose’s behavior and its migratory lifestyle, which frequently requires the transfer from one place to another in pursuit of food. Therefore, the mongoose’s location is the primary focus of the change or mutation in the optimizer procedure. In contrast to the BPSO and BEOSA, which utilize a few unique transfer functions for normalizing the animal’s population, the model depends on calculating the optimal candidate food location as exposed in Equation (1) and then states the mutation among 0   s and 1   s applied in 0.5 < b e s t i + 1 f p 0.5 Equation (2). x i + 1 f p refers to the present optimal calculated food place, and the peep annotation is a sound produced in the search for this food locality. The calculation for the location of all DM within the population is obtained using Equation (3).
b e s t i + 1 f p = f p i + p h i × p e e p
where phi represents the random scaling factor, and peep denotes a stochastic perturbation factor inspired by the communication behavior of dwarf mongooses during foraging.
x d = s e l e c t e d   f e a t u r e   x i + 1 f p > 0.5 f e a t u r e   n o t   s e l e c t e d   x i + 1 f p 0.5
x i + 1 f p = b e s t i f p C F × r a n d × x i f p M   i f   φ i + 1 > ϕ i   e x p l o r a t i o n b e s t i f p + C F × r a n d × x i f p M   e l s e   e x p l o i t a t i o n
C F , which represents a parameter to direct the aggregate velocities of the mongooses’ group movement, in this equation, is inferred as F = 1 i t e r M a x i t e r 2 i t e r M a x i t e r , and M is a vector monitoring the group movement to sleep in other sleeping mounds ( s m ), which can be signified by M = i = 1 n x i × s m i x i . Here, M a x i t e r and i t e r indicate the maximal iteration counts for the optimizer procedure and the present point of iteration within the optimizer procedure. Meanwhile, the φ annotation aids in calculating the average s m as specified by φ = i = 1 n s m i n .
The fitness function (FF) reflects the classification precision and the selected feature counts. It maximizes the classification precision and reduces the set size of the selected features. As a result, the following FF is implemented to assess individual solutions, as presented in Equation (4).
F i t n e s s = α × E r r o r R a t e + 1 α × # S F # A l l _ F
where E r r o r R a t e represents the classification rate of error utilizing the designated features. # S F denotes selected feature counts and # A l l _ F signifies total attribute counts in the new dataset. α has been applied to regulate the prominence of subset length and classification quality. In this test, α is set to 0.9.

3.3. Ensemble Classification Process

For the polymer type classification process, the proposed NPFRS-EDLHSA model designs an ensemble of deep learning models such as the BiRNN model, BiGRU method, and GAE technique.

3.3.1. BiRNN Model

The entire sequence of features experiences an encoder utilizing a Bi-RNN network that stores the contextual vector for all output vectors [25]. The Bi-RNN’s forward and backward network iterate from time step t = 1 to T and t = T to 1. This includes concurrent calculation of the forward and backward hidden sequences h f and h b . The dual sequences are added to upgrade the sequence of output y , succeeding the patterns in Equations (5)–(7).
h f = W x h f x t + W h f h f t 1 + b h f
h b = W x h b x t + W h b h b t 1 + b h b
where W x h f and W x h b characterize the weighting matrices, which convert the input x t into the hidden layer (HL) area for the forward and reverse systems, respectively. W h f and W h b refer to weighted matrices, which link the HLs from the preceding time step to the present HL. The bias vectors b h f and b h b are added to the HLs to permit the method of learning an offset for improved data fitting.
At all time steps t , the forward and backward HLs, h f t and h b t , are joined to yield the output sequence y t , as demonstrated.
y t = W h f h f t + W h b h b t + b y
W h b and W h f signify weighted matrices, which estimate the backward and forward HLs into the output area, and b γ represents the bias vector used for the output. The last output y t is therefore weighting mixtures of the backward and forward HLs, fine-tuned by the bias. Figure 2 illustrates the infrastructure of BiRNN.

3.3.2. BiGRU Method

Bi-GRU is a multivariable time-series model derived from bidirectional GRU [26]. It may adjust to further composite sequence designs, and might successfully seize the dual-manner dependences in time-series data and the communication amongst numerous variables. One network processes time-series data from forward to reverse, while the other handles it from reverse to forward. In particular,
h f t = G R U h f t 1 ,   x t
h b t = G R U h b t + 1 ,   x t
where h f t and h b t characterize the hidden layer (HL) from left to right and vice versa, correspondingly. GRU characterizes the GRU component, and x t embodies the t t h component within the sequence of input. The last HL is the joining of HLs in dual directions.
h t = h f t ; h b t
Here, [;] signifies the joining process of vectors. At last, the HL is given to a fully connected (FC) layer to produce the result y t . The model of y t is stated as
y t = s o f t m a x W h t + b
Now, b and W illustrate the bias and weight of the FC layer, correspondingly, and s o f t m a x signifies the activation function of SoftMax.

3.3.3. GAE Classifier

The graph structure utilized in the research is the polymer samples, which are represented as the nodes, with the edge being defined by similarity in the normalized physicochemical feature space using a distance-based threshold. A graph autoencoder is used as an experimental representation learning aspect to determine whether relational statistics between samples can be used to improve classification results. Since the data is tabular, graph-based learning is used in an exploratory mode, and its usefulness is not presupposed but instead is assessed relative to each other. The effective application of graph data became a pressing concern, resulting in the need for investigators to explore the area of graph neural networks (GNNs) [27]. They proposed a new model for message aggregation and effectively utilized it in the conventional field of GNNs. Presently, the most commonly applied graph convolutional neural network (GCN) is the null field-based GCN. The theory of GCN originated from the model of the CNN in processing grid data, based on the basic concept of combining node-level data from localities to more effectively and flexibly gain the node’s feature representation. In conventional GCNs, the method of upgrading node features H ( l + 1 ) is characterized by Equation (12):
H ( l + 1 ) = σ ( D ~ 1 2 A ~ D ~ 1 2 H ( l ) )
where A ~ = A + I , I denotes the unit matrix, D ~ 1 2 refers to the normalization process of the matrix of degree D ,   H ( l ) stands for node feature representation at the present node layers by combining the characteristics of the nearby adjacent nodes, σ signifies a nonlinear activation function, and H ( l + 1 ) symbolizes node representation at the following layer.
GAE is the unsupervised learning method derived from GCN, tailored for aggregating important locality data and excavating the topological characteristics of graph structures. Nevertheless, conventional GCN models concentrate simply on particular local network architectures and require inadequate attention to higher-level data like node correlations and flow of information, leading to low performance in composite network studies. Still, these techniques concentrate only on sector-related strategies, regulating their flexibility and generalizability. Regarding the recent difficulties, this paper projects a universal incorporated algorithmic method; G A E accepts a multiview model to enlarge the receptive area of the model and makes a novel node embedding matrix by incorporating three dissimilar graph autoencoders to successfully remove important attributes. This combined mechanism is constructed to improve the universality of the model by removing main feature designs in some topologies of graph structures.

3.4. Parameter Tuning Using GOA

At last, the GOA adjusts the hyperparameter values of the ensemble models optimally, and the outcomes result in greater classification performance. The GOA is a bio-inspired swarm intelligence model, based on the collective and foraging swarming behavior detected in grasshoppers [28]. Extensively accepted by investigators, GOA has proved to be efficient in solving different optimization issues. Grasshoppers’ predatory strategies and unique social interactions, comprising constant location upgrades and encouraged configuration zones, allow them to balance local and global search, effectively recognizing the best solutions. The grasshopper swarming behavior is mathematically represented in Equation (13):
P i = S i + G i + A i
where P i characterizes the position of the i t h grasshopper, S i refers to the group behavior between grasshoppers within the swarm, G i specifies the gravitational force functioning on the i t h grasshopper, and A i has been applied for modeling the influence of wind on the movement of the grasshopper. To combine this, the grasshopper’s random behavior can be restructured as (14):
X i = r 1 S i + r 2 G i + r 3 A i
where r 1 ,   r 2 , and r 3 mean numbers dispersed in the interval of ( 0,1 ) .
The description of social interaction S i is given as (15):
S i = j = 1 j 1 N s d i j d ^ i j
Here, d i j denotes the distance from the i t h to the j t h grasshopper, computed as d i j = | x j x i | . s has been applied to represent the social force intensity; d ^ i j signifies a unit vector originating from the i t h to the j t h grasshopper, measured as (16):
d ^ i j = x j x i d i j
The mathematical representation of social forces between grasshoppers, signified by the function s , was calculated as (17):
s r = f e r l e r
Here, the variable f implies the attraction intensity, while l designates the scope of the captivating range. The group behavior between grasshoppers is defined by forces of repulsion and attraction. These are subject to the distance between the grasshoppers, which can be reflected in the interval of [ 0 , 15 ] . The force of gravity G i is computed as (18):
G i = g e ^
Now, the variable g characterizes the gravitational constant, and e ^ g means the unit vector focused on the center of the Earth.
Equation (19) displays the wind advection A i :
A i = u e ^ w
Here, the drift constant is u and the unit vector of the wind in the direction is e ^ w . By substituting the values of G and A , the resultant equation turns out to be
X i = j = 1 j 1 N s x j x i x j x i d i j g e ^ g + u e ^ w
Now, the total locust count within the population is N .
In Equation (20), it cannot be straightforwardly used for optimization functions owing to the particular restriction. During this expression, the grasshoppers quickly reach their secure position, which leads to the swarm’s incapacity to efficiently move towards the preferred target point. To deal with this problem, investigators have established an improved version of the equation, which can be given in Equation (21):
X i d = c j = 1 j 1 N c u b d l b d 2 s x j d x i d x j x i d i j + T ^ d
Here, l b d and u b d specify the lower and upper limits in the d t h dimension, correspondingly. T ^ d implies the best solution established to date inside the d t h dimensional area, and c represents a control coefficient that regulates the balance between exploration and exploitation during the optimization process. The fitness choice is the extensive aspect that manipulates the performance of the GOA. The hyperparameter selection process comprises using the solution encoder model to compute the efficiency of candidate solutions. Also, the GOA utilizes precision as the primary criterion to define the fitness function (FF), as shown in the expression.
F i t n e s s = m a x P
P = T P T P + F P
Here, the true positive value symbolizes TP and the false positive value signifies FP.
To more comprehensively assess classification performance, other metrics, such as the Matthews Correlation Coefficient (MCC), are included alongside accuracy, precision, recall, and F1-score. MCC is suitable for imbalanced data as it takes into account true and false negatives and positives.
The MCC is defined as
M C C = T P × T N F P × F N T P + F P ) ( T P + F N ) ( T N + F P ) ( T N + F N
Moreover, to prevent data leakage and ensure the experiments’ validity, the data folder is partitioned using k-fold cross-validation. The data is split into training and testing sets using k-fold cross-validation to ensure that each fold is disjointed. Both feature selection and feature normalization are also performed within the training folds to avoid the model training being affected by the test set.

3.5. Polymer Data Representation and Feature Encoding

In the presented research, how the polymer data is represented is essential for the learning and classification process. A series of physicochemical properties is used to describe each polymer sample and is used as input features to the proposed model. The extracted features include important parameters such as glass transition temperature (Tg), melting point, viscosity, rheology, elasticity, Young’s modulus, and resilient modulus. These features are well-established as key characteristics that represent the thermal, mechanical, and structural behavior of polymers, thus offering valuable insights to distinguish among different groups of polymers. These features are tabulated as a vector format, with each polymer represented as a vector of features. These features are normalized before training to standardize the scales and enhance the numerical stability of the model during training. This format enables efficient processing of structured data by the deep learning models, as well as the interactions between multiple physicochemical features. To incorporate the graph-based aspect of the model, particularly the Graph Autoencoder (GAE), the tabular features are also converted into a graph. Here, polymer samples are represented as nodes, and edges are determined by similarity in the feature space after normalization. Using a distance threshold (e.g., Euclidean distance) to decide whether two nodes should be connected, an adjacency matrix representing the graph structure is created. This graph structure allows the model to learn latent relationships and neighborhood information among polymer samples, thereby enhancing the feature-based learning of the sequential models.
In summary, this bidirectional representation, which combines both tabular feature vectors and graph structures, allows for a more holistic approach in understanding polymer data by considering both individual property features and relationships among polymer samples.

4. Experimental Result and Analysis

The performance validation of the NPFRS-EDLHSA approach is verified under the dataset [29]. The computational implementation was carried out using Python, Python Software Foundation, Wilmington, DE, USA, available online: https://www.python.org/ (accessed on 14 December 2025). This dataset contains 42 instances under nine classes, as depicted in Table 1. An initial set of multiple physicochemical features is considered for polymer representation. Key parameters such as glass transition temperature (ranging from −70 °C to 150 °C), melting points (from 100 °C to 350 °C), viscosity (from 0.01 to 1000 Pa·s), rheology (between 0.1 and 10 Pa·s), elasticity (ranging from 1 to 10 GPa), Young’s modulus (ranging from 0.5 to 4 GPa), and resilient modulus (between 1 and 5 GPa) are integrated. These features provide critical insights into the physical properties and behavior of the polymers. By including these parameters, the model is better equipped to capture the associations among diverse polymer types, resulting in more accurate classification of the nine class labels. The inclusion of these chosen features significantly improves the capability of the technique to distinguish between classes and improve overall classification performance. The data used in this paper will be a sample of 42 polymer samples in nine polymer classes, where there is an imbalance of classes and very few samples on some classes. This is not a large enough dataset to train deep learning models that have a high guarantee of generalization. Therefore, the outcomes of the experiment are to be understood as exploratory and proof-of-concept instead of being statistically conclusive. This research is aimed at discussing methodological behavior in limited data conditions instead of creating an implementable classification system. Larger and publicly verifiable datasets with an equal content of classes will be necessary in the future to support the strong validation. In order to minimize bias due to a single train–test split, the evaluation of the models is conducted through the cross-validation with k-fold. The performance measures are reported in the form of an average within the folds. Class imbalance leads to precision, recall, and F1-score being evaluated together to prevent the interpretation of performance in a misguided way. The high scores in classifications that have been observed should be viewed with a lot of caution because small sample sizes tend to inflate performance estimates.
Figure 3 shows the classifier performances of the NPFRS-EDLHSA model. Figure 3a represents the entire confusion matrix through the precise detection and classification of all classes. Figure 3b illustrates the PR study, which indicates an improved performance over all classes. Finally, Figure 3c demonstrates the ROC outcome, which signifies capable solutions with great ROC values for dissimilar class labels.
Table 2 and Figure 4 deliver the classifier result of the NPFRS-EDLHSA method below several metrics. The performances indicate that the NPFRS-EDLHSA method accurately detected and classified all the classes. The proposed NPFRS-EDLHSA algorithm obtains an average a c c u y of 98.77%, p r e c n of 98.89%, r e c a l of 98.77%, and F 1 s c o r e of 98.76%.
Figure 5 depicts the accuracy and loss study of the NPFRS-EDLHSA system. The accuracy outcome of the NPFRS-EDLHSA technique gains cumulative values across growing epochs. Moreover, the rising validation through training indicates that the NPFRS-EDLHSA model performs proficiently on the test dataset. At last, the loss curve implies that the NPFRS-EDLHSA algorithm attains closer values. The NPFRS-EDLHSA model understands the test dataset competently.
Table 3 offers a comparative study of the NPFRS-EDLHSA method with existing methodologies below several metrics [30].
Figure 6 examines the a c c u y result of the NPFRS-EDLHSA model with existing approaches. The figure shows that the proposed NPFRS-EDLHSA technique has reached a greater a c c u y value of 98.77%. In the meantime, the existing algorithms, such as Inception, SVM, RF, NB, LSTM, KNN, and VGG16 models, have attained diminishing returns of a c c u y values of 92.54%, 96.70%, 96.53%, 97.79%, 96.28%, 96.52%, and 94.74%.
Figure 7 studies the p r e c n , r e c a l , and F 1 s c o r e performance of the NPFRS-EDLHSA system with existing methodologies. According to p r e c n , the NPFRS-EDLHSA model has a superior p r e c n of 98.89%, whereas the Inception, SVM, RF, NB, LSTM, KNN, and VGG16 algorithms have a minimum p r e c n of 90.99%, 92.98%, 96.99%, 93.74%, 94.14%, 96.44%, and 97.08%, correspondingly. Moreover, according to r e c a l , the NPFRS-EDLHSA model has a superior r e c a l of 98.77%, whereas the Inception, SVM, RF, NB, LSTM, KNN, and VGG16 algorithms have a minimum r e c a l of 94.06%, 90.02%, 94.61%, 95.95%, 93.84%, 96.98%, and 90.25%, correspondingly. In addition, according to F 1 s c o r e , the NPFRS-EDLHSA model has a superior F 1 s c o r e of 98.76%, whereas the Inception, SVM, RF, NB, LSTM, KNN, and VGG16 algorithms have a minimum F 1 s c o r e of 95.83%, 97.63%, 93.36%, 89.86%, 92.72%, 96.88%, and 96.28%, correspondingly.
In Table 4 and Figure 8, the execution time of the NPFRS-EDLHSA method with existing techniques is showcased. The performances imply that the NPFRS-EDLHSA algorithm produces a minimal ET value of 11 min, while the Inception, SVM, RF, NB, LSTM, KNN, and VGG16 models obtain better ET values of 73 min, 57 min, 59 min, 30 min, 56 min, 74 min, and 73 min, respectively. Benchmarks on execution time are documented with the same hardware and training settings, but the architectures of different models and their convergence characteristics differ, so they are only indicative and not absolute benchmarks.
The size of the dataset used in this work is small and is suitable only for initial explorations. Although the proposed model shows a good classification accuracy, it is worth noting that the small sample size is associated with an increased risk of overfitting, where the model may fit the training data but not capture general patterns. While k-fold cross-validation has been used to reduce this effect, it is impossible to completely rule out optimistic bias in the performance estimates due to the limited sample size and class imbalance. Moreover, the current research has not involved external validation on independent or large-scale polymer datasets, which is crucial to validate the performance and generalization of the proposed model. This limits the claim of general applicability of the method in polymer classification and recycling applications. Hence, the findings in this work should be regarded as proof-of-concept and, in future studies, the proposed model will be tested using larger datasets, publicly available data, and more diverse data to improve the reliability and reproducibility.
In addition to the numerical evaluation, the observed classification performance can be related to the underlying physicochemical behavior of polymers. The selected features, such as glass transition temperature, melting point, viscosity, and elastic modulus, are directly associated with molecular structure, chain mobility, and intermolecular interactions. For instance, polymers with higher glass transition temperatures and modulus values typically exhibit stronger intermolecular forces and restricted chain movement, making them structurally distinct from more flexible polymers. Similarly, variations in viscosity and rheological behavior reflect differences in molecular weight distribution and chain entanglement. The proposed model effectively captures these nonlinear relationships between structural properties and polymer types, which contributes to its improved classification performance. Thus, the results are not only computationally significant but also consistent with established principles of polymer chemistry and material behavior.
The ablation study is performed with the presence and absence of each level in the proposed model, and the performance is presented in Table 5.
To better understand the contribution of each component in the proposed NPFRS-EDLHSA model, an ablation study was carried out by removing key modules one at a time. It was observed that the performance gradually decreased whenever a component such as BiGRU, GAE, feature selection (BDMO), or hyperparameter tuning (GOA) was excluded. This clearly indicates that each part of the model contributes to improving the overall classification performance. In addition, a paired t-test was performed, and the results showed that the improvements achieved by the proposed model are statistically significant (p < 0.05). However, since the dataset used in this study is relatively small, these results should be interpreted within the context of the current experimental setup.

5. Conclusions

In this manuscript, we have presented an NPFRS-EDLHSA. The proposed NPFRS-EDLHSA model mainly focuses on providing an attractive solution for polymer recycling and sustainability using state-of-the-art techniques. Initially, the min–max normalization is applied in the data pre-processing stage for noise removal. Following that, the BDMO algorithm has been implemented for the process of feature selection. For the polymer type classification process, the proposed NPFRS-EDLHSA model designs an ensemble of deep learning models, such as the BiRNN model, BiGRU method, and GAE technique. At last, the GOA adjusts the hyperparameter values of the ensemble models optimally, and the outcomes result in greater classification performance. The model was tested experimentally and showed an average accuracy of 98.77 and precision, recall, and F1-score values of 98.89, 98.77, and 98.76, respectively. Although these findings indicate that the model is capable of differentiating the types of polymers in the dataset under investigation, it is important to recognize that the sample size used is rather small to make the results statistically sound. A wide-ranging experimentation was performed to validate the performance of the NPFRS-EDLHSA method. The experimental results specified that the NPFRS-EDLHSA technique emphasized betterment over another existing model. The next stage of this exploration framework is to incorporate larger polymer datasets by incorporating more explicit models of recyclability, degradability, and properties related to processing and use, which would allow for developing a more substantial relationship between polymer classification and sustainability-oriented decision-making.

Funding

This work was supported by the Deanship of Scientific Research, Vice Presidency for Graduate Studies and Scientific Research, King Faisal University, Saudi Arabia [Grant No. KFU252182].

Institutional Review Board Statement

Not applicable.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding authors.

Acknowledgments

The author would like to express their gratitude to King Faisal University for its support in providing a conducive research and learning environment, which facilitated the successful completion of this research work.

Conflicts of Interest

The author declares no conflict of interest.

References

  1. Mangold, H.; von Vacano, B. The frontier of plastics recycling: Rethinking waste as a resource for high-value applications. Macromol. Chem. Phys. 2022, 223, 2100488. [Google Scholar] [CrossRef]
  2. Akovali, G.; Bernardo, C.A.; Leidner, J.; Utracki, L.A.; Xanthos, M. (Eds.) Frontiers in the Science and Technology of Polymer Recycling; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2013; Volume 351. [Google Scholar]
  3. Beghetto, V.; Sole, R.; Buranello, C.; Al-Abkal, M.; Facchin, M. Recent advancements in plastic packaging recycling: A mini-review. Materials 2021, 14, 4782. [Google Scholar] [CrossRef] [PubMed]
  4. Lino, V.; Castaldo, R.; Gentile, G.; Manini, P. Reusable melanin-based biosorbents for efficient methylene blue removal: The new frontier of fungi-inspired allomelanin coatings for sustainable water remediation processes. Mater. Today Sustain. 2023, 21, 100283. [Google Scholar] [CrossRef]
  5. Paraye, P.; Sarviya, R.M. Advances in polymer composites, manufacturing, recycling, and sustainable practices. Polym. Plast. Technol. Mater. 2024, 63, 1474–1497. [Google Scholar] [CrossRef]
  6. Sheldon, R.A.; Basso, A.; Brady, D. New frontiers in enzyme immobilisation: Robust biocatalysts for a circular bio-based economy. Chem. Soc. Rev. 2021, 50, 5850–5862. [Google Scholar] [CrossRef]
  7. Hartmann, F.; Baumgartner, M.; Kaltenbrunner, M. Becoming sustainable, the new frontier in soft robotics. Adv. Mater. 2021, 33, 2004413. [Google Scholar] [CrossRef]
  8. Marino, T.; Galiano, F.; Molino, A.; Figoli, A. New frontiers in sustainable membrane preparation: Cyrene™ as green bioderived solvent. J. Membr. Sci. 2019, 580, 224–234. [Google Scholar] [CrossRef]
  9. Al Rashid, A.; Koç, M. Additive manufacturing for sustainability and circular economy: Needs, challenges, and opportunities for 3D printing of recycled polymeric waste. Mater. Today Sustain. 2023, 24, 100529. [Google Scholar] [CrossRef]
  10. Mostafa, A.L.; Mohamed, M.A.; Ahmed, S.; Youssef, W.M.M. Application of Artificial Intelligence Tools with BIM Technology in Construction Management: Literature Review. Int. J. BIM Eng. Sci. 2023, 6, 39–54. [Google Scholar] [CrossRef]
  11. Kannan, S.; Prabakaran, D.; Dhenesh Kumar, S.; Sivaram, S. A Deep Learning-Based Convolution Neural Networks to Forecast Wind Energy. In Proceedings of the 2023 International Conference on Recent Trends in Electronics and Communication (ICRTEC), Mysore, India, 10–11 February 2023; pp. 1–6. [Google Scholar] [CrossRef]
  12. Atasi, C.; Kern, J.; Ramprasad, R. Design of Recyclable Plastics with Machine Learning and Genetic Algorithm. J. Chem. Inf. Model. 2024, 64, 9249–9259. [Google Scholar] [CrossRef] [PubMed]
  13. Duan, P.J.; Liu, J.Y.; Chen, L.; Li, M.X.; Pan, J.W.; Zhang, Z.Q.; Bai, C.W.; Chen, X.J.; Yu, H.Q.; Chen, F. Polymeric products deactivate carbon-based catalysts in catalytic oxidation reactions. Nat. Water 2025, 3, 178–190. [Google Scholar] [CrossRef]
  14. Sundaralingam, S.; Ramanathan, N. Recyclable plastic waste segregation with deep learning based hand-eye coordination. Environ. Res. Commun. 2024, 6, 045007. [Google Scholar] [CrossRef]
  15. Liang, Y.; Yin, Y.; Deng, Q.; Jiao, S.; Liang, X.; Huo, C.; Luo, Y. Efficient and easily recyclable photocatalytic reduction of Se (IV) from wastewater using stable TiO2/BiOBr/cloth: Mechanism insight and machine learning modeling. Sep. Purif. Technol. 2025, 352, 128021. [Google Scholar] [CrossRef]
  16. Shchegolkov, A.V.; Shchegolkov, A.V.; Kaminskii, V.V. Carbon Nanotubes and Graphene in Polymer Composites for Strain Sensors: Synthesis, Functionalization, and Application. J. Compos. Sci. 2026, 10, 43. [Google Scholar] [CrossRef]
  17. Zare, Y.; Munir, M.T.; Choi, J.H.; Rhee, K.Y. Estimation of electrical conductivity for polymer composites with carbon black nanoparticles by interphase depth, tunneling characteristics and network percentage. Sci. Rep. 2026, 16, 11023. [Google Scholar] [CrossRef]
  18. Li, Z.; Sun, Y.; Hu, F.; Liu, D.; Zhang, X.; Ren, J.; Guo, H.; Shalash, M.; He, M.; Hou, H.; et al. An overview of polymer-based thermally conductive functional materials. J. Mater. Sci. Technol. 2025, 218, 191–210. [Google Scholar] [CrossRef]
  19. Musa, A.A.; Bello, A.; Adams, S.M.; Onwualu, A.P.; Anye, V.C.; Bello, K.A.; Obianyo, I.I. Nano-enhanced polymer composite materials: A review of current advancements and challenges. Polymers 2025, 17, 893. [Google Scholar] [CrossRef] [PubMed]
  20. M. G., V.; Vatnalmath, M.; Auradi, V. A review on conductive polymer composites focusing on advancements in electrical conductivity, and electromagnetic shielding capabilities. Polym. Compos. 2025, 46, 16507–16537. [Google Scholar] [CrossRef]
  21. Kern, J.; Su, Y.; Gutekunst, W.; Ramprasad, R. An Informatics Framework for the Design of Sustainable, Chemically Recyclable, Synthetically-Accessible and Durable Polymers. arXiv 2024, arXiv:2409.15354. [Google Scholar] [CrossRef]
  22. Lee, H.; Park, Y.; Kim, K.; Lee, H. Development of a deep learning-based object recognition system for pre-stage separation to improve the recycling rate of major general-purpose plastics. Sep. Sci. Technol. 2025, 60, 121–132. [Google Scholar] [CrossRef]
  23. Islam, M.J.; Ahmad, S.; Haque, F.; Reaz, M.B.I.; Bhuiyan, M.A.S.; Islam, M.R. Application of min-max normalization on subject-invariant EMG pattern recognition. IEEE Trans. Instrum. Meas. 2022, 71, 2521612. [Google Scholar] [CrossRef]
  24. Oyelade, O.N.; Aminu, E.F.; Wang, H.; Rafferty, K. An adaptation of hybrid binary optimization algorithms for medical image feature selection in neural network for classification of breast cancer. Neurocomputing 2025, 617, 129018. [Google Scholar] [CrossRef]
  25. Ge, M.; Yuan, Y. Evaluation model design of project construction safety level based on bidirectional recurrent neural network (BiRNN) and bidirectional long short-term memory (BiLSTM). PeerJ Comput. Sci. 2024, 10, e2351. [Google Scholar] [CrossRef]
  26. Hong, Y. Boosting the Accuracy of Stock Market Prediction via Multi-Layer Hybrid MTL Structure. arXiv 2025, arXiv:2501.09760. [Google Scholar] [CrossRef]
  27. Xie, C.; Huang, J.; Shi, Y.; Pang, H.; Gao, L.; Wen, X. Ensemble graph auto-encoders for clustering and link prediction. PeerJ Comput. Sci. 2025, 11, e2648. [Google Scholar] [CrossRef] [PubMed]
  28. Nguyen, K.D.; Tran, T.T.; Vo, D.N. Optimizing turbine location in upgraded wind farm using grasshopper optimization algorithm. Bull. Electr. Eng. Inform. 2025, 14, 1553–1569. [Google Scholar] [CrossRef]
  29. Available online: https://khazana.gatech.edu/dataset/ (accessed on 10 December 2025).
  30. Yılmaz, V.S.; Eseller, K.E.; Aslan, O.; Bayraktar, E. Classification of Different Recycled Rubber-Epoxy Composite Based on their Hardness Using Laser-Induced Breakdown Spectroscopy (LIBS) with Comparison Machine Learning Algorithms. Inventions 2023, 8, 54. [Google Scholar] [CrossRef]
Figure 1. Overall flow of the NPFRS-EDLHSA method.
Figure 1. Overall flow of the NPFRS-EDLHSA method.
Polymers 18 01208 g001
Figure 2. BiRNN structure.
Figure 2. BiRNN structure.
Polymers 18 01208 g002
Figure 3. Classifier outcome of NPFRS-EDLHSA model: (a) entire confusion matrix, (b) curve of PR, and (c) curve of ROC.
Figure 3. Classifier outcome of NPFRS-EDLHSA model: (a) entire confusion matrix, (b) curve of PR, and (c) curve of ROC.
Polymers 18 01208 g003
Figure 4. Average of NPFRS-EDLHSA approach under various metrics.
Figure 4. Average of NPFRS-EDLHSA approach under various metrics.
Polymers 18 01208 g004
Figure 5. Accuracy and loss analysis of NPFRS-EDLHSA approach.
Figure 5. Accuracy and loss analysis of NPFRS-EDLHSA approach.
Polymers 18 01208 g005
Figure 6. A c c u y outcome of NPFRS-EDLHSA technique with existing models.
Figure 6. A c c u y outcome of NPFRS-EDLHSA technique with existing models.
Polymers 18 01208 g006
Figure 7. P r e c n , r e c a l , and F 1 s c o r e outcome of NPFRS-EDLHSA algorithm with existing models.
Figure 7. P r e c n , r e c a l , and F 1 s c o r e outcome of NPFRS-EDLHSA algorithm with existing models.
Polymers 18 01208 g007
Figure 8. ET outcome of NPFRS-EDLHSA method with existing models.
Figure 8. ET outcome of NPFRS-EDLHSA method with existing models.
Polymers 18 01208 g008
Table 1. Dataset specification.
Table 1. Dataset specification.
ClassSample Number
Cycloalkanes6
Cycloalkenes1
Methylcycloalkanes4
Ethers1
Polyurethanes3
Lactams4
Lactones9
Thiolactones5
Others9
Total Instances42
Table 2. Classifier output of NPFRS-EDLHSA technique under various metrics.
Table 2. Classifier output of NPFRS-EDLHSA technique under various metrics.
Class A c c u y P r e c n R e c a l F 1 s c o r e
Cycloalkanes100.00100.00100.00100.00
Cycloalkenes100.00100.00100.00100.00
Methylcycloalkanes100.00100.00100.00100.00
Ethers100.00100.00100.00100.00
Polyurethanes100.00100.00100.00100.00
Lactams100.00100.00100.00100.00
Lactones100.0090.00100.0094.74
Thiolactones100.00100.00100.00100.00
Others88.89100.0088.8994.12
Average98.7798.8998.7798.76
Table 3. Comparison evaluation of the NPFRS-EDLHSA model with existing techniques.
Table 3. Comparison evaluation of the NPFRS-EDLHSA model with existing techniques.
Methods A c c u y P r e c n R e c a l F 1 s c o r e
Inception Model92.5490.9994.0695.83
SVM Classifier96.7092.9890.0297.63
Random Forest96.5396.9994.6193.36
Naïve Bayes97.7993.7495.9589.86
LSTM Model96.2894.1493.8492.72
KNN Algorithm96.5296.4496.9896.88
VGG16 Method94.7497.0890.2596.28
NPFRS-EDLHSA98.7798.8998.7798.76
Table 4. Execution time comparison of the NPFRS-EDLHSA method and existing models.
Table 4. Execution time comparison of the NPFRS-EDLHSA method and existing models.
MethodExecution Time (min)
Inception Model73
SVM Classifier57
Random Forest59
Naïve Bayes30
LSTM Model56
KNN Algorithm74
VGG16 Method73
NPFRS-EDLHSA11
Table 5. Ablation study and performance of the proposed model.
Table 5. Ablation study and performance of the proposed model.
Model VariantAccuracy (%)Precision (%)Recall (%)F1-Score (%)
BiRNN Only94.2194.6593.9894.1
BiRNN + BiGRU95.8796.1295.7495.92
Without GAE96.4596.896.2196.5
Without BDMO (Feature Selection)96.9297.196.7596.88
Without GOA (Hyperparameter Tuning)97.3597.697.297.32
Proposed NPFRS-EDLHSA98.7798.8998.7798.76
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Parvez, M.A. AI-Based Polymer Classification Using Ensemble Deep Learning and Heuristic Optimization: Implications for Recycling Applications. Polymers 2026, 18, 1208. https://doi.org/10.3390/polym18101208

AMA Style

Parvez MA. AI-Based Polymer Classification Using Ensemble Deep Learning and Heuristic Optimization: Implications for Recycling Applications. Polymers. 2026; 18(10):1208. https://doi.org/10.3390/polym18101208

Chicago/Turabian Style

Parvez, Mohammad Anwar. 2026. "AI-Based Polymer Classification Using Ensemble Deep Learning and Heuristic Optimization: Implications for Recycling Applications" Polymers 18, no. 10: 1208. https://doi.org/10.3390/polym18101208

APA Style

Parvez, M. A. (2026). AI-Based Polymer Classification Using Ensemble Deep Learning and Heuristic Optimization: Implications for Recycling Applications. Polymers, 18(10), 1208. https://doi.org/10.3390/polym18101208

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop