A Machine Learning Framework for Interpreting Composition-Dependent Weathering in Heritage Glass

Wan, Hailu; Jin, Zhuo; Huang, Gengqiang; Li, Shuang

doi:10.3390/mca31020054

Open AccessArticle

A Machine Learning Framework for Interpreting Composition-Dependent Weathering in Heritage Glass

¹

School of Industrial Design, Guangxi Technological College of Machinery and Electricity, Nanning 530007, China

²

School of Art and Design, Luoyang Vocational College of Science and Technology, Luoyang 471822, China

³

School of Green Building and Low-Carbon Technology, Guangxi Technological College of Machinery and Electricity, Nanning 530007, China

^*

Author to whom correspondence should be addressed.

Math. Comput. Appl. 2026, 31(2), 54; https://doi.org/10.3390/mca31020054

Submission received: 27 January 2026 / Revised: 13 March 2026 / Accepted: 1 April 2026 / Published: 3 April 2026

(This article belongs to the Special Issue Celebrate the 30th Anniversary of Mathematical and Computational Applications (MCA))

Download

Browse Figures

Versions Notes

Abstract

Glass artworks represent a significant component of cultural heritage, yet their surfaces are highly vulnerable to physicochemical weathering resulting from composition-dependent interactions with environmental factors. Understanding the complex and nonlinear relationships between glass composition and deterioration remains challenging using conventional, often invasive, analytical techniques. To address this issue, this study proposes an interpretable and non-destructive computational framework to analyze weathering patterns in historical glass based on oxide composition data. The framework combines statistical hypothesis testing (Chi-squared analysis), metric-based machine learning (Prototypical Networks), probabilistic modeling (Gaussian Mixture Models), multivariate statistical analysis (orthogonal partial least squares discriminant analysis), and information-theoretic methods (mutual information analysis) to identify key compositional features and inter-elemental relationships associated with surface degradation. The results show that lead-barium glass exhibits a higher susceptibility to weathering compared with high-potassium glass, with PbO, BaO, and SiO₂ identified as the most discriminative components. The Prototypical Network achieved 100% accuracy on most specific data partitions within the analyzed dataset, demonstrating its effectiveness in small-sample compositional classification. Meanwhile, mutual information network analysis revealed the complex interrelationships among chemical components involved in surface weathering behavior. These findings indicate that interpretable machine learning and statistical modeling can provide meaningful insights into composition-dependent patterns and support reproducible analysis for the sustainable conservation of cultural heritage glass.

Keywords:

interpretable machine learning; statistical modeling; multivariate analysis; composition-dependent weathering; information theory

1. Introduction

Glass artworks represent a unique intersection of artistic expression and material science, whose preservation is of paramount importance for cultural heritage [1]. These objects, ranging from historical stained glass windows to intricate decorative pieces, are valued not only for their aesthetic qualities but also for the historical and technological information they embody [2]. Glass artifacts not only reflect artistic and aesthetic intentions but also preserve information about historical production techniques, raw material sources, and technological innovations, providing valuable insights for both art historians and materials scientists [3,4,5]. The surface integrity of glass artworks is highly sensitive to environmental conditions, including fluctuations in humidity, temperature, exposure to atmospheric pollutants, and variations in light intensity [6]. These environmental factors often interact in complex, nonlinear ways, influencing weathering processes at multiple scales—from atomic-level compositional changes to micrometer-scale cracking and macroscopic surface corrosion [3]. Over time, these factors induce complex physicochemical weathering processes, resulting in microstructural deterioration, compositional alterations, and visible changes in color and texture. The susceptibility and manifestation of such weathering processes often depend strongly on the glass composition. For example, lead-barium glass, commonly used in decorative and optical objects, exhibits distinct surface corrosion patterns due to the leaching of lead and barium ions [7,8]. Understanding these composition-dependent mechanisms is essential for evaluating the state of preservation, predicting potential risks, and designing targeted conservation strategies. Traditional approaches, however, often fall short in capturing the complex, multidimensional inter-actions among compositional parameters, environmental conditions, and observable deterioration features [9]. The wide variety of glass formulations and environmental exposures further complicates the establishment of universal criteria for evaluating deterioration. These challenges underscore the need for systematic, data-driven frameworks that integrate chemical, physical, and visual characteristics, enabling more comprehensive, predictive, and interpretable analyses of glass weathering patterns.

Traditional methods for investigating glass weathering primarily rely on laboratory-based analytical techniques such as X-ray diffraction (XRD), scanning electron microscopy (SEM), Raman spectroscopy, and inductively coupled plasma (ICP) analysis [10]. These methods provide detailed insights into surface morphology, crystallo-graphic changes, and elemental composition, offering high-resolution data essential for understanding specific weathering mechanisms [7]. However, while such analytical measurements remain fundamental for obtaining reliable compositional information, the subsequent interpretation and classification of glass types or weathering states often depend on manual analysis and conventional statistical approaches. Furthermore, conventional studies often rely on single-variable correlations, which limits their ability to capture the complex, nonlinear interactions among chemical composition, environmental exposure, and observable physical changes in glass artifacts [11]. This complexity poses significant challenges to traditional analytical and statistical frameworks. These limitations highlight the growing need for systematic, data-driven approaches that can integrate chemical, physical, and visual features in a unified framework, enabling more accurate classification, prediction, and mechanistic understanding of glass weathering processes while improving scalability, reproducibility, and interpretability.

Recent advancements in machine learning (ML) offer a promising avenue to overcome these limitations by enabling the extraction of meaningful patterns and predictive insights from complex, heterogeneous datasets [12]. In materials science, ML techniques have been increasingly employed to model composition–property relationships, predict degradation or corrosion behavior, and guide the design of novel materials with enhanced stability [13,14,15]. In the context of heritage science, data-driven approaches have demonstrated success in predicting pigment degradation, assessing metal corrosion, and evaluating stone weathering under environmental stressors [16,17]. However, existing research has yet to establish a systematic and integrated framework for analyzing glass weathering, and most studies address the issue only through isolated case analyses. Several independent research groups have approached this problem using diverse machine learning methods. Li’s research group [18] employed a joint Daen-LR, ARIMA-LSTM, and MLR architecture (JMLA) to analyze the chemical composition of ancient glass, demonstrating improved classification accuracy and efficiency in glass type identification. Rahman and colleagues [19] developed a deep learning-based glass classification model using a convolutional neural network (CNN) that extracts hierarchical features from oxide content data, capturing complex patterns to accurately identify glass types. Chen and colleagues [20] applied random forest and BP neural networks to successfully recognize eight major and minor sub-classes of glass artifacts, highlighting the potential of neural network-based supervised learning for subclass identification. Meng’s team [21] conducted principal component analysis on chemical composition data and developed a case-specific clustering algorithm (K-means++) to categorize glass relics, achieving robust clustering validated by inertia and silhouette scores. Tang’s research group [22] proposed a stacking integration classification combined with Gaussian mixture clustering (SIC-GMC) for component correction and category identification of weathered silicate glass, integrating ensemble learning with probabilistic clustering to handle small datasets. Xu’s group [23] developed a support vector machine classification model, training it on known samples and applying it to predict the classification of unknown ancient glass artifacts based on their chemical composition. Chen and collaborators [24] constructed a classification framework using decision trees, support vector machines, and logistic regression based on glass patterns, colors, surface weathering, types, and composition ratios, complemented by K-means clustering for subclassification of high-potassium and lead-barium glass. Cai’s research team [25] utilized a generalized Shapley function based on fuzzy measurements to analyze the correlation between chemical composition indicators across different glass categories, revealing systematic changes in correlations from unweathered to weathered lead-barium and high-potassium glasses, thus assisting archaeological classification. Despite these promising developments, the systematic application of ML to glass weathering remains limited, particularly when considering multi-dimensional datasets that combine chemical composition, physical properties, and observable visual characteristics. Integrating such approaches holds significant potential for developing scalable, non-destructive, and reproducible frameworks for assessing and predicting surface weathering in glass artworks, ultimately supporting more informed preservation strategies and deepening our understanding of material-specific degradation processes over time.

In this study, we construct a machine learning-based analytical framework using a dataset of weathered glass samples that includes both high-potassium and lead-barium glass, together with their associated chemical compounds. By integrating statistical analysis with methods such as the Prototypical Network, Gaussian Mixture Model (GMM), Orthogonal Partial Least Squares Discriminant Analysis (OPLS-DA), and Mutual Information, the proposed system aims to identify key compositional indicators, reveal intrinsic clustering patterns, and enhance the interpretability of glass weathering behavior. This work establishes a systematic, data-driven approach for understanding and predicting surface deterioration in glass artworks. Its significance lies in bridging materials science and cultural heritage research, providing a reproducible and non-destructive means to analyze complex degradation processes. The framework not only deepens the scientific understanding of composition-dependent weathering mechanisms but also offers practical insights for the long-term preservation and sustainable management of historical glass artifacts.

Despite the strengths of this integrated analytical framework, several limitations should be acknowledged. First, the dataset used in this study may not cover the full spectrum of glass types and weathering conditions, which could restrict the generalizability of the resulting models when applied to broader archaeological or historical contexts. Second, potential biases and noise introduced during data collection, including variations in sample preparation and analytical instrumentation, may affect the stability and accuracy of the predictions. Third, the current work primarily focuses on the influence of chemical composition on glass classification and weathering behavior, while external environmental factors such as soil characteristics, humidity, temperature, and long-term burial conditions are not explicitly incorporated. These omissions may limit the framework’s ability to capture the complete range of mechanisms that drive surface deterioration in glass materials. Future studies incorporating more diverse datasets and additional environmental parameters would help mitigate these constraints and strengthen the robustness of the analytical approach.

Taken together, these considerations outline both the potential and the current boundaries of data driven approaches in cultural heritage materials research. By recognizing these limitations, this study provides a basis for future work that expands the dataset, incorporates environmental factors, and validates the framework across diverse glass contexts. The following sections present the methodology and empirical findings that demonstrate the framework’s ability to enhance the predictive understanding and interpretability of glass weathering.

2. Methods

2.1. Workflow

In this study, we designed a machine learning-based analytical system for the study of weathered glass, aiming to establish a comprehensive and data-driven framework for understanding, classifying, and interpreting glass weathering phenomena. The specific process of the whole model is shown in Figure 1. The workflow follows a progressive, four-step approach, moving from macroscopic correlation analysis to detailed chemical interpretation.

Stage 1: Macro-level Correlation Analysis

This initial step employs Pearson correlation analysis to identify potential associations between observable weathering conditions and categorical glass attributes, including glass type, color, and surface pattern. This stage serves as a global screening step, highlighting which broad categorical factors may be influenced by weathering. The results provide a statistical anchor for the subsequent modeling stages and help narrow the analytical focus.

Stage 2: Primary Glass Type Classification

Building on the correlation findings, a Prototypical Network is adopted for the primary classification task, distinguishing between high-potassium and lead-barium glass. The PN is particularly suitable for few-shot learning scenarios, which is essential given the limited sample size typical of heritage glass datasets. The model learns an embedding function that maps chemical composition features into a metric space, where each class is represented by a prototype (i.e., the mean embedding of support samples). Classification is performed by measuring the distance between a query sample and these class prototypes.

Stage 3: Fine-grained Subclass Analysis and Validation

After identifying the major glass types, Gaussian Mixture Models (GMMs) are applied within each type to detect latent compositional subclusters. This probabilistic clustering approach captures subtle chemical heterogeneity and enables fine-grained subclass discovery. To validate the statistical robustness of these subclasses, Orthogonal Partial Least Squares Discriminant Analysis (OPLS-DA) is subsequently employed. OPLS-DA separates predictive (between-class) variation from orthogonal (within-class) variation, thereby providing a rigorous evaluation of subclass separability.

Stage 4: Mechanistic Chemical Correlation Analysis

In the final stage, Mutual Information (MI)-based network analysis is conducted to quantify nonlinear statistical dependencies among chemical components. By constructing an information-theoretic network of elemental interactions, this analysis reveals co-varying chemical elements and potential structural or weathering-related relationships. This stage provides mechanistic insight into the compositional patterns observed in the classification and clustering results.

2.2. Data Overview

The data were acquired from a private archaeology database containing the chemical composition detection results of 58 glass cultural relics, comprising a total of 66 measurement points. For each glass relic, basic attributes including ornamentation, color, and glass type (high-potassium or lead-barium) were recorded. At each detection point, the degree of weathering and the contents of 14 chemical components were measured. These components included SiO₂, Na₂O, K₂O, CaO, MgO, Al₂O₃, Fe₂O₃, CuO, PbO, BaO, P₂O₅, SrO, SnO, and SO₂.

Overall, the dataset describes glass artifacts using four categories of information: glass type, ornamentation, color, and chemical composition. The glass type was classified into two groups: high-potassium and lead-barium. Ornamentation was simplified into three patterns (A, B, and C). Color was categorized into eight shades: light green, light blue, dark green, dark blue, purple, green, teal, and black.

Since the chemical composition variables represent relative proportions of oxides, the dataset constitutes compositional data. Direct application of conventional statistical methods to such data may lead to spurious correlations caused by the closure effect. Therefore, the Centered Log-Ratio (CLR) transformation [26] was applied to the chemical composition variables prior to subsequent analysis. For a composition vector

x = (x_{1}, x_{2}, \dots, x_{D})

the CLR transformation is defined as:

c l r (x_{i}) = \ln (\frac{x_{i}}{g (x)}), i = 1,2, \dots, D

(1)

where

g (x) = {(x_{1}, x_{2}, \dots, x_{D})}^{1 / D}

denotes the geometric mean of all components. This transformation maps the compositional data from the simplex space to real space, thereby mitigating the closure effect and enabling the application of conventional statistical methods in subsequent analyses.

2.3. Chi-Squared Test

The Chi-squared test is a statistical technique used to compare observed data with expected distributions based on a specific hypothesis. Its main purpose is to determine whether the differences between observed and expected values can be attributed to chance or if they suggest a significant relationship between variables [27,28,29]. Therefore, the formula for Pearson’s chi-squared test is

χ^{2} = \sum_{i = 1}^{r} \sum_{j = 1}^{c} \frac{{(O_{i j} - E_{i j})}^{2}}{E_{i j}}

(2)

The Yates’ corrected chi-square test is appropriate when analyzing a 2 × 2 contingency table formed by two binary categorical variables, particularly when one or more cells have expected frequencies between 1 and 5 [30,31,32]. This correction reduces the risk of Type I errors by making the test more conservative. For our dataset, as certain expected cell frequencies fall within this range, we employed the Yates’ continuity correction to adjust the chi-square test.

χ^{2} = \sum_{i = 1}^{r} \sum_{j = 1}^{c} \frac{{(|O_{i j} - E_{i j}| - \frac{1}{2})}^{2}}{E_{i j}}

(3)

where

O_{i j}

and

E_{i j}

represent the observed and expected frequencies for the cell in the

i

-th row and

j

-th column, respectively. The degrees of freedom (

d f

) for a contingency table are defined as

(r - 1) (c - 1)

, where

r

is the number of rows and

c

is the number of columns.

However, while the chi-square test determines whether a statistically significant association exists between variables, the test statistic itself is highly sensitive to sample size and does not directly reflect the strength of the relationship. To address this, we calculated Cramér’s V as a measure of effect size to assess the practical significance of the observed associations [29]. Cramér’s V scales the chi-square statistic to a range between 0 and 1, where 0 indicates no association and 1 indicates a perfect relationship. For a 2 × 2 table, it is calculated as:

C r a m é r ’ s V = \sqrt{\frac{χ^{2} / n}{m i n (k - 1, r - 1)}}

(4)

where

n

is the total number of observations. To evaluate the practical significance of the findings, the magnitude of the effect size was interpreted as follows [33]: weak (

V \leq 0.2

), moderate (

0.2 < V \leq 0.6

), and strong (

V > 0.6

). This classification ensures that even when statistical significance is achieved, the actual strength of the association is rigorously assessed.

2.4. Prototypical Network

Prototypical Network is a popular few-shot solver that aims at establishing a feature metric generalizable to novel few-shot classification (FSC) tasks using deep neural networks [34,35]. Their simplicity and computational efficiency make them an appealing alternative to more complex meta-learning algorithms for few-shot and zero-shot learning. As shown in Figure 2, the framework is composed of an embedding network that maps input samples into a low-dimensional feature space and a prototype metric classifier that performs classification based on the Euclidean distance between embedded samples and class prototypes.

The basic principle is as follows [36,37]: Given a sample set

S

and a query set

Q

, PN is achieved by first constructing the prototypes of all sample classes, then measuring the distance between the query data features and the class prototypes using a fixed function. The prototype for the

n

-th class

c_{n}

is calculated with

c_{n} = \frac{1}{K} \sum_{k = 1}^{K} f_{φ} (x_{n k})

(5)

where

x_{n k}

is the

k

-th sample of class

n

,

f_{φ} (\cdot)

denotes the feature extraction module that extracts the data into a feature vector. Then for a query data

x_{i}

from

Q

, the probability that it belongs to the

n

-th class is calculated by a non-parametric softmax classifier.

P (n| x_{i}) = \frac{\exp (- d (f_{φ} (x_{i}), c_{n}))}{\sum_{j = 1}^{C} \exp (- d (f_{φ} (x_{i}), c_{j}))}

(6)

where

d (\cdot, \cdot)

is a metric function.

Taking the negative logarithm of the probability, the cross-entropy loss function can be computed as follows:

J_{k} = - l o g P (n| x_{i})

(7)

To implement the proposed Prototypical Network framework in this study, a lightweight embedding network based on a multilayer perceptron (MLP) was constructed to accommodate the small tabular dataset. The embedding network consists of two fully connected layers, where the 14-dimensional input features are first projected to a 16-dimensional hidden representation followed by a ReLU activation function, and then further transformed into an 8-dimensional embedding space used for metric-based classification. The model parameters were initialized using the default Kaiming uniform initialization in PyTorch (version 2.5.1). Training was performed using the Adam optimizer with a learning rate of 1 × 10⁻³ for 30 epochs and a batch size of 16, without applying dropout or weight decay. Class prototypes were computed as the mean embedding of all samples belonging to each class in the training set, and classification was carried out by measuring the Euclidean distance between the query embeddings and these prototypes. The architecture and hyperparameters were selected to match the limited scale of the dataset and to ensure stable and efficient convergence without extensive hyperparameter tuning.

2.5. Gaussian Mixture Model

Gaussian Mixture Modeling (GMM) is performed using the sklearn.mixture package in Python 3.10 to identify potential sub-clusters within the glass samples. To account for the varying scales of different chemical components, the multivariate chemical composition data (comprising 14 features) are first normalized using Z-score standardization [38,39]. Our approach treats GMM as a multivariate clustering method. The probability density of the data

x

is defined as a weighted sum of

K

Gaussian distributions:

p (x) = \sum_{k = 1}^{K} π_{k} N (x| μ_{k}, Σ_{k})

(8)

where

π_{k}

,

μ_{k}

, and

Σ_{k}

represent the mixing proportions, mean vectors, and full covariance matrices for each component, respectively. The model parameters

\hat{θ}

are determined through the Expectation-Maximization (EM) algorithm, initialized via the k-means strategy [40].

Model selection, specifically the determination of the optimal number of Gaussian components, is based on minimizing the Bayesian Information Criterion (BIC), defined as:

B I C_{M, G} = k l o g (n) - 2 l o g (L_{M, G} (x| \hat{θ}))

(9)

where

L_{M, G} (x| \hat{θ})

is the maximized likelihood function of model

M

with

G

components, with maximizing parameters

\hat{θ}

, determined through the EM algorithm,

n

is the sample size, and

k

is the number of estimated parameters [41]. The Bayesian Information Criterion (BIC) is a log-likelihood-based metric that includes a penalty term for model complexity, helping to prevent overfitting. It has been widely validated across diverse applications [42].

To further validate the clustering quality and ensure the mathematical validity of the identified subclasses, the Silhouette Coefficient and the Davies–Bouldin Index (DBI) are employed as quantitative internal evaluation metrics. The Silhouette Coefficient measures how similar an individual sample is to its assigned cluster relative to other clusters, with values ranging from −1 to 1, where a higher average value indicates superior cluster separation [43]. The Davies–Bouldin Index evaluates the average similarity between each cluster and its most similar one, where a lower value signifies a more distinct and compact clustering structure [44]. By identifying the number of components that simultaneously achieve a minimum BIC and a minimum DBI while maintaining a high Silhouette Coefficient, the most statistically robust classification of glass artifacts is determined.

To facilitate visualization and exploratory analysis, Principal Component Analysis (PCA) is applied to reduce the dimensionality of the multivariate chemical composition data. PCA projects the original high-dimensional feature space onto a lower-dimensional subspace spanned by the leading principal components, which capture the majority of data variance [45,46]. In this study, the first two principal components are retained and used for visualizing the distribution patterns and clustering tendencies of glass samples, providing an intuitive representation that complements the subsequent GMM analysis.

2.6. OPLS-DA

Orthogonal projection to latent structures discriminant analysis (OPLS-DA) is a widely used statistical method in multivariate data analysis, particularly in class pattern recognition [47,48]. In OPLS-DA, the response

m a t r i x (Y)

is a dummy matrix that contains the information about class membership for each observation. OPLS separates the variation described by the model into two different parts: predictive and orthogonal. The predictive part is the variation in

X

that is used to model the variation in

Y

. The orthogonal part contains variation in

X

that is unrelated to the response

Y

. In the OPLS-DA context, the predictive part contains between-class variation while the orthogonal part contains within-class variation. By dividing the variation into two parts, the interpretation of the model becomes easier [49].

X = T_{pred} \times P_{pred}^{T} + T_{ortho} \times P_{ortho}^{T} + E

(10)

Y = T_{pred} \times C^{T} + F

(11)

The OPLS-DA method can establish correlation models between different indices and samples, and subsequently screen indices that reflect sample differences, as represented by the variable importance in projection (VIP) values [50]. In addition, to evaluate the accuracy and reliability of the OPLS-DA model, permutation testing is commonly employed.

To ensure the robustness of the OPLS-DA model and to mitigate the potential risk of overfitting given the constraints of the sample size, a rigorous internal validation protocol was implemented. The predictive performance of the model was first assessed using a 7-fold cross-validation procedure. During this process, the dataset was partitioned into seven subsets where each subset was systematically excluded and predicted by the remaining six subsets. This iterative process yielded the cumulative parameters

R^{2} Y

, which represents the fraction of the variation of

Y

explained by the model, and

Q^{2}

, which represents the fraction of the variation of

Y

that can be predicted by the model according to the cross-validation [51,52].

Furthermore, a permutation test with 200 iterations was conducted to evaluate the statistical significance of the classification. In each iteration, the class labels in the

Y

matrix were randomly shuffled while the chemical composition data in the

X

matrix remained constant, followed by the recalculation of the corresponding OPLS-DA model. The resulting

R^{2}

and

Q^{2}

values from the permuted models were compared against those of the original model. The model is considered statistically valid if the permuted

Q^{2}

values are consistently lower than the original value and the

Q^{2}

intercept on the regression line is below zero [52,53].

2.7. Mutual Information Network

Mutual information (MI) inherently addresses the challenge of fairly measuring statistical associations between paired variables. It serves as a fundamental measure of statistical dependence between two random variables, where higher MI values indicate stronger dependence. Previous studies have characterized MI as an objective function that promotes model fairness by maximizing the entropy of cluster proportions, while simultaneously enhancing firmness by minimizing conditional entropy [54]. Together, these properties demonstrate that mutual information provides a robust and intrinsically meaningful framework for interpreting the increasingly large and complex datasets encountered across a wide range of scientific and industrial applications [54,55,56].

Formally, MI quantifies the reduction in uncertainty of one random variable given knowledge of another. For two random variables

X

and

Y

, mutual information is defined as:

I [X; Y] = \iint p (x, y) \log_{2} (\frac{p (x, y)}{p (x) p (y)}) d x d y

(12)

where

p (x, y)

,

p (x)

, and

p (y)

are the joint and marginal probability distributions, respectively.

Mutual information (MI) was estimated using the k-nearest neighbors (k-NN) estimator [57] implemented in scikit-learn’s mutual_info_regression, with k set to 3. This non-parametric estimator avoids binning artifacts and is suitable for continuous variables with small sample sizes.

To enable fair per-class comparison, each class-specific MI matrix was independently min-max normalized to [0, 1] based on its own upper-triangular values (excluding diagonal). The differential score

S = {M I}_{1_n o r m} - {M I}_{0_n o r m}, S \in [- 1, 1]

was computed to highlight class-specific differences.

Statistical robustness was assessed via bias-corrected and accelerated (BCa) bootstrap confidence intervals [58] (1000 resamples,

α = 0.10

to account for low power in n = 18). Pairs were considered significant if the 90% BCa CI strictly excluded zero.

2.8. Evaluation Metrics

To quantitatively evaluate the performance of the proposed method, we adopt four widely used evaluation metrics: Accuracy, Precision, Recall, and F1-score. These metrics are computed based on the confusion matrix, which consists of true positives (TP), true negatives (TN), false positives (FP), and false negatives (FN) [59,60].

Accuracy measures the overall correctness of the model by calculating the proportion of correctly classified samples among all samples. It is defined as:

A c c u r a c y = \frac{T P + T N}{T P + T N + F P + F N}

(13)

Precision reflects the reliability of positive predictions, indicating the proportion of correctly predicted positive samples among all samples predicted as positive:

P r e c i s i o n = \frac{T P}{T P + F P}

(14)

Recall, also known as sensitivity, measures the ability of the model to correctly identify positive samples. It is defined as the ratio of true positive samples to all actual positive samples:

R e c a l l = \frac{T P}{T P + F N}

(15)

F1-score is the harmonic mean of Precision and Recall, providing a balanced evaluation when there is a trade-off between the two metrics:

F 1 - s c o r e = \frac{2 \times P r e c i s i o n \times R e c a l l}{P r e c i s i o n + R e c a l l}

(16)

3. Results and Discussion

3.1. Correlation Analysis

Chi-square tests were performed to evaluate the associations between weathering conditions and three categorical variables: glass type, color, and surface pattern. This analysis enabled us to determine whether observable deterioration features showed statistically significant relationships with specific compositional categories or visual attributes, thereby providing an initial quantitative assessment of variation in weathering behavior among different classes of glass.

The results (Table 1) indicate a statistically significant association between glass type and weathering (

χ^{2} = 6.88

,

P = 0.009

), demonstrating that the occurrence of weathering differs significantly between glass types. To assess the practical significance of this relationship, Cramér’s V was calculated as 0.344, indicating a moderate association (

0.2 < V \leq 0.6

) between the variables. Specifically, lead-barium glass exhibits a markedly higher tendency to undergo weathering compared with high-potassium glass, suggesting that compositional differences may influence the susceptibility of glass to deterioration.

In contrast, glass color (Table 2) was not significantly associated with weathering based on the Yates-corrected chi-squared test (

χ_{Y a t e s}^{2} = 7.234

,

P = 0.405

). However, this non-significant result should be interpreted with caution, as the statistical power of the test may be limited. Therefore, while no strong association between glass color and weathering is detected in this dataset, the possibility of a true relationship cannot be excluded.

The association between surface pattern and weathering (Table 3) was marginal but not statistically significant (

χ_{Y a t e s}^{2} = 4.957

,

P = 0.084

), suggesting a possible but inconclusive relationship between decorative style and weathering behavior. Given the low expected frequencies in some categories and the limited statistical power of the analysis, this result should be interpreted as suggestive rather than definitive evidence against an association. Furthermore, the analysis is based on a relatively small dataset, which may not fully represent the broader population of ancient glass artifacts. Consequently, the observed pattern should be interpreted with caution.

Within the scope of the present dataset, the findings suggest that chemical composition appears to play a dominant role in the weathering processes of ancient glass, whereas visual attributes such as color and decorative patterns exhibit comparatively weaker or indirect associations. Further investigations based on larger and more comprehensive datasets are required to validate and generalize these observations.

3.2. Classification of Glass Types

Based on the chi-squared test results, which indicated that chemical composition plays a dominant role in the weathering behavior of ancient glass, the Prototypical Network was further evaluated for classifying lead-barium glass and high-potassium glass using 14 chemical components as input features. During training, the model exhibited rapid and stable convergence (Figure 3). The initial classification accuracy reached 81.25% with a loss of 0.5502 in the first epoch, and accuracy improved steadily to 100% by the 13th epoch, accompanied by a continuous decrease in training loss to 0.0475. These results demonstrate that chemically informed feature representations enable effective discrimination between the two glass types.

On the test set, the model achieved 100% classification accuracy (Figure 4), demonstrating that it effectively captures the characteristic chemical feature distributions of each glass type. These results indicate that the Prototypical Network is highly effective for few-shot classification of materials, achieving both fast convergence and robust generalization on limited multicomponent chemical data.

To mitigate the potential influence of data partitioning on model performance and stability, different random seeds were used to generate multiple train–test splits of the dataset. This strategy reduces the potential bias associated with a single random partition and provides a more robust evaluation of the model. For each randomized split, the dataset was first divided into training and test subsets. Feature scaling was then performed using a StandardScaler (scikit-learn 1.3.2) fitted exclusively on the training data, and the resulting parameters were subsequently applied to transform both the training and the corresponding test set, thereby preventing any potential data leakage. Based on this procedure, a total of 10,000 independent training and evaluation runs were conducted. The final model performance was summarized by reporting the average values of four evaluation metrics, accuracy, precision, recall, and F1-score, as presented in Table 4.

The experimental results demonstrate that the proposed Prototypical Network consistently achieves strong classification performance. Specifically, the model attains an average accuracy of 0.9674, an average precision of 0.9837, an average recall of 0.9720, and an average F1-score of 0.9766, indicating stable and reliable behavior across diverse data partitions.

To ensure a fair comparison, the CART (Classification and Regression Tree) decision tree model was evaluated using the same experimental protocol, with consistent data preprocessing procedures, random resampling strategies, and evaluation metrics applied across all models. When compared with the widely adopted CART decision tree [61,62,63], the average performance of the Prototypical Network is found to be highly comparable. Considering that the validation set contains only 14 samples, the observed differences in average performance between the two models are relatively small and can be regarded as negligible.

Importantly, beyond average performance, we further examine the models under worst-case data partition scenarios. Under these unfavorable conditions, the Prototypical Network demonstrates a consistent performance advantage of approximately 10% over the decision tree model across the evaluated metrics. This result indicates a markedly improved level of robustness and stability with respect to adverse sample distributions, which is particularly desirable in small-sample and data-sensitive classification tasks.

To further evaluate the robustness of the Prototypical Network, a sensitivity analysis was conducted using the pre-trained model, which achieved 100% accuracy on the original test set. Random perturbations with varying magnitudes (±0, 0.01, 0.02, 0.05, 0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9, and 1) were independently applied to each chemical component in the test data, and the model was evaluated over 100 repeated runs at each noise level. The analysis revealed that only perturbations in SiO₂ resulted in observable changes in classification outcomes, whereas variations in other chemical components had negligible effects. The corresponding frequencies of classification changes across different perturbation levels are reported in Table 5.

The near-exclusive sensitivity to SiO₂ perturbations highlights its dominant discriminative role in the embedding space, consistent with its position as the primary network former in silicate glasses. Importantly, this does not imply the model reduces to a univariate SiO₂ threshold; the embedding is learned jointly across all oxides, enabling robust multivariate separation even under adverse data partitions (Table 4).

3.3. Subclass Analysis and Validation

Using lead-barium glass as a case study, Gaussian Mixture Models (GMM) were employed to investigate potential subclass structures. As summarized in Table 6, model selection was rigorously performed by evaluating a range of clusters from n = 1 to 6. The Bayesian Information Criterion (BIC) reached its global minimum at n = 3 (958.11), providing strong statistical evidence for a three-component model. This selection was further validated by quantitative internal metrics, where the tri-cluster configuration achieved a Silhouette Coefficient of 0.2062 and a Davies–Bouldin Index (DBI) of 1.5901. Although higher-order models showed slight improvements in geometric separation, their significantly elevated BIC values indicated a high risk of over-parameterization. Consequently, the n = 3 model was identified as the most robust and parsimonious representation, effectively balancing goodness-of-fit with model complexity.

Subsequent Principal Component Analysis (PCA) was performed for visualization (Figure 5), revealing clear separation among the three clusters in the reduced feature space. These results demonstrate that GMM, combined with dimensionality reduction, can effectively uncover latent subclass structures in lead-barium glass based on chemical composition.

The OPLS-DA model achieved clear separation among the three subclasses of lead-barium glass. The score plot (Figure 6a) revealed distinct clustering patterns, indicating that the chemical compositions of the subclasses are systematically different [64,65]. The model showed strong explanatory and predictive performance, with

R^{2} X = 0.577

,

R^{2} Y = 0.792

, and

Q^{2} = 0.726

, suggesting good model fitness and reliability. The permutation test (Figure 6b) confirmed the robustness of the model [65], as all permuted

Q^{2}

values were lower than the original and the intercept of

Q^{2}

was −0.333, demonstrating the absence of overfitting. The VIP plot (Figure 6c) identified the main variables contributing to class discrimination. Components such as BaO, SiO₂, PbO, exhibited VIP > 1, indicating their significant influence on the differentiation of subclasses. These compositional differences likely reflect variations in raw material sources or production techniques within the lead-barium glass group.

Based on the classification results and compositional characteristics, the lead-barium glass samples can be divided into three subclasses:

Subclass 1 characterized by high PbO content;
Subclass 2 with high SiO₂ and low PbO;
Subclass 3 with high BaO and low SiO₂.

To quantify the tolerance of the lead-barium glass subclassification to numerical variation in individual compositional variables, a feature-wise perturbation analysis was conducted by introducing bidirectional multiplicative fluctuations in the original data space. For a given compositional feature with original value

x

, the perturbed value

x^{'}

was generated according to:

x^{'} = x \times u, u ~ U (1 - ε, 1 + ε)

(17)

where

ε

denotes the noise amplitude. The noise level

ε

was progressively increased from small (0.01 and 0.02), through intermediate values (0.05, 0.1, and 0.2), to larger amplitudes spanning 0.3–0.9 and ultimately reaching 1.0, thereby covering a wide range of symmetric positive and negative numerical fluctuations.

For each feature and each noise level

ε

, 100 independent perturbation tests were performed. After perturbation, the data were standardized using the scaler derived from the original dataset and reclustered using a fixed three-component GMM. Subclass assignments were compared with the original clustering after label alignment. An acceptable noise range was conservatively defined as the maximum value of

ε

for which the subclassification remained completely unchanged across all repeated tests; once at least one sample exhibited a different subclass label, the model was regarded as unstable for that feature.

As a result, the feature-specific acceptable noise ranges derived under this criterion are summarized in Table 7. The tolerable fluctuation amplitudes vary substantially among chemical components, reflecting pronounced differences in their influence on the stability of lead-barium glass subclassification. SO₂ exhibits the most restricted tolerance (±0.05), indicating that even minimal bidirectional numerical fluctuations in this component may lead to changes in subclass assignment. Several components, including BaO, Na₂O, PbO, SiO₂, and SnO₂, also display low acceptable noise ranges (±0.1), suggesting that these variables exert a strong control on subclass boundaries and that relatively small positive or negative deviations can affect the clustering outcome.

In comparison, Al₂O₃ and CaO show moderate tolerance (±0.2), while P₂O₅ exhibits a slightly higher threshold (±0.3), indicating a more balanced contribution to the clustering structure. Components such as CuO, Fe₂O₃, K₂O, MgO, and SrO maintain stable subclass assignments under substantially larger perturbations (±0.4–0.5), implying a comparatively weaker influence on subclass differentiation within the explored fluctuation range.

From an analytical perspective, components characterized by low acceptable noise thresholds—particularly SO₂, PbO, Na₂O, BaO, SiO₂, and SnO₂—warrant special attention in measurement strategies, as their numerical variability can substantially influence compositional interpretation. Improving the analytical reliability of these components, for example through optimized calibration procedures, enhanced signal stability, or refined quantification approaches, would contribute to a more accurate characterization of lead-barium glass and thereby strengthen the overall reliability of related compositional analyses and subclassification studies.

3.4. Chemical Correlation Analysis

A more detailed examination of the internal compositional structure of lead-barium glass revealed several pronounced and systematically recurring elemental correlations that shed light on the material’s chemical behavior during both formation and weathering processes. As shown in Figure 7, SiO₂ exhibited strong negative correlations with PbO, P₂O₅, and SrO, suggesting that an increase in silica content is typically accompanied by a decrease in these oxides. In contrast, CuO showed a strong positive correlation with BaO, and BaO was also positively correlated with SO₂, indicating possible co-variation in their raw material sources or melting behavior. Additionally, CaO and P₂O₅ displayed a strong positive correlation, implying a possible association in the glass network structure.

To further investigate the inter-element relationships in the two types of ancient glass, a mutual information (MI)-based differential network analysis was performed. Separate MI matrices were first computed for the two glass categories, and their normalized difference matrix

S

was used to construct the network. Edges were retained only when the bootstrap confidence interval of the differential score excluded zero, indicating statistically significant differences in association strength between the two glass types.

In the lead-barium glass (Class 1), the network is dominated by PbO and SiO₂, which show the strongest positive differential association (

S = 0.70

). Significant associations also emerged between SiO₂ and SrO (

S = 0.45

), as well as Fe₂O₃ and PbO (

S = 0.35

). These high positive

S

values indicate that the lead-silicate framework in Class 1 is highly cohesive, with PbO acting as the primary flux that strongly co-varies with the glass-forming SiO₂ and the stabilizing SrO [66].

In contrast, the high-potassium glass (Class 0) exhibits a more complex, multi-component dependency network. The glass former SiO₂ emerged as a central hub, showing significantly stronger associations (negative

S

values) with Al₂O₃ (

S = - 0.41

), CaO (

S = - 0.35

), and K₂O (

S = - 0.29

). Additionally, strong dependencies were observed between Fe₂O₃ and P₂O₅ (

S = - 0.34

), and K₂O and CaO (

S = - 0.34

). This pattern suggests that in the potassium-based system, the chemical structure is more dependent on an integrated aluminosilicate and calcium-potassium-silicate framework, potentially reflecting the use of plant ash or specific mineral fluxes where multiple oxides are introduced simultaneously [67,68].

Overall, the MI network analysis (Figure 8) highlights a fundamental transition in chemical integration: the lead-barium glass is characterized by a focused lead–silicate interaction, while the high-potassium glass relies on a more distributed and interdependent alkali-calcium-aluminosilicate network.

While the present framework successfully discriminates weathering patterns based on oxide composition alone, glass deterioration is inherently an interaction between material intrinsic properties and extrinsic environmental conditions. Prior studies have shown that environmental factors strongly influence the kinetics and extent of alteration, whereas composition primarily governs the qualitative nature of weathering products and relative durability. The absence of site-specific environmental metadata in our dataset precludes modeling these interactions directly. Future extensions incorporating environmental context variables would enable more holistic predictive modeling of degradation risks.

4. Conclusions

The proposed machine learning framework effectively links glass composition to weathering behavior, demonstrating that chemical composition dominates deterioration patterns. Consistent with the MI-based network analysis, lead-barium glass is characterized by a strongly integrated lead–silicate framework in which PbO shows tight associations with SiO₂ and related stabilizing components, forming a highly cohesive oxide network. In contrast, high-potassium glass exhibits a more distributed and multi-component dependency structure, in which SiO₂ engages in complex interactions with Al₂O₃, CaO, and K₂O, reflecting a more intricate aluminosilicate–alkali network. Lead-barium glass forms a highly interdependent oxide network, whereas high-potassium glass exhibits competitive compositional relationships. The integrated use of Prototypical Networks, GMM, and OPLS-DA provides robust classification and subclass identification, while MI analysis clarifies elemental interactions. These findings establish a scalable, data-driven method for evaluating degradation risks and optimizing preservation decisions. The approach advances the application of artificial intelligence in cultural heritage conservation and materials characterization. The results underscore the dominant role of composition in driving observable weathering differences between high-potassium and lead-barium glasses under archaeological burial, while recognizing that extrinsic factors critically modulate the process in real-world settings. Expanding the framework to include environmental covariates represents a key direction for future work.

Author Contributions

Conceptualization, H.W. and G.H.; Methodology, H.W. and Z.J.; Software, Z.J.; Validation, H.W., Z.J. and S.L.; Formal analysis, H.W.; Investigation, H.W. and Z.J.; Resources, G.H.; Data curation, H.W.; Writing—original draft preparation, H.W.; Writing—review and editing, G.H. and S.L.; Visualization, H.W.; Supervision, G.H.; Project administration, G.H.; Funding acquisition, G.H. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the Philosophy and Social Sciences Research Project of Guangxi Zhuang Autonomous Region (Project No.: 25WYF454), the Science and Education Integration Special Project of Professional Groups at Guangxi Technological College of Machinery and Electricity (Scientific Research Special Project): (Project No.: 2024KJRHK030), the Science and Education Integration Special Project of Professional Groups at Guangxi Technological College of Machinery and Electricity (Educational Research Special Project): (Project No.: KYJY2025008), and the Scientific Research Project of Guangxi Technological College of Machinery and Electricity (Project No.: KQ2025012).

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors on request.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

AIC	Akaike Information Criterion
BIC	Bayesian Information Criterion
CART	Classification and Regression Tree
FSC	Few-shot classification
GMM	Gaussian Mixture Model
MI	Mutual information
ML	Machine learning
OPLS-DA	Orthogonal Partial Least Squares Discriminant Analysis
PCA	Principal Component Analysis
VIP	Variable importance in projection

References

Rudolph, C. Inventing the exegetical stained-glass window: Suger, Hugh, and a new elite art. Art Bull. 2011, 93, 399–422. [Google Scholar] [CrossRef][Green Version]
Baumer, U.; Fiedler, I.; Bretz, S.; Ranz, H.-J.; Dietemann, P. Decorative reverse-painted glass objects from the fourteenth to twentieth centuries: An overview of the binding media. Stud. Conserv. 2012, 57, S18–S19. [Google Scholar] [CrossRef]
Zanini, R.; Franceschin, G.; Cattaruzza, E.; Traviglia, A. A review of glass corrosion: The unique contribution of studying ancient glass to validate glass alteration models. npj Mater. Degrad. 2023, 7, 38. [Google Scholar] [CrossRef]
Keenan-Jones, D.; Motta, D.; Garcia, M.H.; Sivaguru, M.; Perillo, M.; Shosted, R.K.; Fouke, B.W. Travertine crystal growth ripples record the hydraulic history of ancient Rome’s Anio Novus aqueduct. Sci. Rep. 2022, 12, 1239. [Google Scholar] [CrossRef]
Zhou, X.; Gao, X.; Rehren, T.; Wei, C.; Wei, Q.; Cui, J. Glassmaking remains from the 12th to 14th centuries CE glass workshop in Boshan, Shandong Province, China. J. Archaeol. Sci. Rep. 2024, 53, 104289. [Google Scholar] [CrossRef]
Palomar, T.; Silva, M.; Vilarigues, M.; Pombo Cardoso, I.; Giovannacci, D. Impact of solar radiation and environmental temperature on Art Nouveau glass windows. Herit. Sci. 2019, 7, 82. [Google Scholar] [CrossRef]
Cui, J.; Wu, X.; Huang, B. Chemical and lead isotope analysis of some lead-barium glass wares from the Warring States Period, unearthed from Chu tombs in Changde City, Hunan Province, China. J. Archaeol. Sci. 2011, 38, 1671–1679. [Google Scholar] [CrossRef]
Hynes, M.J.; Jonson, B. Lead, glass and the environment. Chem. Soc. Rev. 1997, 26, 133–146. [Google Scholar] [CrossRef]
Engle, M.A.; Chaput, J. Visualizing high dimensional structures in geochemical datasets using a combined compositional data analysis and Databionic swarm approach. Int. J. Coal Geol. 2023, 275, 104303. [Google Scholar] [CrossRef]
Baykal, D.S.; Kilic, G.; Ilik, E.; Kavaz, E.; ALMisned, G.; Cakirli, R.; Tekin, H.O. Designing a Lead-free and high-density glass for radiation facilities: Synthesis, physical, optical, structural, and experimental gamma-ray transmission properties of newly designed barium-borosilicate glass sample. J. Alloys Compd. 2023, 965, 171392. [Google Scholar] [CrossRef]
Wu, Z.; Rincon, D.; Gu, Q.; Christofides, P.D. Statistical machine learning in model predictive control of nonlinear processes. Mathematics 2021, 9, 1912. [Google Scholar] [CrossRef]
Kumar, G.; Basri, S.; Imam, A.A.; Khowaja, S.A.; Capretz, L.F.; Balogun, A.O. Data harmonization for heterogeneous datasets: A systematic literature review. Appl. Sci. 2021, 11, 8275. [Google Scholar] [CrossRef]
Rodrigues, J.F., Jr.; Florea, L.; De Oliveira, M.C.; Diamond, D.; Oliveira, O.N., Jr. Big data and machine learning for materials science. Discov. Mater. 2021, 1, 12. [Google Scholar] [CrossRef] [PubMed]
Fu, Z.; Liu, W.; Huang, C.; Mei, T. A review of performance prediction based on machine learning in materials science. Nanomaterials 2022, 12, 2957. [Google Scholar] [CrossRef]
Choudhary, K.; DeCost, B.; Chen, C.; Jain, A.; Tavazza, F.; Cohn, R.; Park, C.W.; Choudhary, A.; Agrawal, A.; Billinge, S.J.L.; et al. Recent advances and applications of deep learning methods in materials science. npj Comput. Mater. 2022, 8, 59. [Google Scholar] [CrossRef]
Nanetti, A. Defining heritage science: A consilience pathway to treasuring the complexity of inheritable human experiences through historical method, AI, and ML. Complexity 2021, 2021, 4703820. [Google Scholar] [CrossRef]
Towarek, A.; Halicz, L.; Matwin, S.; Wagner, B. Machine learning in analytical chemistry for cultural heritage: A comprehensive review. J. Cult. Herit. 2024, 70, 64–70. [Google Scholar] [CrossRef]
Li, Z.-X.; Lu, P.-S.; Wang, G.-Y.; Li, J.-H.; Yang, Z.-H.; Ma, Y.-P.; Wang, H.-H. Analysis of the Composition of Ancient Glass and Its Identification Based on the Daen-LR, ARIMA-LSTM and MLR Combined Process. Appl. Sci. 2023, 13, 6639. [Google Scholar] [CrossRef]
Rahman, M.S.; Sultana, T.; Islam, W.T.; Khan, M.R.A. An Innovative Approach to Glass Classification Using Deep Neural Network. In Proceedings of the 2024 IEEE 3rd International Conference on Robotics, Automation, Artificial-Intelligence and Internet-of-Things (RAAICON), Dhaka, Bangladesh, 29–30 November 2024; pp. 102–106. [Google Scholar]
Chen, Z.; Xu, Y.; Zhang, C.; Tang, M. Prediction of glass chemical composition and type identification based on machine learning algorithms. Appl. Sci. 2024, 14, 4017. [Google Scholar] [CrossRef]
Meng, J.; Yu, Z.; Cai, Y.; Wang, X. K-Means++ clustering algorithm in categorization of glass cultural relics. Appl. Sci. 2023, 13, 4736. [Google Scholar] [CrossRef]
Tang, S.; Yao, L. Component correction and category identification for weathered silicate glass with small datasets. IEEE Sens. J. 2023, 23, 23169–23178. [Google Scholar] [CrossRef]
Hui, X. A study on the composition analysis and identification of ancient glass products based on SVM model. Acad. J. Comput. Inf. Sci. 2022, 5, 89–95. [Google Scholar] [CrossRef]
Chen, W.; Chen, D. Research on the classification of ancient silicate glass artifacts based on machine learning. Archaeometry 2025, 67, 72–86. [Google Scholar] [CrossRef]
Cai, N.-N.; Yin, Y.-Y.; Han, Q. Prediction and classification of chemical composition of ancient glass objects based on generalized Shapley functions. Front. Chem. 2024, 12, 1351143. [Google Scholar] [CrossRef] [PubMed]
Zhang, M.; Shen, Z.; Walden, L.; Sepanta, F.; Luo, Z.; Gao, L.; Serrano, O.; Rossel, R.A.V. Deep learning of the particulate and mineral-associated organic carbon fractions using a compositional transform and mid-infrared spectroscopy. Geoderma 2025, 455, 117207. [Google Scholar] [CrossRef]
Chicco, D.; Sichenze, A.; Jurman, G. A simple guide to the use of Student’s t-test, Mann-Whitney U test, Chi-squared test, and Kruskal-Wallis test in biostatistics. BioData Min. 2025, 18, 56. [Google Scholar] [CrossRef] [PubMed]
Pier, A. χ2-test see chi-squared test. Artif. Intell. 2025, 249, 264. [Google Scholar]
Ben-Shachar, M.S.; Patil, I.; Thériault, R.; Wiernik, B.M.; Lüdecke, D. Phi, Fei, Fo, Fum: Effect sizes for categorical data that use the chi-squared statistic. Mathematics 2023, 11, 1982. [Google Scholar] [CrossRef]
Serra, N.; Rea, T.; Di Carlo, P.; Sergi, C. Continuity correction of Pearson’s chi-square test in 2x2 Contingency Tables: A mini-review on recent development. Epidemiol. Biostat. Public Health 2019, 16, e13059-1–e13059-4. [Google Scholar] [CrossRef] [PubMed]
González-Mariño, M.A. Association of Nonavalent Human Papillomavirus Vaccine with Abdominal Pain Symptoms: A Post-Marketing Drug Safety Study. Open Public Health J. 2025, 18, e18749445416525. [Google Scholar] [CrossRef]
Martin Andres, A.; Hernández, M.Á.; Gaya Moreno, F. The Yates, Conover, and Mantel statistics in 2 × 2 tables revisited (and extended). Stat. Neerl. 2024, 78, 334–356. [Google Scholar] [CrossRef]
Field, A. Discovering Statistics Using IBM SPSS Statistics; Sage Publications Limited: Thousand Oaks, CA, USA, 2024. [Google Scholar]
Zhou, F.; Wang, P.; Zhang, L.; Wei, W.; Zhang, Y. Revisiting prototypical network for cross domain few-shot learning. In Proceedings of the 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, BC, Canada, 17–24 June 2023; pp. 20061–20070. [Google Scholar]
Wang, C.; Yang, J.; Zhang, B. A fault diagnosis method using improved prototypical network and weighting similarity-Manhattan distance with insufficient noisy data. Measurement 2024, 226, 114171. [Google Scholar] [CrossRef]
Snell, J.; Swersky, K.; Zemel, R. Prototypical networks for few-shot learning. In Proceedings of the 31st International Conference on Neural Information Processing Systems; Curran Associates Inc.: Red Hook, NY, USA, 2017; Volume 30. [Google Scholar]
Ji, Z.; Chai, X.; Yu, Y.; Pang, Y.; Zhang, Z. Improved prototypical networks for few-shot learning. Pattern Recognit. Lett. 2020, 140, 81–87. [Google Scholar] [CrossRef]
Liu, T.-C.; Kalugin, P.N.; Wilding, J.L.; Bodmer, W.F. GMMchi: Gene expression clustering using Gaussian mixture modeling. BMC Bioinform. 2022, 23, 457. [Google Scholar] [CrossRef] [PubMed]
Chaleshtori, A.E.; Aghaie, A. A novel bearing fault diagnosis approach using the Gaussian mixture model and the weighted principal component analysis. Reliab. Eng. Syst. Saf. 2024, 242, 109720. [Google Scholar] [CrossRef]
Ikotun, A.M.; Ezugwu, A.E.; Abualigah, L.; Abuhaija, B.; Heming, J. K-means clustering algorithms: A comprehensive review, variants analysis, and advances in the era of big data. Inf. Sci. 2023, 622, 178–210. [Google Scholar] [CrossRef]
Dempster, A.P.; Laird, N.M.; Rubin, D.B. Maximum likelihood from incomplete data via the EM algorithm. J. R. Stat. Soc. Ser. B (Methodol.) 1977, 39, 1–22. [Google Scholar] [CrossRef]
Schwarz, G. Estimating the dimension of a model. Ann. Stat. 1978, 6, 461–464. [Google Scholar] [CrossRef]
Bagirov, A.M.; Aliguliyev, R.M.; Sultanova, N. Finding compact and well-separated clusters: Clustering using silhouette coefficients. Pattern Recognit. 2023, 135, 109144. [Google Scholar] [CrossRef]
Ros, F.; Riad, R.; Guillaume, S. PDBI: A partitioning Davies-Bouldin index for clustering evaluation. Neurocomputing 2023, 528, 178–199. [Google Scholar] [CrossRef]
Bharadiya, J.P. A tutorial on principal component analysis for dimensionality reduction in machine learning. Int. J. Innov. Sci. Res. Technol. 2023, 8, 2028–2032. [Google Scholar]
Gewers, F.L.; Ferreira, G.R.; Arruda, H.F.D.; Silva, F.N.; Comin, C.H.; Amancio, D.R.; Costa, L.d.F. Principal component analysis: A natural approach to data exploration. ACM Comput. Surv. (CSUR) 2021, 54, 70. [Google Scholar] [CrossRef]
Pratiwi, N.; Rosadi, D.; Abdurakhman. Robust scaling strategies for outlier handling in orthogonal projection to latent structures discriminant analysis (OPLS-DA). Commun. Stat.-Simul. Comput. 2025, 54, 1542–1555. [Google Scholar] [CrossRef]
How, M.S.; Hamid, N.; Liu, Y.; Kantono, K.; Oey, I.; Wang, M. Using OPLS-DA to fingerprint key free amino and fatty acids in understanding the influence of high pressure processing in New Zealand clams. Foods 2023, 12, 1162. [Google Scholar] [CrossRef]
Forsgren, E.; Bjorkblom, B.; Trygg, J.; Jonsson, P. OPLS-based multiclass classification and data-driven interclass relationship discovery. J. Chem. Inf. Model. 2025, 65, 1762–1770. [Google Scholar] [CrossRef]
Lu, S.; Xie, Y.; Xu, L.; Liu, C.; Wu, Z.; Zhang, L.; Xu, G.; Zhao, Z.; Gao, Y. Differential analysis of aroma components of cigar tobacco leaves based on OPLS-DA Model. J. Agric. Sci. Technol. 2024, 26, 176. [Google Scholar]
Nti, I.K.; Nyarko-Boateng, O.; Aning, J. Performance of machine learning algorithms with different K values in K-fold cross-validation. Int. J. Inf. Technol. Comput. Sci. 2021, 13, 61–71. [Google Scholar]
Eriksson, L.; Trygg, J.; Wold, S. CV-ANOVA for significance testing of PLS and OPLS^® models. J. Chemom. 2008, 22, 594–600. [Google Scholar] [CrossRef]
Pesarin, F.; Salmaso, L. Permutation Tests for Complex Data: Theory, Applications and Software; John Wiley & Sons: Hoboken, NJ, USA, 2010. [Google Scholar]
Ohl, L.; Mattei, P.-A.; Precioso, F. A tutorial on discriminative clustering and mutual information. ACM Comput. Surv. 2025, 58, 90. [Google Scholar] [CrossRef]
Kinney, J.B.; Atwal, G.S. Equitability, mutual information, and the maximal information coefficient. Proc. Natl. Acad. Sci. USA 2014, 111, 3354–3359. [Google Scholar] [CrossRef] [PubMed]
Lei, X.; Xia, Y.; Wang, A.; Jian, X.; Zhong, H.; Sun, L. Mutual information based anomaly detection of monitoring data with attention mechanism and residual learning. Mech. Syst. Signal Process. 2023, 182, 109607. [Google Scholar] [CrossRef]
Kraskov, A.; Stögbauer, H.; Grassberger, P. Estimating mutual information. Phys. Rev. E-Stat. Nonlinear Soft Matter Phys. 2004, 69, 066138. [Google Scholar] [CrossRef]
DiCiccio, T.J.; Efron, B. Bootstrap confidence intervals. Stat. Sci. 1996, 11, 189–228. [Google Scholar] [CrossRef]
Vujović, Ž. Classification model evaluation metrics. Int. J. Adv. Comput. Sci. Appl. 2021, 12, 599–606. [Google Scholar] [CrossRef]
Naidu, G.; Zuva, T.; Sibanda, E.M. A review of evaluation metrics in machine learning algorithms. In Artificial Intelligence Application in Networks and Systems; Springer: Cham, Switzerland, 2023; pp. 15–25. [Google Scholar]
Li, B.; Sun, X.; Wang, Y.; Wang, B.; Shi, M. Classification and subclassification model of glass products based on CART decision tree algorithm. In Proceedings of the 2023 IEEE 6th International Conference on Information Systems and Computer Aided Education (ICISCAE), Dalian, China, 23–25 September 2023; pp. 783–788. [Google Scholar]
Liu, H.; Wang, S. Identification of Glass Artifacts Based on Decision Tree and k-Means Method. In Proceedings of the 2022 11th International Conference on Computer Technologies and Development (TechDev), Barcelona, Spain, 16–18 October 2022; pp. 12–16. [Google Scholar]
Hu, X.; Zhang, J.; Wang, Z.; Zhao, J. Improved Glass Composition Analysis and Identification of Cultural Heritage with Limited Data Using Data Augmentation and CatBoost. In Artificial Intelligence Technologies and Applications: Proceedings of the 5th International Conference (ICAITA 2023); SAGE Publications: Thousand Oaks, CA, USA, 2024; pp. 735–748. [Google Scholar]
Kucheryavskiy, S.; Rodionova, O.; Pomerantsev, A. A comprehensive tutorial on Data-Driven SIMCA: Theory and implementation in web. J. Chemom. 2024, 38, e3556. [Google Scholar] [CrossRef]
Stumpe, B.; Engel, T.; Steinweg, B.; Marschner, B. Application of PCA and SIMCA statistical analysis of FT-IR spectra for the classification and identification of different slag types with environmental origin. Environ. Sci. Technol. 2012, 46, 3964–3972. [Google Scholar] [CrossRef]
Qin, Y.; Wang, Y.; Chen, X.; Li, H.; Xu, Y.; Li, X. The research of burning ancient Chinese lead-barium glass by using mineral raw materials. J. Cult. Herit. 2016, 21, 796–801. [Google Scholar] [CrossRef]
Stern, W.B.; Gerber, Y. Potassium–calcium glass: New data and experiments. Archaeometry 2004, 46, 137–156. [Google Scholar] [CrossRef]
Sawyer, R.; Nesbitt, H.W.; Bancroft, G.M.; Thibault, Y.; Secco, R.A. Spectroscopic studies of oxygen speciation in potassium silicate glasses and melts. Can. J. Chem. 2015, 93, 60–73. [Google Scholar] [CrossRef]

Figure 1. Overview of the machine learning framework for analyzing composition-dependent weathering in heritage glass.

Figure 2. The framework of Prototypical Network.

Figure 3. Training loss of Prototypical Network.

Figure 4. Confusion matrix of the Prototypical Network on the test set (0 represents High-potassium glass; 1 represents Lead-barium glass).

Figure 5. GMM subclass clustering for lead-barium glass.

Figure 6. OPLS-DA results for the three subclasses of lead-barium glass: (a) score plot showing clear separation among subclasses; (b) permutation test validating model robustness; (c) VIP plot identifying key compositional variables contributing to class discrimination.

Figure 7. Heatmap of 14 chemical components in lead-barium glass showing compositional variation and inter-element correlations.

Figure 8. Mutual information (MI)-based differential network between high-potassium (Class-0) and lead-barium (Class-1) glasses. Edges represent significant differential MI. Red edges indicate stronger associations in lead-barium glass, whereas blue edges indicate stronger associations in high-potassium glass. Chord width increases with the absolute value of the bootstrap mean difference

|S|

. Arc color reflects node degree, with darker colors indicating higher connectivity.

Figure 8. Mutual information (MI)-based differential network between high-potassium (Class-0) and lead-barium (Class-1) glasses. Edges represent significant differential MI. Red edges indicate stronger associations in lead-barium glass, whereas blue edges indicate stronger associations in high-potassium glass. Chord width increases with the absolute value of the bootstrap mean difference

|S|

. Arc color reflects node degree, with darker colors indicating higher connectivity.

Table 1. Chi-square tests of glass type.

		Type		Sum	$χ^{2}$	$P$	Cramér’s V
		High-Potassium Glass	Lead-Barium Glass	Sum	$χ^{2}$	$P$	Cramér’s V
Weathered	N	12	12	24	6.88	0.009	0.344
Weathered	Y	6	28	34	6.88	0.009	0.344

Table 2. Yates-corrected chi-squared tests of glass color.

		Color								$χ_{Y a t e s}^{2}$	$P$
		Black	Blue-Green	Green	Light Blue	Light Green	Dark Blue	Dark Green	Purple	$χ_{Y a t e s}^{2}$	$P$
Weathered	N	0	6	1	8	2	2	3	2	7.234	0.405
Weathered	Y	2	9	0	16	1	0	4	2	7.234	0.405

Table 3. Yates-corrected chi-squared tests of glass pattern.

		Pattern			Sum	$χ_{Y a t e s}^{2}$	$P$
		A	B	C	Sum	$χ_{Y a t e s}^{2}$	$P$
Weathered	N	11	0	13	24	4.957	0.084
Weathered	Y	11	6	17	34	4.957	0.084

Table 4. Average and Worst-Case Classification Performance under Random Data Partitions.

Model	Average Accuracy	Average Precision	Average Recall	Average F1-Score
Prototypical Network	0.9674	0.9837	0.9720	0.9766
CART decision tree	0.9798	0.9930	0.9775	0.9838
Model	Worst Accuracy	Worst Precision	Worst Recall	Worst F1-Score
Prototypical Network	0.7143	0.8750	0.7000	0.7778
CART decision tree	0.6429	0.8571	0.6000	0.7059

Table 5. Frequency of classification changes under different perturbation levels applied to SiO₂ in the test set. For each noise level, 100 independent runs were performed using the pre-trained Prototypical Network. Classification changes were observed only when perturbations were applied to SiO₂; perturbations to other chemical components did not affect the classification outcomes.

Noise Level	Number of Runs with Classification Change	Percentage
0.5	6	6%
0.6	15	15%
0.7	36	36%
0.8	40	40%
0.9	47	47%
1	53	53%

Table 6. GMM Clustering Performance Metrics.

Number of Components	BIC	Silhouette Coefficient	DBI
1	1446.89	\	\
2	1480.22	0.1747	1.8806
3	958.11	0.2062	1.5901
4	1260.80	0.2323	1.2451
5	1170.66	0.2026	1.2540
6	1175.73	0.2183	1.1911

Table 7. Acceptable noise ranges for individual chemical components.

Chemical Components	$Acceptable Noise Range (\pm ε$ )	Interpretation
Al₂O₃	0.2	Moderate tolerance
BaO	0.1	Low tolerance
CaO	0.2	Moderate tolerance
CuO	0.4	High tolerance
Fe₂O₃	0.3	High tolerance
K₂O	0.5	High tolerance
MgO	0.5	High tolerance
Na₂O	0.1	Low tolerance
P₂O₅	0.3	Moderate tolerance
PbO	0.1	Low tolerance
SiO₂	0.1	Low tolerance
SnO₂	0.1	Low tolerance
SO₂	0.05	Very low tolerance
SrO	0.4	High tolerance

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Wan, H.; Jin, Z.; Huang, G.; Li, S. A Machine Learning Framework for Interpreting Composition-Dependent Weathering in Heritage Glass. Math. Comput. Appl. 2026, 31, 54. https://doi.org/10.3390/mca31020054

AMA Style

Wan H, Jin Z, Huang G, Li S. A Machine Learning Framework for Interpreting Composition-Dependent Weathering in Heritage Glass. Mathematical and Computational Applications. 2026; 31(2):54. https://doi.org/10.3390/mca31020054

Chicago/Turabian Style

Wan, Hailu, Zhuo Jin, Gengqiang Huang, and Shuang Li. 2026. "A Machine Learning Framework for Interpreting Composition-Dependent Weathering in Heritage Glass" Mathematical and Computational Applications 31, no. 2: 54. https://doi.org/10.3390/mca31020054

APA Style

Wan, H., Jin, Z., Huang, G., & Li, S. (2026). A Machine Learning Framework for Interpreting Composition-Dependent Weathering in Heritage Glass. Mathematical and Computational Applications, 31(2), 54. https://doi.org/10.3390/mca31020054

Article Menu

A Machine Learning Framework for Interpreting Composition-Dependent Weathering in Heritage Glass

Abstract

1. Introduction

2. Methods

2.1. Workflow

2.2. Data Overview

2.3. Chi-Squared Test

2.4. Prototypical Network

2.5. Gaussian Mixture Model

2.6. OPLS-DA

2.7. Mutual Information Network

2.8. Evaluation Metrics

3. Results and Discussion

3.1. Correlation Analysis

3.2. Classification of Glass Types

3.3. Subclass Analysis and Validation

3.4. Chemical Correlation Analysis

4. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI