Structural Fingerprinting of Crystalline Materials from XRD Patterns Using Atomic Cluster Expansion Neural Network and Atomic Cluster Expansion

Zhang, Xiao; Wang, Xitao; Hu, Shunbo

doi:10.3390/app15115851

Open AccessArticle

Structural Fingerprinting of Crystalline Materials from XRD Patterns Using Atomic Cluster Expansion Neural Network and Atomic Cluster Expansion

by

Xiao Zhang

^1,2

,

Xitao Wang

^1,2,3 and

Shunbo Hu

^1,2,3,4,*

¹

Institute for the Conservation of Cultural Heritage, School of Cultural Heritage and Information Management, Shanghai University, Shanghai 200444, China

²

Key Laboratory of Silicate Cultural Relics Conservation, Ministry of Education, Shanghai University, Shanghai 200444, China

³

Materials Genome Institute, Shanghai University, Shanghai 200444, China

⁴

Institute for Quantum Science and Technology, Shanghai University, Shanghai 200444, China

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2025, 15(11), 5851; https://doi.org/10.3390/app15115851

Submission received: 18 April 2025 / Revised: 14 May 2025 / Accepted: 16 May 2025 / Published: 23 May 2025

Download

Browse Figures

Versions Notes

Abstract

This study introduces a novel contrastive learning-based X-ray diffraction (XRD) analysis framework, an SE(3)-equivariant graph neural network (E3NN) based Atomic Cluster Expansion Neural Network (EACNN), which reduces the strong dependency on databases and initial models in traditional methods. By integrating E3NN with atomic cluster expansion (ACE) techniques, a dual-tower contrastive learning model has been developed, mapping crystal structures and XRD patterns to a continuous embedding space. The EACNN model retains hierarchical features of crystal systems through symmetry-sensitive encoding mechanisms and utilizes relationship mining via contrastive learning to replace rigid classification boundaries. This approach reveals gradual symmetry-breaking patterns between monoclinic and orthorhombic crystal systems in the latent space, effectively addressing the recognition challenges associated with low-symmetry systems and small sample space groups. Our investigation further explores the potential for model transfer to experimental data and multimodal extensions, laying the theoretical foundation for establishing a universal structure–property mapping relationship.

Keywords:

contrastive learning; SE(3)-equivariant graph neural networks; low-symmetry crystal systems

1. Introduction

Since the first discovery of crystal X-ray diffraction by Max von Laue in 1912 [1], diffraction patterns have always been a core analytical tool in materials science, serving as the fingerprint of material structures. The Bragg equation

n λ = 2 d sin θ

, proposed by W.H. and W.L. Bragg [2], laid the foundation for diffraction theory, and the invention of the Debye–Scherrer powder diffraction technique [3] made rapid characterization of polycrystalline and amorphous materials possible. With the innovation of synchrotron radiation sources, two-dimensional pixel detectors, and computational methods [4], modern diffraction techniques have evolved from simple lattice constant measurements to multidimensional information probes for analyzing complex structures such as defects, nanodomains, and superlattices. In this development, spectrum analysis and refinement have always been the key bridge connecting experimental data to real structures. Traditional Rietveld refinement methods [5] achieve iterative optimization of structural parameters by minimizing the residual between experimental and theoretical model patterns, but their effectiveness is highly dependent on the accuracy of the initial model, and the computational cost for complex systems (such as mixed phases and low-symmetry materials) grows exponentially. In recent years, the introduction of machine learning techniques [6,7] has brought a paradigm shift to diffraction analysis: by automatically extracting features such as peak positions, peak widths, and intensity distributions, combined with high-throughput computing, the efficiency of structure analysis has been significantly improved. The bridging role of X-ray diffraction (XRD) technology in this process has become increasingly prominent: experimental XRD patterns are the physical response of real material structures, while simulated XRD patterns of atomic structures provide a gold standard for theoretical verification [8]. Accurate comparison between the two not only verifies the correctness of structural models but also reflects the impact mechanisms of experimental conditions on the data.

Traditional XRD phase identification completes the match by comparing the sample diffraction peak positions, relative intensities, and interplanar spacing with the three core parameters of PDF cards [9]. In the unconditional search mode of the well-known Jade software (version 4.6.0), even without specifying the chemical composition, it still relies on the “three strong peaks” characteristic data stored in the PDF card database for similarity ranking. For this reason, when a sample contains unknown phases or structures not included in the database, the system cannot generate effective matching results. With the rapid development of new functional materials (such as MOFs and two-dimensional heterojunctions), the growth of existing databases still fails to cover all possible crystal variants [10]. When the diffraction peak positions of unknown structures have a match degree lower than the FOM threshold with existing cards, traditional methods can only classify them as “unknown phases” and cannot conduct in-depth analysis. Even advanced methods like Rietveld refinement face convergence difficulties, attributed to the fact that refinement essentially involves adjusting variables such as the unit cell parameters and atomic coordinates of the initial structure, which heavily depends on known crystal structures as the starting point [5]. However, the main difference between search/match analysis and Rietveld analysis is that Rietveld analysis is based on whole-profile fitting, while search/match analysis relies on integrated peak intensities. This discrepancy can be significant for nanocrystalline or anisotropic materials, such as smectites, and can also fail in the common case of preferred orientation or texture. This situation highlights the dual paradox of modern XRD analysis: on one hand, phase identification based on PDF card databases remains an important standard for routine laboratory analysis [11]; on the other hand, the inadequate ability to identify substances outside the database has severely constrained the efficiency of new material development.

The application of machine learning in XRD analysis has alleviated two critical constraints inherent to conventional approaches: the heavy reliance on well-curated PDF card databases and the prerequisite knowledge of initial structural models. Machine learning methods differ from manually preset features such as peak positions and intensities (like the “three strongest peaks” rule in PDF cards) by automatically learning higher-order features through the training of convolutional operators or attention mechanisms on patterns information. In recent scientific research, various machine learning algorithms have demonstrated their powerful data-driven processing capabilities [12]. Davel et al. [13] proposed a machine learning framework for materials discovery and characterization using X-ray scattering data. Prasianakis et al. [14] proposed an AI-enhanced framework for real-time mineral phase identification and quantification using X-ray diffraction analysis. Surdu et al. [15] discussed in detail a series of advancements in the application of machine learning in the field of XRD data analysis. Specifically, support vector machines (SVMs), convolutional neural networks (CNNs), random forests (RFs), k-nearest neighbors (KNN), decision trees (DTs), gradient boosting, and naive Bayes (NB) have shown advantages in scenarios such as feature extraction from XRD images (e.g., synchrotron radiation data stream analysis [16]), anomaly detection [17], crystal system classification (recognition of seven crystal systems [18,19]), identification of perovskite and non-perovskite materials [20], and classification of geothermal rock minerals [21]. In the field of atomic structure modeling of crystals, machine learning methods are being widely applied to crystal structure analysis, including cubic lattice structure classification, space group and crystal symmetry prediction, and phase transition analysis. Techniques such as dynamic time warping (DTW), autoencoders, extremely randomized trees (ERTs), and deep neural networks (DNNs) have shown unique advantages in these tasks. For example, Vecsei et al. [22] proposed a neural network-based method for classifying crystal symmetry from X-ray diffraction patterns, which can effectively identify cubic lattice structures. Similarly, Suzuki et al. [23] achieved successful prediction of crystal space groups and symmetries from X-ray diffraction data using an interpretable machine learning method. These studies indicate that machine learning methods have significant potential in handling complex crystallographic data. Moreover, Venderley et al. [24] utilized unsupervised machine learning techniques to address the big data processing issues in modern X-ray diffraction data, providing new tools for phase transition analysis. Utimula et al. [25] successfully distinguished the composition of ThMn12-type alloys through machine learning clustering techniques. In terms of feature extraction, autoencoders have been proven to be an effective tool, capable of constructing feature spaces from XRD patterns [26].

In recent years, the paradigm of self-supervised learning, particularly contrastive learning, has seen rapid development in materials science, with its unique multimodal mutual learning model and strong zero-shot capabilities significantly enhancing the perception of physical information [27]. The combination of contrastive learning with XRD structural analysis has made it possible to break through the paradigm from “hard classification” to “exploration of the correlation between structural and spectroscopic information”. In this task scenario, common direct classifier models map simulated patterns to discrete crystal system/space group labels, which is essentially a lossy compression. While it is easy to train, it loses the complex physical correlations between patterns and structures, such as atomic arrangement symmetry and Bragg condition response patterns [28,29]. Contrastive learning, on the other hand, builds a joint embedding space for structure and patterns, directly modeling the potential correspondence between the two, thereby preserving multidimensional physical features. In fact, patterns and structures inherently have a one-to-many ambiguity (such as the XRD patterns of four different structural models being highly similar, as mentioned in [30]). The same spectrum may come from different samples, and since the spectrum itself is a statistical representation after dimensionality reduction and averaging, solving this ambiguity from the perspective of the spectrum alone is very challenging [30]. However, the training of contrastive learning is bidirectional: on one hand, the model needs to learn the deterministic generation pattern of the spectrum corresponding to the current structure; on the other hand, it also needs to learn the probabilistic inverse mapping constraint of the structure corresponding to the spectrum. This physically consistent learning method of crystal-pair diffraction patterns is more intrinsic compared to just classifying patterns. Contrastive learning only requires weak supervision signals such as “spectrum A′ of structure A should be similar to other augmented versions of structure A (such as rotations and noise perturbations) but different from the spectrum of structure B” [29,31]. This characteristic allows for the use of unlabeled experimental data to enhance training. Therefore, when facing the challenge of “long-tail distribution” (displaying a figure of a category with fewer space groups), simple direct classification of patterns will fail due to the lack of samples for rare space groups, whereas contrastive learning, by emphasizing relationships between samples rather than absolute categories, can utilize a small number of samples to establish boundaries of difference with common categories. This paradigm shift essentially redefines the mathematical formulation of the “structure–spectrum” mapping problem from a closed category space discrimination to an open relationship manifold learning, resulting in a continuous diffraction spectrum–crystal fingerprint. In the continuous embedding space, crystal symmetry breaking is represented as a gradual change in the curvature of the manifold rather than a discrete category jump [29,31].

In our work, inspired by the Contrastive Language–Image Pre-training (CLIP) model [32], we integrated SE(3)-equivariant [33] network architecture and atomic cluster expansion (ACE) [34,35] technology to construct a dual-tower contrastive learning model: E3NN-based Atomic Cluster Expansion Neural Network (EACNN). As illustrated in Figure 1, crystal structures are processed through a structure encoding tower based on graph neural networks, which interacts with pattern information processed through a 1D ResNet structure for information aggregation in the hidden layer. In our tests, the TOP-1, TOP-3, and TOP-5 accuracies for retrieving structures from XRD patterns were 95.96%, 99.95%, and 99.98%, respectively. Moreover, even in samples with very few space group instances, our model maintained robust performance. The remainder of this paper is organized as follows: Section 2 introduces the EACNN framework, detailing its SE(3)-equivariant graph neural network architecture combined with atomic cluster expansion for crystal structure and XRD pattern encoding [33]. Section 3 demonstrates the model’s effectiveness in structure retrieval, particularly in handling symmetry variations and rare space groups. Section 4 concludes by highlighting the method’s potential for XRD analysis while addressing current limitations and future directions, including experimental data adaptation and multimodal extension.

2. Materials and Method

The main framework of this work model is a dual-encoder contrastive learning model. As illustrated in Figure 1, crystal structures are processed through a structure encoding tower based on SE(3)-equivariant graph neural networks, which are very commonly used in the current crystal machine learning frameworks [36,37,38,39], while pattern information is encoded using a simple one-dimensional convolution. The dual-tower structure maps the data from two modalities into the same space for contrastive learning modality fusion. The crystal structure illustrations in this work were provided by VESTA [40,41].

2.1. Calculation of the Simulated Spectrum

For any input crystal, after entering the cell parameters (

a, b, c, α, β, γ

), space group number, and atomic coordinates occupancy, the equivalent atomic coordinates are automatically generated through Wyckoff position analysis to ensure compliance with space group symmetry. All possible crystal face indices

(h k l)

are traversed, and the corresponding diffraction angle

θ

is calculated according to Bragg’s equation

n λ = 2 d_{h k l} sin θ

, where the interplanar spacing

d_{h k l}

is directly calculated from the cell parameters and crystal face indices. Based on the atomic scattering factor

f_{j}

and Debye–Waller factor

B_{j}

, the structure factor for each

(h k l)

plane is calculated:

F_{h k l} = \sum_{j} f_{j} e^{- B_{j} {(sin θ / λ)}^{2}} e^{2 π i (h x_{j} + k y_{j} + l z_{j})}

(1)

Efficiency can be improved by calculating only the independent reflections through symmetry reduction, as described in Fredericks et al. [42]. The squared modulus of the structure factor

| F_{h k l} |^{2}

is multiplied by the Lorentz–polarization factor, and the instrumental broadening function (such as a Gaussian/Lorentzian mixed peak shape) is overlaid to obtain the continuous diffraction spectrum.

2.2. Datasource

All crystal structures come from the Materials Project database, which includes a total of 154,714 samples distributed across 228 space groups (the 2 missing space groups are due to the lack of structures for these space groups in the Materials Project structure database used). The frequency distribution of the space group numbers of the samples is shown in Figure 2.

In traditional classification models, direct classification of space groups may be challenging due to the imbalance of samples, leading to coupling bias in the symmetry hierarchy and feature distribution. For example, high-symmetry space groups of the cubic crystal system (such as

F m \bar{3} m

) are significantly more prevalent in the MP database than low-symmetry space groups (such as

P 1

of the triclinic crystal system). This long-tail distribution can cause the model to tend to classify ambiguous samples into high-symmetry categories. More seriously, the symmetry-breaking paths of different crystal systems are nonuniformly separable in the feature space—decision boundaries of high-symmetry crystal systems often dominate, while the fine-grained features of low-symmetry crystal systems are easily overwhelmed by the mainstream distribution in the high-dimensional manifold. Classifiers based on handcrafted features (such as cell parameters and space group numbers) cannot decouple lattice symmetry from atomic primitive features. For example, if the monoclinic angle approaches 90°, this issue becomes significant. In such cases, the unit cell parameters and certain diffraction peak positions of the

P 2_{1} / c

structure may exhibit effective overlap with those of orthorhombic systems, such as

P b a m

. This overlap problem is one of the primary obstacles in crystal structure analysis using powder XRD, as the two space groups possess fundamentally different microsymmetry features, including screw axes and glide planes. More generally, similar geometric ambiguities can arise in situations like the near-equivalence between orthorhombic

C m c m

and hexagonal unit cells, where subtle differences in symmetry can lead to significant structural misinterpretation. The hard classification boundaries of traditional models can blur the gradual features of such symmetry breaking.

2.3. Crystal Structure Encoder

This work decouples the multiscale features of more than

10^{5}

crystal structures in the MP database using a deep neural network, preserving the sensitivity of cluster expansion to short-range order while further integrating dynamic correlation patterns across lattices.

E_{i} (σ) = \sum_{K, n, l} c_{n, l}^{(K)} B_{i, n, l}^{(K)}

(2)

in which

E_{i}

is the energy of each atom and

σ

represents the set of relative position vectors between the central atom i and all its neighboring atoms. Specifically,

σ = (r_{1 i}, r_{2 i}, \dots, r_{N i})

, where

r_{j i} = r_{j} - r_{i}

denotes the vector from neighboring atom j to the central atom i. This set fully describes the local atomic environment of the central atom i, K represents the order of the cluster, n and l represent the indices of the radial and angular basis functions, respectively,

B_{i, n, l}^{(K)}

are rotationally invariant basis functions constructed from the product of reduced spherical harmonics, and

c_{n, l}^{(K)}

are the undetermined expansion coefficients.

Through network methods, this work follows the theoretical framework of MACE [35], which combines atomic cluster expansion with the multibody interaction network framework of MPNNs (Message Passing Neural Networks [43]), forming an equivariant encoder for encoding crystal structures.

At the primary feature layer, the convolutional module captures geometric features closely related to the cluster expansion parameters, such as atomic distances and coordination numbers; at the advanced semantic layer, the graph neural network dynamically models the long-range correlation effects between clusters through message passing, breaking the limitation of fixed cutoff radii in traditional cluster expansions.

m_{i}^{(t)} = \sum_{j} u_{1} (σ_{i}^{(t)}; σ_{j}^{(t)}) + \sum_{j_{1}, j_{2}} u_{2} (σ_{i}^{(t)}; σ_{j_{1}}^{(t)}, σ_{j_{2}}^{(t)}) + \dots + \sum_{j_{1}, \dots, j_{ν}} u_{ν} (σ_{i}^{(t)}; σ_{j_{1}}^{(t)}, \dots, σ_{j_{ν}}^{(t)})

(3)

where in Equation (3),

m_{i}^{(t)}

denotes the message received by node i in the t-th layer of the MPNN. The message contains information from its neighboring nodes, used to update the features of node i;

σ_{i}^{(t)}

represents the state of node i at the t-th layer, composed of position

r_{i}

, chemical element

z_{i}

, and learnable feature

h_{i}^{(t)}

; u is learnable; and

ν

represents the maximum correlation order in the state.

2.4. Diffraction Pattern Encoder

This work inputs the diffraction pattern as a one-dimensional vector of length 140 into the model, aggregates the information into a 384-channel one-dimensional vector through two layers of 1D-resnet [44], and uses this vector as the embedding vector of the diffraction pattern.

The core advantage of 1D convolutional networks in XRD pattern encoding stems from their deep alignment with the characteristics of diffraction signals. As a one-dimensional sequence of angle intensity, the key information in XRD patterns is embedded in the local peak shape structures: 1D-CNN can accurately capture the position, intensity, and half-width of diffraction peaks through the local perception characteristics of convolutional kernels. Hierarchical convolutional structures can abstract peak shape combination patterns step by step, achieving efficient mapping from atomic-level diffraction features to macroscopic phase recognition.

2.5. Loss Function

The Loss function design in this work follows the InfoNCE Loss design from SimCLR [45], using all sample pairs within a batch except for the self–sample pair as negative samples. However, as a multimodal contrastive learning task, this work adopts the mean of the mutual InfoNCE Loss between the two modalities as the final Loss of the model, a form consistent with the final Loss design of CLIP [32]:

L_{SimCLR} = - \frac{1}{2 N} \sum_{i = 1}^{2 N} log \frac{exp (\frac{z_{i}^{⊤} z_{j}}{τ ∥ z_{i} ∥ ∥ z_{j} ∥})}{\sum_{k = 1}^{2 N} 1_{k \neq i} exp (\frac{z_{i}^{⊤} z_{k}}{τ ∥ z_{i} ∥ ∥ z_{k} ∥})}

(4)

where in Equation (4),

L

represents the Loss function for this task;

z_{i}

,

z_{j}

are feature vectors of the same original sample after different data augmentations;

τ

is the temperature parameter; N is the batch size, and

1_{k \neq i}

is a flag to exclude self-comparison.

\begin{matrix} L_{EACNN} & = \frac{1}{2} (L_{crystal \to xrd} + L_{xrd \to crystal}) \end{matrix}

(5)

\begin{matrix} L_{crystal \to xrd} & = - \frac{1}{B} \sum_{i = 1}^{B} log \frac{exp (\frac{C_{i}^{⊤} X_{i}}{λ ∥ C_{i} ∥ ∥ X_{i} ∥})}{\sum_{j = 1}^{B} exp (\frac{C_{i}^{⊤} X_{j}}{λ ∥ C_{i} ∥ ∥ X_{j} ∥})} \end{matrix}

(6)

\begin{matrix} L_{xrd \to crystal} & = - \frac{1}{B} \sum_{i = 1}^{B} log \frac{exp (\frac{X_{i}^{⊤} C_{i}}{λ ∥ X_{i} ∥ ∥ C_{i} ∥})}{\sum_{j = 1}^{B} exp (\frac{X_{i}^{⊤} C_{j}}{λ ∥ X_{i} ∥ ∥ C_{j} ∥})} \end{matrix}

(7)

where in Equations (6) and (7),

C_{i}

,

X_{i}

are the i-th crystal structure and xrd pattern embeddings in the batch,

λ

is a trainable temperature parameter initially set to 0.07, and B is the batch size.

3. Results and Discussion

We trained EACNN on a dataset based on the MP database and tested its performance in reconstructing structures from patterns on an independent test set. For contrastive learning, Top-N is commonly used to characterize performance. The specific Top-N values are presented in Table 1, with the accuracy rates for Top-1, Top-3, and Top-5 being 95.96%, 99.95%, and 99.98%, respectively, even though these matches are entirely based on the model’s automatic learning rather than special annotations. From Table 1, it is evident that the model’s performance nearly converges at Top-3.

From the perspective of feature learning, high Top-N accuracy indicates that the model has successfully established a robust mapping in the latent space. Through adversarial training of positive and negative samples in contrastive learning, the model can effectively capture the nonlinear correspondence between atomic arrangements in crystal structures and X-ray diffraction patterns. Further analysis involves the ranking of crystals corresponding to similar patterns in the test set, primarily to observe the similarities and differences between the paired crystals of the detected patterns. As shown in Figure 3, the detected patterns are labeled as True XRD, and the first structure inferred in reverse is the true structure. The model is then required to infer the second structure (most similar structure) and simultaneously extract its corresponding XRD. The model is further instructed to output the third inference structure, and the same method is used to extract the second similar structure and its corresponding XRD.

From the model’s inference results shown in Figure 3, the space group of the crystal structure corresponding to the detected XRD is

P 2_{1} / c

(monoclinic, Bravais lattice type P). The space group of the Top-2 inferred structure is Pbam (orthorhombic, Bravais lattice type P), and the space group of the Top-3 inferred structure has extended to

C 2 / m

(monoclinic, Bravais lattice type C). Analyzing from the perspective of symmetry evolution, the model’s prediction ranking of

P 2_{1} / c

,

P b a m

, and

C 2 / m

reveals a deep connection between the hierarchy of symmetry breaking and diffraction characteristics, despite the significant overlap of XRD patterns in specific angular ranges, as shown by the strong peak clusters. This symmetry-sensitive prediction mechanism may originate from E(3) invariance and strong local learning constraints in cluster expansion: (1) During data augmentation, by applying symmetry transformations allowed by specific space groups (such as glide plane reflections and screw axis translations) to atomic positions, the invariance constraint under symmetry operations forces the model to learn intrinsic symmetry features rather than being limited to specific atomic coordinate arrangements. For example, the screw axis operation of

P 2_{1} / c

generates a periodic phase shift in the diffraction intensity distribution, and the model filters out such nonessential differences through an invariance Loss function. (2) Lattice symmetry components (related to Bravais types) and atomic primitive components (related to Wyckoff positions) are automatically learned and undergo distance relaxation during the aggregation of local representations (the essence of contrastive learning is to readjust the distances between samples), such as the significant separation of

P 2_{1} / c

and

C 2 / m

in the lattice symmetry subspace (Bravais types are explicitly separated), but they are similar in the atomic primitive subspace. This differs fundamentally from traditional database queries, which typically rely on preset structural descriptors (such as space group numbers and cell parameter thresholds) or empirical peak position similarity (such as Euclidean distance), and cannot distinguish between symmetry breaking due to differences in Bravais lattice types (such as P→C) and pseudo-symmetry caused by atomic primitive perturbations. Additionally, the deep learning method also compensates for the lack of local–global feature coupling, as traditional methods usually handle lattice parameters and atomic positions independently, while deep models can achieve multiscale aggregation, establishing dynamic associations between local chemical environment features in cluster expansion and global symmetry features.

To further analyze the physical information learned by the model, t-SNE [46] was used to reduce the dimensionality and visualize the hidden layer of the model. From Figure 4, it can be seen that while some local clustering effects of crystal systems are retained during contrastive learning training, the overall clustering is not significantly constrained by crystal systems.

In traditional methods, the feature distributions of the seven crystal systems exhibit discrete hard boundaries (such as the sharp division between cubic and hexagonal systems), which stem from the linear dependence of handcrafted features on cell parameters (such as the strong constraint of

a = b = c

in cubic systems). Through the soft clustering mechanism of contrastive learning, high-symmetry crystal systems (such as cubic) still maintain local clustering, indicating that the features of high-symmetry crystal systems still have strong distinguishability in the model’s latent space, while low-symmetry crystal systems (monoclinic, triclinic) form a gradual transition zone. For example, the distribution of potential vectors for monoclinic and orthorhombic systems partially overlaps, as the lower symmetry of monoclinic and triclinic systems results in a larger range of variations in cell parameters and atomic arrangements, making them susceptible to lattice distortions and leading to the blurring of feature distributions.

Due to this personalized recognition approach, contrastive learning is not limited by the influence of strict hard-classification categories. We randomly selected some space group samples with small sample sizes and samples of the space group

F m \bar{3} m

with a large number of test samples to test the space group accuracy after Top-1 recognition, as shown in Table 2.

From Table 2, it can be seen that the individual characteristics of small samples are well preserved. The complementary nature of cluster expansion with E3 invariance and contrastive training in the feature space provides a dual guarantee for model performance improvement: the former ensures feature interpretability through a clear physical parameterization process, while the latter mines implicit cross-scale correlation patterns through a data-driven approach. This hybrid paradigm not only retains prior knowledge in the field of materials science but also fully leverages the nonlinear fitting advantages of deep learning, providing new insights for establishing a universal structure–property mapping relationship.

By redefining XRD analysis as a continuous manifold learning problem, our contrastive learning framework transcends the limitations of conventional classification paradigms. It not only advances structure identification for database-excluded materials but also provides a foundation for autonomous materials discovery systems. The synergy between equivariant neural networks and physics-driven constraints opens new avenues for decoding complex structure–property relationships across materials science. In the provided Supplementary Materials, Table S1 systematically summarizes representative applications of machine learning in X-ray diffraction (XRD) analysis, detailing six exemplary methodologies along with their respective research domains, specific contributions, and technical characteristics. Table S2 comprehensively documents the crystallographic parameters of three samples in standard CIF format of Figure 3, including precise lattice constants, fractional atomic coordinates, and site occupancy factors—critical structural information serving as a benchmark for reproducing all crystalline models presented in this study.

While our approach excels in simulated XRD data, experimental complexities—such as preferred orientation effects, instrumental broadening, and amorphous background signals—require further adaptation. Future work should incorporate domain adaptation techniques to align simulated and experimental feature distributions. Additionally, extending the framework to multimodal data (e.g., pairing XRD with PDF analysis or spectroscopy) could enhance structural resolution for disordered systems. The current model’s reliance on Materials Project data also necessitates validation against experimentally synthesized novel materials, particularly metastable phases absent in computational databases.

4. Conclusions

A novel study is presented in this research paper, combining contrastive learning with crystal analysis, marking a paradigm shift in material characterization based on XRD. Notable Top-1 (95.96%), Top-3 (99.95%), and Top-5 (99.98%) accuracy rates have been achieved in the structure retrieval task. However, experimental challenges such as preferred orientation effects still require further exploration to obtain more realistic results. Future research should focus on domain adaptation techniques and extending the framework to multimodal data to improve structural resolution, thereby further advancing the development of intelligent and efficient material science. It is hoped that this paradigm will be expanded to a broader range of material systems and characterization techniques, laying the foundation for a generalized material analysis platform.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/app15115851/s1, Table S1: Supporting Evidence for the Stated Claims in the Text. Table S2: The contents of the three Crystallographic Information File files shown in the table correspond to the three structures mentioned in Figure 3.

Author Contributions

Conceptualization, X.Z. and S.H.; methodology, X.W. and S.H.; validation, X.Z. and X.W.; formal analysis, X.W.; investigation, X.Z.; resources, S.H.; data curation, X.Z.; writing—original draft preparation, X.Z.; writing—review and editing, S.H.; visualization, X.W.; supervision, S.H.; project administration, S.H.; funding acquisition, S.H. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Key R&D Program of China (Grant No. 2023YFB4402600), the National Natural Science Foundation of China (Grant Nos. 52271007, 12074241, 22173058, 12274278, 12274279), the Major Science and Technology Projects of Shanxi Province (No. 202201150501024), and the Shanghai Technical Service Center of Science and Engineering Computing, Shanghai University. The APC was funded by the National Natural Science Foundation of China (Grant No. 52271007).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The datasets used and analyzed during the current study are available from the corresponding author upon reasonable request.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Friedrich, W.; Knipping, P.; Laue, M. Interferenzerscheinungen bei roentgenstrahlen. Ann. Phys. 1913, 346, 971–988. [Google Scholar] [CrossRef]
Bragg, W.H.; Bragg, W.L. The reflection of X-rays by crystals. Proc. R. Soc. Lond. Ser. A 1913, 88, 428–438. [Google Scholar] [CrossRef]
Debye, P.; Scherrer, P. Interferenzen an regellos orientierten Teilchen im Röntgenlicht. I. Nachrichten Ges. Wiss. Göttingen Math.-Phys. Kl. 1916, 1916, 1–15. [Google Scholar]
Toby, B.H.; Von Dreele, R.B. GSAS-II: The genesis of a modern open-source all purpose crystallography software package. Appl. Crystallogr. 2013, 46, 544–549. [Google Scholar] [CrossRef]
Rietveld, H.M. A profile refinement method for nuclear and magnetic structures. Appl. Crystallogr. 1969, 2, 65–71. [Google Scholar] [CrossRef]
Park, W.B.; Chung, J.; Jung, J.; Sohn, K.; Singh, S.P.; Pyo, M.; Shin, N.; Sohn, K.S. Classification of crystal structure using a convolutional neural network. IUCrJ 2017, 4, 486–494. [Google Scholar] [CrossRef]
Oviedo, F.; Ren, Z.; Sun, S.; Settens, C.; Liu, Z.; Hartono, N.T.P.; Ramasamy, S.; DeCost, B.L.; Tian, S.I.; Romano, G.; et al. Fast and interpretable classification of small X-ray diffraction datasets using data augmentation and deep neural networks. NPJ Comput. Mater. 2019, 5, 60. [Google Scholar] [CrossRef]
Billinge, S.J.; Levin, I. The problem with determining atomic structure at the nanoscale. Science 2007, 316, 561–565. [Google Scholar] [CrossRef]
Jenkins, R.; Snyder, R.L. Introduction to X-Ray Powder Diffractometry; Wiley Online Library: Hoboken, NJ, USA, 1996; Volume 138. [Google Scholar]
Furukawa, H.; Cordova, K.E.; O’Keeffe, M.; Yaghi, O.M. The chemistry and applications of metal-organic frameworks. Science 2013, 341, 1230444. [Google Scholar] [CrossRef]
O’Keeffe, M.; Peskov, M.A.; Ramsden, S.J.; Yaghi, O.M. The reticular chemistry structure resource (RCSR) database of, and symbols for, crystal nets. Accounts Chem. Res. 2008, 41, 1782–1789. [Google Scholar] [CrossRef]
Su, T.; Cao, B.; Hu, S.; Li, M.; Zhang, T.Y. CGWGAN: Crystal generative framework based on Wyckoff generative adversarial network. J. Mater. Inform. 2024, 4, 20. [Google Scholar] [CrossRef]
Davel, C.; Bassiri-Gharb, N.; Correa-Baena, J.P. Machine Learning in X-ray Scattering for Materials Discovery and Characterization. ChemRxiv 2024. [Google Scholar] [CrossRef]
Prasianakis, N.I. AI-enhanced X-ray diffraction analysis: Towards real-time mineral phase identification and quantification. IUCrJ 2024, 11, 647–648. [Google Scholar] [CrossRef]
Surdu, V.A.; Győrgy, R. X-ray diffraction data analysis by machine learning methods—A review. Appl. Sci. 2023, 13, 9992. [Google Scholar] [CrossRef]
Wang, B.; Guan, Z.; Yao, S.; Qin, H.; Nguyen, M.H.; Yager, K.; Yu, D. Deep learning for analysing synchrotron data streams. In Proceedings of the 2016 New York Scientific Data Summit (NYSDS), New York, NY, USA, 14–17 August 2016; pp. 1–5. [Google Scholar]
Czyzewski, A.; Krawiec, F.; Brzezinski, D.; Porebski, P.J.; Minor, W. Detecting anomalies in X-ray diffraction images using convolutional neural networks. Expert Syst. Appl. 2021, 174, 114740. [Google Scholar] [CrossRef]
Chakraborty, A.; Sharma, R. See deeper: Identifying crystal structure from x-ray diffraction patterns. In Proceedings of the 2020 International Conference on Cyberworlds (CW), Caen, France, 29 September–1 October 2020; pp. 49–54. [Google Scholar]
Chakraborty, A.; Sharma, R. A deep crystal structure identification system for X-ray diffraction patterns. Vis. Comput. 2022, 38, 1275–1282. [Google Scholar] [CrossRef]
Massuyeau, F.; Broux, T.; Coulet, F.; Demessence, A.; Mesbah, A.; Gautier, R. Perovskite or Not Perovskite? A Deep-Learning Approach to Automatically Identify New Hybrid Perovskites from X-ray Diffraction Patterns. Adv. Mater. 2022, 34, 2203879. [Google Scholar] [CrossRef]
Ishitsuka, K.; Ojima, H.; Mogi, T.; Kajiwara, T.; Sugimoto, T.; Asanuma, H. Characterization of hydrothermal alteration along geothermal wells using unsupervised machine-learning analysis of X-ray powder diffraction data. Earth Sci. Inform. 2022, 15, 73–87. [Google Scholar] [CrossRef]
Vecsei, P.M.; Choo, K.; Chang, J.; Neupert, T. Neural network based classification of crystal symmetries from x-ray diffraction patterns. Phys. Rev. B 2019, 99, 245120. [Google Scholar] [CrossRef]
Suzuki, Y.; Hino, H.; Hawai, T.; Saito, K.; Kotsugi, M.; Ono, K. Symmetry prediction and knowledge discovery from X-ray diffraction patterns using an interpretable machine learning approach. Sci. Rep. 2020, 10, 21790. [Google Scholar] [CrossRef]
Venderley, J.; Mallayya, K.; Matty, M.; Krogstad, M.; Ruff, J.; Pleiss, G.; Kishore, V.; Mandrus, D.; Phelan, D.; Poudel, L.; et al. Harnessing interpretable and unsupervised machine learning to address big data from modern X-ray diffraction. Proc. Natl. Acad. Sci. USA 2022, 119, e2109665119. [Google Scholar] [CrossRef] [PubMed]
Utimula, K.; Hunkao, R.; Yano, M.; Kimoto, H.; Hongo, K.; Kawaguchi, S.; Suwanna, S.; Maezono, R. Machine-Learning Clustering Technique Applied to Powder X-Ray Diffraction Patterns to Distinguish Compositions of ThMn12-Type Alloys. Adv. Theory Simulations 2020, 3, 2000039. [Google Scholar] [CrossRef]
Utimula, K.; Yano, M.; Kimoto, H.; Hongo, K.; Nakano, K.; Maezono, R. Feature space of XRD patterns constructed by an autoencoder. Adv. Theory Simulations 2023, 6, 2200613. [Google Scholar] [CrossRef]
Wu, Y.; Su, T.; Du, B.; Hu, S.; Xiong, J.; Pan, D. Kolmogorov–Arnold Network Made Learning Physics Laws Simple. J. Phys. Chem. Lett. 2024, 15, 12393–12400. [Google Scholar] [CrossRef]
Lai, Q.; Xu, F.; Yao, L.; Gao, Z.; Liu, S.; Wang, H.; Lu, S.; He, D.; Wang, L.; Zhang, L.; et al. End-to-End Crystal Structure Prediction from Powder X-Ray Diffraction. Adv. Sci. 2025, 12, 2410722. [Google Scholar] [CrossRef]
Guo, G.; Goldfeder, J.; Lan, L.; Ray, A.; Yang, A.H.; Chen, B.; Billinge, S.J.L.; Lipson, H. Towards end-to-end structure determination from X-ray diffraction data using deep learning. NPJ Comput. Mater. 2024, 10, 209. [Google Scholar] [CrossRef]
Schlesinger, C.; Fitterer, A.; Buchsbaum, C.; Habermehl, S.; Chierotti, M.R.; Nervi, C.; Schmidt, M.U. Ambiguous structure determination from powder data: Four different structural models of 4, 11-difluoroquinacridone with similar X-ray powder patterns, fit to the PDF, SSNMR and DFT-D. IUCrJ 2022, 9, 406–424. [Google Scholar] [CrossRef]
Parackal, A.S.; Goodall, R.E.; Faber, F.A.; Armiento, R. Identifying crystal structures beyond known prototypes from x-ray powder diffraction spectra. Phys. Rev. Mater. 2024, 8, 103801. [Google Scholar] [CrossRef]
Radford, A.; Kim, J.W.; Hallacy, C.; Ramesh, A.; Goh, G.; Agarwal, S.; Sastry, G.; Askell, A.; Mishkin, P.; Clark, J.; et al. Learning transferable visual models from natural language supervision. In Proceedings of the International conference on machine learning, Virtual, 18–24 July 2021; pp. 8748–8763. [Google Scholar]
Du, W.; Zhang, H.; Du, Y.; Meng, Q.; Chen, W.; Zheng, N.; Shao, B.; Liu, T.Y. SE (3) equivariant graph neural networks with complete local frames. In Proceedings of the International Conference on Machine Learning, Baltimore, MA, USA, 17–23 July 2022; pp. 5583–5608. [Google Scholar]
Drautz, R. Atomic cluster expansion for accurate and transferable interatomic potentials. Phys. Rev. B 2019, 99, 014104. [Google Scholar] [CrossRef]
Batatia, I.; Kovacs, D.P.; Simm, G.; Ortner, C.; Csányi, G. MACE: Higher order equivariant message passing neural networks for fast and accurate force fields. Adv. Neural Inf. Process. Syst. 2022, 35, 11423–11436. [Google Scholar]
Willman, J.T.; Perriot, R.; Ticknor, C. Accurate and efficient parameterization of an atomic cluster expansion (ACE) potential for ammonia under extreme conditions. J. Chem. Phys. 2025, 162, 144316. [Google Scholar] [CrossRef] [PubMed]
Guo, L.; Liu, Y.; Yang, L.; Cao, B. Lattice dynamics modeling of thermal transport in solids using machine-learned atomic cluster expansion potentials: A tutorial. J. Appl. Phys. 2025, 137, 081101. [Google Scholar] [CrossRef]
Rinaldi, M.; Bochkarev, A.; Lysogorskiy, Y.; Drautz, R. Charge-constrained atomic cluster expansion. Phys. Rev. Mater. 2025, 9, 033802. [Google Scholar] [CrossRef]
Zhang, B.; Chen, E.; Asta, M. Oxygen grain-boundary segregation in HCP Ti—Computational investigations using an atomic cluster expansion potential. Comput. Mater. Sci. 2025, 248, 113577. [Google Scholar]
Momma, K.; Izumi, F. VESTA: A three-dimensional visualization system for electronic and structural analysis. J. Appl. Crystallogr. 2008, 41, 653–658. [Google Scholar] [CrossRef]
Momma, K.; Izumi, F. VESTA 3 for three-dimensional visualization of crystal, volumetric and morphology data. J. Appl. Crystallogr. 2011, 44, 1272–1276. [Google Scholar] [CrossRef]
Fredericks, S.; Parrish, K.; Sayre, D.; Zhu, Q. PyXtal: A Python library for crystal structure generation and symmetry analysis. Comput. Phys. Commun. 2021, 261, 107810. [Google Scholar] [CrossRef]
Gilmer, J.; Schoenholz, S.S.; Riley, P.F.; Vinyals, O.; Dahl, G.E. Neural message passing for quantum chemistry. In Proceedings of the International Conference on Machine Learning, Sydney, Australia, 6–11 August 2017; pp. 1263–1272. [Google Scholar]
He, K.; Zhang, X.; Ren, S.; Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar]
Chen, T.; Kornblith, S.; Norouzi, M.; Hinton, G. A simple framework for contrastive learning of visual representations. In Proceedings of the International conference on machine learning, Virtual, 13–18 July 2020; pp. 1597–1607. [Google Scholar]
Su, T.; Cui, Y.; Lian, Z.; Hu, M.; Li, M.; Lu, W.; Ren, W. Physics-Based Feature Makes Machine Learning Cognizing Crystal Properties Simple. J. Phys. Chem. Lett. 2021, 12, 8521–8527. [Google Scholar] [CrossRef]

Figure 1. Schematic diagram of the model architecture.

Figure 2. Histogram of the frequency distribution of different space groups in the dataset (the numbers behind the bars represent the frequency values in descending order from left to right and from bottom to top). The statistics of space groups are divided into five subfigures (a–e), with each subfigure containing values of similar magnitude.

Figure 3. The structure inferred from the actual XRD diffraction pattern and its corresponding image, from bottom to top, are the actual XRD pattern (which is also the pattern of the model-recommended structure); the Top-2 candidate structures and their corresponding pattern; the Top-3 candidate structures and their corresponding pattern.

Figure 4. Visualization of hidden layers in contrastive learning vs. seven crystal systems.

Table 1. Performance comparison of structure retrieval from XRD patterns.

Model	Top-1 (%)	Top-3 (%)	Top-5 (%)
EACNN	95.96	99.95	99.98

Table 2. Space group frequency and accuracy.

Space Group	Frequency	Accuracy
225	808	95.92%
3, 24, 34, 37, 39	1 (each)	96.00%
41, 48, 50, 95, 97
112, 116, 120, 132, 138
143, 157, 159, 180, 192
195, 197, 202, 203, 214

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhang, X.; Wang, X.; Hu, S. Structural Fingerprinting of Crystalline Materials from XRD Patterns Using Atomic Cluster Expansion Neural Network and Atomic Cluster Expansion. Appl. Sci. 2025, 15, 5851. https://doi.org/10.3390/app15115851

AMA Style

Zhang X, Wang X, Hu S. Structural Fingerprinting of Crystalline Materials from XRD Patterns Using Atomic Cluster Expansion Neural Network and Atomic Cluster Expansion. Applied Sciences. 2025; 15(11):5851. https://doi.org/10.3390/app15115851

Chicago/Turabian Style

Zhang, Xiao, Xitao Wang, and Shunbo Hu. 2025. "Structural Fingerprinting of Crystalline Materials from XRD Patterns Using Atomic Cluster Expansion Neural Network and Atomic Cluster Expansion" Applied Sciences 15, no. 11: 5851. https://doi.org/10.3390/app15115851

APA Style

Zhang, X., Wang, X., & Hu, S. (2025). Structural Fingerprinting of Crystalline Materials from XRD Patterns Using Atomic Cluster Expansion Neural Network and Atomic Cluster Expansion. Applied Sciences, 15(11), 5851. https://doi.org/10.3390/app15115851

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Structural Fingerprinting of Crystalline Materials from XRD Patterns Using Atomic Cluster Expansion Neural Network and Atomic Cluster Expansion

Abstract

1. Introduction

2. Materials and Method

2.1. Calculation of the Simulated Spectrum

2.2. Datasource

2.3. Crystal Structure Encoder

2.4. Diffraction Pattern Encoder

2.5. Loss Function

3. Results and Discussion

4. Conclusions

Supplementary Materials

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI