AI Design for High Entropy Alloys: Progress, Challenges and Future Prospects

Xie, Enzhi; Yang, Chao

doi:10.3390/met15091012

Open AccessReview

AI Design for High Entropy Alloys: Progress, Challenges and Future Prospects

by

Enzhi Xie

¹ and

Chao Yang

^1,2,*

¹

Shanghai Key Laboratory of Advanced High-Temperature Materials and Precision Forming, School of Materials Science and Engineering, Shanghai Jiao Tong University, Shanghai 200240, China

²

Inner Mongolia Research Institute, Shanghai Jiao Tong University, Hohhot 010010, China

^*

Author to whom correspondence should be addressed.

Metals 2025, 15(9), 1012; https://doi.org/10.3390/met15091012

Submission received: 16 August 2025 / Revised: 4 September 2025 / Accepted: 8 September 2025 / Published: 11 September 2025

(This article belongs to the Special Issue Advances in High-Entropy Alloys’ Microstructure, Properties and Preparation)

Download

Browse Figures

Versions Notes

Abstract

High-entropy alloys have demonstrated significant application potential in many industrial fields due to their outstanding comprehensive properties. However, the complex multi-component compositions pose challenges for traditional design approaches. In recent years, artificial intelligence (AI) technology, with its powerful capabilities in data analysis, prediction, and optimization, has provided new pathways for rapid discovery and performance modulation of high-entropy alloys. This paper systematically reviews the latest advancements in AI applications for high-entropy alloy design, covering key technologies such as machine learning models (e.g., active learning, generative models, transfer learning), high-throughput computing and experimental data processing, phase structure and property prediction. It also presents typical application cases, including compositional optimization, phase structure prediction, performance synergistic regulation, and novel material discovery. Although AI has significantly improved design efficiency and accuracy, challenges remain, such as the scarcity of high-quality data, insufficient model interpretability, and interdisciplinary integration. Future efforts should focus on building a more robust data ecosystem, enhancing model transparency, and strengthening closed-loop validation between AI and experimental science to advance intelligent design and engineering applications of high-entropy alloys.

Keywords:

high entropy alloys; artificial intelligence; alloy design

1. Introduction

1.1. High Entropy Alloys

High Entropy Alloys (HEAs) are novel multi-component materials composed of five or more elements in stoichiometric or near-stoichiometric ratios [1], characterized by high mixed entropy [2,3]. Mixing entropy (ΔS_mix) refers to the increase in system entropy when different substances are mixed, reflecting an increase in microscopic disorder. It includes compositional complexity and configurational entropy (ΔS_conf). Compositional complexity (n): Geometric and electronic complexity caused by differences in element count, atomic size, electronegativity, etc., leading to localized chemical heterogeneity. Configurational entropy (ΔS_conf): Ideal mixing entropy

Δ S c o n f = - R \sum_{i} x_{i} \ln x_{i}

, which is only related to molar fraction and not element type differences. For an ideal system, the formula is:

Δ S m i x = - n R \sum_{i} x_{i} \ln x_{i}

, where xi represents the molar fraction of component i. Mixing entropy is always positive; the more homogeneous the mixture, the greater the entropy increase, as shown in Figure 1. High mixing entropy is only one of the necessary thermodynamic conditions for the formation of a single-phase solid solution, and its realization depends on enthalpy-entropy competition, dynamic freezing and lattice distortion. This entropy-plus-kinetics synergy stabilizes single solid solution phase structures [4,5] and enhances mechanical properties, corrosion resistance, high-temperature stability, and controllable performance through mechanisms including hysteresis diffusion, lattice distortion, and the “cocktail effect” [6,7], as shown in Figure 2. Breaking away from traditional alloy design paradigms, these materials have found extensive applications in energy systems, catalysis, and high-temperature structural engineering, demonstrating significant practical potential and research value [5,8,9].

1.2. The Thermodynamic-Dynamic Controversy

Notably, the current academic debate centers on whether the formation of high-entropy single phases is a thermodynamically entropy-driven equilibrium or dynamically trapped metastable states. Early “entropy-stability” models assumed ΔG = ΔH − TΔS < 0 to stabilize single phases. However, many high-entropy materials are fabricated through high-temperature sintering or rapid quenching. During these non-equilibrium processes, systems may fail to reach thermodynamic equilibrium and instead be “frozen” in a high-entropy single phase. This state remains metastable at room temperature (ΔG > 0), but phase transitions are dynamically suppressed due to extremely slow atomic diffusion rates (“diffusion barrier effect”), thus preserving it. Research by Miao et al. [11] demonstrates that the complex interplay between configurational entropy, thermodynamics, and kinetics significantly influences nanostructures and chemical environments in high-entropy oxide (HEO) films. Sharma et al. [12] found that the presence of multiple principal elements accelerates amorphization through severe lattice distortion and increased configurational entropy. The atomic size differences of hetero-elements in alloys ensure high stability of formed amorphous phases, effectively preventing transition to ordered lattice structures even after annealing. Therefore, “high-entropy single phases” are not purely thermodynamically inevitable but result from a tripartite interplay of entropy, enthalpy, and kinetics. Therefore, this paper argues that both “thermodynamic descriptors” (ΔH_mix, Ω parameters) and “dynamic descriptors” (diffusion activation energy Q, cold velocity) should be introduced into the AI model, rather than just ΔS_conf.

1.3. Material Design

Current design approaches for high-entropy materials primarily include stoichiometric ratio design, non-stoichiometric ratio design, computational aid design, phase diagram calculations (CALPHAD), experimental screening, gradient material design, and nanostructure design [13,14,15]. Stoichiometric ratio design is straightforward and suitable for initial screening, though it may overlook the unique roles of certain elements [16]. Non-stoichiometric ratio design allows flexible performance tuning but involves higher complexity [17,18]. Computational aid design significantly accelerates material development while reducing experimental costs, though it heavily relies on model accuracy [19,20,21]. Phase diagram calculations provide phase stability and transition information, yet depend on database precision [22,23,24]. Experimental screening yields direct, reliable results but is time-consuming and labor-intensive [25,26,27]. Gradient material design and nanostructure design enable multifunctional integration and performance enhancement, though they require complex manufacturing processes [28,29,30]. In practice, these methods each have distinct advantages and disadvantages, and multiple approaches are often combined to optimize material properties.

As shown in Figure 3, in the field of material design, the integration of AI technology has created both new opportunities and challenges for HEA development [31,32,33]. By leveraging its robust data analysis and predictive capabilities, AI can swiftly extract valuable insights from massive datasets, accelerating the discovery and optimization of novel materials [20,34,35]. The technology excels at handling complex multivariate relationships, establishing precise input-output mappings to enable accurate performance prediction and optimization [36,37].

The application of AI technology in material design offers multiple significant advantages [39,40,41]. First, it can significantly accelerate R&D processes by optimizing material combinations through machine learning, thereby shortening development cycles and enhancing efficiency [42,43]. Second, AI technology reduces R&D costs by replacing substantial exploratory experiments with computational methods, enabling the screening of “higher-probability” material structures for validation experiments, thus cutting down on expensive trial-and-error costs [44,45,46]. Additionally, AI technology improves material performance through deep analysis and predictive modeling, assisting in designing materials with superior properties [40,47]. Furthermore, it demonstrates innovative capabilities by extracting key elements from vast data to generate novel material design solutions, driving advancements in materials science [20,39].

In practical applications, AI technology has yielded numerous exemplary cases in material design. For instance, AI algorithms can rapidly screen and analyze vast amounts of material data to identify high-entropy material candidates, significantly boosting screening efficiency [31,48,49]. Material Genome Engineering leverages AI for digital material design by building genomic databases and models, accelerating the development and application of new materials [50,51]. Furthermore, generative AI models like MatterGen can generate material structures based on target properties, offering innovative approaches to material design [34,52,53,54,55]. These examples vividly demonstrate the immense potential and broad prospects of AI technology in material design.

2. AI Technology in HEA Design

2.1. Algorithm Principle and Applicability Analysis

To help readers in the metallurgical field quickly grasp the physical principles and operational essentials of various models, this section uses the traditional “phase diagram calculation —experimental verification” process as an analogy. The algorithm is broken down into three interconnected stages: ① characteristic smelting (input), ② phase region determination (learning), and ③ performance prediction (output). Table 1 provides a metallurgical interpretation of the core algorithm.

2.1.1. Random Forest and Gradient Boosting—Algorithmic “Multi-Burn-In”

Random forest employs parallel training of numerous decision trees, where each tree utilizes only partial “furnace batches” (sample data) and “features” (attributes), ultimately voting collectively or averaging results. Metallurgically speaking, this approach can be understood as cross-validation using experimental data from different production teams over the years, effectively reducing errors caused by anomalies in individual furnace batches. Gradient boosting, on the other hand, sequentially trains weak learners, continuously optimizing residual errors from previous iterations—much like refining impurities through a process of gradual reduction.

2.1.2. Deep Neural Network—“Diffusion Channel” Perspective

The three-layer fully connected network architecture (input → hidden → output layers) corresponds to the metallurgical process of “composition → micro-structure → properties”. The weight matrix W updates according to an exponential relationship similar to temperature-dependent diffusion coefficients, with gradient descent effectively lowering the system’s free energy. To prevent over-fitting (similar to grain abnormal growth), we implement Dropout = 0.2 after each layer, which acts as a second-phase particle anchoring mechanism.

2.1.3. Conditions Generate Adversarial Networks—“Reverse Design Casting”

Traditional mold casting determines the final shape. The generator G of CGAN outputs components most likely to meet the “target performance” condition vector based on this criterion. The discriminator D distinguishes between the virtual samples generated by G and real experimental data, with both adversarial networks competing until D can no longer differentiate them.

2.1.4. Active Learning—“Sampling Strategy”

Traditional experimental design commonly uses orthogonal tables or uniform designs; active learning replaces manual experience with uncertainty estimates (such as the BALD index) to automatically select alloys with “maximum information” for the next round of smelting. Active learning can reduce hardness prediction errors, which is equivalent to saving experimental costs.

2.1.5. Transfer Learning—“Experience Transfer”

When experimental data are insufficient for new alloy systems (e.g., Nb-Ta-Zr-Hf-Mo), we first pre-train the network on well-documented alloy systems (e.g., Al-Co-Cr-Cu-Fe-Ni). We then freeze weights near the input layer (corresponding to elemental physical properties) while fine-tuning weights near the output layer (corresponding to performance variations). This approach is analogous to transplanting a low-carbon steel dislocation strengthening model to high-entropy steel, requiring only modifications to the dislocation energy-related terms.

2.2. Machine Learning Model

Building on the algorithms, machine learning—ranging from shallow RF to deep CGAN—stands as one of the most prevalent AI technologies in materials design [35]. By developing appropriate models, researchers can effectively predict and optimize the performance of high-entropy materials [56,57]. Active Learning (AL) algorithms, for instance, demonstrate exceptional efficiency in reducing data labeling costs while performing remarkably well on small datasets [58]. Furthermore, methodologies such as generative modeling, data augmentation, and transfer learning have been implemented in high-entropy material design to enhance model performance and generalization capabilities [59,60,61].

Active learning algorithms demonstrate unique advantages in high-entropy material design [62]. By intelligently selecting the most informative data points for labeling within limited resources, they maximize model training efficiency [31,57]. For instance, in designing high-entropy alloys, active learning can precisely identify the most promising alloy compositions from extensive combinations for experimental validation [62]. This approach avoids costly trial-and-error testing of all possible configurations, significantly reducing annotation costs [62]. Moreover, even with small datasets, active learning algorithms continuously improve predictive accuracy and generalization capabilities through iterative optimization, enabling them to perform exceptionally well on small sample sets [58].

Generative models have revolutionized the design of high-entropy materials. These models can generate novel material structures and compositional combinations with potential value by leveraging existing data distributions [63,64]. For instance, in designing high-entropy oxides, generative models can create new oxide materials with similar properties by analyzing the structural characteristics and elemental composition patterns of known stable oxides [31,65]. Not only do these generated materials theoretically exist, but they may also possess unique performance characteristics [31,65]. This approach provides material scientists with abundant research subjects and innovative approaches, significantly expanding the design possibilities for high-entropy materials.

The role of data augmentation techniques in high-entropy material design cannot be underestimated [60]. By reasonably transforming and expanding existing material data through operations such as adding noise, rotating, or flipping the data, we can enhance its diversity and complexity [66,67]. This approach helps improve machine learning models ‘adaptability and robustness to various scenarios, enabling them to perform more reliably across diverse practical applications [68,69,70]. Consequently, it allows for more accurate prediction and optimization of high-entropy materials’ properties [60,71,72,73].

Transfer learning offers cross-domain knowledge transfer advantages for high-entropy material design [57,61]. In materials science, different types of materials often share similarities and commonalities [74]. Transfer learning enables the application of knowledge and model parameters learned from related material fields to high-entropy material design [57,75]. For instance, extracting specific features from solid solution strengthening physical models and transferring them to high-entropy alloy prediction models can reduce reliance on large-scale annotated data in material design [57,59,76]. This approach accelerates model convergence speed, enhances performance and generalization capabilities, thereby allowing rapid breakthroughs in high-entropy material design by leveraging existing research achievements [31,76,77].

2.3. Data Processing and Analysis

AI technology can process and analyze massive amounts of data from experiments and simulations, mining for potential information and patterns [34]. For example, by combining high-throughput computing and experimental screening with machine learning models, the composition and structure of high-entropy materials can be rapidly determined and their properties predicted [78,79,80,81,82].

High-throughput computing plays a pivotal role in high-entropy material design [78,83,84]. Leveraging powerful computational resources, it enables simultaneous theoretical calculations and simulations of large-scale material systems [44,85,86]. For instance, in high-throughput density functional theory (DFT) computations, researchers can predict multiple aspects, including electronic structures, mechanical properties, and thermodynamic characteristics of high-entropy compounds with diverse compositions and architectures [87,88,89]. This approach allows rapid screening of potentially valuable material combinations within short timelines, significantly accelerating material discovery [87,90]. Moreover, the massive datasets generated through high-throughput computing serve as training foundations for machine learning models, providing rich and accurate data support that enhances model reliability and predictive capabilities [87,91].

Experimental screening is an indispensable component in high-entropy material design [81,92]. By integrating high-throughput experimental techniques, researchers can rapidly synthesize and characterize large-scale material samples in the laboratory [93,94,95]. For instance, high-throughput melting technology enables the simultaneous preparation of multiple high-entropy alloy specimens with varying compositions, followed by performance evaluations through rapid testing methods such as hardness measurement and tensile testing [84,96]. Integrating these experimental data with machine learning models allows for further optimization of model parameters and architectures, thereby enhancing the accuracy of material property predictions [76,97]. This approach enables the precise determination of high-entropy materials’ composition and structure, providing reliable candidate materials for practical applications [79,83,87].

AI technology, when processing and analyzing high-entropy material data, can uncover hidden information and patterns beneath the surface [31,98]. Through multivariate statistical methods like cluster analysis and principal component analysis applied to massive experimental and computational datasets, researchers can reveal intrinsic connections between material compositions, structures, and properties [99,100]. For instance, clustering analysis groups high-entropy materials with similar performance into clusters, thereby identifying potential correlations between specific elemental combinations or structural features and material characteristics [101,102]. These insights are crucial for guiding further design and optimization of high-entropy materials, enabling materials scientists to gain a deeper understanding of material fundamentals and develop targeted high-entropy materials more efficiently.

2.4. Performance Prediction

An AI model can establish the complex nonlinear relationship between the structure and properties of materials and realize the accurate prediction of the mechanical and physical properties of HEAs [40,103]. This helps to quickly screen out HEAs with potential application value, reducing the cost and time of experiments [104].

AI models demonstrate unique advantages in establishing complex nonlinear relationships between the structure and properties of HEAs [31,57]. Traditional material performance prediction methods, often based on simple linear models or empirical formulas, struggle to accurately describe the characteristics of multi-component systems like HEAs [105]. However, as shown in Figure 4, AI models such as neural networks can automatically learn complex mapping relationships between inputs (material composition, structure, etc.) and outputs (performance metrics) from massive datasets, even when these relationships are nonlinear and highly coupled [31,57]. For instance, when predicting yield strength in high-entropy alloys, AI models can comprehensively consider factors like elemental content, crystal structure, and grain size that influence strength. They also capture interactions and synergistic effects among these elements, thereby establishing accurate predictive models [106].

Accurate performance prediction plays a vital role in the rapid screening of high-entropy materials [107]. In practical applications, high-entropy materials often need to meet specific performance requirements. Through AI model predictions, researchers can efficiently identify the most promising candidates among numerous material options that best satisfy target performance criteria [108,109]. For instance, in aerospace engineering, there is a critical demand for high-entropy alloys that combine high strength, low density, and high-temperature resistance [110,111]. By leveraging AI models to predict properties of various alloy compositions and structures, scientists can quickly pinpoint several alloy systems with potential applications. Subsequent experimental verification and optimization of these systems significantly reduce the number and scope of required experiments, thereby substantially cutting experimental costs and time consumption [104].

The application of AI technology in predicting the performance of HEAs not only accelerates material development but also provides robust support for optimized design [112,113]. By analyzing and providing feedback on prediction results, it guides adjustments to material composition and structure, further enhancing performance [114,115]. For instance, if predictions indicate that a particular HEA has relatively low electrical conductivity with potential improvement, targeted adjustments can be made based on sensitivity analysis results from the model [3,89]. Subsequent re-prediction through the model enables gradual optimization of material properties until satisfactory outcomes are achieved.

2.5. Limitations of DFT and MD in HEA Modeling

Although density functional theory (DFT) and classical/machine learning molecular dynamics (MD) have become the core means of high entropy alloy (HEA) high-throughput screening and phase stability evaluation, their computational feasibility and prediction accuracy in multi-component systems face new challenges and need to be critically examined.

2.5.1. Calculate the Expansion Law of Cost with the Number of Master Elements

DFT: For a supercell containing M principal elements and N atoms, the number of electronic step self-consistent iterations increases linearly with M. To maintain equivalent accuracy, the k-point grid requires densification, resulting in a total CPU time ∝

M \cdot N^{3} \cdot N k^{3}

. Taking a 500-atom Co-Cr-Fe-Mn-Ni model as an example, single-point energy calculations already require approximately 10⁵ CPU-h. If the principal elements are increased to 7, the computational cost would at least triple or quadruple.

MD: The empirical potential parameter increases with the binary combination number

M (M - 1) / 2

. The machine learning potential training set must cover all

M (M - 1) (M - 2) / 6

ternary sub-spaces, with data generation costs ∝ M³. A single run of the million-atom GPU-MD still requires 10⁴–10⁵ GPU-hours.

Limitations of special quasi-random structure (SQS): In order to ensure randomness, the super-cell needs to be magnified by

\sqrt{M}

times, resulting in the growth of N with M², further exacerbating the exponential expansion.

2.5.2. The Accuracy of Phase Stability Prediction

Exchange correlation functions and magnetic entropy: Magnetic dipoles and phonon entropy exhibit reversible relative stability at high temperatures, yet are neglected in 0 K DFT calculations.

Short-range order (SRO) deficiency: When SRO length exceeds 6 Å, SQS fails to capture genuine chemical correlations, resulting in mixed enthalpy errors of 10–20 meV/atom and phase boundary temperature deviations of 100–200 K.

MD time scale: Experimental annealing requires 10²–10⁴ s, while classical MD can only simulate 10⁻⁷ s, failing to capture slow amplitude decomposition or σ phase separation. Potential function migration errors cause dislocation energy deviations up to 40%, directly affecting the driving force of the FCC-HCP phase transition.

In summary, DFT and MD face dual “precision-cost” bottlenecks in high-entropy alloys (HEA) scenarios. Future efforts should integrate CALPHAD, machine learning-based potential active learning, and experimental feedback to construct a multi-scale hybrid framework that balances accuracy and efficiency.

3. Application Cases of AI in HEA Design

3.1. Component Design

3.1.1. Application of the Generation Model in Refractory HEA Design

A research team from Pennsylvania State University published in the Journal of Materials Informatics demonstrates that generative models represent a highly promising new approach for HEA design [116]. By employing Conditional Generative Adversarial Networks (CGAN) with additional conditional vectors to control outputs, they successfully established a mapping relationship between latent space and desired performance metrics [116], as shown in Figure 5. The generator learns probability distributions of alloy compositions and properties to generate samples meeting specific requirements [116], as shown in Figure 6. The study pioneers a novel pathway for high-entropy alloy composition design, showcasing AI’s immense potential in material innovation [116].

3.1.2. Design of Multi-Objective Optimization Framework for Refractory HEAs

The research team led by Shu Yanjing at Beijing University of Science and Technology has developed a multi-objective optimization (MOO) framework that integrates machine learning, genetic search, cluster analysis, and experimental feedback to design refractory high-entropy alloys (RHEAs) with superior high-temperature strength and room-temperature ductility [117]. The team synthesized 24 distinct RHEAs and experimentally validated the exceptional performance of the Zr-Nb-Mo-Hf-Ta alloy under high-temperature conditions [117]. This study not only demonstrates AI’s application in material design but also validates the reliability of AI models through experimental verification, providing crucial references for RHEA development [117].

3.1.3. Comparison and Discussion

While generative models (e.g., CGAN) and multi-objective optimization frameworks (e.g., MOO) demonstrate significant potential in component design, they differ markedly in methodology and application scenarios. The Penn State team’s CGAN model focuses on implicit mapping between components and performance metrics, making it suitable for reverse engineering in high-dimensional spaces—particularly effective when lacking explicit physical models. In contrast, the MOO framework developed by the University of Science and Technology Beijing emphasizes balancing multiple objectives with experimental feedback, exhibiting strong engineering-oriented characteristics. Notably, both approaches rely on high-quality data inputs, and the “black box” nature of generative models still limits their application in mechanistic interpretation. Future research could explore integrating generative models with MOO to establish a closed-loop design process that transitions from “generating candidates” to “multi-objective optimization.”

3.2. Phase Structure Prediction

3.2.1. Classification and Prediction of HEA Phase Composition by Deep Learning Algorithm

The team from the University of Science and Technology Beijing (USTB) has developed a novel strategy, as shown in Figure 7, which uses genetic algorithms (GA) to automatically generate element numerical descriptors for high-entropy alloys (HEAs). This breakthrough method overcomes limitations of traditional empirical features, significantly improving prediction accuracy in phase structure analysis [118]. As shown in Figure 8, the system achieved 90.2%, 88.1%, and 82.7% accuracy rates in face-centered cubic (FCC), body-centered cubic (BCC), and bimodal classification tasks, respectively [118]. By dynamically optimizing the element descriptor space, they established a closed-loop system integrating “feature generation, model training, and experimental feedback,” providing a universal framework for data-driven material design [118].

3.2.2. Combination of Conditional Generation Adversarial Network and Active Learning

A machine learning model combining Conditional Generative Adversarial Networks (CGAN) and Active Learning (AL) has been developed to predict the body-centered cubic BCC phase, face-centered cubic FCC phase, and BCC + FCC phase [119] in high-entropy alloys. The model first employs domain knowledge embedding for feature selection, then utilizes CGAN to expand the data-set from 1016 samples to 1616. Machine learning training is conducted using the augmented data, followed by active learning techniques to enhance prediction accuracy. The final model achieved a precision rate of 96.08% [119].

3.2.3. Element Feature Transfer Adversarial Network

As shown in Figure 9, a research team from Tsinghua University has developed an algorithm framework combining Information Maximization Generative Adversarial Networks (InfoGAN) and Elementar Convolutional Graph Neural Networks (ECGNN)—known as the Elementar Feature Transfer Adversarial Network (EFTGAN)—for predicting properties of high-entropy alloys [61]. This network extracts elemental features from crystal atomic and structural information while generating new feature representations for prediction targets. By avoiding computationally intensive structural calculations and employing iterative methods, it significantly enhances prediction accuracy [61], as shown in Figure 10.

3.2.4. Comparison and Discussion

In phase structure prediction, genetic algorithms demonstrate high accuracy in classification tasks through element descriptor generation, particularly showing innovation in addressing the limitations of traditional feature representation. The CGAN-active learning strategy significantly improves small-sample prediction performance through data augmentation and active sampling. EFTGAN further integrates graph neural networks with feature transfer mechanisms, maintaining high precision while reducing reliance on first-principles calculations. These three approaches advance phase prediction accuracy and efficiency from three distinct perspectives: feature engineering, data augmentation, and feature learning. However, each faces challenges such as high model complexity, poor interpretability, and excessive computational demands. Future research could explore hybrid models combining multiple advantages to balance precision, efficiency, and explainability.

3.2.5. The Gap Between Machine Learning Predictions and Real Synthetic Dynamics

Although existing artificial intelligence prediction models demonstrate high accuracy (≥90%, see Section 3.2.1, Section 3.2.2 and Section 3.2.3) in distinguishing FCC, BCC, and duplex structures, most of these criteria are based on equilibrium or quasi-equilibrium thermodynamic assumptions, failing to adequately reflect the transient dynamic behaviors observed in actual powder metallurgy or additive manufacturing processes. Sharma et al. [12], in their study on intermetallic crystallization in magnesium-based high-entropy alloy powders, employed a two-step process combining high-energy ball milling and discharge plasma sintering (SPS) and observed the following phenomena: Continued ball milling beyond 5 h resulted in sharp peaks being fully incorporated into a broadened “mantle peak”, indicating that the intermetallic phase had undergone type-III crystallization via dislocation-induced and nanocrystalline boundary pathways. TEM dark-field images further revealed grain sizes < 5 nm, confirming that high-density defects drove the phase transformation. Notably, even after annealing at 500 °C/6 h, no recrystallization signs were observed in the amorphous phase (XRD mantel peaks remained essentially unchanged), demonstrating that under the combined effects of high cooling rates (~10⁶ K s⁻¹) and high defect density, the system had fallen into a deep potential well far beyond the FCC/BCC/HCP stable regions predicted by equilibrium phase diagrams. This result directly indicates that AI models failing to incorporate the “cooling rate-amorphous formation capability” dynamic parameter into their feature space will significantly deviate from experimental observations in predicting phase stability. Therefore, future models must integrate three critical components: (1) transient phase transition pathways obtained through in-situ synchrotron radiation XRD; (2) process descriptors characterizing cooling rates and defect densities (including local shear rate, dislocation density, and lattice mismatch degree); (3) non-equilibrium thermodynamic/dynamic databases such as Kinetic—CALPHAD. Crucially, incorporating formation energy of dislocations, excess energy at nanocrystalline boundaries, and multi-component lattice distortion barriers is essential for reliably predicting phase stability in non-equilibrium processes like additive manufacturing or high-energy ball milling.

3.3. Performance Optimization

3.3.1. Synergistic Optimization of High Temperature Strength and Room Temperature Toughness of HEAs

A research team from Beijing University of Science and Technology developed a multi-objective optimization (MOO) framework integrating machine learning, genetic search, cluster analysis, and experimental feedback. This framework was designed to identify the optimal alloy composition [117] for refractory high-entropy alloys (RHEAs) that achieve both high-temperature strength and room-temperature ductility. The study concluded that the Zr-Nb-Mo-Hf-Ta alloy system demonstrated exceptional high-temperature application potential. Specifically, the Zr_0.13Nb_0.27Mo_0.26Hf_0.13Ta_0.21 alloy exhibited a yield strength approaching 940 MPa at 1200 °C and a room-temperature fracture strain of 17.2% [117]. Its remarkable heat resistance and excellent structural stability indicate significant potential for structural applications in high-temperature environments [117].

3.3.2. Hardness Optimization of Al-Co-Cr-Cu-Fe-Ni HEAs

Using machine learning models, researchers screened and prepared 42 high-entropy alloys [117] from a compositional space containing nearly 2 million alloy types. Among these, 35 alloys demonstrated hardness exceeding the highest values in the training samples, achieving 83.3% performance optimization. Notably, 17 alloys saw hardness improvements surpassing 10%, with the most significant enhancement reaching 14% [117].

3.3.3. Comparison and Discussion

In terms of performance optimization, the MOO framework achieved synergistic optimization of high-temperature strength and room-temperature toughness in refractory high-entropy alloys, demonstrating the value of multi-objective optimization in balancing complex performance parameters. Meanwhile, hardness optimization in the Al-Co-Cr-Cu-Fe-Ni system showcased machine learning’s efficiency advantages in high-throughput screening. These examples highlight the importance of integrating multi-objective search algorithms with experimental validation to ensure practical applicability. Notably, the former relies more on experimental feedback and domain knowledge guidance, while the latter emphasizes data-driven exploration of compositional space. Both cases indicate that performance optimization must be closely integrated with practical application scenarios and experimental verification, as purely data-driven approaches or theoretical predictions cannot fully replace experimental validation. In addition, the current approaches are often limited to specific alloy systems and properties. Future research should expand to more complex performance targets, such as fatigue resistance, corrosion behavior, and thermal stability, and incorporate multi-scale modeling to bridge micro-structural features with macroscopic properties.

3.4. Material Screening and Discovery

3.4.1. Cu-Ni-Co-Si HEA System

In a study, Pan et al. [120] developed a data set for the Cu-Ni-Co-Si high-entropy alloy system, evaluating hardness and electrical conductivity as performance metrics. They trained three models: ordinary least squares (OLS), artificial neural networks (ANN), and random forest (RF) [120]. Comparative analysis demonstrated that the RF model exhibited superior predictive accuracy, making it the preferred tool for final predictions [120], as shown in Figure 11. Through predictive analysis of 38,880 potential alloy compositions and process combinations using this RF model, four optimal combinations were identified [120]. Experimental validation successfully produced an alloy composition with low cobalt content (Cu-2.3Ni-0.7Co-0.7Si) while maintaining excellent overall performance [120].

3.4.2. Low Thermal Expansion Coefficient HEA

Rao [121] and his collaborators developed an active learning-based strategy for efficiently screening high-entropy alloys with low thermal expansion coefficients (TEC) [121]. The approach first constructs a potential alloy space using a generative model (GM), then samples 1000 candidate compositions likely to exhibit low TEC [121] through Markov Chain Monte Carlo (MCMC) methods. After further screening the top 10–30 candidates, density functional theory (DFT) and thermodynamic calculations are employed to obtain supplementary input such as magnetization characteristics [121]. An ensemble model evaluates these candidate alloys, selecting the top three for experimental synthesis and testing. If experimental results fail to meet requirements, the new data is fed back into the initial dataset to initiate iterative optimization [121]. Through six rounds of iterations, 18 alloys were synthesized (17 novel components and 1 known component), ultimately identifying two high-entropy alloys demonstrating ultra-low thermal expansion coefficients (approximately

2 \times 10^{- 6} K^{- 1}

) at 300 K [121].

3.4.3. Nb-Ta-Zr-Hf-Mo Refractory HEA System

To explore the optimal balance between high-temperature strength and room-temperature ductility in the Nb-Ta-Zr-Hf-Mo refractory high-entropy alloy system, Wen et al. [122] proposed a machine learning strategy integrating prediction uncertainty analysis with clustering algorithms [122]. This approach employs Expected Improvement (EI) values as the key metric, which simultaneously considers both predicted performance metrics (high-temperature strength and room-temperature ductility) and their uncertainties [122]. By ranking the two EI values for each alloy in the search space, candidates on the Pareto frontier were identified [122]. Subsequent cluster analysis guided the selection of optimal solutions [122]. Ultimately, this strategy successfully led researchers to discover and synthesize four novel alloys that exhibit excellent high-temperature strength and good room-temperature ductility [122].

3.4.4. Single-Phase Refractory HEA

Yan et al. [123] researchers focused on discovering novel single-phase refractory high-entropy alloys. They constructed a dataset containing eight key characteristics and 1807 records, training nine different classification models for prediction [123]. By comparing evaluation metrics such as the F1 score, the Gradient Boost (GB) model demonstrated optimal performance with a classification accuracy rate of 97.37% [123]. Using this GB model, the researchers successfully predicted over 100 potential single-phase, oxidation-resistant refractory high-entropy alloy compositions [123]. Subsequently, they selected and synthesized 10 alloys for experimental verification [123]. X-ray diffraction (XRD) analysis confirmed that all synthesized alloys exhibited single-phase structures, strongly validating the reliability of machine learning predictions [123].

3.4.5. Comparison and Discussion

In material screening and discovery, research teams have employed diverse strategies, including random forest (RF), active learning (AL), ensemble models, and cluster analysis. These approaches have significantly accelerated the identification of Heads-Up Alloys (HEAs) with desirable properties. Pan et al. successfully predicted hardness and electrical conductivity in Cu-Ni-Co-Si systems using RF models, demonstrating the stability of tree-based models in regression tasks. Rao et al. achieved efficient discovery of alloys with low thermal expansion coefficients through a combination of generative models and active learning. Wen and Yan’s team identified novel high-performance materials in refractory high-entropy alloys using uncertainty-guided clustering analysis and gradient boosting classification models, respectively.

However, the reliance on pre-defined feature sets and the limited diversity of training data remain major constraints. Moreover, the transition from prediction to experimental synthesis is not always straightforward, often requiring iterative refinement. Future efforts should prioritize the development of unified, open-source platforms that integrate prediction, synthesis, and characterization into a closed-loop system, enabling more efficient and reproducible material discovery.

3.4.6. Limitations and Mitigation Strategies of Combinatorial Synthesis

While combinatorial synthesis has significantly accelerated the screening efficiency of Heterogeneous Eutectic Alloys (HEAs), its effectiveness remains constrained by three major challenges: compositional gradients, phase separation, and equilibrium conditions. Taking magnetron sputtering of thin films as an example, significant compositional gradients at the 1 μm scale can lead to localized enriched phases that deviate from the single-phase structures predicted by machine learning models. Additionally, laser-melted RHEAs retain metastable FCC phases due to excessively high cooling rates (~10⁴ K/s), though their volume fraction can be optimized through coupled phase-field simulations and active learning algorithms to refine annealing processes. These findings indicate that future combinatorial synthesis must be deeply integrated with real-time characterization techniques (such as synchrotron radiation XRD) and dynamic process simulations to achieve closed-loop optimization through the “design-synthesis-verification” cycle.

4. Challenges of AI Technology in HEA Design

Although AI technology has made significant progress in the design of HEAs, there are still some challenges.

4.1. Data Related Issues

4.1.1. Scarcity of High-Quality Data

High-quality data is crucial for training and optimizing AI models, yet experimental data on HEAs remains relatively scarce [31,57]. Due to their complex multi-component nature, HEAs present significant challenges in experimental preparation and characterization, leading to higher costs [32,124]. This inherently results in less comprehensive data compared to traditional materials. Moreover, the limited data available still requires improvement in quality and consistency. The varying experimental methods, testing conditions, and data recording approaches used by different research teams further undermine comparability and consistency [125,126]. For instance, some studies employ different strain rates and test temperatures when evaluating the mechanical properties of HEAs, making direct data comparison and integration difficult. This scarcity of high-quality data restricts both the quantity and diversity of training samples for AI models, ultimately affecting their performance and generalization capabilities [127].

4.1.2. Data Skew and Lack of Representativeness

Current HEA data often remain concentrated within specific compositional systems or performance ranges, leading to significant data bias [73,128,129]. For instance, extensively studied HEAs predominantly focus on common metal-element combinations, while data on alloys containing rare or special elements remain relatively scarce [73,128]. This imbalance may cause AI models to over-fit specific material types during training while under-performing in predicting other underrepresented systems. Furthermore, insufficient data representativeness increases the risk of model errors when encountering unknown scenarios in practical applications, ultimately diminishing both the reliability and practical utility of these models [49].

4.1.3. Negative-Sample Deficit

In the current published data set of high entropy alloys (HEA), most entries are alloys that have been successfully synthesized by experiments, while the “unsynthetic” examples (negative samples) that have been explicitly disproven by experiments only account for a minority. This imbalance exposes models to insufficient negative examples during training, causing systematic deviations in decision boundaries regarding synthesis feasibility—potentially underestimating the actual feasible region. Without adequate negative sample guidance, models struggle to accurately characterize the critical features distinguishing “synthesizable from unsynthetic” alloys, leading to unreliable predictions when handling compositional fine-tuning or extreme process conditions. Such boundary cognitive bias not only escalates experimental verification costs but also delays the discovery of novel alloy systems, ultimately diminishing the practical value of AI-assisted material design in real-world R&D processes.

4.2. Insufficient Model Interpretation

The lack of interpretability in AI models poses another critical challenge, hindering deep understanding of the physical-chemical mechanisms underlying materials [40,130]. While advanced AI models like deep neural networks excel at predicting HEA properties, their internal decision-making processes and predictive foundations often resemble a “black box”—difficult to explain clearly [131,132]. In HEA design, understanding these physical-chemical mechanisms is crucial for optimizing performance and developing novel materials. For instance, when an AI model predicts excellent mechanical properties in a HEA, researchers seek to identify specific microscopic structural features or inter-atomic interactions that contribute to this performance. However, due to insufficient model interpretability, key information cannot be directly extracted from the models [131,132]. This not only limits researchers’ mastery of traditional material science knowledge but also further restricts its application in material design.

4.3. Cross-Domain Transferability

Current machine learning potential models or performance models are often confined to single crystal lattice types and narrow compositional-valence electron space ranges, resulting in significant cross-system migration gaps. For instance, while extensive research focuses on FCC alloys, training datasets for BCC systems and alloys with varying valence electron concentrations remain relatively scarce. This imbalance causes models to overfit specific crystal lattices and electronic structures during training. Consequently, when applied to target systems with significant differences in crystal types or valence electron concentrations, the model’s reliability plummets, thereby diminishing its practical value in unexplored compositional spaces.

4.4. Extrapolation Risk When Far from the Training Distribution

Current machine learning potential functions (ML-PFs) demonstrate significant limitations in extrapolation performance when operating beyond their original training distribution regions, creating substantial computational bottlenecks. Specifically, most training datasets only cover limited temperature-pressure windows and typical crystal defect configurations. This results in dramatic amplification of energy and force prediction errors when models encounter unexplored configurational spaces such as extremely high pressures, metastable phases, or non-equilibrium defect structures. In these out-of-distribution scenarios, ML-PFs may generate physically inconsistent outputs like negative binding energies or erroneous potential surfaces, leading to dynamic simulation failures or distorted thermodynamic calculations. As the extrapolation distance increases, errors accumulate nonlinearly, ultimately compromising the reliability and practical applicability of ML-PFs in high-value applications such as material design and extreme condition predictions.

4.5. Interdisciplinary Integration Issues

The integration of AI technology with traditional materials science knowledge remains insufficient, necessitating enhanced interdisciplinary collaboration and communication [133,134]. Currently, the application of AI in HEA design is primarily driven by researchers in materials science, while professionals from computer science, mathematics, and related fields participate relatively infrequently [133,134]. This shortage of cross-disciplinary talent often results in researchers struggling to fully leverage cutting-edge AI technologies and advanced methodologies. For instance, some materials scientists may lack sufficient understanding of complex machine learning algorithms and data processing techniques, leading to difficulties in building and optimizing AI models. Conversely, computer scientists might not have deep expertise in HEA’s specialized requirements and practical needs, making it challenging to design AI solutions that fully meet material design demands. Furthermore, differences in terminology, research methodologies, and cognitive frameworks across disciplines increase communication barriers and collaboration costs [135,136], undermining the efficiency and quality of interdisciplinary cooperation. These factors collectively constrain the development and application of AI technology in HEA design.

4.6. Typical Case: “Predictive-Synthetic” Bias

Sharma et al. [12] found in the study of Mg-based high entropy alloys that when the cooling rate reached 10⁴ K/s, the alloy predicted by machine learning as “stable BCC phase” actually underwent amorphization rather than forming a solid solution, resulting in the failure of prediction.

4.6.1. Problem Causes

The reasons for the bias are (1) Incomplete dynamic parameters: Current AI models solely rely on thermodynamic descriptors (e.g., ΔHmix and Ω parameters), neglecting cooling rate (CR) and amorphous formation capability (GFA). (2) Insufficient negative samples: The training set is mostly “synthetic” data, and there is a lack of “failure to synthesize” cases (such as amorphization or phase separation), which leads the model to overestimate the stability boundary.

4.6.2. Solutions

The solution is as follows:

(1): Introduction of Dynamic Descriptors

New Feature: Incorporate cooling rates (CR, unit K/s) and GFA parameters (e.g.,

Δ T ₓ = T_{l} - T ₓ

, where T_l is the liquidus temperature and Tₓ is the amorphization critical temperature) into the model input.

Implementation Method: Obtain transient phase transition pathways under different CRs through synchrotron radiation in-situ XRD, establishing a CR-ΔTₓ-phase structure correlation database.

(2): Negative Sample Augmentation and Active Learning Closed-loop

Based on 500 “synthesis failure” records generated by Kinetic-CALPHAD, supplementary data were added, and high uncertainty samples were prioritized for experimental verification through active learning (BALD index). Process is as follows:

Step 1: Retrained the Gradient Boosting (GB) model with an augmented dataset,

Step 2: During experimental verification, if the synthesis results deviate from the prediction (such as amorphization), they are fed back to the negative sample library and iterated for three rounds to reduce the error.

(3): Cross-scale verification—Digital twin

The influence of cooling rate on FCC/BCC ratio is predicted by phase field simulation, and the comparison with laser melting experimental data verifies the reliability of the model in the process parameter space.

5. Future Development Direction

In the future, the application of AI in the design of HEAs will move from “tool assistance” to “intelligent co-creation”, and its development will be systematically upgraded around four dimensions: algorithm, data, experimental closed-loop and industrialization.

At the algorithmic level, there is a pressing need to develop specialized models for HEA complex feature spaces. Current mainstream approaches predominantly adopt frameworks from image processing and natural language processing. Future efforts should focus on constructing “physics-guided neural networks” that integrate physical constraints, embedding thermodynamic stability, electronic structure characteristics, and diffusion kinetics into loss functions or network architectures to enhance model extrapolation capabilities and physical interpretability. Simultaneously, developing meta-learning and Bayesian optimization frameworks for small-sample scenarios is crucial. By leveraging prior knowledge transfer and uncertainty quantification, these frameworks can mitigate over-fitting risks caused by data scarcity, provide confidence intervals for predictions, and offer risk metrics for experimental decision-making. Furthermore, generative AI will evolve toward multi-modal and multi-objective systems. Through integration of diffusion models and reinforcement learning, this approach enables full-chain reverse design spanning “components—process—micro-structure—performance,” allowing users to input natural language requirements and receive experimentally verifiable candidate material solutions.

Secondly, at the data level, it is essential to establish a high-quality HEAs database covering the “computational-experimental-literature” trinity. On one hand, we should promote standardization of high-throughput computing by creating unified computational protocols and error evaluation systems to ensure the reliability and comparability of simulation data from DFT and molecular dynamics. On the other hand, intelligent experimental data platforms must be developed using IOT sensors, automated lab robots, and blockchain certification technology to enable real-time collection, cleaning, and sharing of experimental data. Simultaneously, large language models (LLMs) should be utilized to extract latent information from massive literature databases, addressing gaps in the “gray literature” and “negative samples” while establishing a dual-loop system integrating literature and databases. Ultimately, this will create a comprehensive knowledge graph for HEAs spanning composition, processing techniques, micro-structures, and service performance, supporting continuous evolution through active learning and generative models.

Thirdly, at the experimental closed-loop level, it is essential to establish a rapid iteration system of “AI prediction-experiment verification-model iteration”. Future laboratories will evolve into AI-experiment collaborative “autonomous material discovery platforms,” achieving full-process automation of material synthesis, testing, and characterization through robotic experimental platforms and online characterization technologies (such as in-situ radiation characterization and atomic probe tomography). AI models will receive real-time experimental feedback, executing the “experiment-characterization-feedback” closed loop 24/7. By leveraging online learning and proactive learning strategies, these models dynamically adjust experimental protocols to maximize information gain per experiment. Simultaneously, digital twin technology should be developed to create virtual experimental environments. This virtual-real interaction approach reduces trial-and-error costs and R&D cycles while enabling predictions of material behavior under extreme conditions (such as ultra-high temperatures and high pressures).

Finally, at the industrial implementation stage, it is crucial to promote large-scale application of AI-designed HEAs in key sectors such as energy, aerospace, and electronics. On one hand, we should develop low-code/no-code AI design platforms tailored for engineers, encapsulating expert knowledge and model templates to enable material engineers to perform composition optimization and process window exploration through drag-and-drop operations. On the other hand, establishing a “materials-process-equipment” collaborative optimization platform is essential. This platform should deeply integrate AI design outcomes with additive manufacturing and powder metallurgy process parameters, effectively addressing performance degradation issues during the transition from laboratory achievements to engineering applications. Additionally, collaborating with equipment manufacturers to establish interface standards for AI design, additive manufacturing/powder metallurgy/spray coating processes will bridge the “last mile” gap between virtual designs and physical components.

Taking Figure 12 as an example, it demonstrates a complete high-throughput material screening workflow comprising five key components: rapid synthesis, sample preparation, mechanical testing, rapid characterization, and the integration of big data with AI/ML models. However, this process still faces three often-overlooked bottlenecks in the HEA field: (1) Data quality and consistency issues. The “rapid characterization” phase in Figure 12 relies on automated equipment, but discrepancies in testing standards across laboratories (e.g., strain rate, temperature control precision) can lead to data bias. Such deviations significantly reduce the generalization capability of AI models, necessitating the establishment of cross-platform calibration protocols. (2) Risk of failure in the active learning closed-loop. While the cycle between “AI/ML models” and “experimental verification” appears efficient in Figure 12, practical applications may fall into local optima due to “negative sample deficiency,” resulting in overly optimistic model predictions. We recommend incorporating a “failed synthesis case recovery mechanism” to incorporate unsuccessful synthesized components into subsequent training rounds. (3) Breakdown in process-performance mapping. The workflow fails to explicitly demonstrate how process parameters (e.g., cooling rates) influence final phase structures, potentially leading to deviations from predicted outcomes. Future improvements should introduce real-time in-situ characterization (such as synchronous radiation XRD) during the “rapid synthesis” phase to input process parameters as dynamic features into the model. In summary, Figure 12’s high-throughput paradigm requires three enhancements to unlock its full potential: ① Establishing cross-platform data calibration standards; ② Enforcing negative sample feedback; ③ Integrating process parameters with real-time characterization. These changes will upgrade the AI experiment loop from an “efficiency tool” to a “reliable discovery engine”.

6. Conclusions

In summary, artificial intelligence is reshaping the design paradigm of HEAs with unprecedented depth and breadth. From iterative theoretical models to explosive growth in experimental data, and through cross-scale and interdisciplinary collaborative innovation, AI has evolved from an “optional tool” to a “core infrastructure”. This paper systematically reviews the latest advancements in AI applications for HEA composition design, phase structure prediction, performance optimization, and new material discovery. Through a series of case studies, it validates the feasibility and superiority of AI methodologies, driving a paradigm shift from “experience-driven” to “data-driven” material design. However, it must be clearly recognized that the current stage remains merely the prelude to “AI + HEA” integrated innovation, with significant gaps remaining before achieving truly intelligent material engineering.

First, data remains the primary productive force. While generative models and transfer learning demonstrate remarkable performance in small-sample scenarios, the multi-component and multi-scale nature of HEAs inherently leads to exponentially expanding performance landscapes. No model can sustain long-term development without continuous feeding from high-quality, large-scale datasets. Moreover, the high cost and heterogeneity of experimental data exacerbate the “small samples, low quality, strong bias” issue, which becomes a critical constraint on model generalization capabilities. In addition, it must be emphasized that AI is still mainly applicable to “interpolative search in the known element space” rather than “global extrapolative invention”; the fundamental limitations that have not been solved are the missing negative samples, cross-system drift and the blind area of extrapolation away from the training distribution.

Secondly, the black box dilemma. The unexplainability of deep models hinders the deep integration of “data-physical” knowledge, making it difficult for researchers to extract scientific laws and physical mechanisms that can guide experimental design from the prediction results, and makes key fields (such as nuclear use and medical use) doubt the reliability of AI-designed materials.

Finally, disciplinary barriers pose significant constraints. The design of HEAs requires interdisciplinary expertise spanning physics, chemistry, metallurgy, computer science, and automation. Teams with single-discipline backgrounds struggle to manage the entire process. The “language barrier” between materials science and AI technologies has prevented advanced algorithms like graph neural networks and reinforcement learning from fully unlocking their potential in material research. Moreover, the shortage of cross-disciplinary talent creates critical gaps in the innovation chain.

To advance future development, AI-driven design of HEAs requires establishing a four-dimensional collaborative paradigm integrating “data-algorithm-experiment-industry”. At the algorithmic level, specialized models for HEAs must be developed, including physics-guided neural networks with physical constraints, meta-learning frameworks, and Bayesian optimization systems. Generative AI should be employed to enable full-chain reverse engineering. The data dimension demands building a high-quality database of HEAs, standardizing high-throughput computing, developing intelligent experimental platforms, and leveraging large language models to extract the literature data for constructing a “knowledge graph” of HEAs. For closed-loop experimentation, an AI-experiment collaborative platform should automate the entire process while implementing digital twin technology to reduce trial-and-error costs and support extreme condition predictions. In industrial implementation, key applications of AI-designed materials should be promoted through low-code platforms and collaborative optimization mechanisms linking “materials-process-equipment”, bridging the gap between virtual designs and physical components.

In summary, the integration of AI with HEAs represents not merely a technological breakthrough but a profound paradigm shift in scientific research. This transformation requires collaborative efforts from materials scientists, computer engineers, industry stakeholders, and policymakers. Through open collaboration, we can establish an innovative ecosystem of “AI + materials” that ultimately enables the vision of designing materials with precision, efficiency, and on-demand adaptability.

Author Contributions

Conceptualization; C.Y.; Methodology; E.X.; Validation; E.X. and C.Y.; Formal analysis; E.X.; Investigation; E.X.; Resources; C.Y.; Data curation; E.X.; Writing—original draft preparation; E.X.; Writing—review; editing; E.X. and C.Y.; Visualization; E.X.; Supervision; C.Y.; Project administration; C.Y.; Funding acquisition; C.Y. All authors have read and agreed to the published version of the manuscript.

Funding

The authors gratefully acknowledge the financial support of Shanghai Natural Science Foundation (25ZR1401430), and Science and Technology Cooperation Program of Shanghai Jiao Tong University in Inner Mongolia Autonomous Region-Action Plan of Shanghai Jiao Tong University for “Revitalizing Inner Mongolia through Science and Technology” (2023XYJG0001-01-01).

Data Availability Statement

No new data were created or analyzed in this study.

Conflicts of Interest

The authors declare no conflict of interest.

References

Yeh, J.-W. Overview of high-entropy alloys. In High-Entropy Alloys: Fundamentals and Applications; Springer: Berlin/Heidelberg, Germany, 2016; pp. 1–19. [Google Scholar]
Wang, X.; Liu, Q.; Wang, X. High-Entropy Materials: From Bulk to Sub-nano. Adv. Funct. Mater. 2025, 35, 2504275. [Google Scholar] [CrossRef]
Schweidler, S.; Botros, M.; Strauss, F.; Wang, Q.; Ma, Y.; Velasco, L.; Cadilha Marques, G.; Sarkar, A.; Kübel, C.; Hahn, H. High-entropy materials for energy and electronic applications. Nat. Rev. Mater. 2024, 9, 266–281. [Google Scholar] [CrossRef]
Sun, J.; Liu, W.; Liang, F.; Liu, R.; Yao, Y.; Zou, Q.; Dong, P.; Wang, S. Insight Into High Entropy Compounds: Advances, Challenges and Energy Applications. Adv. Funct. Mater. 2025, e10855. [Google Scholar] [CrossRef]
Ren, J.-T.; Chen, L.; Wang, H.-Y.; Yuan, Z.-Y. High-entropy alloys in electrocatalysis: From fundamentals to applications. Chem. Soc. Rev. 2023, 52, 8319–8373. [Google Scholar] [CrossRef]
Ren, J.; Kumkale, V.Y.; Hou, H.; Kadam, V.S.; Jagtap, C.V.; Lokhande, P.E.; Pathan, H.M.; Pereira, A.; Lei, H.; Liu, T.X. A review of high-entropy materials with their unique applications. Adv. Compos. Hybrid Mater. 2025, 8, 195. [Google Scholar] [CrossRef] [PubMed]
Li, L.Y.; Zhang, M.; Jiang, M.; Gao, L.H.; Ma, Z.; Cao, M.S. High entropy ceramics for electromagnetic functional materials. Adv. Funct. Mater. 2025, 35, 2416673. [Google Scholar] [CrossRef]
Gu, X.; Guo, X.-B.; Li, W.-H.; Jiang, Y.-P.; Liu, Q.-X.; Tang, X.-G. High-entropy materials for application: Electricity, magnetism, and optics. ACS Appl. Mater. Interfaces 2024, 16, 53372–53392. [Google Scholar] [CrossRef]
Chen, S.; Wang, Y.; Pu, G.; Xue, Y.; Zhang, K.; Huang, Y. High-entropy materials: Controllable synthesis, deep characterization, electrochemical energy application, and outlook. Energy Fuels 2022, 37, 36–57. [Google Scholar] [CrossRef]
Odetola, P.I.; Babalola, B.J.; Afolabi, A.E.; Anamu, U.S.; Olorundaisi, E.; Umba, M.C.; Phahlane, T.; Ayodele, O.O.; Olubambi, P.A. Exploring high entropy alloys: A review on thermodynamic design and computational modeling strategies for advanced materials applications. Heliyon 2024, 10, e39660. [Google Scholar] [CrossRef]
Miao, L.; Sivak, J.T.; Kotsonis, G.; Ciston, J.; Ophus, C.L.; Dabo, I.; Maria, J.P.; Sinnott, S.B.; Alem, N. Chemical Environment and Structural Variations in High Entropy Oxide Thin Film Probed with Electron Microscopy. ACS Nano 2024, 18, 14968–14977. [Google Scholar] [CrossRef]
Sharma, P.; Gandhi, P.M.; Chintersingh, K.L.; Schoenitz, M.; Dreizin, E.L.; Liou, S.C.; Balasubramanian, G. Accelerated intermetallic phase amorphization in a Mg-based high-entropy alloy powder. J. Magnes. Alloys 2024, 12, 1792–1798. [Google Scholar] [CrossRef]
Wang, H.; He, Q.-F.; Yang, Y. High-entropy intermetallics: From alloy design to structural and functional properties. Rare Met. 2022, 41, 1989–2001. [Google Scholar] [CrossRef]
Marques, F.; Balcerzak, M.; Winkelmann, F.; Zepon, G.; Felderhoff, M. Review and outlook on high-entropy alloys for hydrogen storage. Energy Environ. Sci. 2021, 14, 5191–5227. [Google Scholar] [CrossRef]
Karpov, S. Application of high-entropy alloys in hydrogen storage technology. Probl. At. Sci. Technol. 2024, 2, 48–61. [Google Scholar] [CrossRef]
Davies, D.W.; Butler, K.T.; Jackson, A.J.; Morris, A.; Frost, J.M.; Skelton, J.M.; Walsh, A. Computational screening of all stoichiometric inorganic materials. Chem 2016, 1, 617–627. [Google Scholar] [CrossRef] [PubMed]
Wang, P.; Li, J.; Yang, L.; Mo, P.; Wu, Y. Preparation of high-entropy nitride ceramics (TiVCrNbZr1-x) Ny by introducing nitrogen vacancies. J. Asian Ceram. Soc. 2024, 12, 249–256. [Google Scholar] [CrossRef]
Aruchamy, K.; Balasankar, A.; Ramasundaram, S.; Oh, T.H. Recent design and synthesis strategies for high-performance supercapacitors utilizing ZnCo₂O₄-based electrode materials. Energies 2023, 16, 5604. [Google Scholar] [CrossRef]
Kuehmann, C.; Olson, G. Computational materials design and engineering. Mater. Sci. Technol. 2009, 25, 472–478. [Google Scholar] [CrossRef]
Pyzer-Knapp, E.O.; Pitera, J.W.; Staar, P.W.; Takeda, S.; Laino, T.; Sanders, D.P.; Sexton, J.; Smith, J.R.; Curioni, A. Accelerating materials discovery using artificial intelligence, high performance computing and robotics. npj Comput. Mater. 2022, 8, 84. [Google Scholar] [CrossRef]
Panchal, J.H.; Kalidindi, S.R.; McDowell, D.L. Key computational modeling issues in integrated computational materials engineering. Comput.-Aided Des. 2013, 45, 4–25. [Google Scholar] [CrossRef]
Chen, H.-L.; Mao, H.; Chen, Q. Database development and Calphad calculations for high entropy alloys: Challenges, strategies, and tips. Mater. Chem. Phys. 2018, 210, 279–290. [Google Scholar] [CrossRef]
Schmid-Fetzer, R.; Andersson, D.; Chevalier, P.-Y.; Eleno, L.; Fabrichnaya, O.; Kattner, U.; Sundman, B.; Wang, C.; Watson, A.; Zabdyr, L. Assessment techniques, database design and software facilities for thermodynamics and diffusion. Calphad 2007, 31, 38–52. [Google Scholar] [CrossRef]
Chang, Y.A.; Chen, S.; Zhang, F.; Yan, X.; Xie, F.; Schmid-Fetzer, R.; Oates, W.A. Phase diagram calculation: Past, present and future. Prog. Mater. Sci. 2004, 49, 313–345. [Google Scholar] [CrossRef]
Greenaway, R.L.; Jelfs, K.E. Integrating computational and experimental workflows for accelerated organic materials discovery. Adv. Mater. 2021, 33, 2004831. [Google Scholar] [CrossRef]
Talapatra, A.; Boluki, S.; Honarmandi, P.; Solomou, A.; Zhao, G.; Ghoreishi, S.F.; Molkeri, A.; Allaire, D.; Srivastava, A.; Qian, X. Experiment design frameworks for accelerated discovery of targeted materials across scales. Front. Mater. 2019, 6, 82. [Google Scholar] [CrossRef]
Gao, L.; Lin, J.; Wang, L.; Du, L. Machine learning-assisted design of advanced polymeric materials. Acc. Mater. Res. 2024, 5, 571–584. [Google Scholar] [CrossRef]
Li, X.; Lu, L.; Li, J.; Zhang, X.; Gao, H. Mechanical properties and deformation mechanisms of gradient nanostructured metals and alloys. Nat. Rev. Mater. 2020, 5, 706–723. [Google Scholar] [CrossRef]
Gu, D.; Shi, X.; Poprawe, R.; Bourell, D.L.; Setchi, R.; Zhu, J. Material-structure-performance integrated laser-metal additive manufacturing. Science 2021, 372, eabg1487. [Google Scholar] [CrossRef]
Pragya, A.; Ghosh, T.K. Soft functionally gradient materials and structures–natural and manmade: A review. Adv. Mater. 2023, 35, 2300912. [Google Scholar] [CrossRef] [PubMed]
Wan, X.; Li, Z.; Yu, W.; Wang, A.; Ke, X.; Guo, H.; Su, J.; Li, L.; Gui, Q.; Zhao, S. Machine learning paves the way for high entropy compounds exploration: Challenges, progress, and outlook. Adv. Mater. 2023, 37, 2305192. [Google Scholar] [CrossRef] [PubMed]
Yang, Z.; Xiang, X.; Yang, J.; Zhao, Z.-Y. High-entropy oxides as energy materials: From complexity to rational design. Mater. Futures 2024, 3, 042103. [Google Scholar] [CrossRef]
Katiyar, N.K.; Goel, G.; Goel, S. Emergence of machine learning in the development of high entropy alloy and their prospects in advanced engineering applications. Emergent Mater. 2021, 4, 1635–1648. [Google Scholar] [CrossRef]
Wang, H.; Fu, T.; Du, Y.; Gao, W.; Huang, K.; Liu, Z.; Chandak, P.; Liu, S.; Van Katwyk, P.; Deac, A. Scientific discovery in the age of artificial intelligence. Nature 2023, 620, 47–60. [Google Scholar] [CrossRef] [PubMed]
Batra, R.; Song, L.; Ramprasad, R. Emerging materials intelligence ecosystems propelled by machine learning. Nat. Rev. Mater. 2021, 6, 655–678. [Google Scholar] [CrossRef]
Xu, L.; Lux, T.; Chang, T.; Li, B.; Hong, Y.; Watson, L.; Butt, A.; Yao, D.; Cameron, K. Prediction of high-performance computing input/output variability and its application to optimization for system configurations. Qual. Eng. 2021, 33, 318–334. [Google Scholar] [CrossRef]
Ansari, T.S.; Taqvi, S.A.A. State-of-the-art review on the applications of nonlinear and artificial intelligence-based controllers in petrochemical processes. ChemBioEng Rev. 2023, 10, 884–906. [Google Scholar] [CrossRef]
Wang, J.; Zhang, Y. Artificial intelligence in high-entropy materials. Next Mater. 2025, 9, 100993. [Google Scholar] [CrossRef]
Guo, K.; Yang, Z.; Yu, C.-H.; Buehler, M.J. Artificial intelligence and machine learning in design of mechanical materials. Mater. Horiz. 2021, 8, 1153–1172. [Google Scholar] [CrossRef] [PubMed]
Badini, S.; Regondi, S.; Pugliese, R. Unleashing the Power of Artificial Intelligence in Materials Design. Materials 2023, 16, 5927. [Google Scholar] [CrossRef]
Huang, J.; Liew, J.; Ademiloye, A.; Liew, K.M. Artificial intelligence in materials modeling and design. Arch. Comput. Methods Eng. 2021, 28, 3399–3413. [Google Scholar] [CrossRef]
Correa-Baena, J.-P.; Hippalgaonkar, K.; van Duren, J.; Jaffer, S.; Chandrasekhar, V.R.; Stevanovic, V.; Wadia, C.; Guha, S.; Buonassisi, T. Accelerating materials development via automation, machine learning, and high-performance computing. Joule 2018, 2, 1410–1420. [Google Scholar] [CrossRef]
Xie, J.; Su, Y.; Xue, D.; Jiang, X.; Fu, H.; Huang, H. Machine learning for materials research and development. Acta Metall. Sin. 2021, 57, 1343–1361. [Google Scholar]
Xu, D.; Zhang, Q.; Huo, X.; Wang, Y.; Yang, M. Advances in data-assisted high-throughput computations for material design. Mater. Genome Eng. Adv. 2023, 1, e11. [Google Scholar] [CrossRef]
Montoya, J.H.; Aykol, M.; Anapolsky, A.; Gopal, C.B.; Herring, P.K.; Hummelshøj, J.S.; Hung, L.; Kwon, H.-K.; Schweigert, D.; Sun, S. Toward autonomous materials research: Recent progress and future challenges. Appl. Phys. Rev. 2022, 9, e11. [Google Scholar] [CrossRef]
Morgan, D.; Jacobs, R. Opportunities and challenges for machine learning in materials science. Annu. Rev. Mater. Res. 2020, 50, 71–103. [Google Scholar] [CrossRef]
Ng, W.L.; Goh, G.L.; Goh, G.D.; Ten, J.S.J.; Yeong, W.Y. Progress and opportunities for machine learning in materials and processes of additive manufacturing. Adv. Mater. 2024, 36, 2310006. [Google Scholar] [CrossRef]
Xie, T.; Li, W.; Velisa, G.; Chen, S.; Meng, F.; Liaw, P.K.; Tong, Y. An overview of high-throughput synthesis for advanced high-entropy alloys. Mater. Genome Eng. Adv. 2025, 3, e87. [Google Scholar] [CrossRef]
Wang, Q.; Velasco, L.; Breitung, B.; Presser, V. High-entropy energy materials in the age of big data: A critical guide to next-generation synthesis and applications. Adv. Energy Mater. 2021, 11, 2102355. [Google Scholar] [CrossRef]
Shang, Y.; Xiong, Z.; An, K.; Hauch, J.A.; Brabec, C.J.; Li, N. Materials genome engineering accelerates the research and development of organic and perovskite photovoltaics. Mater. Genome Eng. Adv. 2024, 2, e28. [Google Scholar] [CrossRef]
Stier, S.P.; Kreisbeck, C.; Ihssen, H.; Popp, M.A.; Hauch, J.; Malek, K.; Reynaud, M.; Goumans, T.; Carlsson, J.; Todorov, I. Materials acceleration platforms (MAPs): Accelerating materials research and development to meet urgent societal challenges. Adv. Mater. 2024, 36, 2407791. [Google Scholar] [CrossRef]
Zeni, C.; Pinsler, R.; Zügner, D.; Fowler, A.; Horton, M.; Fu, X.; Wang, Z.; Shysheya, A.; Crabbé, J.; Ueda, S.; et al. A generative model for inorganic materials design. Nature 2025, 639, 624–632. [Google Scholar] [CrossRef]
Han, N.; Su, B.-L. AI-driven material discovery for energy, catalysis and sustainability. Natl. Sci. Rev. 2025, 12, nwaf110. [Google Scholar] [CrossRef]
Park, H.; Li, Z.; Walsh, A. Has generative artificial intelligence solved inverse materials design? Matter 2024, 7, 2355–2367. [Google Scholar] [CrossRef]
Zeni, C.; Pinsler, R.; Zügner, D.; Fowler, A.; Horton, M.; Fu, X.; Shysheya, S.; Crabbé, J.; Sun, L.; Smith, J. Mattergen: A generative model for inorganic materials design. arXiv 2023, arXiv:2312.03687. [Google Scholar]
Zhang, Y.; Wen, C.; Wang, C.; Antonov, S.; Xue, D.; Bai, Y.; Su, Y. Phase prediction in high entropy alloys with a rational selection of materials descriptors and machine learning models. Acta Mater. 2020, 185, 528–539. [Google Scholar] [CrossRef]
Liu, S.; Yang, C. Machine learning design for high-entropy alloys: Models and algorithms. Metals 2024, 14, 235. [Google Scholar] [CrossRef]
Ren, P.; Xiao, Y.; Chang, X.; Huang, P.-Y.; Li, Z.; Gupta, B.B.; Chen, X.; Wang, X. A survey of deep active learning. ACM Comput. Surv. (CSUR) 2021, 54, 1–40. [Google Scholar] [CrossRef]
Sun, Y.; Ni, J. Machine learning advances in high-entropy alloys: A mini-review. Entropy 2024, 26, 1119. [Google Scholar] [CrossRef]
Wu, S.; Song, Z.; Wang, J.; Niu, X.; Chen, H. Enhanced phase prediction of high-entropy alloys through machine learning and data augmentation. Phys. Chem. Chem. Phys. 2025, 27, 717–729. [Google Scholar] [CrossRef]
Sun, Y.; Hou, C.; Tran, N.-D.; Lu, Y.; Li, Z.; Chen, Y.; Ni, J. EFTGAN: Elemental features and transferring corrected data augmentation for the study of high-entropy alloys. npj Comput. Mater. 2025, 11, 54. [Google Scholar] [CrossRef]
Sulley, G.A.; Raush, J.; Montemore, M.M.; Hamm, J. Accelerating high-entropy alloy discovery: Efficient exploration via active learning. Scr. Mater. 2024, 249, 116180. [Google Scholar] [CrossRef]
Liu, Y.; Yang, Z.; Yu, Z.; Liu, Z.; Liu, D.; Lin, H.; Li, M.; Ma, S.; Avdeev, M.; Shi, S. Generative artificial intelligence and its applications in materials science: Current situation and future perspectives. J. Mater. 2023, 9, 798–816. [Google Scholar] [CrossRef]
Fuhr, A.S.; Sumpter, B.G. Deep generative models for materials discovery and machine learning-accelerated innovation. Front. Mater. 2022, 9, 865270. [Google Scholar] [CrossRef]
Madika, B.; Saha, A.; Kang, C.; Buyantogtokh, B.; Agar, J.; Wolverton, C.M.; Voorhees, P.; Littlewood, P.; Kalinin, S.; Hong, S. Artificial Intelligence for Materials Discovery, Development, and Optimization. ACS Nano 2025, 19, 27116–27158. [Google Scholar] [CrossRef]
Choudhary, K.; DeCost, B.; Chen, C.; Jain, A.; Tavazza, F.; Cohn, R.; Park, C.W.; Choudhary, A.; Agrawal, A.; Billinge, S.J. Recent advances and applications of deep learning methods in materials science. npj Comput. Mater. 2022, 8, 59. [Google Scholar] [CrossRef]
Maharana, K.; Mondal, S.; Nemade, B. A review: Data pre-processing and data augmentation techniques. Glob. Transit. Proc. 2022, 3, 91–99. [Google Scholar] [CrossRef]
Shorten, C.; Khoshgoftaar, T.M.; Furht, B. Text data augmentation for deep learning. J. Big Data 2021, 8, 101. [Google Scholar] [CrossRef]
Kumar, T.; Brennan, R.; Mileo, A.; Bendechache, M. Image data augmentation approaches: A comprehensive survey and future directions. IEEE Access 2024, 12, 187536–187571. [Google Scholar] [CrossRef]
Kininis, P. Robustness and Domain Generalization in Computer Vision by Using Adversarial Data Augmentation. 2025. Available online: https://dspace.lib.ntua.gr/xmlui/bitstream/handle/123456789/60642/Thesis%20(1).pdf?sequence=1 (accessed on 9 July 2024).
Zhou, C.; Zhang, Y.; Xin, H.; Li, X.; Chen, X. Complex multiphase predicting of additive manufactured high entropy alloys based on data augmentation deep learning. J. Mater. Res. Technol. 2024, 28, 2388–2401. [Google Scholar] [CrossRef]
Yang, Z.; Li, S.; Li, S.; Yang, J.; Liu, D. A two-step data augmentation method based on generative adversarial network for hardness prediction of high entropy alloy. Comput. Mater. Sci. 2023, 220, 112064. [Google Scholar] [CrossRef]
Peivaste, I.; Jossou, E.; Tiamiyu, A.A. Data-driven analysis and prediction of stable phases for high-entropy alloy design. Sci. Rep. 2023, 13, 22556. [Google Scholar] [CrossRef] [PubMed]
Callister Jr, W.D.; Rethwisch, D.G. Materials Science and Engineering: An Introduction; John Wiley & Sons: Hoboken, NJ, USA, 2020. [Google Scholar]
Feng, S.; Zhou, H.; Dong, H. Application of deep transfer learning to predicting crystal structures of inorganic substances. Comput. Mater. Sci. 2021, 195, 110476. [Google Scholar] [CrossRef]
Golbabaei, M.H.; Zohrevand, M.; Zhang, N. Applications of Machine Learning in High-Entropy Alloys: A Review of Recent Advances in Design, Discovery, and Characterization. Nanoscale 2025. Online ahead of print. [Google Scholar] [CrossRef]
Cheng, H.; Wang, C.-L.; Li, X.-D.; Pan, L.; Liang, C.-J.; Liu, W.-J. Machine Learning-Based High Entropy Alloys-Algorithms and Workflow: A Review. Acta Metall. Sin. (Engl. Lett.) 2025, 38, 1453–1480. [Google Scholar] [CrossRef]
Kaufmann, K.; Maryanovsky, D.; Mellor, W.M.; Zhu, C.; Rosengarten, A.S.; Harrington, T.J.; Oses, C.; Toher, C.; Curtarolo, S.; Vecchio, K.S. Discovery of high-entropy ceramics via machine learning. npj Comput. Mater. 2020, 6, 42. [Google Scholar] [CrossRef]
Li, R.; Xie, L.; Wang, W.Y.; Liaw, P.K.; Zhang, Y. High-throughput calculations for high-entropy alloys: A brief review. Front. Mater. 2020, 7, 290. [Google Scholar] [CrossRef]
Li, J.; Xie, B.; Fang, Q.; Liu, B.; Liu, Y.; Liaw, P.K. High-throughput simulation combined machine learning search for optimum elemental composition in medium entropy alloy. J. Mater. Sci. Technol. 2021, 68, 70–75. [Google Scholar] [CrossRef]
Rittiruam, M.; Noppakhun, J.; Setasuban, S.; Aumnongpho, N.; Sriwattana, A.; Boonchuay, S.; Saelee, T.; Wangphon, C.; Ektarawong, A.; Chammingkwan, P. High-throughput materials screening algorithm based on first-principles density functional theory and artificial neural network for high-entropy alloys. Sci. Rep. 2022, 12, 16653. [Google Scholar] [CrossRef]
Conway, P.L.; Klaver, T.; Steggo, J.; Ghassemali, E. High entropy alloys towards industrial applications: High-throughput screening and experimental investigation. Mater. Sci. Eng. A 2022, 830, 142297. [Google Scholar] [CrossRef]
Mooraj, S.; Chen, W. A review on high-throughput development of high-entropy alloys by combinatorial methods. J. Mater. Inform. 2023, 3, 4. [Google Scholar] [CrossRef]
Liu, Y.; Wang, J.; Xiao, B.; Shu, J. Accelerated development of hard high-entropy alloys with data-driven high-throughput experiments. J. Mater. Inform. 2022, 2, 3. [Google Scholar] [CrossRef]
Ong, S.P. Accelerating materials science with high-throughput computations and machine learning. Comput. Mater. Sci. 2019, 161, 143–150. [Google Scholar] [CrossRef]
Chen, C.; Nguyen, D.T.; Lee, S.J.; Baker, N.A.; Karakoti, A.S.; Lauw, L.; Owen, C.; Mueller, K.T.; Bilodeau, B.A.; Murugesan, V. Accelerating computational materials discovery with machine learning and cloud high-performance computing: From large-scale screening to experimental validation. J. Am. Chem. Soc. 2024, 146, 20009–20018. [Google Scholar] [CrossRef]
Shahzad, K.; Mardare, A.I.; Hassel, A.W. Accelerating materials discovery: Combinatorial synthesis, high-throughput characterization, and computational advances. Sci. Technol. Adv. Mater. Methods 2024, 4, 2292486. [Google Scholar] [CrossRef]
Naghdi, A.H.; Massa, D.; Karimi, K.; Papanikolaou, S. High Entropy Alloy Composition Design for Mechanical Properties. 2024. Available online: https://www.intechopen.com/online-first/1172734 (accessed on 8 September 2025).
Huo, W.; Wang, S.; Dominguez-Gutierrez, F.J.; Ren, K.; Kurpaska, Ł.; Fang, F.; Papanikolaou, S.; Kim, H.S.; Jiang, J. High-entropy materials for electrocatalytic applications: A review of first principles modeling and simulations. Mater. Res. Lett. 2023, 11, 713–732. [Google Scholar] [CrossRef]
Sun, Y.; Dai, S. High-entropy materials for catalysis: A new frontier. Sci. Adv. 2021, 7, eabg1600. [Google Scholar] [CrossRef]
Shi, X.; Zhang, G.; Lu, Y.; Pang, H. Applications of machine learning in electrochemistry. Renewables 2023, 1, 668–693. [Google Scholar] [CrossRef]
Feng, R.; Zhang, C.; Gao, M.C.; Pei, Z.; Zhang, F.; Chen, Y.; Ma, D.; An, K.; Poplawsky, J.D.; Ouyang, L. High-throughput design of high-performance lightweight high-entropy alloys. Nat. Commun. 2021, 12, 4329. [Google Scholar] [CrossRef] [PubMed]
Miracle, D.B.; Li, M.; Zhang, Z.; Mishra, R.; Flores, K.M. Emerging capabilities for the high-throughput characterization of structural materials. Annu. Rev. Mater. Res. 2021, 51, 131–164. [Google Scholar] [CrossRef]
Liu, X.; Liu, B.; Ding, J.; Deng, Y.; Han, X.; Zhong, C.; Hu, W. Building a library for catalysts research using high-throughput approaches. Adv. Funct. Mater. 2022, 32, 2107862. [Google Scholar] [CrossRef]
Green, M.L.; Choi, C.; Hattrick-Simpers, J.; Joshi, A.; Takeuchi, I.; Barron, S.; Campo, E.; Chiang, T.; Empedocles, S.; Gregoire, J. Fulfilling the promise of the materials genome initiative with high-throughput experimental methodologies. Appl. Phys. Rev. 2017, 4, 011105. [Google Scholar] [CrossRef]
Wang, D.; Jiang, W.; Li, S.; Yan, X.; Wu, S.; Qiu, H.; Guo, S.; Zhu, B. A comprehensive review on combinatorial film via high-throughput techniques. Materials 2023, 16, 6696. [Google Scholar] [CrossRef]
Jain, S.; Jain, R.; Kumar, V.; Samal, S. Data-driven design of high bulk modulus high entropy alloys using machine learning. J. Alloys Metall. Syst. 2024, 8, 100128. [Google Scholar] [CrossRef]
Eldabah, N.M.; Pratap, A.; Pandey, A.; Sardana, N.; Sidhu, S.S.; Gepreel, M.A.H. Design Approaches of High-Entropy Alloys Using Artificial Intelligence: A Review. Adv. Eng. Mater. 2025, 27, 2402504. [Google Scholar] [CrossRef]
Shenai, P.M.; Xu, Z.; Zhao, Y. Applications of principal component analysis (PCA) in materials science. Princ. Compon. Anal. Appl. 2012, 25–40. [Google Scholar]
Rajan, K. Materials informatics. Mater. Today 2005, 8, 38–45. [Google Scholar] [CrossRef]
Vazquez, G.; Sauceda, D.; Arróyave, R. Deciphering chemical ordering in High Entropy Materials: A machine learning-accelerated high-throughput cluster expansion approach. Acta Mater. 2024, 276, 120137. [Google Scholar] [CrossRef]
Lee, K.; Ayyasamy, M.V.; Delsa, P.; Hartnett, T.Q.; Balachandran, P.V. Phase classification of multi-principal element alloys via interpretable machine learning. npj Comput. Mater. 2022, 8, 25. [Google Scholar] [CrossRef]
Chanda, B.; Jana, P.P.; Das, J. A tool to predict the evolution of phase and Young’s modulus in high entropy alloys using artificial neural network. Comput. Mater. Sci. 2021, 197, 110619. [Google Scholar] [CrossRef]
Yao, Y.; Dong, Q.; Brozena, A.; Luo, J.; Miao, J.; Chi, M.; Wang, C.; Kevrekidis, I.G.; Ren, Z.J.; Greeley, J. High-entropy nanoparticles: Synthesis-structure-property relationships and data-driven discovery. Science 2022, 376, eabn3103. [Google Scholar] [CrossRef]
Anand, A.; Liu, S.-J.; Singh, C.V. Recent advances in computational design of structural multi-principal element alloys. Iscience 2023, 26, 107751. [Google Scholar] [CrossRef] [PubMed]
Rahman, A.; Hossain, M.S.; Siddique, A.-B. Review: Machine learning approaches for diverse alloy systems. J. Mater. Sci. 2025, 60, 12189–12221. [Google Scholar] [CrossRef]
Risal, S.; Zhu, W.; Guillen, P.; Sun, L. Improving phase prediction accuracy for high entropy alloys with machine learning. Comput. Mater. Sci. 2021, 192, 110389. [Google Scholar] [CrossRef]
Zhou, T.; Song, Z.; Sundmacher, K. Big data creates new opportunities for materials research: A review on methods and applications of machine learning for materials design. Engineering 2019, 5, 1017–1026. [Google Scholar] [CrossRef]
Ramprasad, R.; Batra, R.; Pilania, G.; Mannodi-Kanakkithodi, A.; Kim, C. Machine learning in materials informatics: Recent applications and prospects. npj Comput. Mater. 2017, 3, 54. [Google Scholar] [CrossRef]
Dixit, S.; Rodriguez, S.; Jones, M.R.; Buzby, P.; Dixit, R.; Argibay, N.; DelRio, F.W.; Lim, H.H.; Fleming, D. Refractory High-Entropy Alloy Coatings for High-Temperature Aerospace and Energy Applications. J. Therm. Spray Technol. 2022, 31, 1021–1031. [Google Scholar] [CrossRef]
Ubaidy, S.K.A.; Bouraoui, C. High-Entropy Alloys: Advantages and Applications in Challenging Environments. Ann. Chim.—Sci. Des Matériaux 2024, 48, 125–136. [Google Scholar] [CrossRef]
Li, J.; Xiong, W.; Zhang, T.; Cheng, H.; Shen, K.; He, M.; Zhang, Y.; Song, J.; Deng, Y.; Chen, Q. Machine Learning and Explainable AI-Guided Design and Optimization of High-Entropy Alloys as Binder Phases for WC-Based Cemented Carbides. Comput. Mater. Contin. 2025, 84, 2189–2216. [Google Scholar] [CrossRef]
Yuan, J.; Li, Z.; Yang, Y.; Yin, A.; Li, W.; Sun, D.; Wang, Q. Applications of machine learning method in high-performance materials design: A review. J. Mater. Inform. 2024, 4, 14. [Google Scholar] [CrossRef]
Thike, P.H.; Zhao, Z.; Shi, P.; Jin, Y. Significance of artificial neural network analytical models in materials’ performance prediction. Bull. Mater. Sci. 2020, 43, 211. [Google Scholar] [CrossRef]
Fu, Z.; Liu, W.; Huang, C.; Mei, T. A Review of Performance Prediction Based on Machine Learning in Materials Science. Nanomaterials 2022, 12, 2957. [Google Scholar] [CrossRef] [PubMed]
Debnath, A.; Krajewski, A.M.; Sun, H.; Lin, S.; Ahn, M.; Li, W.; Priya, S.; Singh, J.; Shang, S.; Beese, A.M.; et al. Generative deep learning as a tool for inverse design of high entropy refractory alloys. J. Mater. Inform. 2021, 1, 3. [Google Scholar] [CrossRef]
Wen, C.; Zhang, Y.; Wang, C.; Huang, H.; Wu, Y.; Lookman, T.; Su, Y. Machine-Learning-Assisted Compositional Design of Refractory High-Entropy Alloys with Optimal Strength and Ductility. Engineering 2025, 46, 214–223. [Google Scholar] [CrossRef]
Zhang, Y.; Wen, C.; Dang, P.; Jiang, X.; Xue, D.; Su, Y. Elemental numerical descriptions to enhance classification and regression model performance for high-entropy alloys. npj Comput. Mater. 2025, 11, 75. [Google Scholar] [CrossRef]
Chen, C.; Zhou, H.; Long, W.; Wang, G.; Ren, J. Phase prediction for high-entropy alloys using generative adversarial network and active learning based on small datasets. Sci. China Technol. Sci. 2023, 66, 3615–3627. [Google Scholar] [CrossRef]
Pan, S.; Wang, Y.; Yu, J.; Yang, M.; Zhang, Y.; Wei, H.; Chen, Y.; Wu, J.; Han, J.; Wang, C.; et al. Accelerated discovery of high-performance Cu-Ni-Co-Si alloys through machine learning. Mater. Des. 2021, 209, 109929. [Google Scholar] [CrossRef]
Rao, Z.; Tung, P.-Y.; Xie, R.; Wei, Y.; Zhang, H.; Ferrari, A.; Klaver, T.; Körmann, F.; Sukumar, P.T.; da Silva, A.K.; et al. Machine learning-enabled high-entropy alloy discovery. Science 2022, 378, 78–85. [Google Scholar] [CrossRef]
Wen, C.; Shen, H.; Tian, Y.; Lou, G.; Wang, N.; Su, Y. Accelerated discovery of refractory high-entropy alloys for strength-ductility co-optimization: An exploration in NbTaZrHfMo system by machine learning. Scr. Mater. 2024, 252, 116240. [Google Scholar] [CrossRef]
Yonggang, Y.; Dan, L.; Kun, W. Accelerated discovery of single-phase refractory high entropy alloys assisted by machine learning. Comput. Mater. Sci. 2021, 199, 110723. [Google Scholar] [CrossRef]
Yu, F.; Wang, Y.; Zhang, K.; Shen, H.; Zhang, Y.; Yang, Z.; Zeng, G.; Cui, D.; Xia, J.; Liu, J. High entropy MXenes in energy storage: Structural design, characterization, and applications. J. Mater. Chem. A 2025. [Google Scholar] [CrossRef]
Hauser, D.J.; Moss, A.J.; Rosenzweig, C.; Jaffe, S.N.; Robinson, J.; Litman, L. Evaluating CloudResearch’s Approved Group as a solution for problematic data quality on MTurk. Behav. Res. Methods 2023, 55, 3953–3964. [Google Scholar] [CrossRef]
Wuest, T.; Tinscher, R.; Porzel, R.; Thoben, K.-D. Experimental research data quality in materials science. arXiv 2015, arXiv:1501.01149. [Google Scholar] [CrossRef]
Abdalla, H.B.; Kumar, Y.; Marchena, J.; Guzman, S.; Awlla, A.; Gheisari, M.; Cheraghy, M. The Future of Artificial Intelligence in the Face of Data Scarcity. Comput. Mater. Contin. 2025, 84, 1073–1099. [Google Scholar] [CrossRef]
Han, L.; Zhu, S.; Rao, Z.; Scheu, C.; Ponge, D.; Ludwig, A.; Zhang, H.; Gutfleisch, O.; Hahn, H.; Li, Z. Multifunctional high-entropy materials. Nat. Rev. Mater. 2024, 9, 846–865. [Google Scholar] [CrossRef]
Oses, C.; Toher, C.; Curtarolo, S. High-entropy ceramics. Nat. Rev. Mater. 2020, 5, 295–309. [Google Scholar] [CrossRef]
Bai, X.; Zhang, X. Artificial intelligence-powered materials science. Nano-Micro Lett. 2025, 17, 135. [Google Scholar] [CrossRef]
Kibrete, F.; Trzepieciński, T.; Gebremedhen, H.S.; Woldemichael, D.E. Artificial intelligence in predicting mechanical properties of composite materials. J. Compos. Sci. 2023, 7, 364. [Google Scholar] [CrossRef]
Su, Y.; Wang, X.; Ye, Y.; Xie, Y.; Xu, Y.; Jiang, Y.; Wang, C. Automation and machine learning augmented by large language models in a catalysis study. Chem. Sci. 2024, 15, 12200–12233. [Google Scholar] [CrossRef]
DeCost, B.L.; Hattrick-Simpers, J.R.; Trautt, Z.; Kusne, A.G.; Campo, E.; Green, M. Scientific AI in materials science: A path to a sustainable and scalable paradigm. Mach. Learn. Sci. Technol. 2020, 1, 033001. [Google Scholar] [CrossRef]
Sha, W.; Guo, Y.; Yuan, Q.; Tang, S.; Zhang, X.; Lu, S.; Guo, X.; Cao, Y.-C.; Cheng, S. Artificial intelligence to power the future of materials science and engineering. Adv. Intell. Syst. 2020, 2, 1900143. [Google Scholar] [CrossRef]
Lowe, P.; Phillipson, J. Barriers to research collaboration across disciplines: Scientific paradigms and institutional practices. Environ. Plan. A 2009, 41, 1171–1184. [Google Scholar] [CrossRef]
Dalieva, M. Interoperation of Language, Scientific Terminology, and Interdisciplinary Collaboration. West. Eur. J. Linguist. Educ. 2024, 2, 1–4. [Google Scholar]
Gianola, D.S.; della Ventura, N.M.; Balbus, G.H.; Ziemke, P.; Echlin, M.P.; Begley, M.R. Advances and opportunities in high-throughput small-scale mechanical testing. Curr. Opin. Solid State Mater. Sci. 2023, 27, 101090. [Google Scholar] [CrossRef]

Figure 1. Classifying high-entropy alloys according to mixed entropy, Adapted from Ref. [10].

Figure 2. Four core effects of HEAs, Adapted from Ref. [10].

Figure 3. Key challenges in the development of HEMs design, Adapted from Ref. [38].

Figure 4. A machine learning program loop used to predict the hardness of light alloys, Adapted from Ref. [10].

Figure 5. Schematic diagram of the material inverse design generation model based on conditional generative adversarial networks, Reprinted from Ref. [116]. (A) Adversarial training process: G (Generator) learns to map random latent vectors to alloy compositions with target properties, while D (Discriminator) learns to distinguish between generated and real compositions. The generator and discriminator compete against each other to pursue better performance. (B) After training, G is employed for inverse design by sampling the latent space to generate candidate alloys that satisfy predefined property criteria.

Figure 6. Comparison between real (top row) and generated (bottom row) works, Reprinted from Ref. [116]. (A) Element pair correlation. Red values indicate a higher likelihood of element pairs appearing in high-entropy alloy (HEA) compositions, while blue values suggest lower occurrence probability. (B) Quantities of different elements in each alloy. (C) Sample composition illustration. Each column represents an alloy, arranged by elemental abundance density. Blue intensity reflects the atomic proportion of each element in the alloy.

Figure 7. Strategies for generating element numerical descriptors for specific problems, Reprinted from Ref. [118]. (a) The common approach combines elemental composition with numerical descriptors to construct material characteristics. (b) Established numerical descriptors (e.g., radius and valence electron concentration) only occupy a small portion of the optimization space. Generating more precise numerical descriptors can enhance model performance and scalability. (c) Given the enormous size of the optimization space, a customized genetic algorithm framework is employed to produce higher-quality element numerical descriptors.

Figure 8. Comparison of classifier performance based on the element numerical description feature proposed by us, the selected features from correlation analysis above and traditional empirical features, Reprinted from Ref. [118]. VEC (valence electron concentration) is calculated as the weighted average of valence electrons per atom. δR (atomic radius difference) and δER (electronegativity difference) are computed using Goldschmidt radii and Pauling electronegativity, respectively. * marks that the features in the elements numerical description of generated are superior to those selected by correlation analysis and traditional empirical features.

Figure 9. Schematic diagram of the EFTGAN model architecture. The green module represents the ECNet model, Reprinted from Ref. [61], which extracts material element features through element convolution operations. Principal component analysis (PCA) is applied to reduce feature dimensionality to enhance generator performance. The purple module contains the reduced element features, while the blue module corresponds to the InfoGAN model. During iterative training, a multi-layer perceptron predicts the generated features from InfoGAN and feeds the predictions back into the InfoGAN training process.

Figure 10. Machine learning predictions of

E_{form}

and

m_{s}

for

{({Cr}_{0.25} {Pd}_{0.75})}_{0.45} {({Fe}_{x} {Co}_{y} {Ni}_{z})}_{0.55}

systems, Reprinted from Ref. [61]. (a) displays the

E_{form}

of the FCC single-phase solid solutions (SPSS) system, (b) displays the

E_{form}

of the BCC SPSS system and (c) displays the

E_{form}

difference between the FCC phase and the BCC phase for the same composition; (d) displays the ms of the FCC system, (e) displays the ms of the BCC system and (f) displays the magnetic moments of relatively stable phases in FCC and BCC systems for the same composition; (g) displays the free energies of FCC system in 400 K and 800 K, (h) displays the free energies of BCC system in 400 K and 800 K. DFT-calculated

E_{form}

(formation energy in eV/atom relative to elemental references) and

m_{s}

(magnetic moment in μB/atom) for

{({Cr}_{0.25} {Pd}_{0.75})}_{0.45} {({Fe}_{x} {Co}_{y} {Ni}_{z})}_{0.55}

systems. The color band to the right of each figure displays the values represented by the colors displayed in each figure. The color map indicates stability (red: more stable) and magnetization intensity (blue: higher ms).

Figure 10. Machine learning predictions of

E_{form}

and

m_{s}

for

{({Cr}_{0.25} {Pd}_{0.75})}_{0.45} {({Fe}_{x} {Co}_{y} {Ni}_{z})}_{0.55}

systems, Reprinted from Ref. [61]. (a) displays the

E_{form}

of the FCC single-phase solid solutions (SPSS) system, (b) displays the

E_{form}

of the BCC SPSS system and (c) displays the

E_{form}

difference between the FCC phase and the BCC phase for the same composition; (d) displays the ms of the FCC system, (e) displays the ms of the BCC system and (f) displays the magnetic moments of relatively stable phases in FCC and BCC systems for the same composition; (g) displays the free energies of FCC system in 400 K and 800 K, (h) displays the free energies of BCC system in 400 K and 800 K. DFT-calculated

E_{form}

(formation energy in eV/atom relative to elemental references) and

m_{s}

(magnetic moment in μB/atom) for

{({Cr}_{0.25} {Pd}_{0.75})}_{0.45} {({Fe}_{x} {Co}_{y} {Ni}_{z})}_{0.55}

systems. The color band to the right of each figure displays the values represented by the colors displayed in each figure. The color map indicates stability (red: more stable) and magnetization intensity (blue: higher ms).

Figure 11. Prediction effects of hardness and electrical conductivity by each model, Reprinted from Ref. [120]. (a) random forest (RF)-hardness, (b) artificial neural network (ANN)-hardness, (c) ordinary least square (OLS)-hardness, (d) RF-electrical conductivity, (e) ANN-electrical conductivity, (f) OLS-electrical conductivity.

Figure 12. A high-throughput workflow for material selection includes: rapid synthesis method, test sample manufacturing, mechanical testing, rapid material characterization, and processing of big data streams combined with AI/ML models, Reprinted from Ref. [137].

Table 1. Metallurgical translation of core algorithms.

Category of Algorithm	Metallurgical Analogy	The Role in the Study
Random Forest (RF)	The median is taken after tensile tests with multiple furnace cycles and sampling points	Robust regression or classification baseline
Gradient Boosting (GB)	Continuous refining: each round of remelting for residual error	Single phase/multi-phase classification, F1 highest
Deep neural network(DNN)	High temperature diffusion: inter-layer weights, such as diffusion channels	Mechanical performance end-to-end mapping
Conditions generate adversarial networks(CGAN)	Oriented solidification: Generator = mold, discriminator = quality control	Generate alloy composition on demand
Active learning(AL)	Additional sampling at key experimental points	Pick the alloy with the most information for the experiment under a small sample
Transfer learning(TL)	The strengthening mechanism of low-carbon steel is transferred to high-entropy steel	Accelerate the modeling of new systems using known alloy knowledge

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Xie, E.; Yang, C. AI Design for High Entropy Alloys: Progress, Challenges and Future Prospects. Metals 2025, 15, 1012. https://doi.org/10.3390/met15091012

AMA Style

Xie E, Yang C. AI Design for High Entropy Alloys: Progress, Challenges and Future Prospects. Metals. 2025; 15(9):1012. https://doi.org/10.3390/met15091012

Chicago/Turabian Style

Xie, Enzhi, and Chao Yang. 2025. "AI Design for High Entropy Alloys: Progress, Challenges and Future Prospects" Metals 15, no. 9: 1012. https://doi.org/10.3390/met15091012

APA Style

Xie, E., & Yang, C. (2025). AI Design for High Entropy Alloys: Progress, Challenges and Future Prospects. Metals, 15(9), 1012. https://doi.org/10.3390/met15091012

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

AI Design for High Entropy Alloys: Progress, Challenges and Future Prospects

Abstract

1. Introduction

1.1. High Entropy Alloys

1.2. The Thermodynamic-Dynamic Controversy

1.3. Material Design

2. AI Technology in HEA Design

2.1. Algorithm Principle and Applicability Analysis

2.1.1. Random Forest and Gradient Boosting—Algorithmic “Multi-Burn-In”

2.1.2. Deep Neural Network—“Diffusion Channel” Perspective

2.1.3. Conditions Generate Adversarial Networks—“Reverse Design Casting”

2.1.4. Active Learning—“Sampling Strategy”

2.1.5. Transfer Learning—“Experience Transfer”

2.2. Machine Learning Model

2.3. Data Processing and Analysis

2.4. Performance Prediction

2.5. Limitations of DFT and MD in HEA Modeling

2.5.1. Calculate the Expansion Law of Cost with the Number of Master Elements

2.5.2. The Accuracy of Phase Stability Prediction

3. Application Cases of AI in HEA Design

3.1. Component Design

3.1.1. Application of the Generation Model in Refractory HEA Design

3.1.2. Design of Multi-Objective Optimization Framework for Refractory HEAs

3.1.3. Comparison and Discussion

3.2. Phase Structure Prediction

3.2.1. Classification and Prediction of HEA Phase Composition by Deep Learning Algorithm

3.2.2. Combination of Conditional Generation Adversarial Network and Active Learning

3.2.3. Element Feature Transfer Adversarial Network

3.2.4. Comparison and Discussion

3.2.5. The Gap Between Machine Learning Predictions and Real Synthetic Dynamics

3.3. Performance Optimization

3.3.1. Synergistic Optimization of High Temperature Strength and Room Temperature Toughness of HEAs

3.3.2. Hardness Optimization of Al-Co-Cr-Cu-Fe-Ni HEAs

3.3.3. Comparison and Discussion

3.4. Material Screening and Discovery

3.4.1. Cu-Ni-Co-Si HEA System

3.4.2. Low Thermal Expansion Coefficient HEA

3.4.3. Nb-Ta-Zr-Hf-Mo Refractory HEA System

3.4.4. Single-Phase Refractory HEA

3.4.5. Comparison and Discussion

3.4.6. Limitations and Mitigation Strategies of Combinatorial Synthesis

4. Challenges of AI Technology in HEA Design

4.1. Data Related Issues

4.1.1. Scarcity of High-Quality Data

4.1.2. Data Skew and Lack of Representativeness

4.1.3. Negative-Sample Deficit

4.2. Insufficient Model Interpretation

4.3. Cross-Domain Transferability

4.4. Extrapolation Risk When Far from the Training Distribution

4.5. Interdisciplinary Integration Issues

4.6. Typical Case: “Predictive-Synthetic” Bias

4.6.1. Problem Causes

4.6.2. Solutions

5. Future Development Direction

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI