1. Introduction
In recent years, machine learning (ML) technologies have demonstrated growing value and potential in earth sciences, particularly in large-scale mineral prediction [
1]. This trend arises from the global increase in mineral resource demand and the challenges faced by traditional exploration methods when handling complex, multi-source geological data [
2]. Although conventional mineral prediction approaches, like geostatistics and weights of evidence, have achieved remarkable results in the past, their efficiency and accuracy may decline when confronted with massive, high-dimensional, and nonlinear geological data [
3]. Against this backdrop, machine learning emerges as a powerful data-driven tool. It can automatically identify complex patterns, optimize target area selection, and predict mineral resource potential, excelling even when dealing with intricate, multidimensional datasets [
4].
Traditional large-scale mineral prediction methods, such as geological analogy and empirical qualitative analysis, encounter technical bottlenecks when facing multi-source heterogeneous geological data [
5]. Firstly, data from geology, geochemistry, geophysics, and remote sensing vary in format and quality, with significant missing values and noise. Traditional methods struggle to integrate these complex data effectively, leading to the isolation of information [
6]. Secondly, mineralization is a highly nonlinear process involving multiple factors. The relationships between ore-controlling factors are intricate and hidden [
7]. Traditional methods, such as linear regression models, have difficulty capturing complex nonlinear relationships. However, there are other methods, such as geostatistical methods (for example, the Kriging method) and simulation methods (for example, Monte Carlo simulation), which do not rely on linear regression but still have limitations when dealing with complex geological data [
8,
9]. For instance, predicting iron ore geological bodies requires complex processing of magnetic anomaly data using nonlinear inversion models, with initial model accuracy highly dependent on prior geological knowledge [
10]. Moreover, traditional methods are inefficient in processing large datasets and lack objective quantification and uncertainty assessment of predictions, restricting their application in large-scale, high-throughput exploration [
11]. While some traditional methods, such as Sequential Gaussian Simulation (SGS), can provide uncertainty assessments for predictions, they often struggle with the complexity and scale of modern geological datasets. In contrast, machine learning methods, especially deep learning techniques, have shown significant potential in handling these challenges and providing more robust uncertainty assessments. The application of machine learning technology in the field of earth science has gradually become a research hotspot, especially in mineral prediction. Zhou Yongzhang et al. (2018) systematically expounded the theoretical basis of machine learning methods and their applications in geology in “Big Data Mining and Machine Learning in Earth Sciences”, laying a solid foundation for subsequent research [
12].
The advancement of multi-source data fusion technology has also brought about a revolutionary change in mineral prediction. The integration of multi-source geological data such as remote sensing, geophysical, and geochemical data has significantly improved the success rate of mineral prediction. Meanwhile, intelligent feature selection algorithms have greatly enhanced the efficiency of data processing. In the aspect of mineral regularity modeling, Long Short-Term Memory (LSTM) models have been used to handle time-series geochemical data, and deep learning models also show good adaptability in dealing with complex geological conditions [
13].
Despite significant progress in mineral prediction using machine learning, many challenges remain:
(1): The lack of standardized geological datasets makes it difficult to effectively compare research results.
(2): Acquiring labeled data within actual geological circumstances entails significant costs and demands a substantial amount of time.
(3): There is a need to develop models that can adapt to mineral prediction in different geological environments.
In an attempt to tackle these difficulties, researchers are progressively concentrating on unsupervised learning techniques (for example, autoencoders and clustering), transfer learning, and few-shot learning [
5,
14]. The interaction between machine learning and geological exploration is growing in significance and represents one of the most hopeful research avenues in mineral resource exploration. A multitude of studies have validated the efficacy of machine learning approaches in both laboratory settings and real-world applications, demonstrating robust abilities in regional mineral potential evaluation and the demarcation of specific target areas [
15].
This review aims to answer the following key research questions:
Which machine learning algorithm families currently dominate large-scale mineral prediction?
What geological tasks, such as mineral information extraction, target area selection, mineral regularity modeling, or resource potential evaluation, are most commonly addressed in these studies?
Have there been observable shifts in research methods between 2016 and 2025, including in terms of article types, methods used, or application fields?
The objective of this review is to comprehensively outline the current state of machine learning applications in large-scale mineral prediction analysis, with a particular focus on modern solutions developed between 2016 and 2025. This encompasses traditional approaches like support vector machines, k-nearest neighbors, and decision trees, along with innovative deep learning methods such as convolutional neural networks, long short-term memory networks, and autoencoders. It is worth noting that while autoencoders have been recently applied as a deep learning technique in mineral exploration and modelling, they are not new—this method was introduced approximately forty years ago.
This review examines 255 scientific publications indexed in the Web of Science database from 2016 to 2025, which include keywords related to “mineral prediction” and “machine learning.” Throughout the analysis timeframe, there has been a substantial surge in the quantity of publications within this domain. This notable uptick vividly demonstrates a burgeoning interest among the scientific and industrial sectors in leveraging artificial intelligence for mineral exploration and forecasting. This review also reveals research gaps and potential future directions, including developing mineral prediction methods adaptable to different geological environments, utilizing few-shot learning for transfer learning with small sample data, and creating open and diverse benchmark datasets. An important direction is the comprehensive integration of machine learning methods with geological exploration, mineral resource development, and management systems to align with the concept of smart mines and a digital Earth.
This paper is structured in the following manner.
Section 2 sets out the principles and procedures used to identify and screen the literature.
Section 3 offers a thematic integration of the examination of machine-learning techniques and relevant research. The outcomes of quantitative and bibliometric analyses are presented in
Section 4.
Section 5 delineates the future research directions and the limitations of the study. Finally,
Section 6 sums up the ultimate conclusions of this review.
2. Materials and Methods
This paper conducts a comprehensive literature review regarding the utilization of machine-learning approaches in large-scale mineralization forecasting. The main goal is to gather, arrange, and assess the research outcomes from the last ten years (2016–2025). It encompasses the practical applications of both traditional machine-learning algorithms and cutting-edge deep-learning techniques in the realm of mineral exploration. In terms of data handling, a literature database was established with PostgreSQL 16.2 (PostgreSQL Global Development Group, Berkeley, CA, USA). All data manipulation, model categorization, and trend assessment were carried out within the Python 3.11.2 (Python Software Foundation, Wilmington, DE, USA) programming environment.
The chosen period reflects significant advancements in machine learning technology, particularly in handling complex geological data. From the perspective of machine learning technology, around 2016 marked a turning point when machine learning techniques began to gain widespread attention and initial application in the field of mineral exploration and prediction. For instance, deep learning demonstrated significant potential in processing geophysical data, bringing innovations to ore body localization and mineralization information extraction. Deep learning algorithms enabled efficient processing of multi-source data such as remote sensing data and geochemical data, mining hidden ore-indicating anomalies within them. Furthermore, deep learning models could segment and classify complex geological images, helping geologists gain a clearer understanding of subsurface geological structures and thereby predicting ore body locations more accurately [
16]. Since then, deep learning algorithms have been more widely applied in geoscience, used to process sequence data and complex graph-structured data, enhancing the ability of models to capture multi-dimensional and nonlinear geological features. Particularly after 2020, with the Graphics Processing Unit (GPU) and the improvement of open-source frameworks, the application of deep learning models in geoscience entered an explosive phase, enabling the processing of larger-scale, higher-dimensional data and more complex pattern recognition and prediction [
17]. By 2025, various new intelligent algorithms such as Graph Neural Networks (GNNs) and causal inference models have shown great potential in the geological field, indicating that the application of machine learning in the geological and mineral sector will enter a more mature and diversified stage.
Meanwhile, substantial progress has been made in geological data acquisition and processing technologies, providing high-quality data support for the training and application of machine learning models. The popularization of high-resolution remote sensing technologies (such as multispectral, hyperspectral, and synthetic aperture radar data) has enabled more precise capture of key information like surface biodiversity monitoring and vegetation structure, laying the foundation for broader geoscience applications [
18]. Three-dimensional geological modeling technology has also become increasingly mature, capable of presenting complex subsurface geological bodies in a 3D visualized form and providing 3D spatial training samples for machine learning [
19]. Geophysical exploration technologies (such as high-density resistivity method and transient electromagnetic method) and geochemical detection methods have also achieved explosive growth in data volume and improved precision, offering abundant subsurface information for machine learning models [
20,
21]. For example, 3D digital outcrop modeling technology based on UAV oblique photography can convert massive geological data into intuitive 3D geological models, and when combined with deep learning algorithms, it has significantly improved the accuracy of lithology identification [
22].
These geological data, which are sourced from multiple origins, exhibit heterogeneity, and possess high resolution, precisely fulfill the requirement of deep learning models for substantial quantities of data. This, in turn, facilitates the profound application of machine learning methods in the geological domain. Therefore, the period 2016–2025 is not only a golden decade for the rapid development of machine learning technology itself but also a critical period for the qualitative leap in geological data acquisition and processing capabilities. Their synergistic development has jointly driven the paradigm shift of large-scale mineralization prediction from traditional methods to intelligent algorithms, making this period an ideal window to review the evolutionary process of machine learning in mineralization prediction.
2.1. Search and Select Documents
This study utilizes a standardized literature search approach to methodically identify research regarding the utilization of machine learning in large-scale mineralization forecasting. The process involves formulating a search strategy in the Web of Science database, conducting structured processing of bibliographic data, and implementing content screening based on the thematic characteristics of mineralization prediction and methodological criteria. The selection of literature is based on multi-dimensional query logic, aiming to accurately capture application cases of machine learning techniques in the geological field. The core search formula includes:
Title (“mineral prospectivity” OR “ore prediction” OR “mineralization mapping”) AND Title (“machine learning” OR “deep learning”)
The research results are limited to final publications issued between 2016 and 2025. To guarantee a thematic concentration, only the literature that employs particular machine-learning methods is incorporated. These methods encompass Convolutional Neural Networks, Recurrent Neural Networks, Long Short-Term Memory networks, Autoencoders, Support Vector Machines, Decision Trees, k-Nearest Neighbors, Random Forests, and K-means clustering. These nine algorithms form the core classification framework of this review, ensuring consistency between literature screening and subsequent analysis.
In addition, to ensure the focus on geological and mineral-related applications, publications from non-geological and mineral-related fields (e.g., medical diagnosis, mechanical engineering, social sciences, pure mathematical theory, etc.) were excluded. Based on the conducted query, a total of 430 relevant documents were obtained and subjected to systematic screening.
To ensure comprehensive geographical representativeness and relevance of the included studies, we conducted a preliminary bibliometric analysis of the initial query results. This analysis revealed that fifteen countries—China, Canada, Australia, the United States, Russia, South Africa, Brazil, Finland, India, Germany, France, the United Kingdom, Chile, Peru, and Iran—account for a significant proportion of all English-language publications in the field of machine learning applications for mineral prediction. These countries were selected for inclusion in the review based on their substantial contributions to the field, as evidenced by the high volume of relevant publications.
By focusing on these fifteen countries, the review aims to capture the majority of influential research while maintaining broad geographical diversity. This approach ensures that the study remains relevant to the primary research areas in machine learning applications for mineral prediction. However, we acknowledge that this decision may exclude some important contributions from other regions. To mitigate this potential limitation, we have also included a discussion on the potential for future research to expand the geographical scope and incorporate additional international contributions.
A preliminary search retrieved 430 documents. After screening for thematic relevance, 175 additional documents from low-impact-factor and duplicate journals were excluded. A total of 255 documents were finally included in the analysis. The workflow of data collection and screening is presented in
Figure 1.
The metadata of the screened literature were imported into a PostgreSQL relational database. Fields included title, author, affiliation, publication year, document type, keywords, DOI, abstract, citations, and publication stage. These structured fields support SQL query construction for aggregating and classifying literature features. The data analysis was carried out utilizing Python 3.12.2 along with libraries like pandas for structuring the data, and matplotlib and seaborn for visualizing it. All the data from the literature were coded and arranged in tables in accordance with pre-established classification standards.
While machine learning techniques are usually classified according to the learning paradigm (such as supervised learning, unsupervised learning, and deep learning), this review adopts a unified categorization framework founded on application scenarios. The grouping is not strictly based on the theoretical affiliation or architectural depth of the algorithms, but rather on their practical application in mineral prediction.
This method of classification stems from the finding that in geological applications, the demarcations between traditional techniques and deep-learning methodologies frequently intersect. For instance, in study [
23], a convolutional neural network (CNN) was used for feature extraction of mineralization alteration from remote sensing images, while the final mineral potential classification was performed using a traditional support vector machine (SVM) model, forming a hybrid method combining deep learning feature extraction with a classical classifier. Similarly, in [
24], random forests and SVM models were applied to data preprocessed by principal component analysis (PCA) in the fusion analysis of multi-source geological data (geochemical + geophysical), with traditional statistical methods playing a key role in feature engineering. In study [
25], an autoencoder, which is an unsupervised deep-learning model, was employed to reconstruct the missing geochemical data. Subsequently, the output was subjected to further analysis for predicting the probability of mineralization. Similarly, [
26] applied support vector machines (SVMs) and k-nearest neighbors (kNNs) to tectonic mineral-controlling element data that had been manually interpreted, with these data also serving as input features for a convolutional neural network (CNN), forming a multi-method comparative study. Even within a single study, deep learning and classical methods often complement each other. For example, [
27] used both a Convolutional Neural Network and Long Short-Term Memory (DCNN-LSTM) hybrid deep learning model and traditional algorithms such as decision trees and kNN to predict mineral potential in the same area, comparing the effectiveness of different methods via Receiver Operating Characteristic Curve (ROC) curves.
Given this methodological hybridization, classification based solely on algorithm architecture would fail to reflect the practical application scenarios of mineral prediction. The classification system of this review focuses on the combination of methods in geological practice, aiming to reveal the diversity of techniques and their contribution mechanisms to the accuracy of mineral prediction.
To guarantee the trustworthiness and solidity of the outcomes, our review adopted a rigorously structured workflow: we crafted a comprehensive search plan, imposed stringent date and language constraints, and applied a stepwise literature screening protocol. These combined measures guarantee that the 255 papers ultimately retained are both highly pertinent and broadly representative, offering a clear panorama of how the technology evolved between 2016 and 2025. Moreover, the application of a classification framework based on usage scenarios, which acknowledges the integration of classical and deep learning techniques in geological practice, lays a methodological foundation for subsequent comparative analyses.
To ensure the transparency and replicability of this review, it is crucial to offer a comprehensive account of the literature search approach. A structured query carried out in the Web of Science database, which is a fundamental step in the search procedure, is described as follows:
Basic search formula: TITLE ((“mineral prospectivity mapping” OR “ore deposit prediction”) AND (“machine learning” OR “deep learning”))
Extended search formula: Built upon the basic search, the extended search is refined by specifying particular algorithms, as illustrated below:
(TS = (machine learn* OR “machine learning” OR ml OR “neural network*” OR “deep learn*” OR “support vector machine*” OR “random forest*” OR “decision tree*” OR “bayesian network*” OR “ensemble method*” OR “reinforcement learn*” OR “gradient boost*” OR “clustering algorithm*”)) AND (TS = (mineral deposit* OR ore deposit* OR mineralization OR “deposit formation” OR “mineral prediction” OR “ore prediction” OR “mineral prospect*” OR “geological prediction” OR “mineral exploration” OR “ore genesis” OR “mineral resource*”)) AND (PY = 2016–2025).
In this context, the TS field represents the title or subject of the publications, while the PY field specifies the range of publication years.
2.2. Classification Criteria
All included literature has undergone an in-depth content analysis relevant to the research tasks and methodology of mineral prediction. The classification process is based on the metadata and full-text content of the literature, including:
The title, the abstract, the key terms, the author identification details, and, if required, an assessment grounded in the complete text.
Document type differentiation: journal articles, conference papers, review articles, etc.
The core classification criterion is to identify the machine learning algorithms used in each document, specifically including Recurrent Neural Networks (RNN), Convolutional Neural Networks (CNN), Long Short-Term Memory networks (LSTM), Autoencoders, Decision Trees (DT), Support Vector Machines (SVM), Neural Network Systems (NNS), K-means Clustering, and Random Forests (RF). In addition, the classification system also takes into account the types of input data, including geological mapping data, geochemical data, geophysical data, and remote sensing image data. Based on the above characteristics, this review classifies machine learning techniques into three major categories:
Traditional machine learning techniques encompass Support Vector Machines (SVM), Decision Trees (DT), k-Nearest Neighbors (kNN), and Random Forests (RF). These approaches depend on manually crafted geological characteristics (for example, anomaly cut-off values and structural buffer areas). The primary benefits of these methods are the interpretability of the models and computational effectiveness.
Advanced deep learning techniques, including Recurrent Neural Networks (RNN), Convolutional Neural Networks (CNN), Long Short-Term Memory networks (LSTM), and Autoencoders, are capable of directly extracting hierarchical feature representations from unprocessed geological data. This characteristic renders them highly appropriate for the interpretation of remote sensing images and the integration of multi-source geological data.
Hybrid approaches: combining the strengths of the above two categories of methods. For example, using CNN to automatically extract alteration mineral features from hyperspectral data, and then performing mineralization potential classification through SVM or RF. This aims to balance feature learning capabilities and model simplicity.
Key classification criteria also include specific application scenarios of machine learning in mineral prediction. Based on content analysis, the literature is categorized into the following application categories: identification of mineralization anomalies (such as geochemical anomaly extraction), assessment of mineral potential (probabilistic mineral resource prediction), exploration target selection (multi-criteria decision analysis), extraction of structural mineral-controlling elements, and estimation of mineral resource quantities. Most studies have cross-category characteristics, reflecting the multidisciplinary nature of mineral prediction. Types of research methods are also included in the classification system, encompassing experimental studies (including field data validation), case studies (specific types of deposits), conceptual models (innovations in theoretical methods), and literature reviews.
Figure 2 illustrates the association network between machine learning algorithms and mineral prediction application scenarios.
This network diagram, with machine learning techniques at its core and various application scenarios distributed peripherally, illustrates the connections between typical algorithms—such as CNN, SVM, and RNN—and application contexts like mineralization anomaly identification and target area selection. For instance, CNN is predominantly used for alteration zone recognition in remote sensing images and structural interpretation, whereas RF and XGBoost are extensively applied in mineral potential assessment through multi-source data fusion. This visual representation of associations not only reveals the multidimensional nature of mineral prediction methods but also reflects the differences in the applicability of various algorithms.
In addition to the systematic literature selection, a key strength of the methodology in this review lies in its practical classification framework for machine learning techniques. Unlike theoretical paradigm-based classifications, this framework is grounded in the actual application and combination patterns of algorithms in mineral prediction. This approach stems from observations in geological practice where the boundary between classical and deep learning methods in mineral exploration is often blurred, prompting the emergence of hybrid methods as an effective means to address complex geological problems. This review aligns the categorization criteria with the technical procedures of mineral prediction, like anomaly detection and target zone selection. By doing so, it guarantees that the analytical outcomes directly address the practical requirements of mineral exploration. This effectively fills the void between the theoretical progress in machine-learning and geological applications.
2.3. Data Processing and Analysis
All of the literary data underwent structural processing through SQL queries to create statistical reports that showcase the following features: annual distribution of publications, country contributions, major machine learning methods applied, and mineral prediction application scenarios. Meanwhile, qualitative analyses were conducted through abstract reviews and full-text readings when necessary to ensure accurate classification of the literature into the preset research categories. In the course of the analysis, multiple research scenarios were pinpointed, including cases that combined the identification of anomalies and the selection of target areas. A multi-label classification method was adopted for these cases to support cross-tabulation analysis and methodological comparisons. The third chapter will provide a detailed presentation of the visualization of the analysis results and the association patterns between various categories.
2.4. Review Protocol and Quality Assessment
To ensure transparency and reproducibility of the study, this review was designed in accordance with the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines, which include four key stages: literature identification, initial screening, eligibility assessment, and final inclusion (
Figure 3).
In the identification stage, a grand total of 430 research papers were obtained from the Web of Science database by utilizing the pre-defined search terms. The search scope was restricted to English-language final manuscripts that were published within the time frame from 2016 to 2025. Subsequently, in the screening phase, redundant entries were eliminated. After that, the remaining records were evaluated for their thematic relevance, and this assessment was carried out by examining their titles and abstracts. Studies that clearly did not address the application of machine learning in mineral prediction were excluded.
During the eligibility evaluation stage, a comprehensive full-text examination was conducted. Articles were incorporated into the study if they satisfied the subsequent criteria:
Explicitly applied machine learning techniques to large-scale (≥1:100,000) mineral prediction;
Provided a complete methodological description, including data sources, model parameters, and validation methods;
Peer-reviewed journal or conference papers;
The authors are from one of nine selected countries active in this research area.
A total of 255 articles were ultimately included for in-depth analysis. Exclusion criteria include non-geological mineral field research, purely theoretical method discussion (without actual geological data verification), small scale mineral point prediction (<1:100,000) and studies that do not provide model performance evaluation. It is especially important to point out that the term “absence of experimental verification” pertained specifically to research that failed to showcase the application outcomes of machine learning models using actual geological data. Conceptual studies proposing innovative methodological frameworks or performance evaluation metrics directly related to mineral prediction were still retained.
To evaluate the academic quality of the included literature, this study analyzed the following indicators: journal impact factors (JCR Q1/Q2/Q3), number of article citations, inclusion of independent test set validation, and public availability of geological data. For high-potential research published recently (2023–2025), even with low citation counts, studies with significant methodological innovation or large-scale data were prioritized for inclusion. This structured review process ensured the methodological rigor of the review, providing a reliable data foundation for subsequent technical trend analysis.
4. Results and Discussion
Table 2 summarizes the scientific literature on machine learning techniques for large-scale mineral prediction from 2016 to 2020 and from 2021 to 2025. During the initial five-year timeframe, there were 96 publications, and in the subsequent five years, this number grew to 159. Of the total 255 studies analyzed, nearly two-thirds (62.35%) were published in the latter five-year period. This notable disparity unmistakably demonstrates the burgeoning research enthusiasm in this domain and the escalating focus of the academic circle on the utilization of machine learning methodologies in large-scale mineral forecasting. In terms of document types, the analysis shows that journal articles dominate, followed by conference papers, with these being relatively few. This distribution pattern reflects the maturity of research in this field, the active academic exchange, and the preferred channels for knowledge dissemination. Journal articles, as the primary vehicle for publishing academic research findings, far exceed other types in number. This indicates that researchers in the field of mineral prediction using machine learning tend to publish rigorously peer-reviewed, highly academic, and innovative results in professional journals. This not only helps ensure the quality and reliability of the research but also promotes the systematic accumulation and widespread dissemination of knowledge. The high proportion of journal articles also implies that the research methods, theoretical frameworks, and application practices in this field are gradually forming a consensus and standards, providing a solid foundation for future research.
In terms of specific machine learning methods, decision trees were the most frequently used, appearing in 114 publications (44.71% of all cases). Random forests and SVMs were the next most common, appearing in 107 and 60 studies, respectively. Notably, all 43 publications related to CNNs in this research were from the 2021–2025 period, indicating a significant increase in interest in this technology [
23,
34]. Overall analysis shows that the popularity of machine learning methods has changed significantly over time, reflecting shifts in technical trends in research. In addition, there has been a significant change in the types of machine learning methods. Specifically, in recent times, deep learning models, including Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs), have grown more widespread. They have supplanted earlier conventional methods such as Support Vector Machines (SVMs) and decision trees. This shift might be a consequence of the broader utilization of neural architectures, the accessibility of computational resources, and the escalating intricacy of geological datasets [
5,
25].
In terms of application fields, the most frequently discussed topic was mineral potential assessment, which appeared in as many as 175 publications (68.63% of all publications). Exploration target selection appeared in over half (53.73%) of the studies. No statistically significant association was found between application fields and time. This means that key areas such as mineralization anomaly identification, mineral potential assessment, and exploration target selection have remained consistent over the decade, regardless of the machine learning methods used. This indicates a stable research interest in key mineral exploration challenges.
In terms of research methods, conceptual research dominated, accounting for 63.33% of the 161 publications. Case studies appeared in 32.75% of the research, while literature analysis was relatively rare at 3.92%. The research method structure showed no significant changes between the two periods (2016–2020 and 2021–2025), indicating a stable distribution of methods despite the increase in research volume. The temporal distribution of research methods also showed no significant differences, with literature analysis, conceptual and case study methods being used similarly in both periods, reflecting a sustained methodological balance [
81].
An annual statistical analysis of the 255 papers published between 2016 and 2025 clearly revealed the flourishing trend of machine learning techniques in large-scale mineral prediction research.
Figure 4 indicates that there has been a remarkable increase in the quantity of publications within this domain. Notably, after the year 2020, there was a substantial upsurge in the output of research in this area. This reflects the growing academic interest in this research direction and indicates that the potential of machine learning techniques in solving complex geological problems is gradually being recognized and put into practice.
As depicted in
Figure 4, research activity in this field has shown significant growth. From the relatively low publication volume in 2016, it reached a peak in 2021 and grew again in 2024, indicating the continuous rise of machine learning in mineral prediction. The following is a specific data analysis:
Early Exploration Phase (2016–2017): During this period, the number of publications was small, with an annual average of about 10–15 papers. This reflected the initial introduction and exploration of machine learning techniques in the Earth sciences. Research mainly focused on proof-of-concept and the application of basic methods, with researchers beginning to apply classic machine learning algorithms (such as SVM and RF) to simple geological classification and prediction tasks.
Rapid Growth Phase (2018–2021): Starting in 2018, the number of publications grew explosively, with an annual average of 20–40 papers. It reached its highest point in 2021, nearing 50 papers. This growth was primarily driven by the maturity and popularization of deep learning technologies and the rise of the geological big data concept. Research began to focus on more complex nonlinear problems and attempted to handle multi-source heterogeneous data. Additionally, enhanced computing capabilities and the user-friendly nature of open-source machine learning frameworks accelerated research progress.
Stable Development and Deepening Stage (2022–2025): Despite slight fluctuations in the number of studies in 2022 and 2023, the overall level remained high, with an annual publication rate between 30 and 45 papers. In 2024, growth occurred again, indicating that the field will continue to maintain strong momentum. During this stage, research focused more on model robustness, interpretability, generalization ability, and deep integration with geological expertise. Advanced technologies like transfer learning, few-shot learning, and reinforcement learning were also introduced to address practical application challenges.
This growth reflects the global surge of interest in artificial intelligence and the expanding availability of high-quality geoscience data. It mirrors the increasing enthusiasm and capacity of the research community to adopt and test machine-learning approaches. The continuous increase in literature is not just a simple accumulation of research output but also represents an expansion in research depth and breadth, indicating that machine learning will play a more central role in future mineral prediction.
Specifically, the number of studies in this field was relatively small in 2016 and 2017, which may be related to the limited popularity and depth of application of machine learning technology in earth science at that time. However, with the rapid development of cutting-edge technologies like deep learning and their breakthroughs in image recognition and natural language processing, researchers began to introduce these technologies into geological exploration. From 2018 onwards, the number of publications showed a steady upward trend, indicating growing research interest in this interdisciplinary field. In 2021, a peak was reached, likely linked to the increasing global demand for mineral resources and the deeper application of AI technologies across industries [
2,
3]. Despite minor fluctuations in 2022 and 2023, the overall level remained high, and significant growth reappeared in 2024, suggesting strong development momentum for the coming years. The 2025 data also shows a continuation of this positive trend.
Behind this growth are several driving factors. First, the rapid accumulation of geological big data has provided an unprecedented data foundation for machine learning applications. The richness of traditional and emerging data sources, such as remote sensing, geophysical exploration, geochemical analysis, and drilling data, has made data-driven mineral prediction possible. Second, the significant improvement in computing power, especially the popularization of high-performance computing and cloud computing, has provided essential support for training and deploying complex machine learning models. Third, the maturity and ease of use of open-source machine learning frameworks have greatly lowered the barrier to entry for researchers. Finally, global mineral resource exploration faces challenges like deep mineral discovery, hidden mineral prediction, and identification of complex deposit types, which urgently require new technological approaches to enhance efficiency and success rates, providing broad application prospects for machine learning.
From 2016 to 2019, classic machine learning algorithms dominated mineral prediction applications. As shown in
Figure 5, their applications far exceeded those of deep learning algorithms during this period. The main reasons are as follows:
Technological Maturity and Interpretability: Classical algorithms such as Support Vector Machines (SVMs), Random Forests (RFs), Logistic Regression (LR), Decision Trees (DTs), and K-Nearest Neighbors (KNNs) were relatively mature with solid theoretical foundations and numerous application cases. Their good interpretability was crucial for geologists to understand model decision-making and verify prediction rationality. Model transparency was often key for acceptance in geological surveys.
Data Availability and Scale: Geological data for model training was limited and of varying quality in early studies. Classical machine learning algorithms, with lower data requirements and the ability to handle small sample data, were better suited to the data environment at the time.
Computational Resource Constraints: Deep learning models require substantial computational resources, such as high-performance GPUs, which were not easily accessible to all research teams around 2016. In contrast, classical machine learning algorithms demanded fewer computational resources and were easier to deploy and run in conventional computing environments.
Research Paradigm Inertia: The geological community’s acceptance of machine learning developed gradually. Early researchers tended to start with familiar classical algorithms, incrementally exploring their applicability in geology.
During this phase, classical machine learning algorithms were primarily used to process structured geological data, such as geochemical element concentrations, geophysical anomalies, and drilling data, for anomaly identification, classification prediction, and potential area delineation. For example, Random Forests, with their ability to handle high-dimensional data and nonlinear relationships and their robustness to outliers and noise, were widely applied in mineral prediction. SVMs, known for their excellent performance with small samples and high-dimensional data, were also effectively used in mineral potential evaluation and exploration target selection.
From 2020 to 2025, there has been a significant change in the application pattern of machine learning algorithms. As shown in
Figure 4, the number of applications of deep learning algorithms has grown exponentially, almost reaching the same level as that of classical machine learning algorithms. This marks the transition of deep learning from an exploratory phase to a stage of widespread application in geological exploration, particularly in large-scale mineral prediction. The drivers of this shift are as follows:
The maturation and popularization of deep learning technologies: Deep learning models such as CNN, RNN, LSTM, and GAN have achieved remarkable success in fields like image recognition and natural language processing, attracting the attention of geologists. These models demonstrate unparalleled advantages in processing unstructured data (e.g., remote sensing images, geophysical profiles, geological maps) and complex spatiotemporal sequence data.
The development of geological big data: With advances in high-resolution remote sensing, 3D geophysical exploration, and high-throughput geochemical analysis, geological data has experienced explosive growth in volume and complexity. Deep learning models can automatically learn complex feature representations from massive datasets and effectively handle high-dimensional, nonlinear, and multimodal geological big data, which poses a challenge for classical machine learning algorithms.
Enhanced computing power and open-source framework support: The widespread availability of GPU computing power and the maturity of open-source deep learning frameworks like TensorFlow and PyTorch have significantly reduced the development and training thresholds for deep learning models, enabling more research teams to utilize these advanced tools.
The demand for higher predictive accuracy and automated feature extraction: Deep learning model can automatically learn the deep features of data through multi-layer neural network, which avoids the tedious and expert-dependent feature engineering process in traditional methods, thus improving the prediction accuracy and automation level. For instance, CNNs have shown strong capabilities in extracting mineralization alteration information from remote sensing images and identifying geophysical anomalies, while RNNs and LSTMs have unique advantages in processing sequence data such as drilling data and geochemical profiles.
In this stage, the application scope of deep learning algorithms has been continuously expanding. They are not only used for traditional classification and regression tasks but also extended to cutting-edge directions such as data generation (e.g., GANs for generating synthetic geological data), anomaly detection, transfer learning (applying pre-trained models to new geological areas), and few-shot learning (conducting effective predictions in data-sparse regions). Meanwhile, classical machine learning algorithms have not been completely replaced. They still play an important role in scenarios with relatively small data volumes, high requirements for model interpretability, or limited computational resources. Many studies have begun to explore the combination of classical and deep learning to leverage their respective advantages in constructing hybrid models, aiming to achieve better predictive performance.
It is worth noting that the growth in literature is not merely quantitative but also qualitative. Early research mainly focused on the initial exploration and verification of classical machine learning algorithms. Over time, more complex models such as deep learning, transfer learning, and reinforcement learning have been introduced. The depth and breadth of research have also expanded. Studies have evolved from single data sources to multi-source heterogeneous data integration, from regional predictions to global-scale ones, and from qualitative analysis to quantitative prediction. This progression reflects researchers’ deeper understanding and more effective utilization of machine learning’s potential.
Future research will likely continue to focus on building more robust and interpretable models, effectively handling uncertainties and sparsity in geological data, and closely integrating machine learning models with geological expert knowledge to form intelligent mineral prediction systems with human–machine collaboration. The sustained growth in literature indicates that machine learning will play an increasingly important role in mineral prediction, becoming a key driver of mineral exploration technology advancement.
A national affiliation analysis of the 255 studies included in this review was conducted. The results are shown in
Figure 6, clearly displaying the research landscape and major contributing forces of machine learning in large-scale mineral prediction globally. The analysis indicates that a small number of countries show significant research activity in this interdisciplinary field, while others have relatively less involvement.
As observed in
Figure 6 and
Table 3, China has dominated the research on machine learning applications in mineral prediction, with a number of publications far exceeding those of other countries. This dominance is closely related to China’s immense demand for mineral resource exploration, strategic national investments in geological science and artificial intelligence, and its large base of research personnel. Research institutions and universities such as the Chinese Geological Survey, the Chinese Academy of Sciences, and China University of Geosciences have played a key role in advancing research in this field, producing a large number of high-quality academic outcomes. In addition, China’s rich data resources and diverse geological background in the earth sciences provide a unique advantage for training and validating machine learning models.
Following China are traditional mining giants such as Canada and Australia. These countries, with their long history of mineral exploration and abundant mineral resources, have an urgent need to improve mineral exploration efficiency and reduce costs. Consequently, they actively introduce and develop advanced technologies, including machine learning, to address the challenges of deep mineral exploration and hidden mineral prediction. Canada has strong capabilities in earth science data processing and modeling, while Australia is at the forefront of mining technology innovation and application. Both countries have also made significant progress in combining machine learning with mineral prediction.
The rationality of the “relatively low number of papers from the United States” can be analyzed as follows. Regarding the “low number of papers from the United States,” the following analysis can be made from the perspective of the real research ecosystem: The United States has a deep accumulation in the field of geological research. However, mineral prediction research is significantly influenced by resource demand drivers. If the research focuses on the emerging interdisciplinary field of “machine learning + mineral prediction,” the United States may have a relatively low number of publications in this niche direction compared to countries like China due to the following factors: the maturity of mineral resource development phases (e.g., some traditional minerals in the United States have reached a mature development stage, and resource demand priorities have shifted) and research investment focus (e.g., greater attention to frontier fields such as deep space and marine exploration). Meanwhile, India has a strong demand for mineral resources (such as iron ore and coal, which support industrial development) and has complex geological structures (e.g., the Himalayan orogenic belt and the Deccan Traps). There is a practical need for “improving traditional exploration efficiency and urgently requiring intelligent technology breakthroughs.” In recent years, institutions such as the Geological Survey of India and the Indian Institutes of Technology have intensified their efforts in directions like “intelligent processing of low-cost geochemical exploration data and machine learning analysis of structural mineralization patterns,” driven by resource needs to propel scientific research.
Russia is also a significant contributor in the field of machine learning for mineral prediction. Russia’s vast geological territories and complex geological formations provide a rich database for machine learning applications. The country’s expertise in geoscience, combined with its computational resources, enables the development of sophisticated models that can handle the intricacies of mineral exploration. Russia’s research often emphasizes the integration of traditional geological knowledge with cutting-edge machine learning techniques, enhancing the accuracy of mineral prediction in challenging environments. The nation’s strategic interest in resource development further drives investment in this domain, fostering innovation and collaboration among academic institutions and industry players.
Germany, the UK, and France together form a powerful trio in advancing machine learning for mineral prediction in Europe. Germany drives innovation with its strong geoscience infrastructure, creating hybrid models that merge machine learning with physical simulations. The UK utilizes its extensive historical geological data to optimize traditional exploration techniques through machine learning. France bridges fundamental and applied research with its comprehensive geological research system and cross-disciplinary collaboration. Collectively, they offer diverse yet complementary approaches, enhancing the academic rigor and practical applicability of machine learning in mineral exploration and contributing significantly to the global geoscience community.
China leads in research on machine learning for mineral prediction with 73 publications (28.63% of the total). Canada and Australia follow with 36 (14.12%) and 29 (11.37%) publications, respectively. The US and Russia contribute 27 (10.59%) and 22 (8.63%) publications, respectively. Countries like South Africa, Brazil, Finland, Iran, and India have fewer publications (11, 8, 6, 6, and 6, respectively). This distribution shows research is concentrated in major mining nations and those with high mineral demand, indicating potential for international collaboration.
Evolution of the Global Research Landscape: Despite the current concentration of research efforts, it is anticipated that more countries will join this field as global mineral resource demands continue to grow and artificial intelligence technologies become more widespread. Future international cooperation and data sharing will become increasingly important to jointly address the challenges of global mineral resource exploration.
Future research trends may include:
Strengthening of international cooperation: As the complexity of global mineral resource exploration increases and machine learning technologies become more universally applicable, scientific collaboration among nations will become more frequent to jointly address cross-regional and cross-mineral mineral prediction challenges.
Emergence of new economies: Some new economies with abundant mineral resources but currently low research output may gradually increase their investment in this field as their technological capabilities improve and their demand for mineral resource development grows, becoming new growth points for research.
Data sharing and standardization: To promote research progress globally, it will become crucial to advance the sharing and standardization of geological data and machine learning models. This will help overcome data barriers and accelerate technological innovation.
In general, countries such as China, Canada, and Australia play a leading role in the application of machine learning to large-scale mineral prediction. But research efforts globally need further expansion and synergy. International cooperation is an important driving force for the development of mineral prediction technology. Qianlong Zhang et al. (2023) systematically summarized the global cooperation network of cross-disciplinary research between geochemistry and artificial intelligence, providing an important reference for future international cooperation [
78]. Through strengthened international cooperation and data sharing, it is expected to jointly promote technological advancements in this field and provide more effective solutions for the sustainable development of global mineral resources.
An analysis of the frequency of machine learning algorithms used in the 255 papers included in this review is presented in
Figure 7. This figure clearly shows which algorithms are favored by researchers in large-scale mineral prediction and their application characteristics and strengths. Random Forest and Decision Trees are the most widely used classic machine learning algorithms. While CNNs and RNNs dominate deep learning algorithms.
As shown in
Figure 7, Decision Trees (DT), Random Forests (RF), Support Vector Machines (SVM), Convolutional Neural Networks (CNN), and Recurrent Neural Networks (RNN) are the four most frequently used algorithms in this field. Quantifying the application frequency of these algorithms can reveal preferences for them in different periods and tasks.
Decision Trees (DT): As the most frequently used algorithm, DT was applied 87 times from 2015 to 2019 and 27 times from 2020 to 2021. With clear rule derivation and adaptability to multi-feature hierarchical classification of geological data, DT can effectively break down complex geological associations in tasks like mineralized anomaly identification and ore-controlling element extraction. It provides highly interpretable model conclusions for mineral potential evaluation and continues to support mineral prediction analysis.
Random Forests (RF): In mineral prediction algorithms, RF stands out with a high application frequency, being mentioned or used in over 100 literature works. This is attributed to its excellent generalization ability, high-dimensional data processing capability, and noise robustness. RF performs well in dealing with multi-source heterogeneous geological data, feature importance evaluation, and integrated prediction model building. It is particularly suitable for target area selection and mineral potential evaluation tasks.
Support Vector Machines (SVM): With an application frequency of nearly 60 times ranking among the top, SVM is highly popular in tasks such as mineralized anomaly identification and deposit type classification due to its excellent performance on small samples and high-dimensional data, as well as its ability to handle nonlinear relationships. Despite the rise of deep learning, SVM still has irreplaceable advantages in some specific scenarios.
Convolutional Neural Networks (CNN): The application frequency of CNN exceeds 40 times, showing its rapid rise in recent years. As a representative of deep learning, CNN has a natural advantage in processing image data such as remote sensing images and geophysical maps. It can automatically extract deep features, greatly enhancing the automation and intelligence of remote sensing alteration information extraction, geophysical anomaly identification, and structural information extraction. The fast growth in its application frequency reflects the huge potential of deep learning in geological big data processing.
Recurrent Neural Networks (RNN): With an application frequency of 34 times, RNN mainly focuses on processing sequential data. For example, in tasks such as drill hole data interpretation, geochemical profile analysis, and time-series prediction, RNN and its variants (such as LSTM) can effectively capture the spatiotemporal dependencies in data, revealing deep mineralization patterns and element migration patterns.
Overall, the application frequency of classical machine learning algorithms (decision trees, RF, SVM) and deep learning algorithms (CNN, RNN) shows a neck-and-neck trend. This indicates that in mineral prediction, researchers flexibly select and combine different machine learning algorithms according to specific tasks and data characteristics. In the future, it is expected that more hybrid models and ensemble learning methods will emerge to fully utilize the advantages of different algorithms, achieving more accurate and robust mineral prediction.
The application of machine learning algorithms in mineral prediction shows a trend of diversification and specialization. Random forests and SVM, as representatives of classical algorithms, continue to be favored for their robustness and good performance with limited data. CNN and RNN, as representatives of deep learning, have rapidly risen due to their powerful automatic feature extraction ability and advantages in handling complex unstructured data. In the future, with the further development of geological big data and computing power, more advanced machine learning and deep learning algorithms will be introduced into mineral prediction. Meanwhile, methods such as multi-model integration, hybrid models, and semi-supervised learning and reinforcement learning combined with geological expert knowledge will become important research directions. It is hoped that these approaches will build smarter, more efficient, and more interpretable mineral prediction systems.
A statistical analysis was conducted on the application fields of the 255 papers included in this review, with the results shown in
Figure 8.
Table 4 classifies and summarizes the 255 scientific publications according to application field, machine learning method, and research method. It is important to highlight that a single publication may fall into several categories, which is why the total values do not add up to 100%. The analysis encompasses five primary topical domains: mineralization anomaly identification, mineral potential assessment, exploration target preference, tectonic ore-controlling element extraction, and mineral resource estimation. Among these, the most studied area is mineral potential assessment (175 papers), indicating the greatest research attention. This is followed by exploration target preference (137 papers) and mineralization anomaly identification (82 papers). The other fields, such as mineral resource estimation (58 papers) and tectonic ore-controlling element extraction (58 papers), though less frequent, remain important components of the research.
From the perspective of research method classification, conceptual research forms the largest group, accounting for 63.33% of the studies (161 papers). This indicates the research community’s emphasis on developing new solutions, concepts, and models. Most of these studies are concentrated in mineral potential assessment (47 papers), exploration target area selection (46 papers), and mineralization anomaly identification (33 papers). Case studies are the second largest group (84 papers, 32.75%), mainly focusing on mineral potential assessment (26 papers) and exploration target area selection (24 papers). Literature analysis is relatively rare, appearing in only 10 papers, primarily in the field of mineral potential assessment (3 papers). No significant differences were found between machine learning methods and their application fields. The distribution of research methods across different fields shows no particular preference. Mineral potential assessment remains the dominant research area with 175 papers, where decision trees (45 papers) and random forests (42 papers) are the most commonly used methods. The prevalence of conceptual and case studies strongly confirms that this field is in a stage of vigorous development.
Figure 9 presents a heat map that clearly illustrates the relationship between methodological types and major application areas. The frequency of occurrence, indicated by color coding, allows for the quick identification of dominant links between methodologies and research topics in the field of machine learning and large-scale mineral potential prediction.
In conceptual research, the impact is significant, with a large number of studies spanning almost all fields. These are most frequently found in publications on mineralization anomaly identification (58 related studies), mineral potential assessment (175 studies), exploration target selection (137 studies), and tectonic ore-controlling factor extraction (58 studies). The theoretical structures and analytical frameworks established through conceptual research are crucial as the cornerstone for subsequent studies. The ideas and models developed at this stage also lay the foundation for future experiments and practical applications.
Case studies rank second in total article count and are present across many fields, though to a lesser extent. Within these, the highest numbers are seen in exploration target selection (24 studies), mineral potential assessment (26 studies), and mineralization anomaly identification (17 studies). This dissemination indicates that conceptual notions are steadily being converted into real-world implementations and evaluated in laboratory or on-site research environments. Conversely, literature surveys occur only intermittently in a small number of publications and are confined to particular domains. For instance, the application of autoencoders is relatively low across all fields, with just 1 study in mineralization anomaly identification. This implies that the present emphasis continues to be on devising and evaluating novel solutions instead of integrating pre-existing knowledge.
The provided data table proficiently showcases the framework of research progression within the domain. Conceptual exploration lays the theoretical groundwork, whereas experiments and case analyses signify the ensuing phases of verifying and applying the devised methodologies in practical scenarios. This comprehensive overview deepens our comprehension of how diverse methodological strategies in machine learning can attain particular objectives in large-scale mineral forecasting.
A bibliometric examination uncovers not just a numerical increase in published works but also qualitative alterations in research methodologies and technological inclinations. A clear transition is observed from classical machine learning methods like support vector machines (with a total of 60 applications across fields) and decision trees (totaling 114) to deep learning architectures such as convolutional neural networks (CNNs with 43 applications) and long short-term memory networks (LSTMs with 18). This change reflects an increasing demand for the automated extraction and handling of intricate, high-dimensional geological data. Such data processing is of great significance for practical mineral prediction systems.
The geographic concentration of publications, particularly those from China, Australia, and Canada, shows increasing industrial involvement and investment in the digitization and intelligentization of mineral resource exploration in these countries. This indicates that the future of large-scale mineral prediction will be closely linked to countries with advanced mineral digitalization and exploration technologies, and these countries are also more likely to generate and share large-scale geological datasets.
In summary, the observed trends go beyond mere statistics. They reveal a progressive evolution in the application of machine learning to large-scale mineral prediction, moving toward higher technological levels, practical implementations, and comprehensive integration with mineral exploration engineering systems.
6. Conclusions
From 2016 to 2025, machine learning applications in large-scale mineral prediction showed significant progress, particularly in mineralization anomaly identification, resource potential assessment, and exploration target selection. After reviewing 255 publications, it was found that research activity increased notably after 2020, with a 65.63% rise in studies published between 2020 and 2025.
The areas attracting the most research interest included mineralization anomaly identification (32.16% of publications), mineral potential assessment (68.6%), and exploration target selection (53.7%). Among the machine learning methods used, classical algorithms—especially decision trees and random forests (RF)—dominated, accounting for nearly 65.5% of all applications. Meanwhile, the popularity of deep learning technologies, particularly CNNs and RNNs, has grown significantly. These are most commonly applied in mineralization anomaly identification, 3D geological modeling, and mineralization time-series analysis.
Although less frequently used, methods such as autoencoders, LSTMs, and K-means clustering are gaining popularity. They typically serve exploratory or complementary roles to mainstream approaches.
From a methodological viewpoint, conceptual research holds a marked predominance, featuring in excess of 63.33% of publications. This indicates that the field remains heavily reliant on theoretical foundations and model development. Nonetheless, the proportion of case studies has shown a substantial rise (exceeding 32.75%), suggesting a gradual shift from theory to practical exploration, particularly in porphyry copper, sandstone uranium, and volcanic-associated gold mineral deposits, serving as a bridge between theory and practice. Literature reviews, however, account for only a small fraction of the analyzed work. This may stem from the field’s novelty and rapid evolution, with the focus still on generating novel solutions.
Geographically, China and Canada lead in research activity, with their publications constituting a significant share of the global total. China’s research output growth is especially noteworthy, with a substantial increase observed during 2020–2025 compared to the preceding period. Over the past few years, novel research facilities have come into existence in nations like Australia, the United States, Russia, and South Africa. This development showcases the increasing globalization of research within this particular domain.
To conclude, the review of existing literature vividly illustrates the growing significance of machine learning in the realm of mineral prediction. The current focus is primarily on identifying mineralization anomalies, resource assessment, and target area selection. While classic methods like SVM and decision trees remain fundamental to many solutions, interest in deep learning technologies is on the rise. While the majority of works possess a conceptual essence, an increasing quantity of contributions grounded in experimentation and case studies are emerging. This trend serves as an indication of the field’s progression towards maturity.
Over the next few years, future advancements ought to not only augment and broaden machine learning algorithms and their uses but also strive for standardizing geological data, establishing open benchmarks, and offering more real-world exploration datasets. Among the crucial future paths are delving into transfer learning, unsupervised learning, and fusing data from a variety of geological data sources. All these efforts can greatly enhance the efficiency and dependability of contemporary mineral prediction systems.
In the later phases of this study, the writers plan to expand the examination to cover publications listed in other well-regarded scientific data repositories, such as Scopus, CNKI, and Engineering Village. This will enrich the current review with additional knowledge sources and offer a more comprehensive perspective on global trends in the application of machine learning to large-scale mineral prediction. Incorporating a broader spectrum of databases will likewise enable comparisons of research methodologies among scientific circles and assist in pinpointing less investigated yet promising thematic domains. Additionally, the authors intend to carry out a comprehensive examination of specific deposit types, like porphyry copper deposits, lithium clay deposits, or volcanic massive sulfide (VMS) deposits, by utilizing bibliometric and network analysis methods.
This review focuses on the machine learning methods most widely used in the field of large-scale mineral exploration prediction over the past decade. However, we have also noted that emerging paradigms as Transformer models, graph neural networks, and reinforcement learning (RL) are attracting attention from the academic community. Through innovative modeling approaches, these technologies can effectively handle the structures of time-series data, spatial data, and unlabeled data, providing important support for the construction of mineral exploration prediction systems in the future. For example, transformers show potential in handling sequential data in mineral exploration, graph neural networks can effectively model complex geological relationships. and reinforcement learning methods—such as deep Q-networks (DQN) and policy gradient algorithms—are excellent in scenarios requiring dynamic decision optimization, which is a common challenge in mineral exploration. Specifically, in real-time exploration route planning or adaptive adjustment of geophysical survey parameters, reinforcement learning can continuously optimize decision strategies based on feedback from geological environments, significantly improving the efficiency of on-site exploration and the accuracy of target area positioning.
Yet, our review mainly focuses on relatively mature deep learning methods. This is because the application of large language models in mineral prospectivity prediction is still in its infancy with few relevant studies. At present, focusing on the research of mature deep learning methods may bring more valuable results to this field. In the future, we will pay close attention to the application of large language models in this field and plan to explore them more comprehensively in our subsequent work.