Next Article in Journal
Smart Homes Supporting the Wellness of One or Two-Person Households
Next Article in Special Issue
Machine Learning Models for Slope Stability Classification of Circular Mode Failure: An Updated Database and Automated Machine Learning (AutoML) Approach
Previous Article in Journal
MIMO Antennas: Design Approaches, Techniques and Applications
Previous Article in Special Issue
Capture and Prediction of Rainfall-Induced Landslide Warning Signals Using an Attention-Based Temporal Convolutional Neural Network and Entropy Weight Methods
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Review

Scientometric Analysis of Artificial Intelligence (AI) for Geohazard Research

1
Badong National Observation and Research Station of Geohazards (BNORSG), China University of Geosciences, Wuhan 430074, China
2
Three Gorges Research Center for Geohazards of the Ministry of Education, China University of Geosciences, Wuhan 430074, China
3
School of Economics and Management, China University of Geosciences, Wuhan 430074, China
*
Author to whom correspondence should be addressed.
Sensors 2022, 22(20), 7814; https://doi.org/10.3390/s22207814
Submission received: 15 September 2022 / Revised: 6 October 2022 / Accepted: 12 October 2022 / Published: 14 October 2022

Abstract

:
Geohazard prevention and mitigation are highly complex and remain challenges for researchers and practitioners. Artificial intelligence (AI) has become an effective tool for addressing these challenges. Therefore, for decades, an increasing number of researchers have begun to conduct AI research in the field of geohazards leading to rapid growth in the number of related papers. This has made it difficult for researchers and practitioners to grasp information on cutting-edge developments in the field, thus necessitating a comprehensive review and analysis of the current state of development in the field. In this study, a comprehensive scientometric analysis appraising the state-of-the-art research for geohazard was performed based on 9226 scientometric records from the Web of Science core collection database. Multiple types of scientometric techniques, including coauthor analysis, co-citation analysis, and cluster analysis were employed to identify the most productive researchers, institutions, and hot research topics. The results show that research related to the application of AI in the field of geohazards experienced a period of rapid growth after 2000, with major developments in the field occurring in China, the United States, and Italy. The hot research topics in this field are ground motion, deep learning (DL), and landslides. The commonly used AI algorithms include DL, support vector machine (SVM), and decision tree (DT). The obtained visualization on research networks offers valuable insights and an in-depth understanding of the key researchers, institutions, fundamental articles, and salient topics through animated maps. We believe that this scientometric review offers useful reference points for early-stage researchers and provides valuable in-depth information to experienced researchers and practitioners in the field of geohazard research. This scientometric analysis and visualization are promising for reflecting the global picture of AI-based geohazard research comprehensively and possess potential for the visualization of the emerging trends in other research fields.

1. Introduction

According to the Occupational Safety and Health Administration (OSHA, https://www.ccohs.ca/oshanswers/hsprograms/hazard_risk.html, accessed on 3 October 2022), a hazard is any source of potential damage, harm, or adverse health effects on something or someone under certain conditions at work. Geohazards refer to events caused by geological conditions or processes that pose a threat to human life, property, or the natural environment [1]. According to the Emergency Events Database (EM-DAT, https://public.emdat.be/, accessed on 7 July 2022), a global database of technical and natural disasters, 1877 large-scale geohazards occurred worldwide between 1 January 1990 and 7 July 2022. These disasters killed 2.43 million people, left 25.74 million people homeless, and caused $862 million in damages. Japan and China are the countries with the highest losses due to geohazards, which caused approximately $392 million and $114 million in damages, respectively (Figure 1a). As shown, the number of geohazards increased from 1990 to 2000. Asia and the Americas, which account for 55.5% and 22.9% of the total number of geohazards worldwide, respectively, have suffered the most from geohazards (see Figure 1c).
Great efforts have been made in geohazard prevention and mitigation [2,3,4]. However, geohazards are characterized as complex and uncertain [5,6]; thus, challenges remain for researchers and practitioners [7]. Recently, artificial intelligence (AI) has become popular among researchers and practitioners and has led to considerable advances in geohazard research. Affected by multiple triggering factors [8,9], the monitoring data of the geohazard are usually characterized with complex and nonlinear relationships. For example, due to seasonal rainfall and periodic reservoir fluctuation, the landslide movements in the Three Gorges Reservoir area are characterized with step-like deformation, which makes the displacement predictions remain as challenges. AI is able to analyze these complex and nonlinear characteristics well by establishing a mapping between the input feature data and the output final results [10]. AI has proven its capability in dealing with high-dimensional and large-scale datasets by providing satisfactory predictions [11]. Moreover, AI, a data-driven approach, relies less on expertise and clear understanding of physical processes [12]. Based on previous review works [13,14], AI is widely used in the geohazard field [15,16] (see Figure 2). For example, Kalantar et al. and Xia et al. [17,18] modeled landslide susceptibility using the support vector machine (SVM) algorithm, logistic regression (LR) algorithm, and artificial neural network (ANN) algorithm. Ghorbanzadeh et al. [19] evaluated the application of deep learning (DL) in landslide identification. Zhang et al. [20] used ML algorithms such as decision tree (DT) and random forest (RF) to map landslide susceptibility. Mousavi et al. [21] proposed a DL model for simultaneous seismic detection and phase selection. Wu et al. [22] used AI algorithms such as SVM for tunnel collapse risk assessment. Choubin et al. [23] adopted AI algorithms such as multivariate discriminant analysis for the prediction of avalanche hazards. Valade et al. [24] implemented intelligent monitoring of global volcanic activity using AI techniques on multisensory satellite-based imagery from Sentinel-1. The rapid development of AI research in geohazards has led to a rapid increase in the number of publications on the subject. This makes it difficult for researchers and practitioners to keep abreast of cutting-edge research information and the overall status of research in this field, which can easily lead to meaningless and repetitive studies. To solve this problem, a scientometric analysis and review of the current state of recent research in this area is necessary.
Several researchers have previously conducted review studies in this field. For example, Dikshit et al. [13] provided a qualitative analysis of the application of AI in geohazards highlighting the direction of development in this field. Huang et al. and Merghadi et al. [25,26] analyzed the application of DL in the field of landslide susceptibility evaluation. Xie et al. [27] provided an overview of the applications and prospects of machine learning (ML) in the field of seismic research. Despite their important contributions to the development of the field, these review studies have some limitations. Most of these review studies are qualitative or are limited to the application of a particular AI to a certain type of geohazard; thus, there is a lack of quantitative and comprehensive review studies of the development of AI in geohazard research. In addition, the current review studies in the field do not include an analysis of the publication characteristics of existing papers, the main authors, institutions, and countries or the studies related to the identification of hot research in the field. Therefore, the current review studies do not provide a comprehensive and objective description of the current state of research of AI in the field of geohazards.
A scientometric analysis, which refers to the quantitative study of science and communication in science [28], is promising for addressing the abovementioned limitations as it can handle large volumes of publications; thus offering a visualization of research networks of key scholars, institutions, fundamental articles, and salient topics. Therefore, scientometric reviews have been applied to various research fields [29,30,31,32]. However, so far, no previous reviews have conducted the scientometric analysis of AI-based geohazard research by identification of the salient term and research trend and mapping interconnection.
To fill the gap in quantitative analysis research in geohazard reviews and promote development, quantitative analysis methods are used in this study to analyze and summarize the development of AI in geohazard research from 1990 to 2022. This study contributes to the development of the field of geohazards by objectively presenting the current research status and future directions of AI in this field. The main researchers, institutions, countries, and hot research topics are identified. The advantages and limitations of popular AI algorithms in the field of geohazards are analyzed, and future directions are discussed.

2. Materials and Methods

Scientometric analysis is a method of scientific analysis that shows the logic and connections between documents by mapping, mining, ranking, and analyzing them [33]. Various techniques, such as BibExcel, HistCite, and CiteSpace are available to achieve this goal. CiteSpace (version 5.4. R1 64 bit) [34,35] was chosen in this study because the clarity and interpretability of the resulting visualizations are better than those of other scientometric analysis tools. In the present study, a scientometric analysis of AI for geohazard research was performed based on the following three procedures (see Figure 3).
Data collection: Web of Science (http://apps.webofknowledge.com, accessed on 7 March 2022) is a comprehensive database with high-quality citation analysis [36] that is based on high-quality citation data, publication standards, and expert judgment. This database is of higher quality, contains more specialized data than other databases (such as Scopus and Google Scholar [37]) and can support a longer period of citation analysis. Thus, in this study, Web of Science was adopted for data collection. A search was performed for the topic (TS) query in Web of Science using the following formula “TS = ((fuzzy sets or naive Bayes or linear regression or random forests or gradient boosting or reinforcement learning or meta heuristics or AI or artificial intelligence or optimization algorithm or machine learning or deep learning or computational intelligence or decision tree or prediction model) AND (geohazard or landslide or slope or rockfall or collapse or earthquake or debris flow or hazard or tsunami))”. Based on the literature search publications in the English language were selected. The year of publication ranged from 1 January 1990 to 1 January 2022, and the subject categories were refined to GeoScience Multidisciplinary and Engineering Geological. A total of 9226 documents were retrieved for scientometric analysis.
Data filtering and refining: Subject terms were identified, subject searches were performed in the Web of Science database (“title, abstract, author keywords, and KeyWords Plus”), and Boolean operators (OR/AND) were used to expand the search and exclude irrelevant papers. After filtering and refining the search results to determine the time frame, the search results were downloaded and prepared for the next step of the analysis.
Data analysis and visualization: After filtering and refining the papers, the data were visually represented by using the visualization tool CiteSpace. Cluster analysis, an exploratory data mining technique, was adopted for the identification of the salient term and context, research trend, and interconnection. Log-likelihood ratio was used as the clustering index due to advantages of high-quality classification with high intra-class similarity and low inter-class similarity. A cluster overlap indicates that there are relationships between keywords of these different clusters. CiteSpace was used as a tool for performing cluster analysis. A visualization map generated by CiteSpace consists of color-coded nodes and links that describe co-citations or cooccurrences between these nodes. Each representative node, which is made up of a “tree ring” of different colors, denotes one specific item (e.g., country, institution, keyword, author, cited reference, or cited journal). The spectrum of colors denotes the temporal order: oldest in blue and newest in orange. The size of the ring represents the frequency of the corresponding item in a particular year. A red ring present in a particular year denotes a burst, that is, a surge of occurrences or citations in that year. Based on the data visualization, scientometrics and content analysis of the search results in the field are performed and discussed and the results are derived.

3. Results

3.1. Analysis of Publication Characteristics

3.1.1. Publication Distribution Characteristics

The characteristics of the publications on AI for geohazard research over time are shown in Figure 4. As shown, since 1990, the number of papers published in this field has continued to increase. After entering the 21st century, with the rapid development of AI technology, AI technology in the field of geohazards has developed quickly. As shown in Figure 1b, after 2000 there was an overall decreasing trend of geohazards. At the same time the research of AI in the field of geohazards began to grow rapidly, which to some extent reflects the help of AI in geohazard prevention and mitigation.
The spatial distribution characteristics of publications are shown in Figure 1a. The circles of different colors and sizes in the figure indicate the total number of papers published in the countries where the circles are located. As shown, China and the United States, with a total of 2320 publications and 1993 publications, respectively, are the two countries with the highest total number of publications in this field. In addition, the total number of national papers is positively related to geohazard losses in that country.

3.1.2. Publication Source Characteristics

To provide researchers with references to authoritative publication sources and to facilitate access to relevant and cutting-edge papers, the top 15 sources of AI for geohazard research are shown in Table 1. The sources of the top 15 papers are all journals and there are no conference papers; these data suggest that researchers prefer to publish their papers in journals rather than academic conferences. In addition, the papers from the top 15 sources account for 48.46% of the total number of papers. Among them, Natural Hazards and Remote Sensing, with 5.96% and 4.58% of the total number of papers, respectively, are among the top two source journals in this field. Among the top 15 sources, Geomorphology (17,656 citations), Journal of Hydrology (13,417 citations), and Catena (12,810 citations) are the most cited. Those results correspond well with the bibliometric review of Wu et al. [38] in the field of AI. Among the top 15 sources, Geomorphology (61.52), Earthquake Spectra (50.59), and Catena (47.62) have the highest average number of citations. Geomorphology and Catena not only have a high total number of citations but also a high average number of citations. Therefore, they are considered to be the most active journals in this field.

3.1.3. Publication Keyword Characteristics

Figure 5 shows the 10 keywords with the strongest citation burst of AI research citations in the field of geohazards, representing the main interests of researchers in the field. As shown, researchers have been interested in researching AI techniques in geohazards since 2012 when fuzzy logic was a popular research topic in the field. New research hotspots have gradually emerged. In 2016, the analytical hierarchy process appeared. In 2017, SVM became the third most cited keyword in this citation burst. Subsequently, LR, DL, and many other ML algorithms began to be widely applied in geohazards and became hot topics in the field. Furthermore, the different sizes of circles in Figure 5 indicate different occurrence frequencies; the larger the circle is, the higher the occurrence frequency. Among them, LR appeared 671 times and was the most popular keyword in the field. Moreover, an increasing number of AI algorithms have been applied to the field of geohazards by researchers.

3.2. Analysis of Authors, Institutions, and Countries

3.2.1. Most Productive Authors in AI Research in the Field of Geohazards

The affiliations of the top 15 most productive authors in terms of AI research in the field of geohazards and their H-index, total number of papers, and total number of citations are shown in Table 2. As shown in this table, each of these authors published 26 or more papers. These authors have published a cumulative total of 732 papers, which accounts for 7.93% of the papers published by researchers worldwide. Six of these researchers are affiliated with Asian institutions, with three researchers from China and three from Iran. Therefore, AI in the field of geohazards is considered to be developing rapidly in Asia, with China and Iran being the main participating countries in this field of research.
The most productive authors are Pradhan, Biswajeet (136); Dieu Tien Bui (70); and Pourghasemi, Hamid Reza (65); who are also the most cited authors. Six of the top 15 authors have a higher number of average citations (i.e., higher than 69.30) than the rest. They are Pourghasemi, Hamid Reza (97.42); Pradhan, Biswajeet (96.66); Shahabi, Himan (84.60); Dieu Tien Bui (80.06); Rahmati, Omid (76.93); and Hong, Haoyuan (69.94). Four authors have an H-index above 60: Pradhan, Biswajeet (94); Dieu Tien Bui (68); Pourghasemi, Hamid Reza (66); and Lee, Saro (64). Six authors have a relative citation impact greater than 1 relative to the other top 15 authors. They are Pourghasemi, Hamid Reza (1.41); Pradhan, Biswajeet (1.39); Shahabi, Himan (1.22); Dieu Tien Bui (1.16); Rahmati, Omid (1.11); and Hong, Haoyuan (1.01). According to these data, Pradhan, Biswajeet; Dieu Tien Bui; and Pourghasemi, Hamid Reza have better performances under all parameters. Therefore, they are considered to be strong influential researchers in the field.

3.2.2. Most Productive Institutions in Terms of AI Research in the Field of Geohazards

Among 9226 scientometric records of AI studies in the field of geohazards, 841 institutions are identified. Table 3 shows the data related to the top 15 most productive institutions. At each of these institutions, 83 or more papers have been published. These papers (1806 total) account for 19.58% of the cumulative number of papers in the field. The institutions with the most published papers are the Chinese Academy of Sciences (384), China University of Geosciences (158), and U.S. Geological Survey (156). The Chinese Academy of Sciences has not only published the most papers but also has the highest total citations and is one of the most active research institutions in AI research in the field of geohazards. Those results correspond well with the bibliometric review of Ho and Wang [39] in the field of AI.
The average citations per paper for all papers related to AI research in the field of geohazards from the top 15 institutions is 28.46%. Six institutions have higher average citations per paper than the others. These institutions include the U.S. Geological Survey (57.23); University of California, Berkeley (50.91); Sejong University (42.82); Duy Tan University (36.02); Islamic Azad University (35.74); and China Earthquake Administration (28.60). From 1990 to 2022, the relative citation impact of the top 15 most productive institutions relative to the total global research output of AI in geohazards was 1.00. Five institutions exceeded this relative citation impact, including the U.S. Geological Survey (2.00); University of California, Berkeley (1.78); Sejong University (1.5); Duy Tan University (1.26); and Islamic Azad University (1.25).
According to average citations per paper and relative citation impact, the U.S. Geological Survey; University of California, Berkeley; and Sejong University are considered to be the most active institutions in this field.

3.2.3. Top Countries in Terms of AI Research in the Field of Geohazards

Information on publications by country and region is closely related to publication characteristics but reflects different information (see Table 4). This table shows that China is the country with the most publications in this field during 1990–2022, with 2349 historical publications. The United States and Italy rank second and third with a total of 1993 and 894 publications, respectively. In addition, the United States has the highest total citations with 61,656 historical citations, followed by China (41,179) and Italy (27,388). Malaysia has the highest average citations per paper with an average of 75.71, followed by Norway and Vietnam with an average of 56.19 and 41.37 citations per paper, respectively. In the area of intercountry cooperation, the United States is the most influential country in this field and has the highest number of collaborations among 10 countries, including China, Italy, and the United Kingdom. In addition, the United States and China are the closest collaborators, with 279 collaborations.
Figure 6 shows the cooperation of major countries and regions in this field. The figure clearly shows that China, the United States, Italy, and Iran are the countries with the most AI studies in geohazard research. They are also the countries with the highest number of papers published. Among them, China has the highest number of publications and the second highest number of collaborations with other countries after the United States. The United States is the second most published country and has the most collaborations with other countries. In addition, China and the United States are the countries that cooperate most closely with each other. Italy has published more papers than Iran but has collaborated less with other countries than Iran. These results indicate that China and the United States are the two most representative countries in AI research in the field geohazards. Those results correspond well with the previous bibliometric review of Ho and Wang and Wu et al. [38,39]. Additionally, Italy and Iran also have a productive role in the AI-based geohazard research.

3.3. Identification of Salient Research Clusters

In this study, we selected papers ranked in the top 40% of references each year as the prominent research clusters for identifying the development of AI in geohazard research. By applying the log-likelihood ratio algorithm, 10 prominent research clusters were identified based on the keywords of the top-cited references (see Table 5). The identified clusters for #0 to #9 are ground motion, DL, GIS, landslide, impact, segment linkage, prediction, root reinforcement, debris flow, validation, respectively. Figure 7 and Table 5 show the prominent research clusters obtained based on the Web of Science search results. The silhouette value, a measure of the homogeneity of individual clusters, ranges from −1 to 1. The clustering results are considered convincing only when the silhouette value is greater than 0.5. As shown in Table 5, the silhouette values determined in this study ranged from 0.81 to 1, which indicates that the clustering results are convincing and that the members of each cluster have good consistency. For brevity, only the first five clusters (#0 to #4) were analyzed in this study and are discussed below.
Ground motion: The largest cluster (#0) is labeled ground motion, with a total of 70 members and a silhouette value of 0.81. A representative paper is that by Boore and Atkinson [40]. Ground motion usually refers to the surface movement of an area caused by an earthquake or explosion that results from waves generated by the sudden sliding of a fault or the sudden appearance of pressure from an explosion source and propagates along the surface of the Earth.
Deep learning: The second largest cluster (#1) is labeled DL, with 64 members and a silhouette value of 0.91. A representative paper is that by Bui et al. [41]. DL, a major branch of ML, is an algorithm for learning representations of information based on ANNs [45]. With the development of DL, its powerful nonlinear data processing capability has received increasing attention from geohazard researchers. DL is very powerful in geohazard processing and is effective in information extraction. It has since been introduced into geohazard analysis and prevention [14], including landslide and mudflow detection, seismic data interpolation and noise reduction.
GIS: The third largest cluster (#2) is labeled GIS, with 61 members and a silhouette value of 0.97. A representative paper is that by Guzzetti et al. [42]. GIS, or geographic information systems, is a comprehensive discipline of geography, cartography, and computer technology and is now widely used in the field of geohazards. GIS technology has contributed to quantitative studies of geohazard risk assessment and mapping. It has made important contributions in delineating geohazard susceptibility and sensitivity maps, land planning and utilization, and disaster loss reduction [46].
Landslide: The fourth largest cluster (#3) is labeled landslide, with a total of 54 members and a silhouette value of 0.91. Landslides are one of the most common geohazards, causing large economic losses and safety threats to people every year. Representative papers include those by Pradhan [43], Tien Bui et al. [47], and Pourghasemi et al. [48]. These papers consider specific applications of AI in landslide hazards. AI is commonly used in landslide hazards such as displacement prediction and susceptibility mapping.
Impact: The fifth largest cluster (#4) is labeled impact, with 47 members and a profile value of 0.92. The impact label includes both the impact factors and the impacts caused by geohazards. In representative papers by Kim et al. and Ma et al. [44,49], the impact factors of geohazards were studied. Claessens et al. [50] studied the impact of geohazards.

3.4. Top Algorithms and Future Trends in AI Research of Geohazards

In addition to the keyword characteristic analysis of publications, the 10 keywords with the strongest keyword citation burst are all related to AI technology. Seven of these keywords are AI algorithms. This also indicates that AI algorithms have become an important method in geohazard research. Therefore, it is necessary to summarize and analyze the AI algorithms commonly used in geohazard research. Table 6 shows a brief summary and some advantages and limitations of some common AI algorithms in the field of geohazards. Among them, NB, DT, and SVM are the classic ML algorithms. These common single ML algorithms had seen citation outbreaks one after another in 2017 and are widely used by researchers of geohazards. Recently, DL methods, including autoencoders and convolutional and recurrent neural networks, have been widely used by researchers because of their greater processing power of raw natural data [51] and higher accuracy of qualitative hazard prediction [16] than traditional ML methods. DL is the second largest cluster in the cluster analysis and a silhouette of 0.91 is a convincing result. Therefore, it can be considered that DL is one of the trends of AI in geohazards.

4. Future Directions

AI has been extensively applied to geohazard research, yielding tremendous success. Based on the scientometric analysis of the literature to date, we recommend the following aspects should be addressed for AI-based geohazard research.

4.1. Establishment of Benchmark Database

AI modeling is driven by data [103]; therefore, the quantity and quality of the data may directly affect the performance of AI [14]. However, some fundamental constraints remain for data acquisition and preparation. Firstly, the high cost of monitoring equipment limits the coverage of field monitoring and limits researchers’ access to high-quality field data. Another impediment is the lack of large and generalized geohazard datasets. Although tens of thousands of papers have been published for AI-based geohazard research, it is difficult to extract and utilize openly available, curated, and labeled training data. Generally, researchers from different institutions often use different datasets and research methods for their studies, with the terminology and data completeness in the papers varying tremendously. This has led to strong calls from researchers for the establishment of a benchmark database, data sharing, and standardization of data reporting [104] which will be an important boost for the development of AI-based geohazard researches. Some researchers have already started data sharing work. For example, Ji et al. [105] shared a large landslide dataset (http://study.rsgis.whu.edu.cn/pages/download/, accessed on 4 October 2022) containing landslide images, landslide boundary information, landslide area DEM data, etc. Mousavi et al. [100] contributed a large number of high-quality seismic analysis datasets (https://github.com/smousavi05/STEAD, accessed on 4 October 2022) which contain local seismic waveforms, seismic noise waveforms, and no seismic signals. These publicly available high-quality datasets can be used as benchmark datasets for the evaluation of the performance of different AI algorithms in this field and provide a reproducible evaluation environment. In addition, a standardized baseline database not only provides researchers with high-quality datasets but also eases the work of researchers in data management [106]. Therefore, a standardized geohazard benchmark database is desired by researchers.

4.2. Integration of AI with Physical Processes

AI techniques provide good performances in geohazards such as landslide susceptibility evaluation [20], earthquake identification and phase selection [21], and volcanic activity monitoring [24]. However, researchers and practitioners still face challenges in enhancing the reliability [107]. To improve the reliability of AI, some researchers have attempted to integrate AI and physical processes to embody the powerful data processing capabilities of AI techniques and the reliability of physical processes in an ensemble algorithm [108]. For example, Jiang et al. [108] proposed an algorithm to improve the geoscientific knowledge of AI. Depina et al. [109] used an algorithm for the study of unsaturated groundwater flow using a combination of AI and physical processes. The reliability has been enhanced by adopting data-driven components to improve the unrepresentable parts of physical processes and integration of the evolution of physical processes in AI algorithms.

4.3. Auto ML

A strong mastery of expert knowledge is required for AI-based geohazard research. A general workflow for AI modeling usually consists of data preprocessing, feature engineering, selection of a machine learning model, and optimization of the associated hyperparameters [110,111]. Reducing the requirement of expert knowledge and automating all the processing steps is a common expectation among researchers. Some researchers have offered auto ML platforms that have somewhat overcome the problems of algorithm selection and hyperparameter optimization, reducing the need for expertise in AI algorithms. For example, Auto-sklearn optimized hyperparameter selection using a Bayesian algorithm and automated policy selection using meta-learning and integration structures [112]. Auto-WEKA implements the automatic selection of algorithms and hyperparameter optimization based on Bayesian optimization techniques [113]. These auto ML platforms have proven their capacity in the fields of medicine [114], mechanics [115], and geoscience [116].

4.4. Uncertainty Quantification

AI analyzes geohazard data by building corresponding models to predict the occurrence of geohazards and provide evidence and suggestions for its prevention and mitigation. In this process, the uncertainties existing in the data and models may bias the analysis results. Data uncertainty is generated due to class overlap and noise in the training data and is non-approachable due to limitations in how the data are collected. Epistemic uncertainty results from errors caused by model inference or model performance [117]. With the widespread use of AI in geohazards, it is becoming more and more crucial to evaluate the validity and reliability of AI systems before using their analysis results.
Currently, accurate uncertainty quantification is the key to enhance the reliability and accuracy of AI analysis results and the future direction of AI in the field of geohazards. A few researchers have started research on uncertainty quantification. The most common approaches can be divided into Bayesian uncertainty quantification that focuses on specifying the training set to approximate the posterior probability distribution, such as Monte Carlo [118] and Markov Chain Monte Carlo [119], and ensemble uncertainty quantification that obtains improved accuracy by combining multiple models [120] such as deep ensemble [121] and Dirichlet Deep Networks [122].

4.5. Interpretable AI

Some AI algorithms cannot provide a reasonable interpretation for their results which makes researchers and practitioners distrust results obtained from AI. This has limited the development of AI-based geohazard research to a large extent and has brought increasing attention to interpretable AI [123]. Based on previous studies, research methods for interpreting AI techniques are maturing [124,125], terminology and metrics are being harmonized [126,127], and there is some development in the evaluation of interpretable AI and interpretation of AI. Some primary research methods are currently being used to study AI “black boxes”; for example, by decomposing model components into small parts that we can explain [128] and by visualizing the weights of different models to improve the interpretability of DL for seismic monitoring and phase selection [21]. Future works should include overcoming the obstacles to development caused by the uncertainty of quantitative AI interpretation methods, causal interpretation, feature dependence, and other problems [129].

5. Conclusions

AI has been extensively applied to geohazard research and yielding tremendous success. The present study performed a scientometric-assisted review for AI-based geohazard research by visualization of the research status quo and identification of the salient term and context, research trend, and mapping interconnection based on 9226 scientometric records. The analysis of the research publication trend indicates that AI has obtained continuous development in geohazard research over the past 30 years and entered a period of rapid growth beginning in 2000. An analysis of publication source characteristics has revealed that Natural Hazards and Remote Sensing are the top two source journals. Geomorphology and Catena are considered to be the most active journals in this field. The analysis of keyword features revealed that ML is a popular research method in this field. Pradhan, Biswajeet; Dieu Tien Bui, and Pourghasemi, Hamid Reza are among the three most productive researchers in this field. Three organizations including the U.S. Geological Survey; University of California, Berkeley; and Sejong University are considered to be the most productive institutions in this field. China, the United States, and Italy are the countries with the highest number of publications and the highest number of total citations among all countries. Identification of salient research clusters indicates that ground motion, DL, GIS, and landslides are current research hotspots.
Future studies on AI-based geohazard research themes may focus on the establishment of benchmark database, integration of AI with physical processes, Auto ML, uncertainty quantification and interpretable AI.
This scientometric review offers useful reference points for early-stage researchers and provides valuable in-depth information to experienced researchers and practitioners in the field of geohazard research. This scientometric analysis and visualization are promising for comprehensively reflecting the global picture of AI-based geohazard research and are potential for visualization the emerging trends in other research fields.

Author Contributions

Conceptualization, J.M. and S.J.; methodology, J.M. and S.J.; validation, J.M., Z.L. and H.G.; formal analysis, S.J.; data curation, J.M. and Z.L.; writing—original draft preparation, S.J.; writing—review and editing, S.J., J.M., Z.L. and H.G.; visualization, S.J. and Z.L.; supervision J.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Major Program of the National Natural Science Foundation of China (Grant No. 42090055), the National Natural Science Foundation of China (Grant Nos. 42177147 and 71874165), and the Fundamental Research Funds for the Central Universities, China University of Geosciences (Wuhan) (CUG2642022006).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

AI: artificial intelligence; ANN: artificial neural network; DL: deep learning; DT: decision tree; EM-DAT: Emergency Events Database; EM: ensemble method; ELM: extreme learning machine; KNN: k-nearest neighbor; LR: logistic regression; ML: machine learning; NB: naive Bayes; RF: random forest; SVM: support vector machine.

References

  1. Liu, Z.; Li, W.; Zhang, L.; Li, J. Fine Geological Modeling of Complex Fault Block Reservoir Based on Deep Learning. Wireless Commun. Mobile Comput. 2022, 2022, 9670311. [Google Scholar] [CrossRef]
  2. Zhang, K.; Wang, L.; Dai, Z.; Huang, B.; Zhang, Z. Evolution trend of the Huangyanwo rock mass under the action of reservoir water fluctuation. Nat. Hazard. 2022, 113, 1583–1600. [Google Scholar] [CrossRef]
  3. Shi, W.; Zhang, M.; Ke, H.; Fang, X.; Zhan, Z.; Chen, S. Landslide Recognition by Deep Convolutional Neural Network and Change Detection. IEEE Trans. Geosci. Remote Sens. 2021, 59, 4654–4672. [Google Scholar] [CrossRef]
  4. Miao, F.; Zhao, F.; Wu, Y.; Li, L.; Xue, Y.; Meng, J. A novel seepage device and ring-shear test on slip zone soils of landslide in the Three Gorges Reservoir area. Eng. Geol. 2022, 307, 106779. [Google Scholar] [CrossRef]
  5. Guo, W.; Zuo, X.; Yu, J.; Zhou, B. Method for Mid-Long-Term Prediction of Landslides Movements Based on Optimized Apriori Algorithm. Appl. Sci. 2019, 9, 3819. [Google Scholar] [CrossRef] [Green Version]
  6. Miao, F.; Wu, Y.; Török, Á.; Li, L.; Xue, Y. Centrifugal model test on a riverine landslide in the Three Gorges Reservoir induced by rainfall and water level fluctuation. Geosci. Front. 2022, 13, 101378. [Google Scholar] [CrossRef]
  7. Wang, L.; Zhang, Z.; Huang, B.; Hu, M.; Zhang, C. Triggering mechanism and possible evolution process of the ancient Qingshi landslide in the Three Gorges Reservoir. Geomat. Nat. Hazards Risk 2021, 12, 3160–3174. [Google Scholar] [CrossRef]
  8. Wu, S.; Hu, X.; Zheng, W.; He, C.; Zhang, G.; Zhang, H.; Wang, X. Effects of reservoir water level fluctuations and rainfall on a landslide by two-way ANOVA and K-means clustering. Bull. Eng. Geol. Environ. 2021, 80, 5405–5421. [Google Scholar] [CrossRef]
  9. Miao, F.; Wu, Y.; Xie, Y.; Li, Y. Prediction of landslide displacement with step-like behavior based on multialgorithm optimization and a support vector regression model. Landslides 2018, 15, 475–488. [Google Scholar] [CrossRef]
  10. Ma, J.; Niu, X.; Tang, H.; Wang, Y.; Wen, T.; Zhang, J. Displacement prediction of a complex landslide in the Three Gorges Reservoir Area (China) using a hybrid computational intelligence approach. Complexity 2020, 2020, 2624547. [Google Scholar] [CrossRef]
  11. L’Heureux, A.; Grolinger, K.; Elyamany, H.F.; Capretz, M.A.M. Machine Learning with Big Data: Challenges and Approaches. IEEE Access 2017, 5, 7776–7797. [Google Scholar] [CrossRef]
  12. Niu, X.; Ma, J.; Wang, Y.; Zhang, J.; Chen, H.; Tang, H. A Novel Decomposition-Ensemble Learning Model Based on Ensemble Empirical Mode Decomposition and Recurrent Neural Network for Landslide Displacement Prediction. Appl. Sci. 2021, 11, 4684. [Google Scholar] [CrossRef]
  13. Dikshit, A.; Pradhan, B.; Alamri, A.M. Pathways and challenges of the application of artificial intelligence to geohazards modelling. Gondwana Res. 2021, 100, 290–301. [Google Scholar] [CrossRef]
  14. Ma, Z.; Mei, G. Deep learning for geological hazards analysis: Data, models, applications, and opportunities. Earth Sci. Rev. 2021, 223, 103858. [Google Scholar] [CrossRef]
  15. Pradhan, B.; Lee, S. Landslide risk analysis using artificial neural network model focussing on different training sites. Int. J. Phys. Sci. 2009, 4, 1–15. [Google Scholar]
  16. Ma, J.W.; Xia, D.; Wang, Y.K.; Niu, X.X.; Jiang, S.; Liu, Z.Y.; Guo, H.X. A comprehensive comparison among metaheuristics (MHs) for geohazard modeling using machine learning: Insights from a case study of landslide displacement prediction. Eng. Appl. Artif. Intell. 2022, 114, 105150. [Google Scholar] [CrossRef]
  17. Kalantar, B.; Pradhan, B.; Naghibi, S.A.; Motevalli, A.; Mansor, S. Assessment of the effects of training data selection on the landslide susceptibility mapping: A comparison between support vector machine (SVM), logistic regression (LR) and artificial neural networks (ANN). Geomat. Nat. Hazards Risk 2017, 9, 49–69. [Google Scholar] [CrossRef]
  18. Xia, D.; Tang, H.M.; Sun, S.X.; Tang, C.Y.; Zhang, B.C. Landslide Susceptibility Mapping Based on the Germinal Center Optimization Algorithm and Support Vector Classification. Remote Sens. 2022, 14, 2707. [Google Scholar] [CrossRef]
  19. Ghorbanzadeh, O.; Blaschke, T.; Gholamnia, K.; Meena, S.; Tiede, D.; Aryal, J. Evaluation of Different Machine Learning Methods and Deep-Learning Convolutional Neural Networks for Landslide Detection. Remote Sens. 2019, 11, 196. [Google Scholar] [CrossRef] [Green Version]
  20. Zhang, Y.; Ge, T.; Tian, W.; Liou, Y.-A. Debris Flow Susceptibility Mapping Using Machine-Learning Techniques in Shigatse Area, China. Remote Sens. 2019, 11, 2801. [Google Scholar] [CrossRef] [Green Version]
  21. Mousavi, S.M.; Ellsworth, W.L.; Zhu, W.; Chuang, L.Y.; Beroza, G.C. Earthquake transformer—An attentive deep-learning model for simultaneous earthquake detection and phase picking. Nat. Commun. 2020, 11, 3952. [Google Scholar] [CrossRef]
  22. Wu, B.; Qiu, W.; Huang, W.; Meng, G.; Nong, Y.; Huang, J. A Multi-Source Information Fusion Evaluation Method for the Tunneling Collapse Disaster Based on the Artificial Intelligence Deformation Prediction. Arab. J. Sci. Eng. 2022, 47, 5053–5071. [Google Scholar] [CrossRef]
  23. Choubin, B.; Borji, M.; Mosavi, A.; Sajedi-Hosseini, F.; Singh, V.P.; Shamshirband, S. Snow avalanche hazard prediction using machine learning methods. J. Hydrol. 2019, 577, 123929. [Google Scholar] [CrossRef]
  24. Valade, S.; Ley, A.; Massimetti, F.; D’Hondt, O.; Laiolo, M.; Coppola, D.; Loibl, D.; Hellwich, O.; Walter, T.R. Towards Global Volcano Monitoring Using Multisensor Sentinel Missions and Artificial Intelligence: The MOUNTS Monitoring System. Remote Sens. 2019, 11, 1528. [Google Scholar] [CrossRef] [Green Version]
  25. Huang, Y.; Zhao, L. Review on landslide susceptibility mapping using support vector machines. CATENA 2018, 165, 520–529. [Google Scholar] [CrossRef]
  26. Merghadi, A.; Yunus, A.P.; Dou, J.; Whiteley, J.; ThaiPham, B.; Bui, D.T.; Avtar, R.; Abderrahmane, B. Machine learning methods for landslide susceptibility studies: A comparative overview of algorithm performance. Earth Sci. Rev. 2020, 207, 103225. [Google Scholar] [CrossRef]
  27. Xie, Y.; Ebad Sichani, M.; Padgett, J.E.; DesRoches, R. The promise of implementing machine learning in earthquake engineering: A state-of-the-art review. Earthq. Spectra 2020, 36, 1769–1801. [Google Scholar] [CrossRef]
  28. Yalcinkaya, M.; Singh, V. Patterns and trends in Building Information Modeling (BIM) research: A Latent Semantic Analysis. Autom. Constr. 2015, 59, 68–80. [Google Scholar] [CrossRef]
  29. Martinez, P.; Al-Hussein, M.; Ahmad, R. A scientometric analysis and critical review of computer vision applications for construction. Autom. Constr. 2019, 107, 102947. [Google Scholar] [CrossRef]
  30. Li, P.; Lu, Y.; Yan, D.; Xiao, J.; Wu, H. Scientometric mapping of smart building research: Towards a framework of human-cyber-physical system (HCPS). Autom. Constr. 2021, 129, 103776. [Google Scholar] [CrossRef]
  31. Olawumi, T.O.; Chan, D.W.M. A scientometric review of global research on sustainability and sustainable development. J. Cleaner Prod. 2018, 183, 231–250. [Google Scholar] [CrossRef]
  32. Chatterjee, J.; Dethlefs, N. Scientometric review of artificial intelligence for operations & maintenance of wind turbines: The past, present and future. Renew. Sustain. Energy Rev. 2021, 144, 111051. [Google Scholar]
  33. Borner, K.; Chen, C.M.; Boyack, K.W. Visualizing knowledge domains. Annu. Rev. Inf. Sci. Technol. 2003, 37, 179–255. [Google Scholar] [CrossRef]
  34. Chen, C.M. CiteSpace II: Detecting and visualizing emerging trends and transient patterns in scientific literature. J. Am. Soc. Inf. Sci. Technol. 2006, 57, 359–377. [Google Scholar] [CrossRef] [Green Version]
  35. Chen, C. CiteSpace: A Practical Guide for Mapping Scientific Literature; Nova Science Publishers: New York, NY, USA, 2016. [Google Scholar]
  36. Bar-Ilan, J.; Levene, M.; Lin, A. Some measures for comparing citation databases. J. Informetr. 2007, 1, 26–34. [Google Scholar] [CrossRef]
  37. Falagas, M.E.; Pitsouni, E.I.; Malietzis, G.A.; Pappas, G. Comparison of PubMed, Scopus, Web of Science, and Google Scholar: Strengths and weaknesses. FASEB J. 2008, 22, 338–342. [Google Scholar] [CrossRef]
  38. Wu, X.; Chen, X.; Zhan, F.B.; Hong, S. Global research trends in landslides during 1991–2014: A bibliometric analysis. Landslides 2015, 12, 1215–1226. [Google Scholar] [CrossRef]
  39. Ho, Y.-S.; Wang, M.-H. A bibliometric analysis of artificial intelligence publications from 1991 to 2018. Collnet J. Scientometr. Inf. Manag. 2020, 14, 369–392. [Google Scholar] [CrossRef]
  40. Boore, D.M.; Atkinson, G.M. Ground-Motion Prediction Equations for the Average Horizontal Component of PGA, PGV, and 5%-Damped PSA at Spectral Periods between 0.01 s and 10 s. Earthq. Spectra 2008, 24, 99–138. [Google Scholar] [CrossRef] [Green Version]
  41. Bui, D.T.; Tsangaratos, P.; Nguyen, V.-T.; Liem, N.V.; Trinh, P.T. Comparing the prediction performance of a Deep Learning Neural Network model with conventional machine learning models in landslide susceptibility assessment. CATENA 2020, 188, 104426. [Google Scholar] [CrossRef]
  42. Guzzetti, F.; Carrara, A.; Cardinali, M.; Reichenbach, P. Landslide hazard evaluation: A review of current techniques and their application in a multi-scale study, Central Italy. Geomorphology 1999, 31, 181–216. [Google Scholar] [CrossRef]
  43. Pradhan, B. A comparative study on the predictive ability of the decision tree, support vector machine and neuro-fuzzy models in landslide susceptibility mapping using GIS. Comput. Geosci. 2013, 51, 350–365. [Google Scholar] [CrossRef]
  44. Kim, H.; Lee, J.-H.; Park, H.-J.; Heo, J.-H. Assessment of temporal probability for rainfall-induced landslides based on nonstationary extreme value analysis. Eng. Geol. 2021, 294, 106372. [Google Scholar] [CrossRef]
  45. Deng, L. Deep Learning: Methods and Applications. Found. Trends Signal Processing 2014, 7, 197–387. [Google Scholar] [CrossRef] [Green Version]
  46. Yalcin, A. GIS-based landslide susceptibility mapping using analytical hierarchy process and bivariate statistics in Ardesen (Turkey): Comparisons of results and confirmations. CATENA 2008, 72, 1–12. [Google Scholar] [CrossRef]
  47. Tien Bui, D.; Tuan, T.A.; Klempe, H.; Pradhan, B.; Revhaug, I. Spatial prediction models for shallow landslide hazards: A comparative assessment of the efficacy of support vector machines, artificial neural networks, kernel logistic regression, and logistic model tree. Landslides 2016, 13, 361–378. [Google Scholar] [CrossRef]
  48. Pourghasemi, H.R.; Pradhan, B.; Gokceoglu, C. Application of fuzzy logic and analytical hierarchy process (AHP) to landslide susceptibility mapping at Haraz watershed, Iran. Nat. Hazard. 2012, 63, 965–996. [Google Scholar] [CrossRef]
  49. Ma, J.W.; Tang, H.M.; Hu, X.L.; Bobet, A.; Zhang, M.; Zhu, T.W.; Song, Y.J.; Eldin, M. Identification of causal factors for the Majiagou landslide using modern data mining methods. Landslides 2017, 14, 311–322. [Google Scholar] [CrossRef]
  50. Claessens, L.; Schoorl, J.M.; Veldkamp, A. Modelling the location of shallow landslides and their effects on landscape dynamics in large watersheds: An application for Northern New Zealand. Geomorphology 2007, 87, 16–27. [Google Scholar] [CrossRef]
  51. LeCun, Y.; Bengio, Y.; Hinton, G. Deep learning. Nature 2015, 521, 436–444. [Google Scholar] [CrossRef]
  52. Zhang, H. The optimality of naive Bayes. Aa 2004, 1, 3. [Google Scholar]
  53. Jiang, L.; Li, C.; Wang, S.; Zhang, L. Deep feature weighting for naive Bayes and its application to text classification. Eng. Appl. Artif. Intell. 2016, 52, 26–39. [Google Scholar] [CrossRef]
  54. He, Q.; Shahabi, H.; Shirzadi, A.; Li, S.; Chen, W.; Wang, N.; Chai, H.; Bian, H.; Ma, J.; Chen, Y.; et al. Landslide spatial modelling using novel bivariate statistical based Naïve Bayes, RBF Classifier, and RBF Network machine learning algorithms. Sci. Total Environ. 2019, 663, 1–15. [Google Scholar] [CrossRef]
  55. Das, I.; Stein, A.; Kerle, N.; Dadhwal, V.K. Landslide susceptibility mapping along road corridors in the Indian Himalayas using Bayesian logistic regression models. Geomorphology 2012, 179, 116–125. [Google Scholar] [CrossRef]
  56. Jiang, L.; Zhang, L.; Li, C.; Wu, J. A Correlation-Based Feature Weighting Filter for Naive Bayes. IEEE Trans. Knowl. Data Eng. 2019, 31, 201–213. [Google Scholar] [CrossRef]
  57. Tien Bui, D.; Pradhan, B.; Lofman, O.; Revhaug, I. Landslide Susceptibility Assessment in Vietnam Using Support Vector Machines, Decision Tree, and Naïve Bayes Models. Math. Probl. Eng. 2012, 2012, 974638. [Google Scholar] [CrossRef] [Green Version]
  58. Kotsiantis, S.B. Decision trees: A recent overview. Artif. Intell. Rev. 2011, 39, 261–283. [Google Scholar] [CrossRef]
  59. Khadse, V.M.; Mahalle, P.N.; Shinde, G.R. Statistical Study of Machine Learning Algorithms Using Parametric and Non-Parametric Tests: A Comparative Analysis and Recommendations. Int. J. Ambient. Comput. Intell. 2020, 11, 80–105. [Google Scholar] [CrossRef]
  60. Moshkov, M.J. Time Complexity of Decision Trees; Transactions on Rough Sets III; Peters, J.F., Skowron, A., Eds.; Springer: Berlin/Heidelberg, Germany, 2005; pp. 244–459. [Google Scholar]
  61. Dietterich, T.G. An experimental comparison of three methods for constructing ensembles of decision trees: Bagging, boosting, and randomization. Mach. Learn. 2000, 40, 139–157. [Google Scholar] [CrossRef]
  62. Yildiz, O.T.; Alpaydin, E. Omnivariate decision trees. IEEE Trans. Neural Netw. 2001, 12, 1539–1546. [Google Scholar] [CrossRef]
  63. Tian, Y.; Qi, Z.; Ju, X.; Shi, Y.; Liu, X. Nonparallel Support Vector Machines for Pattern Classification. IEEE Trans. Cybern. 2014, 44, 1067–1079. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  64. Nalepa, J.; Kawulok, M. Selecting training sets for support vector machines: A review. Artif. Intell. Rev. 2018, 52, 857–900. [Google Scholar] [CrossRef] [Green Version]
  65. Li, Z.; Weida, Z.; Licheng, J. Wavelet support vector machine. IEEE Trans. Syst. Man Cybern. Part B 2004, 34, 34–39. [Google Scholar]
  66. Jiang, Q.; Wang, G.; Jin, S.; Li, Y.; Wang, Y. Predicting human microRNA-disease associations based on support vector machine. Int. J. Data Min. Bioinf. 2013, 8, 282. [Google Scholar] [CrossRef]
  67. Trafalis, T.B.; Ince, H. Support vector machine for regression and applications to financial forecasting. In Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks. IJCNN 2000. Neural Computing: New Challenges and Perspectives for the New Millennium, Como, Italy, 27 July 2000. [Google Scholar]
  68. Zendehboudi, A.; Baseer, M.A.; Saidur, R. Application of support vector machine models for forecasting solar and wind energy resources: A review. J. Cleaner Prod. 2018, 199, 272–285. [Google Scholar] [CrossRef]
  69. Wu, X.; Kumar, V.; Ross Quinlan, J.; Ghosh, J.; Yang, Q.; Motoda, H.; McLachlan, G.J.; Ng, A.; Liu, B.; Yu, P.S.; et al. Top 10 algorithms in data mining. Knowl. Inf. Syst. 2007, 14, 1–37. [Google Scholar] [CrossRef] [Green Version]
  70. Chen, X.; Xu, S.; Li, S.; He, H.; Han, Y.; Qu, X. Identification of architectural elements based on SVM with PCA: A case study of sandy braided river reservoir in the Lamadian Oilfield, Songliao Basin, NE China. J. Pet. Sci. Eng. 2021, 198, 108247. [Google Scholar] [CrossRef]
  71. Suthaharan, S. Support vector machine. In Machine Learning Models and Algorithms for Big Data Classification; Springer: New York, NY, USA, 2016; pp. 207–235. [Google Scholar]
  72. Jain, A.K.; Jianchang, M.; Mohiuddin, K.M. Artificial neural networks: A tutorial. Computer 1996, 29, 31–44. [Google Scholar] [CrossRef] [Green Version]
  73. Cao, W.; Wang, X.; Ming, Z.; Gao, J. A review on neural networks with random weights. Neurocomputing 2018, 275, 278–287. [Google Scholar] [CrossRef]
  74. Tu, J.V. Advantages and disadvantages of using artificial neural networks versus logistic regression for predicting medical outcomes. J. Clin. Epidemiol. 1996, 49, 1225–1231. [Google Scholar] [CrossRef]
  75. Ding, S.; Li, H.; Su, C.; Yu, J.; Jin, F. Evolutionary artificial neural networks: A review. Artif. Intell. Rev. 2011, 39, 251–260. [Google Scholar] [CrossRef]
  76. Li, H.; Xu, Q.; He, Y.; Deng, J. Prediction of landslide displacement with an ensemble-based extreme learning machine and copula models. Landslides 2018, 15, 2047–2059. [Google Scholar] [CrossRef]
  77. Huang, G.-B.; Zhu, Q.-Y.; Siew, C.-K. Extreme learning machine: Theory and applications. Neurocomputing 2006, 70, 489–501. [Google Scholar] [CrossRef]
  78. Zhao, F.; Liu, M.; Wang, K.; Wang, T.; Jiang, X. A soft measurement approach of wastewater treatment process by lion swarm optimizer-based extreme learning machine. Measurement 2021, 179, 109322. [Google Scholar] [CrossRef]
  79. Gou, J.; Ma, H.; Ou, W.; Zeng, S.; Rao, Y.; Yang, H. A generalized mean distance-based k-nearest neighbor classifier. Expert Syst. Appl. 2019, 115, 356–372. [Google Scholar] [CrossRef]
  80. Cai, P.; Wang, Y.; Lu, G.; Chen, P.; Ding, C.; Sun, J. A spatiotemporal correlative k-nearest neighbor model for short-term traffic multistep forecasting. Transp. Res. Part C Emerg. Technol. 2016, 62, 21–34. [Google Scholar] [CrossRef]
  81. Zhang, S.C.; Li, X.L.; Zong, M.; Zhu, X.F.; Cheng, D.B. Learning k for kNN Classification. ACM Trans. Intell. Syst. Technol. 2017, 8, 19. [Google Scholar] [CrossRef] [Green Version]
  82. Tripepi, G.; Jager, K.J.; Dekker, F.W.; Zoccali, C. Linear and logistic regression analysis. Kidney Int. 2008, 73, 806–810. [Google Scholar] [CrossRef] [Green Version]
  83. Mehrolia, S.; Alagarsamy, S.; Solaikutty, V.M. Customers response to online food delivery services during COVID-19 outbreak using binary logistic regression. Int. J. Consum. Stud. 2020, 45, 396–408. [Google Scholar] [CrossRef]
  84. Muche, R. Logistic regression—A useful tool in rehabilitation research. Rehabilitation 2008, 47, 56–62. [Google Scholar] [CrossRef]
  85. Bai, S.-B.; Wang, J.; Lü, G.-N.; Zhou, P.-G.; Hou, S.-S.; Xu, S.-N. GIS-based logistic regression for landslide susceptibility mapping of the Zhongxian segment in the Three Gorges area, China. Geomorphology 2010, 115, 23–31. [Google Scholar] [CrossRef]
  86. Sperandei, S. Understanding logistic regression analysis. Biochem. Med. 2014, 24, 12–18. [Google Scholar] [CrossRef] [PubMed]
  87. Han, D.; Ma, L.; Yu, C. Financial Prediction: Application of Logistic Regression with Factor Analysis. In Proceedings of the 2008 4th International Conference on Wireless Communications, Networking and Mobile Computing, Dalian, China, 12–14 October 2008. [Google Scholar]
  88. Wang, Z.Y.; Srinivasan, R.S. A review of artificial intelligence based building energy use prediction: Contrasting the capabilities of single and ensemble prediction models. Renew. Sust. Energ. Rev. 2017, 75, 796–808. [Google Scholar] [CrossRef]
  89. Ma, J.W.; Xia, D.; Guo, H.X.; Wang, Y.K.; Niu, X.X.; Liu, Z.Y.; Jiang, S. Metaheuristic-based support vector regression for landslide displacement prediction: A comparative study. Landslides 2022, 23, 2489–2511. [Google Scholar] [CrossRef]
  90. Aburomman, A.A.; Ibne Reaz, M.B. A novel SVM-kNN-PSO ensemble method for intrusion detection system. Appl. Soft Comput. 2016, 38, 360–372. [Google Scholar] [CrossRef]
  91. Ma, J.W.; Liu, X.; Niu, X.X.; Wang, Y.N.; Wen, T.; Zhang, J.R.; Zou, Z.X. Forecasting of Landslide Displacement Using a Probability-Scheme Combination Ensemble Prediction Technique. Int. J. Environ. Res. Public Health 2020, 17, 4788. [Google Scholar] [CrossRef]
  92. Zhang, J.R.; Tang, H.M.; Wen, T.; Ma, J.W.; Tan, Q.W.; Xia, D.; Liu, X.; Zhang, Y.Q. A Hybrid Landslide Displacement Prediction Method Based on CEEMD and DTW-ACO-SVR-Cases Studied in the Three Gorges Reservoir Area. Sensors 2020, 20, 4287. [Google Scholar] [CrossRef]
  93. Sagi, O.; Rokach, L. Ensemble learning: A survey. WIREs Data Min. Knowl. Discov. 2018, 8, e1249. [Google Scholar] [CrossRef]
  94. Zhang, J.R.; Tang, H.M.; Tannant, D.D.; Lin, C.Y.; Xia, D.; Liu, X.; Zhang, Y.Q.; Ma, J.W. Combined forecasting model with CEEMD-LCSS reconstruction and the ABC-SVR method for landslide displacement prediction. J. Cleaner Prod. 2021, 293, 18. [Google Scholar] [CrossRef]
  95. Huang, D.; Wang, C.-D.; Lai, J.-H. Locally Weighted Ensemble Clustering. IEEE Trans. Cybern. 2018, 48, 1460–1473. [Google Scholar] [CrossRef] [Green Version]
  96. Janiesch, C.; Zschech, P.; Heinrich, K. Machine learning and deep learning. Electron. Mark. 2021, 31, 685–695. [Google Scholar] [CrossRef]
  97. Min, S.; Lee, B.; Yoon, S. Deep learning in bioinformatics. Brief. Bioinform. 2016, 18, 851–869. [Google Scholar] [CrossRef] [Green Version]
  98. Shen, D.; Wu, G.; Suk, H.-I. Deep Learning in Medical Image Analysis. Annu. Rev. Biomed. Eng. 2017, 19, 221–248. [Google Scholar] [CrossRef] [Green Version]
  99. Mater, A.C.; Coote, M.L. Deep Learning in Chemistry. J. Chem. Inf. Model. 2019, 59, 2545–2559. [Google Scholar] [CrossRef]
  100. Zhang, W.; Li, H.; Li, Y.; Liu, H.; Chen, Y.; Ding, X. Application of deep learning algorithms in geotechnical engineering: A short critical review. Artif. Intell. Rev. 2021, 54, 5633–5673. [Google Scholar] [CrossRef]
  101. Spring, R.; Shrivastava, A. Scalable and Sustainable Deep Learning via Randomized Hashing. In Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, New York, NY, USA, 13–17 August 2017. [Google Scholar]
  102. Yuan, X.; Ou, C.; Wang, Y.; Yang, C.; Gui, W. A Layer-Wise Data Augmentation Strategy for Deep Learning Networks and Its Soft Sensor Application in an Industrial Hydrocracking Process. IEEE Trans. Neural Netw. Learn. Syst. 2021, 32, 3296–3305. [Google Scholar] [CrossRef]
  103. Ghahramani, Z. Probabilistic machine learning and artificial intelligence. Nature 2015, 521, 452–459. [Google Scholar] [CrossRef]
  104. Gewin, V. Data sharing: An open mind on open data. Nature 2016, 529, 117–119. [Google Scholar] [CrossRef] [Green Version]
  105. Ji, S.; Yu, D.; Shen, C.; Li, W.; Xu, Q. Landslide detection from an open satellite imagery and digital elevation model dataset using attention boosted convolutional neural networks. Landslides 2020, 17, 1337–1352. [Google Scholar] [CrossRef]
  106. Reichstein, M.; Camps-Valls, G.; Stevens, B.; Jung, M.; Denzler, J.; Carvalhais, N.; Prabhat. Deep learning and process understanding for data-driven Earth system science. Nature 2019, 566, 195–204. [Google Scholar] [CrossRef]
  107. Ebert-Uphoff, I.; Samarasinghe, S.; Barnes, E. Thoughtfully using artificial intelligence in Earth science. Eos 2019, 100, 10.1029. [Google Scholar] [CrossRef]
  108. Jiang, S.; Zheng, Y.; Solomatine, D. Improving AI System Awareness of Geoscience Knowledge: Symbiotic Integration of Physical Approaches and Deep Learning. Geophys. Res. Lett. 2020, 47, e2020GL088229. [Google Scholar] [CrossRef]
  109. Depina, I.; Jain, S.; Mar Valsson, S.; Gotovac, H. Application of physics-informed neural networks to inverse problems in unsaturated groundwater flow. Georisk Assess. Manage. Risk Eng. Syst. Geohazards 2021, 16, 21–36. [Google Scholar] [CrossRef]
  110. Hutter, F.; Kotthoff, L.; Vanschoren, J. Automated Machine Learning: Methods, Systems, Challenges; Springer Nature: New York, NY, USA, 2019. [Google Scholar]
  111. Sun, Z.; Sandoval, L.; Crystal-Ornelas, R.; Mousavi, S.M.; Wang, J.; Lin, C.; Cristea, N.; Tong, D.; Carande, W.H.; Ma, X.; et al. A review of Earth Artificial Intelligence. Comput. Geosci. 2022, 159, 105034. [Google Scholar] [CrossRef]
  112. Feurer, M.; Eggensperger, K.; Falkner, S.; Lindauer, M.; Hutter, F. Auto-sklearn 2.0: Hands-free automl via meta-learning. arXiv 2020, arXiv:2007.04074. [Google Scholar]
  113. Kotthoff, L.; Thornton, C.; Hoos, H.H.; Hutter, F.; Leyton-Brown, K. Auto-WEKA: Automatic model selection and hyperparameter optimization in WEKA. In Automated Machine Learning; Springer: Cham, Switzerland, 2019; pp. 81–95. [Google Scholar]
  114. Kim, I.K.; Lee, K.; Park, J.H.; Baek, J.; Lee, W.K. Classification of pachychoroid disease on ultrawide-field indocyanine green angiography using auto-machine learning platform. Br. J. Ophthalmol. 2021, 105, 856–861. [Google Scholar] [CrossRef]
  115. Sader, S.; Husti, I.; Daróczi, M. Enhancing failure mode and effects analysis using auto machine learning: A case study of the agricultural machinery industry. Processes 2020, 8, 224. [Google Scholar] [CrossRef] [Green Version]
  116. Hill, E.; Pearce, M.A.; Stromberg, J.M. Improving automated geological logging of drill holes by incorporating multiscale spatial methods. Math. Geosci. 2021, 53, 21–53. [Google Scholar] [CrossRef] [Green Version]
  117. Hüllermeier, E.; Waegeman, W. Aleatoric and epistemic uncertainty in machine learning: An introduction to concepts and methods. Mach. Learn. 2021, 110, 457–506. [Google Scholar] [CrossRef]
  118. Neal, R.M. Bayesian Learning for Neural Networks; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2012; Volume 118. [Google Scholar]
  119. Kupinski, M.A.; Hoppin, J.W.; Clarkson, E.; Barrett, H.H. Ideal-observer computation in medical imaging with use of Markov-chain Monte Carlo techniques. JOSA A 2003, 20, 430–438. [Google Scholar] [CrossRef] [Green Version]
  120. Abdar, M.; Pourpanah, F.; Hussain, S.; Rezazadegan, D.; Liu, L.; Ghavamzadeh, M.; Fieguth, P.; Cao, X.; Khosravi, A.; Acharya, U.R.; et al. A review of uncertainty quantification in deep learning: Techniques, applications and challenges. Inf. Fusion 2021, 76, 243–297. [Google Scholar] [CrossRef]
  121. Hu, R.; Huang, Q.; Chang, S.; Wang, H.; He, J. The MBPEP: A deep ensemble pruning algorithm providing high quality uncertainty prediction. Appl. Intell. 2019, 49, 2942–2955. [Google Scholar] [CrossRef] [Green Version]
  122. Tsiligkaridis, T. Information Aware max-norm Dirichlet networks for predictive uncertainty estimation. Neural Netw. 2021, 135, 105–114. [Google Scholar] [CrossRef]
  123. Adadi, A.; Berrada, M. Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI). IEEE Access 2018, 6, 52138–52160. [Google Scholar] [CrossRef]
  124. Carvalho, D.V.; Pereira, E.M.; Cardoso, J.S. Machine Learning Interpretability: A Survey on Methods and Metrics. Electronics 2019, 8, 832. [Google Scholar] [CrossRef]
  125. Guidotti, R.; Monreale, A.; Ruggieri, S.; Turini, F.; Giannotti, F.; Pedreschi, D. A Survey of Methods for Explaining Black Box Models. ACM Comput. Surv. 2019, 51, 1–42. [Google Scholar] [CrossRef] [Green Version]
  126. Hoffman, R.R.; Mueller, S.T.; Klein, G.; Litman, J. Metrics for explainable AI: Challenges and prospects. arXiv 2018, arXiv:1812.04608. [Google Scholar]
  127. Mohseni, S.; Zarei, N.; Ragan, E.D. A Multidisciplinary Survey and Framework for Design and Evaluation of Explainable AI Systems. ACM Trans. Interact. Intell. Syst. 2021, 11, 1–45. [Google Scholar] [CrossRef]
  128. Molnar, C.; Casalicchio, G.; Bischl, B. Interpretable Machine Learning—A Brief History, State-of-the-Art and Challenges; ECML PKDD 2020 Workshops; Koprinska, I., Kamp, M., Appice, A., Loglisci, C., Antonie, L., Zimmermann, A., Guidotti, R., Özgöbek, Ö., Ribeiro, R.P., Gavaldà, R., et al., Eds.; Springer International Publishing: Cham, Switzerland, 2020; pp. 417–431. [Google Scholar]
  129. Molnar, C.; König, G.; Herbinger, J.; Freiesleben, T.; Dandl, S.; Scholbeck, C.A.; Casalicchio, G.; Grosse-Wentrup, M.; Bischl, B. Pitfalls to avoid when interpreting machine learning models. arXiv 2020, arXiv:2007.04131v1. [Google Scholar]
Figure 1. (a) The distribution of geohazard loss (Source: https://public.emdat.be/data, accessed on 7 July 2022) and the number of papers. Different colors indicate different degrees of geohazard loss and the size and color of the circles indicate the number of papers published in that country. (b) The change in the number of geohazards and the number of publications over time. (c) The regional distribution of the number of geohazards.
Figure 1. (a) The distribution of geohazard loss (Source: https://public.emdat.be/data, accessed on 7 July 2022) and the number of papers. Different colors indicate different degrees of geohazard loss and the size and color of the circles indicate the number of papers published in that country. (b) The change in the number of geohazards and the number of publications over time. (c) The regional distribution of the number of geohazards.
Sensors 22 07814 g001
Figure 2. Main AI algorithms and applications in geohazard prevention mitigation.
Figure 2. Main AI algorithms and applications in geohazard prevention mitigation.
Sensors 22 07814 g002
Figure 3. Flow chart for scientometric analysis of AI for geohazard research.
Figure 3. Flow chart for scientometric analysis of AI for geohazard research.
Sensors 22 07814 g003
Figure 4. Characteristics of publications of AI for geohazard research by year.
Figure 4. Characteristics of publications of AI for geohazard research by year.
Sensors 22 07814 g004
Figure 5. Citation burst of AI in geohazard research during 1990–2022 (logarithmic scale).
Figure 5. Citation burst of AI in geohazard research during 1990–2022 (logarithmic scale).
Sensors 22 07814 g005
Figure 6. Cooperation between countries and regions.
Figure 6. Cooperation between countries and regions.
Sensors 22 07814 g006
Figure 7. Research cluster network for AI in geohazard research.
Figure 7. Research cluster network for AI in geohazard research.
Sensors 22 07814 g007
Table 1. Top 15 source journals according to the number of publications in AI for geohazard research (1990–2022).
Table 1. Top 15 source journals according to the number of publications in AI for geohazard research (1990–2022).
No.SourceTotal
Papers
Total
Citations
Average
Citations per Paper
Percentage
of Total
Papers
1Natural Hazards
(https://www.springer.com/journal/11069, accessed on 3 September 2022)
55012,28622.345.96%
2Remote Sensing
(https://www.mdpi.com/journal/remotesensing, accessed on 3 September 2022)
423637615.074.58%
3Journal of Hydrology
(https://journals.elsevier.com/journal-of-hydrology, accessed on 3 September 2022)
31413,41742.733.40%
4Bulletin of Earthquake Engineering
(https://www.springer.com/journal/10518/, accessed on 3 September 2022)
306606819.833.32%
5Soil Dynamics and Earthquake Engineering
(https://www.sciencedirect.com/journal/soil-dynamics-and-earthquake-engineering, accessed on 3 September 2022)
306406013.273.32%
6Environmental Earth Sciences
(https://www.springer.com/journal/12665, accessed on 3 September 2022)
298779826.173.23%
7Engineering Geology
(https://www.sciencedirect.com/journal/engineering-geology, accessed on 3 September 2022)
29011,41739.373.14%
8Geomorphology
(www.elsevie r.com/locate/geomorph, accessed on 3 September 2022)
28717,65661.523.11%
9Catena
(www.elsevier.com/locate/catena, accessed on 3 September 2022)
26912,81047.622.92%
10Natural Hazards and Earth System Sciences
(https://www.natural-hazards-and-earth-system-sciences.net/, accessed on 3 September 2022)
264661125.042.86%
11Landslides
(https://www.springer.com/journal/10346, accessed on 3 September 2022)
262897034.242.84%
12Arabian Journal of Geosciences
(https://www.springer.com/journal/12517, accessed on 3 September 2022)
243329813.572.63%
13Earthquake Engineering & Structural Dynamics
(https://onlinelibrary.wiley.com/journal/10969845, accessed on 3 September 2022)
233775633.292.53%
14Earthquake Spectra
(https://journals.sagepub.com/home/eqs, accessed on 3 September 2022)
21310,77650.592.31%
15Geophysical Research Letters
(https://agupubs.onlinelibrary.wiley.com/journal/19448007, accessed on 3 September 2022)
213526924.742.31%
Table 2. Top 15 most productive authors for AI research in the field of geohazards (1990–2022).
Table 2. Top 15 most productive authors for AI research in the field of geohazards (1990–2022).
No.NameInstitution,
Country
Total
Papers
Total
Citations
Average
Citations
per Paper
H-IndexRelated
Citations
Impact
1Pradhan, BiswajeetUniversity of Technology Sydney, Australia13613,14696.66941.39
2Dieu Tien BuiUniversity of South-Eastern Norway, Norway70560480.06681.16
3Pourghasemi, Hamid RezaShiraz University, Iran65633297.42661.41
4Chen, WeiXi’an University of Science and Technology, China63410465.14530.94
5Lee, SaroKorea Institute of Geoscience and Mineral Resources, KIGAM, Korea63388061.59640.89
6Hong, HaoyuanUniversität Vienna, Austria52363769.94451.01
7Binh Thai PhamUniversity of Transport Technology, Vietnam45285163.36260.91
8Arabameri, AlirezaTarbiat Modares University, Iran3268421.38290.31
9Bradley, Brendon A.University of Canterbury, New Zealand3258318.22360.26
10Xu, ChongInstitute of Geology, China Earthquake Administration, China31169354.61290.79
11Shahabi, HimanUniversity of Kurdistan, Iran30253884.60521.22
12Xu, QiangChengdu University of Technology, China3065921.97430.32
13Rahmati, OmidAgricultural Research, Education and Extension Organization (AREEO), Vietnam29223176.93361.11
14Prakash, IndraBhaskaracharya Institute for Space Applications and Geoinformatics, India28155155.39370.80
15Blaschke, ThomasUniversität Salzburg, Austria26123547.50500.69
Note: The H-index in the table header means that the author has H papers cited H times.
Table 3. Top 15 most productive institutions in terms of AI research in the field of geohazards (1990–2022).
Table 3. Top 15 most productive institutions in terms of AI research in the field of geohazards (1990–2022).
No.InstitutionTotal
Papers
Total
Citations
Average
Citations per
Paper
Relate
Citations
Impact
1Chinese Academy of Sciences
(https://english.cas.cn, accessed on 3 September 2022)
384908823.670.83
2China University of Geosciences
(https://en.cug.edu.cn, accessed on 3 September 2022)
158330320.910.73
3U.S. Geological Survey
(https://www.usgs.gov, accessed on 3 September 2022)
156892857.232.00
4University of Chinese Academy of Sciences
(https://english.ucas.ac.cn, accessed on 3 September 2022)
115126410.990.38
5Chengdu University of Technology
(http://www.cdut.edu.cn. accessed on 3 September 2022)
101154915.340.54
6Tongji University
(https://en.tongji.edu.cn, accessed on 3 September 2022)
100174917.490.61
7University of California, Berkeley
(https://www.berkeley.edu, accessed on 3 September 2022)
97493850.911.78
8University of Technology Sydney
(https://www.uts.edu.au, accessed on 3 September 2022)
93258327.770.97
9Duy Tan University
(https://duytan.edu.vn, accessed on 3 September 2022)
91327836.021.26
10University of Tehran
(https://ut.ac.ir/en. accessed on 3 September 2022)
88202723.030.81
11Islamic Azad University
(https://iau.ae, accessed on 3 September 2022)
86307435.741.25
12China Earthquake Administration
(https://www.cea.gov.cn, accessed on 3 September 2022)
85243128.601.00
13Sejong University
(https://en.sejong.ac.kr/eng/index.do, accessed on 3 September 2022)
85364042.821.50
14Tarbiat Modares University
(https://en.modares.ac.ir, accessed on 3 September 2022)
84230927.490.96
15Kyoto University
(https://www.kyoto-u.ac.jp/en, accessed on 3 September 2022)
83123814.920.52
Table 4. Top 20 countries or regions in terms of number of publications.
Table 4. Top 20 countries or regions in terms of number of publications.
CountryTotal
Papers
Total
Citations
Average
Citations
per Paper
Closest
Collaborating
Country
Number
of Total
Collaborators
China234941,17917.53United States279
United States199361,65630.94China279
Italy89427,38830.64United States72
Iran62919,30230.69Vietnam85
England57218,37132.12United States111
Japan50511,03621.85China83
India504943018.71Vietnam63
Australia44311,37625.68China100
Germany41011,08627.04United States51
Canada40914,58635.66United States78
France38610,32626.75United States51
South Korea32310,79333.41Australia65
Turkey289931132.22United States34
Spain277758227.37Italy47
Switzerland240762531.77United States42
Netherlands239949639.73United States35
Vietnam182752941.37Iran85
Greece174491328.24Italy26
Malaysia17112,94675.71Iran52
Norway162910356.19Vietnam45
Table 5. Research clusters of AI in geohazard research between 1990 and 2022.
Table 5. Research clusters of AI in geohazard research between 1990 and 2022.
Cluster IDSizeSilhouetteCluster LabelRepresentative Document
#0700.81Ground motionBoore & Atkinson [40]
#1640.91Deep learningBui et al. [41]
#2610.97GISGuzzetti et al. [42]
#3540.91LandslidePradhan [43]
#4470.92ImpactKim et al. [44]
Table 6. Summary of popular AI algorithms in the field of geohazards.
Table 6. Summary of popular AI algorithms in the field of geohazards.
1Naive Bayes
Summary  NB classifiers are simple probabilistic classifiers based on the Bayes theorem and the strong (naive) independence assumption between features [52].
Advantages
  • NB is simple, efficient, and reliable [53];
  • The NB classifier does not require a complex iterative parameter estimation scheme and is easy to construct [54];
  • The NB classifier handles correlated noise and irrelevant attributes and has very good robustness [55].
Limitations
  • NB determines the posterior probability by using the prior probability and data to determine the classification, so there is a certain error rate in the classification decision [56];
  • NB classifiers require strong independence between attributes [57].
2Decision Tree
Summary  The DT is a basic classification and regression method. The DT model has a tree-like structure and represents the process of classifying instances based on features in a classification problem [58].
Advantages
  • DT can solve complex problems [59];
  • It provides expressive representation for learning discrete functions;
  • The time complexity of the decision tree is small [60].
Limitations
  • A large amount of storage is required to store all classifier results [61];
  • It is difficult to understand the reasoning process when multiple classifiers are involved in the decision [58];
  • It is easy to overfit the data [62].
3Support Vector Machine
SummarySVM is a supervised learning machine proposed by Vapnik et al. [63,64]. It is a powerful tool for solving pattern classification problems and regression problems [65] and has been used in various fields [66,67,68].
Advantages
  • SVM can achieve better results with fewer samples [25];
  • SVM is insensitive to dimensionality and outliers [69];
  • SVM is very robust and accurate [70].
Limitations
  • The SVM algorithm has high time complexity and memory training complexity [64];
  • The SVM principle is complex and computationally expensive [71].
4Artificial Neural Networks
SummaryANNs are algorithmic models inspired by biological neural networks. They are massively parallel systems with a large number of interconnected simple processors [72].
Advantages
  • ANNs have significant advantages in data classification and regression [73];
  • ANNs are not constrained by predefined mathematical relationships between variables [74];
  • ANNs have a powerful ability to handle complex nonlinear problems.
Limitations
  • The model is considered a black box, making it difficult to understand the internal mechanism [74];
  • ANNs have a high demand for computing resources;
  • ANNs can easily fall into the local minima and sometimes it is difficult to adjust the structure [75].
5Extreme Learning Machine
SummaryThe ELM is a single-layer feedforward neural network that overcomes the difficulty of parameter initialization. It is one of the most widely used algorithms for predicting time series data [76].
Advantages
  • The theoretical basis of an ELM is relatively simple [76];
  • ELM can achieve global minimum optimization and has powerful generalization [77];
  • ELM computes much faster than other feedforward neural networks.
Limitations
  • An ELM with a fixed number of hidden layer nodes reduces model prediction accuracy [78];
  • When the training dataset is relatively small ELM has more erroneous results [76].
6K-Nearest Neighbor
SummaryKNN is a nonparametric method that is considered one of the top 10 data mining algorithms because of its simplicity, efficiency, and implementation power for classification [79].
Advantages
  • The structure of the KNN algorithm is relatively simple and has good portability [80];
  • KNN algorithms are powerful in classification with good effectiveness and implementation [79].
Limitations
  • Outliers in KNN algorithms can have a large adverse effect on the results [79];
  • The problems of similarity measurement of two data points and K-value selection in the KNN algorithm still need to be solved [81].
7Logistics Regression
SummaryLR analysis is a statistical technique for analyzing the relationship between an independent variable and two dependent variables (dichotomous variables) and is widely used in various fields [82,83].
Advantages
  • A regression relationship is formed between the dependent variable and one or more independent variables [84];
  • LR algorithms are independent of the data distribution [85];
  • Continuous explanatory variables can be used [86].
Limitations
  • LR is more sensitive to multiple linear data [87];
  • A large sample size is required for goodness-of-fit measurements [59].
8Ensemble Methods
SummaryEMs refer to the combination of individual AI models into one model that has higher accuracy and stronger generalization ability than the individual AI models [88,89].
Advantages
  • The results of the integrated approach are more accurate than those predicted by individual models [90,91,92];
  • EMs can avoid overfitting and local optima [93];
  • EMs have better generalization ability than a single AI algorithm [88,94].
Limitations
  • EMs tend to ignore local clustering diversity [95];
  • The accuracy of EMs is determined by the choice of the base model [88];
  • Much maintenance is required.
9Deep Learning
SummaryDL is a branch of machine learning based on ANN [96]. It has excellent performance in processing a large amount of high level data and has a wide range of applications in various fields [97,98,99].
Advantages
  • DL has powerful feature learning and expression capabilities [100];
  • DL is highly efficient in processing high-dimensional data [96];
  • DL has more frameworks to use and is more compatible.
Limitations
  • DL requires much data and computing power and the computational cost is high [101,102];
  • DL requires high hardware requirements due to its high computing power.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Jiang, S.; Ma, J.; Liu, Z.; Guo, H. Scientometric Analysis of Artificial Intelligence (AI) for Geohazard Research. Sensors 2022, 22, 7814. https://doi.org/10.3390/s22207814

AMA Style

Jiang S, Ma J, Liu Z, Guo H. Scientometric Analysis of Artificial Intelligence (AI) for Geohazard Research. Sensors. 2022; 22(20):7814. https://doi.org/10.3390/s22207814

Chicago/Turabian Style

Jiang, Sheng, Junwei Ma, Zhiyang Liu, and Haixiang Guo. 2022. "Scientometric Analysis of Artificial Intelligence (AI) for Geohazard Research" Sensors 22, no. 20: 7814. https://doi.org/10.3390/s22207814

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop