Susceptibility to Seismic Amplification and Earthquake Probability Estimation Using Recurrent Neural Network (RNN) Model in Odisha, India

: The eastern region of India, including the coastal state of Odisha, is a moderately seismic-prone area under seismic zones II and III. However, no major studies have been conducted on earthquake probability (EPA) and hazard assessment (EHA) in Odisha. This paper had two main objectives: (1) to assess the susceptibility of seismic wave ampliﬁcation (SSA) and (2) to estimate EPA in Odisha. In total, 12 indicators were employed to assess the SSA and EPA. Firstly, using the historical earthquake catalog, the peak ground acceleration (PGA) and intensity variation was observed for the Indian subcontinent. We identiﬁed high amplitude and frequency locations for estimated PGA and the periodograms were plotted. Secondly, several indicators such as slope, elevation, curvature, and ampliﬁcation values of rocks were used to generate SSA using predeﬁned weights of layers. Thirdly, 10 indicators were implemented in a developed recurrent neural network (RNN) model to create an earthquake probability map (EPM). According to the results, recent to quaternary unconsolidated sedimentary rocks and alluvial deposits have great potential to amplify earthquake intensity and consequently lead to acute ground motion. High intensity was observed in coastal and central parts of the state. Complicated morphometric structures along with high intensity variation could be other parameters that inﬂuence deposits in the Mahanadi River and its delta with high potential. The RNN model was employed to create a probability map (EPM) for the state. Results show that the Mahanadi basin has dominant structural control on earthquakes that could be found in the western parts of the state. Major faults were pointed towards a direction of WNW–ESE, NE–SW, and NNW–SSE, which may lead to isoseismic patterns. Results also show that the western part is highly probable for events while the eastern coastal part is highly susceptible to seismic ampliﬁcation. The RNN model achieved an accuracy of 0.94, precision (0.94), recall (0.97), F1 score (0.96), critical success index (CSI) (0.92), and a Fowlkes–Mallows index (FM) (0.95).


Introduction
The state of Odisha is famous for its cultural heritage, historical monuments, and hot springs, and is considered as the center for economic activity in central-eastern India. However, Odisha is seismically active due to ancient faults developed in the Mahanadi rift basins. Earthquakes of intra-cratonic events have occurred in western and northeastern Odisha [1]. Low-to moderate-events have been reported by the United States Geological Survey (USGS) and National Center for Seismology (NCS). The main for future events. The specific objectives of this study were as follows: (1) to locate the potential of sediment deposit; (2) to identify the site of high PGA and intensity with the spatial variation; 3) to develop an RNN model to identify the probable locations for future events. The addressed r research questions, such as: (1) is it possible to achieve good accuracy in seismic wave ifications (SSA) and EPA? If so, how efficient are the obtained results?; (2) what are the possible ns behind high probable earthquake areas?; and (3) is the proposed model powerful enough for e earthquake studies?

tudy Area
Odisha is a state in eastern Indian and shares 450 km of coastline with the Bay of Bengal (Figure he state is known for its tribal cultures, numerous hot springs, ancient history, and unique ral and geographic environment. Odisha is situated between the latitudes of 17.780N and 0N, and longitudes of 81.37E and 87.53E. The capital of Odisha is Bhubaneswar, also called ple city". The state covers 155,707 km 2 , which is approximately 4.87% of the total area of India. opulation, according to 2019 census data, is approximately 4.6 million. Odisha's GDP during eriod (2017-18) was ₹ 4.16 lakh crore. According to the pre-existing seismic zonation map of , Odisha comes under zones II and III. Although much of Odisha is in zone II, Mahanadi graben is in zone III and encompasses cities as Talchir, Angul, Bhubaneswar, Cuttack, Sambalpur, Dhenkanal, and the Balasore district. The igarh-Talchir area was struck by two moderate events (Mw 5 and 4.8) during May and June in respectively. Two more events also happened of 5.2 Mw in 1958 and 1962 within Rengali ince. In January 1986, events of Mw 4.4, 4.1, and 4.3 were reported due to the active north Odisha dary fault (NOBF), respectively. The Geological Survey of India established four stations for -earthquake measurement in the Bonaigarh-Talchir area. However, many hypocenters were ved in NOBF, indicating neo-tectonics. 4.16 lakh crore. According to the pre-existing seismic zonation map of India, Odisha comes under zones II and III.
Appl. Sci. 2020, 10,5355 3 of 19 sites for future events. The specific objectives of this study were as follows: (1) to locate the potential areas of sediment deposit; (2) to identify the site of high PGA and intensity with the spatial variation; and (3) to develop an RNN model to identify the probable locations for future events. The addressed major research questions, such as: (1) is it possible to achieve good accuracy in seismic wave amplifications (SSA) and EPA? If so, how efficient are the obtained results?; (2) what are the possible reasons behind high probable earthquake areas?; and (3) is the proposed model powerful enough for future earthquake studies?

Study Area
Odisha is a state in eastern Indian and shares 450 km of coastline with the Bay of Bengal ( Figure  1). The state is known for its tribal cultures, numerous hot springs, ancient history, and unique cultural and geographic environment. Odisha is situated between the latitudes of 17.780N and 22.730N, and longitudes of 81.37E and 87.53E. The capital of Odisha is Bhubaneswar, also called "temple city". The state covers 155,707 km 2 , which is approximately 4.87% of the total area of India. The population, according to 2019 census data, is approximately 4.6 million. Odisha's GDP during this period (2017-18) was ₹4.16 lakh crore. According to the pre-existing seismic zonation map of India, Odisha comes under zones II and III. Although much of Odisha is in zone II, Mahanadi graben is in zone III and encompasses cities such as Talchir, Angul, Bhubaneswar, Cuttack, Sambalpur, Dhenkanal, and the Balasore district. The Bonaigarh-Talchir area was struck by two moderate events (Mw 5 and 4.8) during May and June in 1995, respectively. Two more events also happened of 5.2 Mw in 1958 and 1962 within Rengali province. In January 1986, events of Mw 4.4, 4.1, and 4.3 were reported due to the active north Odisha boundary fault (NOBF), respectively. The Geological Survey of India established four stations for micro-earthquake measurement in the Bonaigarh-Talchir area. However, many hypocenters were observed in NOBF, indicating neo-tectonics. Although much of Odisha is in zone II, Mahanadi graben is in zone III and encompasses cities such as Talchir, Angul, Bhubaneswar, Cuttack, Sambalpur, Dhenkanal, and the Balasore district. The Bonaigarh-Talchir area was struck by two moderate events (Mw 5 and 4.8) during May and June in 1995, respectively. Two more events also happened of 5.2 Mw in 1958 and 1962 within Rengali province. In January 1986, events of Mw 4.4, 4.1, and 4.3 were reported due to the active north Odisha boundary fault (NOBF), respectively. The Geological Survey of India established four stations for micro-earthquake measurement in the Bonaigarh-Talchir area. However, many hypocenters were observed in NOBF, indicating neo-tectonics.

Geological Settings
In this section, we describe a tentative geological setting in Odisha. Dating of rocks revealed that Odisha carries a 3700 million year (M.Y.) geological history [22]. Precambrian rocks cover 75% of Odisha. Accordingly, about 25% consists of Post-cambrian deposits, including unconsolidated sedimentary rocks [23]. The availability of basement rocks includes granites and gneisses, khondalites, charnockites, and ultrabasic-basic intrusions of granitic essence. The Eastern Ghats Mobile Belt (EGMB) is divided from North Odisha Craton (NOC) by the east-west running MSZ. Some observations have been conducted along the litho-contact between the cratons. The fault zone originated between the cratons, giving rise to Gondwana graben, and characterized by coal deposits. As the fault slices intermingle, the stratigraphic relation is difficult to interpret because of sedimentation, granitic intrusion, and lateritic cover concealing the contact.
EGMB is characterized by granulite-facies such as quartzites, charnockites khondalites, garnet-biotite schists, and gneisses. EGMB can be distinguished by the Rengali (northern fringe) and Angul (southern fringe) assemblages, even though they have different lithological characteristics. The North Orissa craton consists of a banded iron formation (BIF) and low-grade supracrustals within several generations of granitic intrusion [24]. EGMB rocks are believed to form before BIF-bearing granites. EGMB 1.0-0.8 Ga (Grenville-age) rocks formed because of several late Proterozoic events, include migmatization, alkali plutonism, and charnockitization. The fault and the litho-contact zones are characterized by nepheline syenite and ultrabasic rocks bearing chromite, respectively. Amphibolite facies can be found in the south of the Singhbhum craton [24]. Meta-sedimentary and meta-igneous rocks can be found in EGMB composed of migmatite [25]. Mahanadi graben is located in E-W orientation, which is sandwiched between EGMB and NOC. However, the basin is considered as a half-graben and comprises many stepwise normal faults with Gondwana deposits. Thus, NOBF trends parallel to EGMB structures in the direction of WNW-ESE and is found in between the NOC and Mahanadi basin. Kerajang fault is within the Talchir basin and situated along the NOBF. Moreover, MSZ and NOBF reactivation directs basin and seismicity tectonic development [26].

Data
Data collection from multiple sources were used in this study. Kolathayar et al. [27] designed spatial attenuation relationships that converted magnitudes scaled into a standardized moment scale. The body-wave magnitude (Mb) scale was used to estimate the magnitude of events in moment magnitude (Mw). In this study, a complete catalog of historical events was collected for probability mapping. The public and private disaster management agencies made earthquake catalogs freely available. The accessible sources were the Advanced National Seismic System (ANSS), United States Geological Survey (USGS), National Center for Seismology (NCS), and the National Earthquake Information Center (NEIC). The digital elevation model (DEM) and earthquake catalog from USGS were used to prepare a geodatabase and generate several layers for SSA and EPA. Geological data including the faults, lithology, and amplification factors for different rock types were used to generate indicators. Training and prediction of the classifications were performed for probability mapping. The raw geological map was collected from the Geological Survey of India (GSI) and applied to produce thematic layers. The seismotectonic atlas published by Geological Survey of India [23] was employed as an authentic referral for locating potential sources.

Thematic Layer Preparation
We conducted an extensive literature review. Choropleth maps were not developed in this study. The chorochromatic and cartogram maps were developed using ArcGIS statistical software (Version 10.4, Environmental Systems Research Institute (Esri), Brisbane, Australia). The probabilistic study is quite challenging, especially with regard to data-dependent models.
In this research, a 30 m spatial resolution DEM, geological data, estimated PGA, and intensity values were used to estimate causal factors. To prepare thematic layers, we implemented a world geodetic system (WGS) 1984 reference system. The derived thematic layers were elevation, slope, curvature, geology, fault density, Euclidean distance from faults, proximity to the epicenters, epicenters density, magnitude density, and depth variation ( Figure 2). The obtained thematic layers of causal factors were presented in a stretched format. Table 1 provides detailed data used in the current study. By uncovering objective themes, emerging patterns, and insight logic, these layers can be understood. Preprocessing, processing, and post-processing were conducted to estimate probability [28]. Layers were converted to multiple values by points and each pixel value was estimated. Some negative and unexpected values were removed from the database. Prediction of classification was conducted and then post-processing was performed after the testing task [6]. The predicted exact values of each pixel were post-processed to remove irrelevant pixel values. Until the end, the point-to-raster conversion was performed in a GIS environment, which generated SSA and EPM. In this research, a 30 m spatial resolution DEM, geological data, estimated PGA, and intensity values were used to estimate causal factors. To prepare thematic layers, we implemented a world geodetic system (WGS) 1984 reference system. The derived thematic layers were elevation, slope, curvature, geology, fault density, Euclidean distance from faults, proximity to the epicenters, epicenters density, magnitude density, and depth variation ( Figure 2). The obtained thematic layers

Importance of Factors
Slope and curvature [6,29]: Landscape processes are affected by slope structure. Several geomorphological processes such as subsurface flow velocity, erosion potential, runoff rates, and overland flow are most affected by slope characteristics. Slopes are associated with crustal faults. Moreover, fault length and size depends on the slope inclination angles. Curvature is associated with slopes having either a positive, negative, or zero curvature. Basins have negative curvatures and their locations for sedimentary deposits are susceptible to seismic amplification.
Elevation [30]: High elevation indicates erosion areas with complicated structures. The probability of earthquakes increases when elevation increases. Therefore, high-elevated areas (i.e., the Himalayan region) are extremely susceptible to earthquakes.
Fault density and proximity to faults: Earthquake probability increases when fault density also increases. Earthquake occurrence is high if faults are active in a complex tectonic condition.
Geology with amplification factor values [31]: Amplification factors of various rock types vary depending on composition. This information is useful to identify locations of high ground shaking.
Epicenter density and proximity to epicenter [32]: Earthquake events are clustered at a location and are high density. However, areas with large events are considered as zones of high probability. Earthquake probability is high when zones are clustered. Earthquake probability decreases when distance from an earthquake epicenter increases.
Magnitude and depth density [32]: Magnitude density indicates the likelihood of occurrence of a particular event. Density provides information for fault capacity and depth that could generate a particular magnitude event. The complex active fault system is more probable in areas that can produce high magnitude events.

Spectral Analysis
Spectral analysis is popularly known as spectrum analysis, which plots the time-dependent spectrum of frequencies [33]. In seismology, the spectral analysis is generally conducted for seismic signal processing, through an algorithm that displays the power of different frequency elements for a time-domain signal. Therefore, this is also called frequency domain analysis in signal processing.
Signals are displayed in the form of a periodogram. A periodogram shows the spectral density with respect to frequency. Therefore, the periodogram for a sequence [x 1 , ...., x n ] can be mathematically presented as: The expression presented in Equation (1) can form a power spectrum of any seismic signal having values of [x 1 , ..., x n ], where t is time and n = 1,2,3 . . . etc. The window (ω) can be selected based on requirements. However, we implemented the Blackman window to improve continuous function. The power spectral density∅ p (ω)/F of a periodogram can be estimated using a nonuniform fast Fourier transform, where the frequency is 2π during no supply of the sampling frequency. However, the sampling frequency (fs) is applied when the frequency range is (0, 1). The main aim of the periodogram spectral estimator is to expose "hidden periodicities" in a time series analysis that defines the periodogram [34,35].

Short-Time Fourier Transform
In short-time Fourier transform (STFT) functions, if time is continuous, then a window function multiplied with the existing function cannot be zero [36]. The one-dimensional Fourier transform function can be converted into a two-dimensional signal when the function is filtered along the time axis. The expression can be mathematically presented as: where the window function is ω(τ) and the Blackman window is centered throughout. However, the signal x(t) needs to be transformed. Frequency is also represented by ω. The Fourier transform of Using the STFT function, the magnitude, signal frequency, and expected location can be plotted. The time index τ is not presented as high resolution and is instead considered as "slow" time. In Equation (2), e −iωt is used to solve a differential equation, where t is time and ω is frequency.

Estimation of Distance from Source-to-Site, PGA and Intensity
Joyner and Boore [37] developed the attenuation equation to calculate PGA. The required data for PGA estimation can be collected from the USGS earthquake catalog. Firstly, there is a requirement to calculate the source to site distance based on the epicentral distance according to the catalog, as presented in Equation (3). Similarly, the intensity values for each earthquake were estimated using Equation (5). Various researchers have proposed several attenuation equations, yet in this study we applied Joyner and Boore's equation [37]. Source to site distance can be estimated based on the following: where D = distance from the source to site and E = epicentral distance. Joyner and Boore [37] presented the PGA estimation equation as: where M = Earthquake magnitude. Many other attenuation laws can be implemented in a probabilistic seismic hazard assessment (PSHA) [38][39][40]. Boore and Joyner [37] developed the general equation of regression, which is popular for its regional and worldwide use. However, the modified Mercalli intensity (MMI) was calculated from PGA values using the proposed equation: where the PGA unit is G (Gal).
Appl. Sci. 2020, 10, 5355 8 of 18 PGA variation and MMI variation were obtained for the 1800 events. Similarly, PGA and intensity variation were estimated for Odisha. We considered 32 locations from south to north that had a specific PGA value and intensity. All locations were chosen with STFT analysis and 32 was the minimum point to conduct the analysis. If we consider all values as peaks of a signal, then the PGA and intensity variation at all locations can be understood through STFT analysis. However, PGA and intensity variation were not involved in probability assessment, as these are hazard factors. In the next stage, seismic amplification mapping susceptibility conducted to identify high ground shaking locations.

Recurrent Neural Network
RNN architecture is made up of input, hidden, and output layers that can process dynamic information. However, data must be in a serialized form and periodically connect to hidden nodes ( Figure 3). In contrast to other neural networks, RNN follows the order of information. All the input and output values are independent in the traditional neural networks. However, in the RNN model, within the hidden layer inter-association among units occurred with different time intervals. Thus, through this process, information can pass away from one to another layer within the network [41].
The information shared across the RNN network precisely incorporates the concept of "loop feedback". Similar to having a loopback in the Elman or Jordan network, this simple RNN model was implemented with a loopback. In this study, the RNN model consists of x t as input, h t as hidden, and y t as the output vector at time t. Therefore, the expressions can be presented as follows: where parameter matrices are W and U, σ(·) is represented as loss function of the training sample sequence, and the bias vector b.

Data Presentation for RNN
Detailed RNN model parameters are presented in Table 2. In this study, all the 10 thematic layers were converted into data points to explore the RNNs predictive potential. The experienced earthquakes were presented for RNNs. Firstly, each probability indicator was selected in a sequence. However, each layer can be regarded as a single-band image. Every layer was stacked in an Excel sheet. Then, using the information to gain the algorithm, factor importance was estimated. According to the importance, the sequence was as follows: magnitude density, epicenter density, fault density, depth variation, proximity to the epicenter, distance from the faults, elevation, curvature, geology, and slope. Dara were organized in a sequence in descending order of significance.

Data Presentation for RNN
Detailed RNN model parameters are presented in Table 2. In this study, all the 10 thematic layers were converted into data points to explore the RNNs predictive potential. The experienced earthquakes were presented for RNNs. Firstly, each probability indicator was selected in a sequence. However, each layer can be regarded as a single-band image. Every layer was stacked in an Excel sheet. Then, using the information to gain the algorithm, factor importance was estimated. According to the importance, the sequence was as follows: magnitude density, epicenter density, fault density, depth variation, proximity to the epicenter, distance from the faults, elevation, curvature, geology, and slope. Dara were organized in a sequence in descending order of significance. Each pixel value must be converted to a sequential sample. Accordingly, in the RNN model, the most valuable factors were sent first and the least valuable factors were sent last. Moreover, because of the recurrent structure, critical information was retained and proceeded to the hidden state, thus predicting earthquake (1) and non-earthquake (0) targets. Figure 4 presents this methodological flowchart.

Model Evaluation Methods
To assess the proposed model, evaluation techniques were used [42]. The diagnostic ability of the model was represented by the ROC. The ROC portrayed the specificity curve against sensitivity. The area under the curve (AUC) depicted the prediction values. The prediction was better when the area under the curve was larger [43]. The prediction ability of RNN model was determined based on ACC, recall, F1-score, precision, threat score (TS), CSI, and FM, and can be expressed as follows [44][45][46]:

Model Evaluation Methods
To assess the proposed model, evaluation techniques were used [42]. The diagnostic ability of the model was represented by the ROC. The ROC portrayed the specificity curve against sensitivity. The area under the curve (AUC) depicted the prediction values. The prediction was better when the area under the curve was larger [43]. The prediction ability of RNN model was determined based on ACC, recall, F1-score, precision, threat score (TS), CSI, and FM, and can be expressed as follows [44][45][46]: F1 score = 2 × TP 2 × TP + FP + FN (10) Hereby, true positive (TP) and true negative (TN) show the correct classification of earthquakes and non-earthquakes. Similarly, false-positive (FP) and false-negative (FN) indicate misclassified earthquakes and non-earthquakes, respectively. For accuracy, the values of every statistical performance must be between 0 and 1. Moreover, the higher the ACC performance values, recall, F1-score, and CSI, the better the model works (Table 3). Moreover, MCC value were estimated for perfect predictions (1), random predictions (0), and inconsistency (1).

Periodogram Plotting of PGA and Intensity
In seismology, periodogram represents seismic signal spectral density. However, in this study, the PGA power spectra (amplitude) and intensity were plotted against the frequency based on the Hamming window (green color, Figure 5). Therefore, having a radius of 1700 km kept Odisha ay the center and 1800 events were implemented to plot the periodogram. No recorded wave signal was used to perform the periodogram study. Amplitudes were observed against frequencies for all PGA and intensity values. By averaging the frequency, a smoothed periodogram was plotted in a black color instead of time. The spectral plot of the smoothed periodogram shows the highest amplitude of PGA at the frequency of 0.01, while the highest amplitude of intensity is at the frequency of 0.04. High amplitude with low frequency provides information regarding controlling earthquakes.

Short-Time Fourier Transform
PGA (0-0.53 g) and intensity (II to VI) variations were observed in Odisha. Periodogram of the spectral density of PGA and intensity variations were plotted for the study area. In Figure 6a, the amplitudes of PGA values are shown against frequency, where the amplitudes vary from 0-10 m, while frequencies vary from 0-0.5 Hz. Similarly, in Figure 6c, the amplitudes of intensities vary from 0-12 m, while frequencies vary from 0-0.5 Hz. However, in both cases, the amplitude level at the frequency of 0.01 is 8.76 Hz, while the amplitude level at the frequency of 0.05 is 7.129 Hz. This indicates that the power of the amplitude increased when frequency decreased. Therefore, the highest PGA and intensity can be expected in Cuttack, Bhubaneswar, Kendrapada, Jagatsngpur, Puri, Khordha, Anugul, Dhenkanal, and some parts of Bhadrakh and Baleswar. STFT simulation was conducted using the Blackman-Harris window, which had a window size of 512 mm (rectangular window = 512). STFT analysis for PGA and intensity in the simulated graph have a stretched magnitude. In Figure 6b, the frequencies of PGAs are plotted against 32 expected locations. The locations were chosen randomly. PGA variation in Odisha was highest between the location of 10-18, where frequency varied from 0-0.1 Hz. STFT simulation for intensity variation was projected for 32 locations in Odisha, as shown in Figure 6d. The highest intensity could be expected at the locations from 7-14 and 17-25, having a frequency range of 0-0.15 m. The highest PGA and intensity were observed in Cuttack, Bhubaneswar, Anugul, Dhenkanal, and some parts of Bhadrakh and Baleswar. This analysis did not count external factors of sub-surface lithology and complicated tectonic structures. Therefore, SSA was conducted to identify the expected locations for high ground shaking.

Periodogram Plotting of PGA and Intensity
In seismology, periodogram represents seismic signal spectral density. However, in this study, the PGA power spectra (amplitude) and intensity were plotted against the frequency based on the Hamming window (green color, Figure 5). Therefore, having a radius of 1700 km kept Odisha ay the center and 1800 events were implemented to plot the periodogram. No recorded wave signal was used to perform the periodogram study. Amplitudes were observed against frequencies for all PGA and intensity values. By averaging the frequency, a smoothed periodogram was plotted in a black color instead of time. The spectral plot of the smoothed periodogram shows the highest amplitude of PGA at the frequency of 0.01, while the highest amplitude of intensity is at the frequency of 0.04. High amplitude with low frequency provides information regarding controlling earthquakes.   PGA (0-0.53 g) and intensity (II to VI) variations were observed in Odisha. Periodogram of the spectral density of PGA and intensity variations were plotted for the study area. In Figure 6a, the amplitudes of PGA values are shown against frequency, where the amplitudes vary from 0-10 m, while frequencies vary from 0-0.5 Hz. Similarly, in Figure 6c, the amplitudes of intensities vary from 0-12 m, while frequencies vary from 0-0.5 Hz. However, in both cases, the amplitude level at the frequency of 0.01 is 8.76 Hz, while the amplitude level at the frequency of 0.05 is 7.129 Hz. This indicates that the power of the amplitude increased when frequency decreased. Therefore, the highest PGA and intensity can be expected in Cuttack, Bhubaneswar, Kendrapada, Jagatsngpur, Puri,

Susceptibility to Seismic Amplification (SSA)
The study area falls within seismic zones II and III. Therefore, SSA results obtained from this study are not surprising, as they follow seismic zones (Figure 7). However, PGA observed in this study ranges between 0-0.53 g, which was implemented after SSA and could reach up to 0.092 g. According to the obtained results, the highest intensity could reach up to VI with the highest PGA value of 0.092-0.18g, which was determined by considering several factors, such as amplification factors of rock types, slope, location elevation, and recent sediments in the basin. The highest velocity of the surface waves could reach up to 8.1-16 (cm/s). This gives an indication of strong ground shaking. The shaking was strong despite the potential damage ranging from light to moderate. The highest ground shaking was observed in coastal Odisha and Mahanadi graben. Moderate shaking was observed in the IB river basin, Talchir basin, and Jajpur-Chandbali basin situated in the northwestern part, while Athgarh and Salepur basins are on the southern part of the Mahanadi rift valley. The seismic ground velocity divides the state into three major parts: 0-6, 6-10 and 10-15 cm/s. The final map was classified into four classes of very high, high, moderate, and low ground shaking. The east coast of the Mahanadi river basin and coastal parts of the Bay of Bengal basin, including Chilika Lake, have very high ground shaking. The cities with very high-to-high susceptible areas are Ganjam, Puri, Jagatsinghpur, Kendrapada, Bhadrakh, and Baleswar, as well as some parts of Khordha, Cuttack, and Jaipur. In the moderate ground shaking category are Dhenkanal, Anugul, Sambalpur, and Jharsuguda. In addition, all other districts fall in the low category.

Susceptibility to Seismic Amplification (SSA)
The study area falls within seismic zones II and III. Therefore, SSA results obtained from this study are not surprising, as they follow seismic zones (Figure 7). However, PGA observed in this study ranges between 0-0.53 g, which was implemented after SSA and could reach up to 0.092 g. According to the obtained results, the highest intensity could reach up to VI with the highest PGA value of 0.092-0.18g, which was determined by considering several factors, such as amplification factors of rock types, slope, location elevation, and recent sediments in the basin. The highest velocity of the surface waves could reach up to 8.1-16 (cm/s). This gives an indication of strong ground shaking. The shaking was strong despite the potential damage ranging from light to moderate. The highest ground shaking was observed in coastal Odisha and Mahanadi graben. Moderate shaking was observed in the IB river basin, Talchir basin, and Jajpur-Chandbali basin situated in the northwestern part, while Athgarh and Salepur basins are on the southern part of the Mahanadi rift valley. The seismic ground velocity divides the state into three major parts: 0-6, 6-10 and 10-15 cm/s. The final map was classified into four classes of very high, high, moderate, and low ground shaking. The east coast of the Mahanadi river basin and coastal parts of the Bay of Bengal basin, including Chilika Lake, have very high ground shaking. The cities with very high-to-high susceptible areas are Ganjam, Puri, Jagatsinghpur, Kendrapada, Bhadrakh, and Baleswar, as well as some parts of Khordha, Cuttack, and Jaipur. In the moderate ground shaking category are Dhenkanal, Anugul, Sambalpur, and Jharsuguda. In addition, all other districts fall in the low category.

Earthquake Probability Assessment
A probability assessment was conducted using the RNN model to predicts earthquake (1) and non-earthquake (0) classifications. The model was unable to predict 50 earthquake data points out of 1800 events. Due to heterogeneity and irrelevant values obtained for some pixels in the thematic layers, the model predicted 1750 events out of 1800 events. In total, 1850 points were predicted as earthquakes, while 850 were non-earthquakes. The RNN model predicted 48900 km 2 as the highest earthquake probable location in Odisha. Training (70%) and testing (30%) were conducted for 2700

Earthquake Probability Assessment
A probability assessment was conducted using the RNN model to predicts earthquake (1) and non-earthquake (0) classifications. The model was unable to predict 50 earthquake data points out of 1800 events. Due to heterogeneity and irrelevant values obtained for some pixels in the thematic layers, the model predicted 1750 events out of 1800 events. In total, 1850 points were predicted as earthquakes, while 850 were non-earthquakes. The RNN model predicted 48900 km 2 as the highest earthquake probable location in Odisha. Training (70%) and testing (30%) were conducted for 2700 points, where 1800 were earthquakes and 900 were non-earthquakes. The RNN model achieved 94% accuracy. The accuracy was estimated as 0.94, whereas as precision was 0.94, recall was 0.97, F1 score was 0.96, TS/CSI was 0.921053, and FM was 0.95. The probability map is presented in Figure 8. Accuracy is plotted through ROC (Figure 9a) and every true negative event is plotted in Figure 9b. All unpredicted earthquakes showed no discernible pattern.
The probability of future earthquake occurrence is higher in the northwestern parts of the state, as shown in Figure 8. However, the northwestern part of Odisha has already experienced moderate magnitude events of 5.3 Mw due to the NOBF active fault system. Four sections of the very high probable areas were shown within the map, showing that NOBF has the capability to strike high magnitude events in the future. NOBF extends from the east to the west, having a strike length of 250 km, whereas width ranges from 2-5 km. The NOBF is not uniform and is rather characterized by interdigitation of rocks, which makes northern Odisha irregular with complex geo-structures. Therefore, high probability in the northwestern and southern Odisha can be associated with NOBF. Medium probability was characterized as having high probability, whereas low probability could be expected in east and central Odisha. The districts under very high (0.85-1) to high (0.65-0.85) probability includes Sundargarh, Jharsuguda, Sambalpur, Bargarh, Subarnapur, Balangir, Nuapada, Kalahandi, and Nabarangpur. All other districts fall in the moderate to very low categories. Approximately 33.46% and 52,014 km 2 of Odisha fall in the very high to high category (Table 4).
Appl. Sci. 2020, 10, 5355 14 of 19 points, where 1800 were earthquakes and 900 were non-earthquakes. The RNN model achieved 94% accuracy. The accuracy was estimated as 0.94, whereas as precision was 0.94, recall was 0.97, F1 score was 0.96, TS/CSI was 0.921053, and FM was 0.95. The probability map is presented in Figure 8.
Accuracy is plotted through ROC ( Figure 9a) and every true negative event is plotted in Figure 9b.
All unpredicted earthquakes showed no discernible pattern. The probability of future earthquake occurrence is higher in the northwestern parts of the state, as shown in Figure 8. However, the northwestern part of Odisha has already experienced moderate magnitude events of 5.3 Mw due to the NOBF active fault system. Four sections of the very high probable areas were shown within the map, showing that NOBF has the capability to strike high magnitude events in the future. NOBF extends from the east to the west, having a strike length of 250 km, whereas width ranges from 2-5 km. The NOBF is not uniform and is rather characterized by interdigitation of rocks, which makes northern Odisha irregular with complex geo-structures. Therefore, high probability in the northwestern and southern Odisha can be associated with NOBF. Medium probability was characterized as having high probability, whereas low probability could be expected in east and central Odisha. The districts under very high (0.85-1) to high (0.65-0.85) probability includes Sundargarh, Jharsuguda, Sambalpur, Bargarh, Subarnapur, Balangir, Nuapada, Kalahandi, and Nabarangpur. All other districts fall in the moderate to very low categories. Approximately 33.46% and 52,014 km 2 of Odisha fall in the very high to high category (Table 4). Table 4. Area under earthquake probability in Odisha.

Discussion
SSA indicates that eastern coastal districts in Odisha are highly susceptible to amplification. This is because of the widespread deposits of unconsolidated sediments with nonuniform thickness in association with low groundwater levels. The SSA map obtained from this study considered slope (35%), curvature (32%), elevation (23%), and lithology with amplification factors (10%) as the conditioning factors with predefined weights [47]. According to the periodogram and STFT simulation, the PGA ranged between 0-0.9, whereas the PGV ranged between 0-15 (cm/s). The districts with a velocity of 0-3 (cm/s) are Rayagada, Kandhamal, Ganjam, Gajapati, and Keonjhar. This demonstrates that lower PGA values could be attributed to less complicated structural features, with strong basement and rock types. The areas with PGVs between 6 to 10 (cm/s) are the major basins of Odisha, including the Mahanadi basin, where there are thick unconsolidated layers of deposits. However, the highest PGV was observed in the eastern coastal districts, ranging from 10-15 (cm/s). This is due to the Mahanadi river delta and Brahmani river delta, as well as the Bay of Bengal basin. The northern and central parts of Odisha accommodates the Talchir basin, Rengali Province, and IB basin at the north of EGMB, all of which fall under moderate PGA characterized by larger structural faults. Several studies on seismic microzonation [47][48][49] have explained ground shaking. Yet few researchers have employed litho-dynamic, stratigraphic, and topographic influencing factors when assessing ground-shaking amplification [50]. Moreover, Aucelli et al. [51] used geological and geomorphological factors to estimate the susceptibility index for seismic amplification in Italy. Therefore, Odisha is seismically active because of periodical NOBF reactivation. Therefore, the next step probability assessment was conducted to show the highest probability in northwestern Odisha.
The probability map was obtained from the study denotes a very high to high probability in the northwestern parts of the state and decreases towards the central and coastal areas. There is a transition zone of moderate to low probability in between that can be perceived. However, all earthquakes in Odisha are intraplate shallow-focus earthquakes. Mahanadi graben is situated as an interior active region of Odisha that is relatively stable compared to the Himalayan region and Java Sumatra subduction zone. Clustering of earthquakes gives a pattern with foreshocks, mainshocks, and aftershocks, which were not studied in this work, as not many events like this experienced in Odisha. Furthermore, earthquakes do not always follow patterns, which makes it less common to estimate probability. Mostly, a significant number of earthquakes originated at shallow depths of 70 km within the state, while deep-focus earthquakes were experienced within the Bay of Bengal. Therefore, the leading cause of probability in Odisha could be the active NOBF system. As we observed, the seismic amplification was higher in the Mahanadi graben and coastal parts of Odisha,

Discussion
SSA indicates that eastern coastal districts in Odisha are highly susceptible to amplification. This is because of the widespread deposits of unconsolidated sediments with nonuniform thickness in association with low groundwater levels. The SSA map obtained from this study considered slope (35%), curvature (32%), elevation (23%), and lithology with amplification factors (10%) as the conditioning factors with predefined weights [47]. According to the periodogram and STFT simulation, the PGA ranged between 0-0.9, whereas the PGV ranged between 0-15 (cm/s). The districts with a velocity of 0-3 (cm/s) are Rayagada, Kandhamal, Ganjam, Gajapati, and Keonjhar. This demonstrates that lower PGA values could be attributed to less complicated structural features, with strong basement and rock types. The areas with PGVs between 6 to 10 (cm/s) are the major basins of Odisha, including the Mahanadi basin, where there are thick unconsolidated layers of deposits. However, the highest PGV was observed in the eastern coastal districts, ranging from 10-15 (cm/s). This is due to the Mahanadi river delta and Brahmani river delta, as well as the Bay of Bengal basin. The northern and central parts of Odisha accommodates the Talchir basin, Rengali Province, and IB basin at the north of EGMB, all of which fall under moderate PGA characterized by larger structural faults. Several studies on seismic microzonation [47][48][49] have explained ground shaking. Yet few researchers have employed litho-dynamic, stratigraphic, and topographic influencing factors when assessing ground-shaking amplification [50]. Moreover, Aucelli et al. [51] used geological and geomorphological factors to estimate the susceptibility index for seismic amplification in Italy. Therefore, Odisha is seismically active because of periodical NOBF reactivation. Therefore, the next step probability assessment was conducted to show the highest probability in northwestern Odisha.
The probability map was obtained from the study denotes a very high to high probability in the northwestern parts of the state and decreases towards the central and coastal areas. There is a transition zone of moderate to low probability in between that can be perceived. However, all earthquakes in Odisha are intraplate shallow-focus earthquakes. Mahanadi graben is situated as an interior active region of Odisha that is relatively stable compared to the Himalayan region and Java Sumatra subduction zone. Clustering of earthquakes gives a pattern with foreshocks, mainshocks, and aftershocks, which were not studied in this work, as not many events like this experienced in Odisha. Furthermore, earthquakes do not always follow patterns, which makes it less common to estimate probability. Mostly, a significant number of earthquakes originated at shallow depths of 70 km within the state, while deep-focus earthquakes were experienced within the Bay of Bengal. Therefore, the leading cause of probability in Odisha could be the active NOBF system. As we observed, the seismic amplification was higher in the Mahanadi graben and coastal parts of Odisha, experiencing a few small-magnitude events. Therefore, the coastal parts are not highly probable for future events because of less complicated structural features, low height, and more sediments than stressed rocks. Moreover, some specific areas are prone to swarms, such as geothermal areas, including hot spring locations. However, in Odisha, hot springs are associated with the Mahanadi Shear Zone, which is not presently active. Sometimes, a particular location has several small earthquakes before a larger event. Therefore, we believe that the events in western Odisha could be the foreshock. However, these events do not look like foreshocks and cannot be considered as such until overtaken by mainshocks. In addition, earthquake size is dependent on fault type, length, depth, and fault slip. Recently, scientists have tried to predict earthquakes based on clusters, as they could be a warning sign for a larger earthquake. Rout and Nanda [52] conducted a PSHA at Bhubaneswar and estimated spectral acceleration as 0.05 g if a high magnitude event hit the city. Mandal et al. [53] conducted a PSHA for central India and showed that the western Odisha is seismically active. According to the United Nations Development Program [54], faulted Mahanadi basin could be a potential source for future events. Odisha has not had a solid probability assessment. However, hybrid intelligent machine learning-based techniques have been developed for the potential zone and risk assessment [55,56]. Jena and Pradhan [57] described active tectonics and fault characteristics before the probability and risk assessment.

Conclusions
The present study proposed an RNN model for earthquake probability mapping in Odisha, India. The use of artificial intelligence and GIS in earthquake studies adds a new, beneficial dimension. Therefore, this work developed a SSA and EPM map based on the RNN classification technique, using direct attributes of pertinent criteria. This study demonstrated that the spatial variation of seismic amplification was very high in coastal parts of Odisha and high in the Mahanadi basin.
Similarly, the probability map shows that very high-to-high probable locations for future events could be found in the western part of the state. The probability results denoted that structural attributes and complex geological/geomorphological features in western NOBF could be the main reason for high probability. However, a noticeable intersection of faults in the shear zone could have control on isoseismic elongation in the west. The study also shows the Mahanadi graben spreading and creating a huge basin for recent deposits. Hence, the active graben can damage the cities when originate from NOBF and the Bay of Bengal faults.
The RNN model was applied to transfer areas with similar or more complex tectonics and had several limitations. The proposed model and data analysis can cover a large study area but the process is time-consuming. This study can be considered as a reconnaissance study prior to performing a detailed study for mitigation and micro-zonation purposes. The RNN model deals with technical complexities and data uncertainties during the priority analyses. These processes can be modified and improved using integrated and ensemble techniques. The RNN model strictly depends on sequence, quality, and input data preparation.
Future works can be done in association with geophysical, seismological, geo-structural, geological, and earthquake engineering data, including novel integrated, hybrid, and ensemble techniques. Currently, remote sensing techniques and artificial intelligence techniques are popular and thus there is potential for a future project that uses high-resolution LiDAR techniques and images. In conclusion, the proposed model enables planners, managers, agencies, and policymakers to make decisions for land use planning, as well as scheming appropriate proposals for seismically active areas.