A Semi-Parametric KDE-GPD Model for Earthquake Magnitude Analysis

Zhang, Yanfang; Zhao, Yibin; Wang, Fuchang

doi:10.3390/math13122003

Open AccessArticle

A Semi-Parametric KDE-GPD Model for Earthquake Magnitude Analysis

by

Yanfang Zhang

^*,

Yibin Zhao

and

Fuchang Wang

College of Science, Institute of Disaster Prevention, Langfang 065201, China

^*

Author to whom correspondence should be addressed.

Mathematics 2025, 13(12), 2003; https://doi.org/10.3390/math13122003

Submission received: 3 May 2025 / Revised: 15 June 2025 / Accepted: 16 June 2025 / Published: 17 June 2025

Download

Browse Figures

Versions Notes

Abstract

A semi-parametric mixture model, combining kernel density estimation (KDE) and the generalized Pareto distribution (GPD), is applied to analyze the statistical characteristics of earthquake magnitudes. Data below a threshold are fitted using KDE, while data above the threshold are modeled using the GPD. Both the kernel bandwidth and the threshold are directly estimable as parameters. An estimation method based on the empirical distribution function (EDF) and maximum likelihood estimation (MLE) is used to estimate the parameters of the mixture model. The application of this model to earthquake magnitude analysis offers insights for seismic hazard assessment.

Keywords:

KDE; mixture model; semi-parametric model; GPD; earthquake magnitude; statistical characteristics

MSC:

62G32

1. Introduction

The statistical patterns of earthquake magnitudes constitute a critical aspect in understanding seismic activity trends. Such research helps elucidate crustal movement dynamics while enhancing the accuracy and reliability of earthquake forecasting. The study of magnitude statistics has a long developmental history.

Gutenberg and Richter [1] first systematically analyzed observed earthquake magnitudes, noting that magnitude-frequency relationships generally follow specific patterns within regions exceeding a certain size. Given the destructive potential of major earthquakes, early research primarily focused on large seismic events, employing the GPD from extreme value theory to characterize the tail distributions of magnitude data [2,3,4]. However, the GPD only utilizes data exceeding a threshold while neglecting sub-threshold observations, whose distributional characteristics influence threshold selection and parameter estimation.

Subsequent improvements introduced parametric mixture models [5,6,7]. Zhang et al. [8] applied parametric mixture models to magnitude data analysis using EDF-based estimation, obtaining complete statistical distributions of seismic magnitudes. Parametric mixture models are widely adopted due to their interpretability, computational efficiency, and well-established theoretical properties. Nevertheless, like all parametric statistical inferences, they rely on strong model assumptions that may not hold in practice, potentially leading to erroneous conclusions.

Seminal studies pioneered the integration of parametric and nonparametric distributions to develop semi-parametric mixture models. Heckman and Singer [9] conducted comparative analyses of MLE across parametric mixture models and their semi-parametric counterparts. Wang and Chee [10] present a general framework for univariate non-parametric density estimation, based on mixture models with improved accuracy. Fienberg et al. [11] established a semi-parametric mixture logit regression framework capable of capturing “risk perception” through dichotomous responses, while Follmann and Lambert [12] subsequently employed semi-parametric mixture logistic regression to address overdispersion relative to binomial models. Davies [13] demonstrated the application in economic sectors, catalyzing expanded research interest in nonparametric and semi-parametric mixture approaches.

Substantial methodological advancements emerged in subsequent decades. Hall and Zhou [14] and Hall et al. [15] extended the framework to multivariate settings, while Bordes et al. [16] and Hunter et al. [17] developed novel parameter estimation techniques. Song et al. [18] implemented semi-parametric mixture models for sequential clustering, and Bordes et al. [19] investigated their regression formulations. These models have gained widespread adoption across economics, finance, biology, and medicine. Parallel progress occurred in estimation methodologies [20,21,22,23]. Young and Hunter [24] and Huang and Yao [25] examined proportionally varying semi-parametric mixture regression models, with the latter establishing convergence properties of the EM algorithm through smoothed likelihood theory. MacDonald et al. [26] constructed flexible mixture extreme-value models incorporating Bayesian estimation, while Pommeret and Vandekerkhove [27] systematically articulated the theoretical advantages of semi-parametric approaches. Contemporary applications include high-dimensional clustering [28] and penalized semi-parametric density estimation [29]. Martins-Ferreira et al. [30] introduced a hybrid framework integrating transformer neural networks with GPD to model compound flood exceedances. Chen and Zhang [31] proposed an adaptive Bayesian framework for threshold selection in extreme value mixture models. Vinayan et al. [32] employed Generalized Extreme Value (GEV) and GPD within an extreme value analysis framework to quantify variability in design wave heights. Liu and Zhou [33] developed an accelerated algorithm imposing constraints via penalized MLE.

Building upon this foundation, this study applies a semi-parametric composite model integrating KDE and GPD to analyze earthquake magnitudes. The model features direct estimation of kernel bandwidth and threshold as parameters, with estimation performed using MLE and an EDF-based approach. The application of magnitude analysis provides insights for seismic hazard assessment. The paper is structured as follows: Section 2 establishes the semi-parametric mixture model, Section 3 details parameter estimation, and Section 4 presents the simulation studies. Section 5 applies the model to evaluate seismic hazards in the eastern Bayan Har block, and Section 6 discusses conclusions and future directions.

2. Semi-Parametric Mixture Model

A general mixture model can be expressed as:

f (x) = π f_{1} (x) + (1 - π) f_{2} (x)

(1)

Since extreme events are typically characterized by analyzing the tail behavior of data exceeding a high threshold, we model the upper tail using the GPD [34,35]. The sub-threshold regime, whose distributional form is not parametrically specified, is modeled through a non-parametric probability density function (PDF). Common methods for non-parametric PDF estimation include KDE, the maximum entropy method [36], Edgeworth series expansion [37], and orthogonal polynomial expansion. Comparative studies [38] demonstrate KDE’s superior versatility and reduced estimation errors [39,40]. Therefore, we adopt KDE as our fitting approach. In Equation (1), we set

π = 1 - ϕ_{u}

:

f_{1} (x) = \frac{h (x | λ, X)}{H (u | λ, X)} I_{(- \infty, u)} (x), f_{2} (x) = g (x | u, σ_{u}, ξ) I_{(u, \infty)} (x) .

Let

ϕ_{u} = P (X > u)

, where

ϕ_{u}

represents the proportion of data exceeding the threshold. This parameter quantifies the relative weights between KDE and GPD components in the mixture model. We can get

P (X > x) = ϕ_{u} [1 - P (X < x | X > u)]

If

X_{1}, X_{2}, \dots, X_{n}

is a sequence of independent and identically distributed (i.i.d.) random variables, the cumulative distribution function (CDF) of the semi-parametric mixture models [41] can be expressed as:

F (x | λ, u, σ_{u}, ξ, X) = \{\begin{cases} (1 - ϕ_{u}) \frac{H (x | λ, X)}{H (u | λ, X)} & x \leq u \\ (1 - ϕ_{u}) + ϕ_{u} G (x | u, σ_{u}, ξ) & x > u \end{cases}

(2)

where

F (x | λ, u, σ_{u}, ξ, X)

satisfies the continuity constraint

\lim_{x \to u^{-}} (1 - ϕ_{u}) \frac{H (x | λ, X)}{H (u | λ, X)} = \lim_{x \to u^{+}} (1 - ϕ_{u}) + ϕ_{u} G (x | u, σ_{u}, ξ) = 1 - ϕ_{u}

The corresponding PDF is given by:

f (x | λ, u, σ_{u}, ξ, X) = \{\begin{cases} (1 - ϕ_{u}) \frac{h (x | λ, X)}{H (u | λ, X)} & x \leq u \\ ϕ_{u} g (x | u, σ_{u}, ξ) & x > u \end{cases}

(3)

where

G (x | u, σ_{u}, ξ) = P (X < x | X > u) = \{\begin{cases} 1 - {[1 + ξ (\frac{x - u}{σ_{u}})]}^{- \frac{1}{ξ}} & ξ \neq 0 \\ 1 - \exp [- (\frac{x - u}{σ_{u}})] & ξ = 0 \end{cases}

(4)

g (x | u, σ_{u}, ξ) = \{\begin{cases} \frac{1}{σ_{u}} {[1 + ξ (\frac{x - u}{σ_{u}})]}^{- \frac{1}{ξ} - 1} & ξ \neq 0 \\ \frac{1}{σ_{u}} \exp [- (\frac{x - u}{σ_{u}})] & ξ = 0 \end{cases}

(5)

represent the PDF and CDF of the GPD, respectively.

In the semi-parametric mixture PDF (3), the univariate KDE for

h (x | λ, X)

is defined as follows:

\hat{h} (x | λ, X) = \frac{1}{n λ} \sum_{i = 1}^{n} K (\frac{x - x_{i}}{λ})

where

K (x)

is the kernel function, which typically satisfies the following conditions:

K (x) \geq 0, \int_{- \infty}^{\infty} K (x) d x = 1 and \{\begin{cases} \int x K (x) d x = 0 \\ \int x^{2} K (x) d x = c > 0 \end{cases}

where

c

is a constant. Many functions satisfy these conditions, including Gaussian and polynomial kernels. Research demonstrates, however, that fitting errors vary only slightly across different kernel selections, indicating little influence of kernel choice on non-parametric KDE accuracy. We therefore adopt the Gaussian kernel for probability density estimation in subsequent analysis, i.e.,

K (x) = \frac{1}{\sqrt{2 π}} \exp (- \frac{x^{2}}{2})

. Then,

h (x | λ, X)

can be expressed as:

\hat{h} (x | λ, X) = \frac{1}{\sqrt{2 π} n λ} \sum_{i = 1}^{n} \exp {- \frac{1}{2} {(\frac{x - x_{i}}{λ})}^{2}}

(6)

3. Parameter Estimation of the Semi-Parametric Mixture Model

3.1. MLE of Parameters

Due to the complexity of the likelihood function, analytical solutions are infeasible. Therefore, we compute MLE numerically. The semi-parametric mixture model’s likelihood function is:

L (θ | X) = L_{K DE} (λ, u | X) L_{G P D} (u, μ, σ, ξ | X)

Habbema et al., 1974 and Duin (1976) [42,43] pioneered likelihood-based bandwidth estimation for KDE. The KDE likelihood function is given by:

L_{K D E} (λ, X) = \prod_{x_{i} \leq u} \frac{(1 - ϕ_{u})}{H (u | λ, X)} \frac{1}{n λ} \sum_{j = 1}^{n} K (\frac{x_{i} - x_{j}}{λ})

However, when

x_{i} = x_{j}

, (so

x_{i} - x_{j} = 0

), and

λ \to 0

, this result in a degenerate likelihood function [43]. To address this issue, the likelihood function can be replaced with a cross-validation likelihood function:

L_{K DE} (λ, u | X) = \prod_{x_{i} \leq u} \frac{(1 - ϕ_{u})}{H (u | λ, X)} \frac{1}{(n - 1) λ} \sum_{\begin{array}{l} j = 1 \\ j \neq i \end{array}}^{n} K (\frac{x_{i} - x_{j}}{λ})

(7)

Equation (7) can be interpreted as minimizing the Kullback–Leibler divergence (K-L distance) [44,45]. Let

A = {x_{i} \leq u}

and

B = {x_{i} > u}

, where the samples belong in the PDF. The kernel density likelihood function can be expressed as a function of the bandwidth.

L_{K D E} (λ, u | X) = {\{\frac{(1 - ϕ_{u})}{\frac{1}{n} \sum_{i = 1}^{n} Φ (\frac{u - x_{i}}{λ})}\}}^{| A |} \prod_{A} \frac{1}{(n - 1) λ} \sum_{\begin{array}{l} j = 1 \\ j \neq i \end{array}}^{n} K (\frac{x_{i} - x_{j}}{λ})

(8)

The tail portion of the sample follows a GPD, and the corresponding likelihood function is given by:

L_{G P D} (u, σ, ξ | X) = \{\begin{cases} \prod_{{i : x_{i} > u}} \frac{1}{σ} {(1 + ξ \frac{x_{i} - u}{σ})}^{- \frac{1}{ξ} - 1}, ξ \neq 0 \\ \prod_{{i : x_{i} > u}} \frac{1}{σ} \exp (\frac{x_{i} - μ}{σ}), ξ = 0 \end{cases}

(9)

The likelihood function for the semi-parametric model is given by:

L (θ | X) = L_{K DE} (λ, u | X) L_{G P D} (u, μ, σ, ξ | X) = \{\begin{cases} {(\frac{(1 - ϕ_{u})}{\frac{1}{n} \sum_{i = 1}^{n} ϕ (\frac{u - x_{i}}{λ})})}^{| A |} \prod_{A} \frac{1}{(n - 1)} \sum_{\begin{array}{l} j = 1 \\ j \neq i \end{array}}^{n} K_{λ} (x_{i} - x_{j}) \prod_{B} \frac{1}{σ} {(1 + ξ \frac{x_{i} - μ}{σ})}^{- \frac{1}{ξ} - 1}, ξ \neq 0 \\ {(\frac{(1 - ϕ_{u})}{\frac{1}{n} \sum_{i = 1}^{n} ϕ (\frac{u - x_{i}}{λ})})}^{| A |} \prod_{A} \frac{1}{(n - 1)} \sum_{\begin{array}{l} j = 1 \\ j \neq i \end{array}}^{n} K_{λ} (x_{i} - x_{j}) \prod_{B} \frac{1}{σ} \exp (\frac{x_{i} - μ}{σ}), ξ = 0 \end{cases}

(10)

The corresponding log-likelihood function is expressed as:

\ln L (θ | X) = \ln L_{K DE} (λ, u | X) + \ln L_{G P D} (u, μ, σ, ξ | X) = \{\begin{cases} | A | \ln (\frac{1 - ϕ (u)}{\frac{1}{n} \sum_{i = 1}^{n} Φ (\frac{u - x_{i}}{λ})}) + | A | \ln [\frac{1}{(n - 1) λ}] + \sum_{A} [\ln \sum_{\begin{array}{l} j = 1 \\ j \neq i \end{array}}^{n} φ (\frac{x - x_{j}}{λ})] + \sum_{B} [- \ln σ + (\frac{- 1}{ξ} - 1) \ln (1 + ξ \frac{x_{i} - μ}{σ})], ξ \neq 0 \\ A | \ln (\frac{1 - ϕ (u)}{\frac{1}{n} \sum_{i = 1}^{n} Φ (\frac{u - x_{i}}{λ})}) + | A | \ln [\frac{1}{(n - 1) λ}] + \sum_{A} [\ln \sum_{\begin{array}{l} j = 1 \\ j \neq i \end{array}}^{n} φ (\frac{x - x_{j}}{λ})] + \sum_{B} (- \ln σ + \frac{x_{i} - μ}{σ}), ξ \neq 0 \end{cases}

(11)

3.2. Parameter Estimation Method Based on the EDF

Assume

F_{n} (x)

denotes the sample EDF. We define a loss function

L o s s = | F (x) - F_{n} (x) |,

whose minimizer yields EDF-based parameter estimates. Since analytical minimization is infeasible, we use the Newton–Raphson method for numerical optimization.

4. Simulation of Parameter Estimation for Semi-Parametric Mixture Models

To evaluate the semi-parametric model’s performance, we generated random samples from three parametric mixture distributions:

Normal (shape = 1, scale = 4) + GPD.
Weibull (shape = 1.5, scale = 2) + GPD.
Gamma (shape = 1, scale = 2) + GPD.

All cases shared common GPD parameters (shape parameter: −0.2, scale parameter: 1.5). The semi-parametric mixture model was fitted to each sample, with parameter estimates compared against true values.

Table 1 presents the parameter estimates, standard deviations, and 95% confidence intervals for the normal-GPD case.

Figure 1 compares the EDF-estimated CDF with the EDF of the simulated data. Figure 2 displays the simulated data histogram alongside the semi-parametric mixture model’s fitted PDF using EDF-estimated parameters.

Figure 3 compares the MLE-estimated CDF with the EDF of the simulated data.

Figure 4 displays the simulated data histogram alongside the semi-parametric mixture model’s fitted PDF using MLE-estimated parameters. As shown in Table 1 and Figure 1, Figure 2, Figure 3 and Figure 4, both parameter estimation methods yield acceptable results for the normal-GPD mixture data.

Table 2 presents the parameter estimates, standard deviations, and 95% confidence intervals for the Weibull-GPD case.

Figure 5 compares the EDF-estimated CDF with the EDF of the simulated data. Figure 6 displays the simulated data histogram alongside the semi-parametric mixture model’s fitted PDF using EDF-estimated parameters.

Figure 7 compares the MLE-estimated CDF with the EDF of the simulated data. Figure 8 shows the simulated data histogram alongside the semi-parametric mixture model’s fitted PDF with MLE-estimated parameters.

Table 2 and Figure 5, Figure 6, Figure 7 and Figure 8 demonstrate that both parameter estimation methods yield acceptable results for the Weibull-GPD composite data.

Table 3 presents the parameter estimates, standard deviations, and 95% confidence intervals for the Gamma-GPD case.

Figure 9 compares the EDF-estimated CDF with the EDF of the simulated data. Figure 10 displays the simulated data histogram alongside the semi-parametric mixture model’s fitted PDF with EDF-estimated parameters.

Figure 11 compares the MLE-estimated CDF estimated with the EDF of the simulated data. Figure 12 presents the simulated data histogram alongside the semi-parametric mixture model’s fitted PDF with the MLE-estimated parameters.

Table 3 and Figure 9, Figure 10, Figure 11 and Figure 12 show that both parameter estimation methods yield acceptable results for the Gamma-GPD mixture data.

Simulation results across all scenarios confirm the semi-parametric mixture model’s strong performance. We next apply it to real seismic data analysis.

5. Statistical Characteristic Analysis of Seismic Magnitude Data in the Eastern Bayan Har Block

Earthquake magnitude data were obtained from the National Seismic Science Data Sharing Center (https://data.earthquake.cn/ (accessed on 29 April 2019)). After removing aftershocks using the C-S method [46], 19,221 records before December 2019 were retained as research samples. Figure 13 displays the annual magnitude histogram.

5.1. Data Statistics

Table 4 summarizes key statistics: a kurtosis of 6.7771 indicates leptokurtic distribution, while a positive skewness (1.5523) confirms right-skewed data with a heavy upper tail. This occurs when infrequent large values extend the distribution’s right tail.

5.2. Nonparametric KDE of the Data

Using a normal kernel function for KDE, we computed the optimal bandwidth via Equation (6). Figure 14 superimposes the KDE curve on the data histogram, demonstrating close CDF alignment. Note that while KDE provides an excellent visual correspondence, its lack of explicit PDF expression may constrain advanced applications.

5.3. Data Fitting Using the Semi-Parametric Mixture Model

This section applies the semi-parametric mixture model established in Section 3 to seismic magnitude data. We obtained the MLE of parameters by maximizing the log-likelihood function using MATLAB’s (R2018b) optimization algorithms. This method simultaneously estimates the bandwidth as a parameter, eliminating errors from manual selection. Table 5 presents parameter estimates for both the non-parametric KDE and the semi-parametric (KDE-GPD) mixture model.

Figure 15 shows the semi-parametric mixture model’s PDF. The blue segment depicts the KDE component for sub-threshold data, while the red segment represents the GPD tail model for supra-threshold values.

Figure 16 compares the fitted EDF with the actual data EDF, showing consistent trends.

We performed a Kolmogorov–Smirnov (K-S) goodness-of-fit test for the tail data’s conformance to GPD, obtaining a p-value of 0.4665. This fails to reject the null hypothesis, supporting the GPD assumption’s statistical plausibility. Additionally, Figure 17 further validates model fit through a Q-Q plot showing alignment between empirical quantiles and theoretical GPD quantiles along the diagonal.

Using the formula

{\hat{x}}^{*} = \hat{u} - \frac{\hat{σ}}{\hat{ξ}}

, we calculate a maximum magnitude of Ms 8.73. This estimate aligns with historical earthquakes in the region: the 2001 Kunlun Mountain Pass (Ms 8.0), 2008 Wenchuan (Ms 8.0), 2010 Yushu (Ms 7.1), 2013 Lushan (Ms 7.0), 2021 Qinghai Maduo (Ms 7.4), and 2022 Luding (Ms 6.8).

For tail data characterization, quantile estimation provides critical seismic hazard indicators. We compute return levels for specified return periods using

T = 1 / (1 - p)

, which is also a key indicator of seismic hazard. The quantile is calculated by the formula

x_{p} = u + \frac{σ}{ξ} (p^{- ξ} - 1)

. The estimated return level for return period

T = 1 / (1 - p)

is:

{\hat{x}}_{1 - \frac{1}{365 T}} = \hat{u} + \frac{\hat{σ}}{\hat{ξ}} ({(\frac{n}{N_{u}} (1 - p))}^{- \hat{ξ}} - 1) = \hat{u} + \frac{\hat{σ}}{\hat{ξ}} ({(\frac{365 T \cdot N_{u}}{n})}^{\hat{ξ}} - 1)

(12)

Table 6 presents the return levels for different return periods, computed from Equation (12).

The return level plot (Figure 18) demonstrates strong model–data alignment, with empirical return levels within 95% confidence intervals. However, extrapolation to very high return levels involves substantial uncertainty.

6. Conclusions

Semi-parametric mixture models balance flexibility, efficiency, and interpretability by integrating parametric and non-parametric approaches. Our application to magnitude data pioneers new uses of these models: the sub-threshold portion uses non-parametric fitting, while supra-threshold data follows GPD, enabling complete statistical characterization of seismic magnitudes. Both threshold and bandwidth are directly estimated as parameters, eliminating subjectivity in conventional threshold selection methods. While LOOL-enhanced KDE improves accuracy, its computational demands highlight the need for efficiency-optimized algorithms that preserve precision, a key advancement target. Future research should address smooth transitions between nonparametric and parametric components, incorporate spatial dependencies, and account for temporal non-stationarity in complete magnitude catalogs.

Author Contributions

Methodology, Y.Z. (Yanfang Zhang); Software, Y.Z. (Yanfang Zhang); Investigation, F.W.; Data curation, Y.Z. (Yibin Zhao); Writing—original draft, Y.Z. (Yanfang Zhang). All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Self-Financing Project of Scientific Research and Development Plan of the Lang Fang Science and Technology Bureau, grant number 2024011020.

Data Availability Statement

All data, models, or codes supporting this study’s findings are available from the corresponding author upon reasonable request.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Gutenberg, B.; Richter, C.F. Magnitude and energy of earthquakes. Ann. Geophys. 2010, 53, 7–12. [Google Scholar] [CrossRef]
Dutfoy, A. Estimation of tail distribution of the annual maximum earthquake magnitude using extreme value theory. Pure Appl. Geophys. 2019, 176, 527–540. [Google Scholar] [CrossRef]
Dutfoy, A. Earthquake recurrence model based on the generalized Pareto distribution for unequal observation periods and imprecise magnitudes. Pure Appl. Geophys. 2021, 178, 1549–1561. [Google Scholar] [CrossRef]
Zhang, Y.F.; Zhao, Y.B.; Ren, Q.Q. Seismic risk in the east of the Bayan Har block based on the POT model. Geomat. Nat. Hazards Risk 2022, 13, 2697–2711. [Google Scholar] [CrossRef]
Frigessi, A.; Haug, O.; Rue, H. A dynamic mixture model for unsupervised tail estimation without threshold selection. Extremes 2002, 5, 219–235. [Google Scholar] [CrossRef]
Mendes, B.V.d.M.; Lopes, H.F. Data driven estimates for mixtures. Comput. Stat. Data Anal. 2004, 47, 583–598. [Google Scholar] [CrossRef]
Behrens, C.N.; Lopes, H.F.; Gamerman, D. Bayesian analysis of extreme events with threshold estimation. Stat. Model. 2003, 4, 227–244. [Google Scholar] [CrossRef]
Zhang, Y.; Wang, F.; Zhao, Y. Statistical characteristics of earthquake magnitude based on the composite model. AIMS Math. 2024, 9, 607–624. [Google Scholar] [CrossRef]
Heckman, J.; Singer, B. A method for minimizing the impact of distributional assumptions in econometric models for duration data. Econometrica 1984, 52, 271–320. [Google Scholar] [CrossRef]
Wang, Y.; Chee, C.-S. Density estimation using non-parametric and semi-parametric mixtures. Stat. Model. 2012, 12, 67–92. [Google Scholar] [CrossRef]
Fienberg, S.E.; Bromet, E.J.; Follmann, D.; Lambert, D.; May, S.M. Longitudinal analysis of categorical epidemiological data: A study of Three Mile Island. Environ. Health Perspect. 1985, 63, 241–248. [Google Scholar] [CrossRef] [PubMed]
Follmann, D.A.; Lambert, D. Generalizing logistic regression by nonparametric mixing. J. Amer. Statist. Assoc. 1989, 84, 295–300. [Google Scholar] [CrossRef]
Davies, R.B. Nonparametric control for residual heterogeneity in modelling recurrent behaviour. Comput. Stat. Data Anal. 1993, 16, 143–160. [Google Scholar] [CrossRef]
Hall, P.; Zhou, X.-H. Nonparametric estimation of component distributions in a multivariate mixture. Ann. Stat. 2003, 31, 201–224. [Google Scholar] [CrossRef]
Hall, P.; Neeman, A.; Pakyari, R.; Elmore, R. Nonparametric inference in multivariate mixtures. Biometrika 2005, 92, 667–678. [Google Scholar] [CrossRef]
Bordes, L.; Delmas, C.; Vandekerkhove, P. Semiparametric estimation of a two-component mixture model where one component is known. Scand. J. Stat. 2006, 33, 733–752. [Google Scholar] [CrossRef]
Hunter, D.R.; Wang, S.; Hettmansperger, T.P. Inference for mixtures of symmetric distributions. Ann. Stat. 2007, 35, 224–251. [Google Scholar] [CrossRef]
Song, S.; Nicolae, D.L.; Song, J. Estimating the mixing proportion in a semiparametric mixture model. Comput. Stat. Data Anal. 2010, 54, 2276–2283. [Google Scholar] [CrossRef]
Bordes, L.; Kojadinovic, I.; Vandekerkhove, P. Semiparametric estimation of a mixture of two linear regressions where one component is known. Electron. J. Stat. 2013, 7, 2603–2644. [Google Scholar] [CrossRef]
Bordes, L.; Vandekerkhove, P. Semiparametric two-component mixture model with a known component: An asymptotically normal estimator. Math. Methods Stat. 2010, 19, 22–41. [Google Scholar] [CrossRef]
Xiang, S.; Yao, W.; Wu, J. Minimum profile Hellinger distance estimation for a semiparametric mixture model. Can. J. Stat. 2014, 42, 246–267. [Google Scholar] [CrossRef]
Xiang, S.; Yao, W.; Yang, G. An overview of semiparametric extensions of finite mixture models. Stat. Sci. 2019, 34, 391–404. [Google Scholar] [CrossRef]
Huang, M.; Wang, S.; Wang, H.; Jin, T. Maximum smoothed likelihood estimation for a class of semiparametric pareto mixture densities. Stat. Interface 2018, 11, 31–40. [Google Scholar] [CrossRef]
Young, D.; Hunter, D. Mixtures of regressions with predictor-dependent mixing proportions. Comput. Stat. Data Anal. 2010, 54, 2253–2266. [Google Scholar] [CrossRef]
Huang, M.; Yao, W. Mixture of regression models with varying mixing proportions: A semiparametric approach. J. Am. Stat. Assoc. 2012, 107, 711–724. [Google Scholar] [CrossRef]
Macdonald, A.; Scarrott, C.; Lee, D.; Darlow, B.; Reale, M.; Russell, G. A flexible extreme value mixture model. Comput. Stat. Data Anal. 2011, 55, 2137–2157. [Google Scholar] [CrossRef]
Pommeret, D.; Vandekerkhove, P. Semiparametric density testing in the contamination model. Electron. J. Stat. 2019, 13, 4743–4793. [Google Scholar] [CrossRef]
Yin, A.; Yuan, A. Multi-dimensional classification with semiparametric mixture model. Stat. Interface 2020, 13, 347–359. [Google Scholar] [CrossRef]
Tan, X.; Yan, M. Semi-parametric density estimation method based on regular penalty. J. Chongqing Technol. Bus. Univ. Chin. 2025, 42, 1–9. [Google Scholar]
Martins-Ferreira, T.; Sampaio, A.F.; Figueiredo, R.; Lopes, A.R.; Reis, M.T.; Fortes, C.J.E.M.; Silva, R. Hybrid transformer-exceedance models for compound flooding. Nat. Hazards Earth Syst. Sci. 2024, 24, 801–817. [Google Scholar]
Chen, Y.; Zhang, R. Adaptive Bayesian threshold selection for extreme value mixtures. Technometrics 2023, 65, 511–525. [Google Scholar]
Vinayan, S.; Kumar, V.S.; Sajeev, R. Variabilities in the estimate of 100-year return period wave height in the Indian shelf seas. J. Oceanogr. 2024, 80, 377–391. [Google Scholar] [CrossRef]
Liu, Q.; Zhou, H. Massively parallel continuity constraints for semi-parametric extremes. Comput. Stat. Data Anal. 2025, 189, 107831. [Google Scholar]
Kinnison, R.R. Applied Extreme Value Statistics, 1st ed.; Battelle Press: Columbus, OH, USA, 1985; pp. 132–166. [Google Scholar]
Epanechnikov, V.A. Non-parametric estimation of a multivariate probability density. Theory Probab. Appl. 1969, 14, 153–158. [Google Scholar] [CrossRef]
Gramacki, A. Kernel density estimation outperforms orthogonal series and maximum entropy methods. In Nonparametric Kernel Density Estimation and Its Computational Aspects, 1st ed.; Springer International Publishing: Cham, Switzerland, 2018; pp. 23–47. [Google Scholar]
Sheather, S.J.; Jones, M.C. A reliable data-based bandwidth selection method for kernel density estimation. J. R. Stat. Soc. Ser. B 1991, 53, 683–690. [Google Scholar] [CrossRef]
Kamer, Y.; Hiemer, S. Data-driven spatial b value estimation with applications to California seismicity. J. Geophys. Res. Solid Earth 2015, 120, 2601–2618. [Google Scholar] [CrossRef]
Silverman, B.W. Density Estimation for Statistics and Data Analysis; Chapman and Hall: London, UK, 1986; pp. 68–96. [Google Scholar]
Wand, M.P.; Jones, M.C. Multivariate plug-in bandwidth selection. Comput. Stat. Data Anal. 1994, 17, 97–116. [Google Scholar]
Tancredi, A.; Anderson, C.; O’Hagan, A. Accounting for threshold uncertainty in extreme value estimation. Extremes 2006, 9, 87–106. [Google Scholar] [CrossRef]
Habbema, J.; Hermans, J.; van den Broek, K. A stepwise discriminant analysis program using density estimation. In Compstat; Bruckmann, G., Ed.; Physica-Verlag: Vienna, Austria, 1974; pp. 101–110. [Google Scholar]
Duin, R.P.W. On the choice of smoothing parameters for Parzen estimators of probability density functions. IEEE Trans. Comput. C 1976, 25, 1175–1179. [Google Scholar] [CrossRef]
Bowman, A.W. A note on consistency of the kernel method for the analysis of categorical data. Biometrika 1980, 67, 682–684. [Google Scholar] [CrossRef]
Bowman, A.W. An alternative method of cross-validation for the smoothing of density estimates. Biometrika 1984, 71, 353–360. [Google Scholar] [CrossRef]
Chen, L.; Liu, J.; Chen, Y.; Chen, L.S. Aftershock deletion in seismicity analysis. Acta Geophys. Sin. 1998, 41 (Suppl. 1), 244–252. (In Chinese) [Google Scholar]

Figure 1. EDF and the estimated CDF (from the EDF method) of the mixture model (Normal + GPD).

Figure 2. Histogram and fitted PDF (from the EDF method) of the mixture model (Normal + GPD).

Figure 3. EDF and the estimated CDF (from MLE) of the mixture model (Normal + GPD).

Figure 4. Histogram and fitted PDF (from MLE) of the mixture model (Normal + GPD).

Figure 5. EDF and the estimated CDF (from the EDF method) of the mixture model (Weibull + GPD).

Figure 6. Histogram and fitted PDF (from the EDF method) of the mixture model (Weibull + GPD).

Figure 7. EDF and the estimated CDF (from MLE) of the mixture model (Weibull + GPD).

Figure 8. Histogram and fitted PDF (from MLE) of the mixture model (Weibull + GPD).

Figure 9. EDF and the estimated CDF (from the EDF method) of the mixture model (Gamma + GPD).

Figure 10. Histogram and fitted PDF (from the EDF method) of the mixture model (Gamma + GPD).

Figure 11. EDF and the estimated CDF (from MLE) of the mixture model (Gamma + GPD).

Figure 12. Histogram and fitted PDF (from MLE) of the mixture model (Gamma + GPD).

Figure 13. Histogram of earthquake magnitude.

Figure 14. KDE fitting for seismic magnitude data.

Figure 15. Estimated PDF of the mixture model.

Figure 16. The fitted EDF and the observed EDF.

Figure 17. Q-Q plot of the tail data.

Figure 18. Return level plot.

Table 1. Parameter estimates for semi-parametric model (Normal + GPD).

$N (1, 2^{2}) + GPD (ξ = - 0.2, σ = 1.5)$	$\hat{λ}$	$\hat{u}$	$\hat{ξ}$	$\hat{σ}$
Parameter estimations by EDF	0.6005	2.0153	−0.2358	1.4802
Standard deviation	0.0052	0.0211	0.0498	0.0001
Confidence interval (confidence level $α = 0.05$ )	(0.6000, 0.6001)	(1.752, 2.0664)	(−0.3750, −0.0589)	(1.3800, 1.5816)
Parameters by MLE	0.3817	2.0053	−0.2388	1.4901
Standard deviation	0.1213	0.0142	0.0388	0.0008
Confidence interval (confidence level $α = 0.05$ )	(0.1035, 0.5811)	(1.300, 2.1438)	(−0.3419, 0.1207)	(1.331, 1.6734)

Table 2. Parameter estimates for sesemi-parametric model (Weibull + GPD).

$Weibull (1.5, 2) + GPD (ξ = - 0 . 2, σ = 1 . 5)$	$\hat{λ}$	$\hat{u}$	$\hat{ξ}$	$\hat{σ}$
Parameter estimations by EDF	0.6235	2.0003	−0.1958	1.9802
Standard deviation	0.1241	0.0002	0.0498	0.1950
Confidence interval (confidence level $α = 0.05$ )	(0.1003, 0.8005)	(1.8725, 2.0346)	(0.2975, 0.1018)	(1.2943, 2.2473)
Parameters by MLE	0.7482	2.003	−0.2080	1.7744
Standard deviation	0.1774	0.0142	0.0247	0.0877
Confidence interval (confidence level $α = 0.05$ )	(0.1203, 0.8011)	(2.0064, 2.2438)	(−0.3419, −0.1007)	(1.3351, 1.8281)

Table 3. Parameter estimates for semi-parametric model (Gamma + GPD).

$Gamma (1.5, 2) + GPD (ξ = - 0 . 2, σ = 1 . 5)$	$\hat{λ}$	$\hat{u}$	$\hat{ξ}$	$\hat{σ}$
Parameter estimations by EDF	0.1001	2.0051	−0.2463	1.4901
Standard deviation	0.0005	0.0124	0.0235	0.0430
Confidence interval (confidence level $α = 0.05$ )	(0.1001, 0.80121)	(1.5764, 2.5995)	(−0.2803, 0.1484)	(1.3802, 1.5682)
Parameters by MLE	0.1072	2.0235	−0.2176	1.4880
Standard deviation	0.0141	0.0507	0.0683	0.0295
Confidence interval (confidence level $α = 0.05$ )	(0.1103, 0.6012)	(1.7003, 2.3644)	(−0.1504, −0.0146)	(1.3213, 1.6180)

Table 4. Statistical summary of seismic magnitude data.

Minimum	Maximum	Mean	First Quartile (Q1)	Third Quartile (Q3)	Variance	Skewness	Kurtosis
0.0500	8.1000	0.9617	0.3900	1.4100	0.6575	1.5523	6.7771

Table 5. MLEs of parameters for nonparametric and semi-parametric models.

Models	$\hat{λ}$	$\hat{u}$	$\hat{ξ}$	$\hat{σ}$
KDE	0.0591
Semi-parametric mixture model (KDE-GPD)	1.0001	4.9801	−0.2021	0.7514

Table 6. Return levels for different return periods.

Return Period (Years)	Return Level
10	6.3635
20	6.6687
30	6.8283
50	7.0117
60	7.0727
80	7.1645
100	7.2322

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhang, Y.; Zhao, Y.; Wang, F. A Semi-Parametric KDE-GPD Model for Earthquake Magnitude Analysis. Mathematics 2025, 13, 2003. https://doi.org/10.3390/math13122003

AMA Style

Zhang Y, Zhao Y, Wang F. A Semi-Parametric KDE-GPD Model for Earthquake Magnitude Analysis. Mathematics. 2025; 13(12):2003. https://doi.org/10.3390/math13122003

Chicago/Turabian Style

Zhang, Yanfang, Yibin Zhao, and Fuchang Wang. 2025. "A Semi-Parametric KDE-GPD Model for Earthquake Magnitude Analysis" Mathematics 13, no. 12: 2003. https://doi.org/10.3390/math13122003

APA Style

Zhang, Y., Zhao, Y., & Wang, F. (2025). A Semi-Parametric KDE-GPD Model for Earthquake Magnitude Analysis. Mathematics, 13(12), 2003. https://doi.org/10.3390/math13122003

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Semi-Parametric KDE-GPD Model for Earthquake Magnitude Analysis

Abstract

1. Introduction

2. Semi-Parametric Mixture Model

3. Parameter Estimation of the Semi-Parametric Mixture Model

3.1. MLE of Parameters

3.2. Parameter Estimation Method Based on the EDF

4. Simulation of Parameter Estimation for Semi-Parametric Mixture Models

5. Statistical Characteristic Analysis of Seismic Magnitude Data in the Eastern Bayan Har Block

5.1. Data Statistics

5.2. Nonparametric KDE of the Data

5.3. Data Fitting Using the Semi-Parametric Mixture Model

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI