Next Article in Journal
Wavelet and Deep Learning Framework for Predicting Commodity Prices Under Economic and Financial Uncertainty
Previous Article in Journal
Adaptive Terminal Sliding Mode Control for a Quadrotor System with Barrier Function Switching Law
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Parameter Estimation of Geographically and Temporally Weighted Elastic Net Ordinal Logistic Regression

1
Department of Statistics, Faculty of Science and Data Analytics, Institut Teknologi Sepuluh Nopember, Surabaya 60111, Indonesia
2
Department of Statistics, School of Computer Science, Bina Nusantara University, Jakarta Barat 11480, Indonesia
*
Author to whom correspondence should be addressed.
Mathematics 2025, 13(8), 1345; https://doi.org/10.3390/math13081345
Submission received: 6 February 2025 / Revised: 24 February 2025 / Accepted: 5 March 2025 / Published: 19 April 2025
(This article belongs to the Special Issue Spatial Statistics Methods and Modeling)

Abstract

:
Geographically and Temporally Weighted Elastic Net Ordinal Logistic Regression is a parsimonious ordinal logistic regression with consideration of the existence of spatial and temporal effects. This model has been developed with the following three considerations: the spatial effect, the temporal effect, and predictor selection. The last point prompted the use of Elastic Net regularization in choosing predictors while handling multicollinearity, which often arises when there are many predictors involved. The Elastic Net penalty combines ridge and LASSO penalties, leading to the determination of the appropriate λ E N and α E N . Therefore, the objective of this study is to determine the parameter estimator using Maximum Likelihood Estimation. The estimation process comprises defining the likelihood function, determining the natural logarithm of the likelihood function, and maximizing the function using Berndt–Hall–Hall–Hausman. These steps continue until the estimator converges on the values that maximize the likelihood function. This study contributes by developing an estimation framework that integrates spatial and temporal effects with Elastic Net regularization, allowing for improved model interpretation and stability. The findings provide an advanced methodological approach for ordinal logistic regression models that incorporate spatial and temporal dependencies. This framework is particularly useful for applications in fields such as economic forecasting, epidemiology, and environmental studies, where ordinal responses exhibit spatial and temporal patterns.
MSC:
62F10; 62H11

1. Introduction

One of the important steps in regression analysis is deciding which predictors are to be included in the model. Often, the result of other research can motivate the use of certain predictors. Exploring more research results increases the number of predictors to use in the model. If the number of predictors increases, the estimated regression model becomes more complicated. Many predictors in the estimate model result in a large amount of data that we must collect to obtain the estimated value. When there are many predictors then there would be overfitting [1] and multicollinearity [2].
In logistic regression, overfitting yields a large value of the odds ratio, and the confidence interval of the parameter might be extremely wide [3]. Multicollinearity also causes the logistic estimators to have a high standard error and, in certain cases, the estimator may have the incorrect sign [4]. Generally, the presence of overfitting and multicollinearity negatively affects the estimated regression model, which leads to the wrong conclusion. As the main point of applying a regression model is to obtain a good estimated model that predicts a new response with minimal error, both violations above must be handled. One approach that can be taken is using the regularization techniques.
Regularization techniques, such as Ridge Regression (RR) [5], Least Absolute Shrinkage and Selection Operator (LASSO) [6], and Elastic Net (EN) [7], can improve prediction accuracy. EN combines the regularization techniques of RR and LASSO. EN has been used in linear regression [8,9,10], and in logistic regression [11,12,13]. In these studies, EN has been successful in choosing predictors with no multicollinearity.
This paper introduces a model, namely the Geographically and Temporally Weighted Elastic Net Ordinal Logistic Regression (GTWENOLR) model. There are three features in the GTWENOLR model: the effect of location, the effect of time, and predictor selection. As a part of regression, this model has an estimated model and hypothesis tests. This study will describe the parameter estimator of GTWENOLR.
Many researchers have studied the effect of location in regression modeling, including Purhadi et al. [14] who proposed the Geographically Weighted Ordinal Logistic Regression (GWOLR) model. This model is an extension of Ordinal Logistic Regression (OLR), incorporating spatial effects. The presence of spatial influence leads to the creation of localized OLR regression model estimates for each distinct area. This means that instead of assuming a homogeneous relationship between predictor variables and the response variable across the entire study region, the model estimates a unique set of regression coefficients for each location. As a result, GWOLR requires the estimation of multiple sets of coefficients, reflecting the local variations that exist in the data. Although this approach increases model complexity, it enables a more precise understanding of how localized influences affect the outcome being studied, ultimately providing a more accurate representation of spatial heterogeneity in the data.
As one of the spatial regressions, there is a spatial weight that causes each observation to have a different estimated value, and each value could have different significant independent variables. To determine the parameter estimator model, Purhadi et al. employed the Maximum Likelihood Estimation (MLE) and the Newton–Raphson iteration algorithm to determine the optimal solution [14]. Other methods, such as Fisher scoring [15], can also be used as iteration techniques. The result of these studies showed that the location factor also affected the regression modeling, even when the response has an ordinal scale.
However, parameter estimation in the GWOLR model is complex due to its non-closed form structure, requiring numerical iteration. Unlike models with analytical solutions, methods like MLE and local likelihood approaches depend on iterative procedures to reach stable estimates. These iterations capture local variations in the data while balancing computational efficiency with accuracy. Choosing the appropriate iteration method is critical. Considerations include convergence speed, stability, and computational complexity. The Newton–Raphson method is commonly used for its quadratic convergence but requires computationally expensive second derivatives (the Hessian matrix). In cases where computing the Hessian is difficult or the parameter space is expansive, quasi-Newton methods like Broyden–Fletcher–Goldfarb–Shanno (BFGS) offer an efficient alternative by approximating the Hessian, as well as balancing speed and accuracy [16]. The Expectation-Maximization (EM) algorithm may be helpful when dealing with missing or latent data, as it breaks the estimation into more manageable steps [17]. Ultimately, the choice of iteration method should depend on the model structure, the quality of initial parameter estimates, and the available computational resources, ensuring reliable convergence and efficient parameter estimation.
In 2024, Ohyver et al. [18] developed a Geographically and Temporally Weighted Ordinal Logistic Regression (GTWOLR) model, an extension of the GWOLR model that incorporates the influence of time. The estimator was obtained using MLE and Berndt–Hall–Hall–Hausman (BHHH) as the iteration technique. This study’s findings are consistent with other temporal regression analyses. For example, Ma et al. conclude that Geographically Weighted Regression (GWR) is not suitable to model the transit ridership as there is significant hourly average ridership at certain times [19]. Similarly, Ling et al. highlighted the need for models that capture variation in effects across both time and space when analyzing right-turn lane crashes in Indiana. Their study shows that the Geographically and Temporally Weighted Negative Binomial (GTWNBR) is the best model to capture the space and time instability in crashes [20]. These cases highlight the importance of choosing a model that takes location and time into account. The GTWOLR is one of the regression models that capture those two aspects. The existence of spatial and temporal weights causes every location in different time periods to have a different model.
The final feature in GTWENOLR is its ability to construct a parsimonious predictive regression model. In other words, the predictors in GTWENOLR are selected predictors using EN. EN is a combination of LASSO and RR. The development of EN was conceived as a method that would produce better predictions if the predictors were highly correlated [10]. The successful application of EN in predictor selection has been demonstrated in various studies focusing on spatial logistic regression [21,22]. These studies have shown that incorporating EN leads to models that are both simpler and more robust, with fewer predictors, that still capture the essential information required for accurate predictions. The compelling performance of these models provides a strong rationale for employing the EN method in other regression frameworks, including the GTWENOLR model proposed here. By leveraging EN, the model not only achieves a more streamlined predictor set but also improves overall predictive performance, making it a valuable tool for analyses involving complex spatial and temporal data.
Another important perspective is the challenge of multicollinearity, which is prevalent in spatial–temporal datasets where predictors often exhibit high intercorrelations. Multicollinearity can lead to unstable parameter estimates and inflated variances, thereby undermining the reliability of the model [23]. Traditional methods, such as LASSO, may arbitrarily select one predictor from a group of highly correlated variables, potentially discarding valuable information, while RR only shrinks coefficients without eliminating any variables [24]. EN effectively overcomes these issues by combining the strengths of both approaches; it not only performs predictor selection but also encourages the grouping of correlated predictors. This dual action stabilizes the estimation process, reduces variance, and enhances model robustness—making EN an ideal choice for the GTWENOLR model in handling complex predictor interrelationships.
The highlighted features of the GTWENOLR model can be counted as the contributions of this model. Based on these contributions, we need to construct the estimation model. We will estimate the model parameters using the MLE and BHHH iteration techniques. Therefore, obtaining the parameter estimators for GTWENOLR is the goal of this study. This study has the advantage of supporting researchers in modeling cases that have location and time effects.

2. Elastic Net Ordinal Logistic Regression

Elastic Net Ordinal Logistic Regression (ENOLR) is an OLR model that selects predictors. These predictors have been selected using EN, so the accuracy prediction can be improved [21]. EN is one of the regularization methods that combine LASSO and RR to address the weaknesses of each method.
In 1970, Hoerl and Kennard introduce RR as a regression model that can handle multicollinearity. The RR adds a penalty to the loss function in ordinal logistic regression. This penalty, called an L 2 -penalty, reduces large coefficient values and helps avoid overfitting. Adding this penalty to the loss function of OLR, then the parameter estimator of ENOLR model obtained by solving the formula in (1).
i = 1 n g = 1 G y i g log exp α g R + x i β R 1 + exp α g R + x i β R exp α g 1 R + x i β R 1 + exp α g 1 R + x i β R λ R β R 2 2
The β R = β 1 R , β 2 R ,   ,   β K R in (1) is the vector of regression coefficients of ENOLR, and α g R , g = 1 ,   2 ,   ,   G 1 are the intercepts. The λ R penalty in (1) is a positive number that defines the relative trade-off between the model variance and bias. When there is no penalty, the error is inflated. As the penalty increases, the bias becomes too large, and the model starts to underfit. While RR shrinks the parameter estimates towards 0, the model does not set the values to absolute 0 for any value of the penalty. Even though some parameter estimates become negligibly small, this model does not conduct predictor selection [7].
Predictor selection refers to the process of choosing the most relevant predictor to include in a regression model. This selection is related to determining which predictors should be included in the model and in what form they should be included [25]. RR is a good regression model when there is a multicollinearity, but it does not produce a parsimonious model [26]. Therefore, there is LASSO regression, which was developed by Tibshirani in 1996. There are two advantages in using LASSO regression, the accuracy of prediction and the convenience in interpretation [6].
LASSO is a penalized least squares method imposing an L 1 -penalty on the regression coefficients. With the addition of this penalty, ENOLR estimators were obtained by minimizing the following equation.
i = 1 n g = 1 G y i g log exp α g L + x i β L 1 + exp α g L + x i β L exp α g 1 L + x i β L 1 + exp α g 1 L + x i β L λ L β L 2 2 .
While the regression coefficients still shrunk towards 0, a consequence of penalizing the absolute values is that some parameters are actually set to 0 for some value of λ L . Thus, the lasso yields models simultaneously use regularization to improve the model and to conduct predictor selection.
While considering the . , there are three conditions for the solution of (2).
Condition   1 : β L > 0 , then   β L = β L ,
Condition   2 : β L   <   0 , then   β L = β L , and
Condition   3 : β L = 0 , then   β L = 0 .
Even though the LASSO regression has two advantages, it also has one drawback, especially compared to RR. Since the main idea of LASSO is to select the predictor, then in the result there is a certain variable that sometimes it is not the important one.
EN is one of the techniques that combines the capabilities of RR and LASSO. It means EN regression will choose predictors while overcoming the effects of multicollinearity. Ref. [7] stated that the advantage of this model is that it enables effective regularization via the ridge-type penalty with predictor selection quality of the LASSO penalty.
The EN regression model combines two penalties, which [26] stated that the EN estimator is a mixture of ridge and LASSO estimators. The estimators will be achieved by minimizing the formula in (6).
i = 1 n g = 1 G y i g log exp α g E N + x i β E N 1 + exp α g E N + x i β E N exp α g 1 E N + x i β E N 1 + exp α g 1 E N + x i β E N
λ E N 1 ϑ E N β E N 1 + ϑ E N β E N 2 2 .
The regularization parameter λ E N in (6) is the sum of two nonnegative penalties, λ E N = λ L + λ R . Let ϑ E N = λ R λ L + λ R and 1 ϑ E N = λ L λ L + λ R , where 0 < α E N < 1 [27]. Then,
β ^ E N = β ^ R ,   ϑ E N = 0 β ^ E N , 0 < ϑ E N < 1 β ^ L , ϑ E N = 1 .
The values of λ L and λ R can be determined using Cross Validation (CV) and Generalized Cross Validation (GCV).

3. Geographically and Temporally Weighted Ordinal Logistic Regression

A logistic regression model called the Geographically and Temporally Weighted Ordinal Logistic Regression (GTWOLR) is used to find the relationship between an ordinal dependent variable and independent variables [18]. The observations used are locations, where the locations are observed over time. The logit model gives in (7).
logit P y i T g | x i t = α g u i , v i , t i + x i t β T u i , v i , t i ;   g = 1,2 , , G 1 ; t = 1,2 , T .
where:
P y i T g | x i t is a cumulative probability for g category given x i t , β T u i , v i , t i = β 1 T u i , v i , t i , β 2 T u i , v i , t i , β K T u i , v i , t i is regression coefficient vector at location i and time t , α g u i , v i , t i is intercept with α 1 T u i , v i , t i α 2 T u i , v i , t i α G 1 , T u i , v i , t i , and u i , v i , t i is coordinate point at location i and time t .
Equation (7) shows β T u i , v i , t i as the model parameter. In order to determine the estimators, suppose there are n vectors of random variables samples y 1 T , y 2 T , , y n T where y i T = [ y i 1 T y i 1 T y i , G 1 , T ] have multinomial distribution with π g T x i T as the probability of g category. Equation (8) gives the set of parameters.
Ω T = α 1 T u i , v i , t i , α 2 T u i , v i , t i , , α G 1 , T u i , v i , t i ,
β T u i , v i , t i ;   i = 1,2 , , n
The natural logarithm of the likelihood function for every location for GTWOLR is given in (9).
log L Ω i T
= j = 1 n g = 1 G h = 1 T w i j h y j g h log exp α g T u i , v i , t i + x j T β T u i , v i , t i 1 + exp α g T u i , v i , t i + x i T β T u i , v i , t i
exp α g 1 , T u i , v i , t i + x j T β T u i , v i , t i 1 + exp α g 1 , T u i , v i , t i + x j T β T u i , v i , t i .
In Equation (9), there is w i j h , the diagonal element of spatial–temporal matrix W u i , v i , t i .
Suppose the dependent variable has three categories, then (9) changes to (10).
log L Ω i T
= j = 1 n h = 1 T w i j h y j 1 h exp α 1 T u i , v i , t i + x j T β T u i , v i , t i
        y j 1 h + y j 2 h log 1 + exp α 1 T u i , v i , t i
        + y j 2 h log exp α 2 T u i , v i , t i + x j T β T u i , v i , t i
        exp α 1 T u i , v i , t i + x j T β T u i , v i , t i
      + y j 1 h 1 log 1 + e x p α 2 T u i , v i , t i + x j T β T u i , v i , t i .
The next step is taking the first derivative with respect to each of parameter models for t = T using (10). The results are given in (11)–(13).
log L Ω i T     α 1 T u i , v i , t i
= j = 1 n h = 1 T w i j h y j 1 h y j 1 h + y j 2 h exp α 1 T u i , v i , t i + x j T β T u i , v i , t i 1 + exp α 1 T u i , v i , t i + x j T β T u i , v i , t i
y j 2 h exp α 1 T u i , v i , t i + x j T β T u i , v i , t i Δ 1 .
log L Ω i T     α 2 T u i , v i , t i
= j = 1 n h = 1 T w i j h y j 2 h exp α 2 T u i , v i , t i + x j T β T u i , v i , t i Δ 1
+ y j 1 h 1 exp α 2 T u i , v i , t i + x j T β T u i , v i , t i 1 + exp α 2 T u i , v i , t i + x j T β T u i , v i , t i .
log L Ω i T     β T u i , v i , t i
= j = 1 n h = 1 T w i j h y j 1 h + y j 2 h x j T 1 1 + exp α 1 T u i , v i , t i + x j T β T u i , v i , t i
+ y j 1 h 1 x j T exp α 2 T u i , v i , t i + x j T β T u i , v i , t i 1 + exp α 2 T u i , v i , t i + x j T β T u i , v i , t i .
where:
Δ 1 = exp α 2 T u i , v i , t i + x j T β T u i , v i , t i exp α 1 T u i , v i , t i + x j T β T u i , v i , t i .
To obtain the estimators, the BHHH iteration is applied and gives Ω ^ T as the set of parameter estimators which are displayed in (14).
Ω ^ i T = α ^ 1 T u i , v i , t i , α ^ 2 T u i , v i , t i ,
β ^ 1 T u i , v i , t i , β ^ 2 T u i , v i , t i , , β ^ K T u i , v i , t i ;   i = 1,2 , , n .

4. Geographically and Temporally Weighted Elastic Net Ordinal Logistic Regression

The GTWENOLR model integrates spatial and temporal weighting into an ordinal logistic regression framework, enhanced by the EN penalization technique. This model assigns distinct weights to each observation based on both spatial and temporal proximity, allowing it to capture heterogeneity across various locations and time periods. The application of EN—which combines the strengths of LASSO and RR—enables effective variable selection and addresses issues of multicollinearity, particularly when predictors are highly correlated. This results in more stable and interpretable parameter estimates. For parameter estimation in the GTWENOLR model, MLE is employed, with optimization performed using the BHHH iterative algorithm.
GTWENOLR is developed by GTWOLR which focuses on selecting the predictor. The regularization term for this model is defined in (15).
λ E N ϑ E N β E N T u i , v i , t i 1 + 1 2 1 ϑ E N β E N T u i , v i , t i 2 2 .
The set of parameters for GTWENOLR, Ω E N T , that must be estimated, are displayed in (16).
Ω E N T = α E N 1 T u i , v i , t i , α E N 2 T u i , v i , t i , , α E N G 1 , T u i , v i , t i ,
β E N T u i , v i , t i ;   i = 1,2 , , n .
Suppose the response variable has three categories, then:
Ω E N T = α E N 1 T u i , v i , t i , α E N 2 T u i , v i , t i ,   β E N T u i , v i , t i ;   i = 1,2 , , n .
Using Equation (10), the natural logarithm of the likelihood function for every location for GTWENOLR is given in (18).
log L Ω i E N T
= j = 1 n h = 1 T w i j h y j 1 h exp α E N 1 T u i , v i , t i + x j T β E N T u i , v i , t i
y j 1 h + y j 2 h log 1 + exp α E N 1 T u i , v i , t i
+ y j 2 h log exp α E N 1 T u i , v i , t i + x j T β E N T u i , v i , t i
exp α E N 1 T u i , v i , t i + x j T β E N T u i , v i , t i
+ y j 1 h 1 log 1 + e x p α E N 2 T u i , v i , t i + x j T β E N T u i , v i , t i
+ λ E N ϑ E N β E N T u i , v i , t i 1 + 1 2 1 ϑ E N β E N T u i , v i , t i 2 2 .
The estimator of α E N 1 T u i , v i , t i and α E N 2 T u i , v i , t i is obtained by solving the formula in (19) and (20). Further, folllowing (3)–(5), the estimator for β E N T will determined using three conditions, and defined in (21)–(23).
log L Ω i E N T     α E N 1 T u i , v i , t i
= j = 1 n h = 1 T w i j h y j 1 h y j 1 h + y j 2 h exp α E N 1 T u i , v i , t i + x j T β E N T u i , v i , t i 1 + exp α E N 1 T u i , v i , t i + x j T β E N T u i , v i , t i
y j 2 h exp α E N 1 T u i , v i , t i + x j T β E N T u i , v i , t i Δ 2 .
log L Ω i E N T     α E N 2 T u i , v i , t i
= j = 1 n h = 1 T w i j h y j 2 h exp α E N 2 T u i , v i , t i + x j T β E N T u i , v i , t i Δ 2
      + y j 1 h 1 exp α E N 2 T u i , v i , t i + x j T β E N T u i , v i , t i 1 + exp α E N 2 T u i , v i , t i + x j T β E N T u i , v i , t i .
where:
Δ 2 = exp α E N 2 T u i , v i , t i + x j T β E N T u i , v i , t i exp α E N 1 T u i , v i , t i + x j T β E N T u i , v i , t i .
Condition   1 : β E N T u i , v i , t i > 0 ,   then   β E N T u i , v i , t i = β E N T u i , v i , t i , and
log L Ω i E N T     β E N T u i , v i , t i
= j = 1 n h = 1 T w i j h y j 1 h + y j 2 h x j T 1 1 + exp α 1 T u i , v i , t i + x j T β T u i , v i , t i
+ y j 1 h 1 x j T exp α 2 T u i , v i , t i + x j T β T u i , v i , t i 1 + exp α 2 T u i , v i , t i + x j T β T u i , v i , t i
λ E N ϑ E N + 1 ϑ E N β E N T u i , v i , t i .
Condition   2 : β E N T u i , v i , t i > 0 ,   then   β E N T u i , v i , t i = β E N T u i , v i , t i , and
log L Ω i E N T     β E N T u i , v i , t i
= j = 1 n h = 1 T w i j h y j 1 h + y j 2 h x j T 1 1 + exp α 1 T u i , v i , t i + x j T β T u i , v i , t i
+ y j 1 h 1 x j T exp α 2 T u i , v i , t i + x j T β T u i , v i , t i 1 + exp α 2 T u i , v i , t i + x j T β T u i , v i , t i
λ E N ϑ E N + 1 ϑ E N β E N T u i , v i , t i .
Condition   3 : β E N T u i , v i , t i = 0 ,   then   β E N T u i , v i , t i = 0 ,
log L Ω i E N T     β E N T u i , v i , t i
= j = 1 n h = 1 T w i j h y j 1 h + y j 2 h x j T 1 1 + exp α 1 T u i , v i , t i + x j T β T u i , v i , t i
+ y j 1 h 1 x j T exp α 2 T u i , v i , t i + x j T β T u i , v i , t i 1 + exp α 2 T u i , v i , t i + x j T β T u i , v i , t i
1 ϑ E N β E N T u i , v i , t i .
The addition of a regularization term in determining the estimator causes complicated computation since there are several things to do. When using GTWOLR, one focuses on determining spatial and temporal bandwidth, although this task also takes time to acquire the optimal bandwidth. For GTWENOLR, the task increases by determining λ E N and ϑ E N . The bandwidths, λ E N and ϑ E N , need to be derived first before we determine the estimator [24]. This is also supported by Tay et al. [28], who proposed an algorithm to estimate the parameters of extending EN to all Generalized Linear Model (GLM) families.
Since Equations (19)–(23) are nonanalytical equations, then we applied the BHHH algorithm to obtain the solutions. Here are the algorithms to obtain the estimators of the GTWENOLR model.
  • Select a value of ϑ E N dan λ E N , where ϑ E N 0,1 dan λ E N 1 > λ E N 2 > > λ E N m .
  • Determine the threshold, ε . The ε is a small positive number to determine when to stop iterating.
  • Determine the initial values, α E N 1 T ( 0 ) u i , v i , t i , α E N 2 T ( 0 ) u i , v i , t i , β E N T ( 0 ) u i , v i , t i , H Ω ^ i E N T 0 , and g Ω ^ i E N T 0 . The estimated values of OLR can be used as the initial values. H · and g · are the Hessian matrix and the gradient vector.
  • For k = 1 , 2 , , m , start the iteration using:
    Ω ^ i E N T + 1 = Ω ^ i E N T H 1 Ω ^ i E N T g Ω ^ i E N T ,  
    where:
    g j Ω ^ i E N T = log l Ω i E N T α E N 1 T u i , v i , t i log l Ω i E N T α E N 2 T u i , v i , t i log l Ω i E N T β E N T u i , v i , t i T ,
    and
    H Ω ^ i E N T = j = 1 n g j Ω ^ i E N T g j T Ω ^ i E N T .
  • Stop the iteration when Ω ^ E N T + 1 Ω ^ E N T ε .
The choosing of ϑ E N dan λ E N is an important task in GTWENOLR. There are several techniques that we can use to choose them. For example, we can use CV [29] and GCV [30].

5. Discussion

The dataset for GTWENOLR consists of the n location which is observed T times. In obtaining the estimator, all these observations are used. In Equation (8), there is a spatial–temporal matrix, W u i , v i , t i , that causes the estimated parameter to vary over location and time. In computing W u i , v i , t i , we need to determine the spatial and temporal bandwidths. We can use the Fotheringham et al. algorithm [31], or the Huang et al. algorithm [32]. Incorporating the W u i , v i , t i produces multiple regression models. However, these estimated models have not considered the possibility of the existence of multicollinearity. This multicollinearity sometimes arises when there are many predictors involved. This is also related to which or how many predictors we want to use in our model. Adopting the idea of [32], we use EN to select the independent variables and to handle the multicollinearity on GTWOLR.
Using the concept of EN led GTWOLR to transform to GTWENOLR, which has an optimal number of predictors with no multicollinearity. The ϑ E N is known as the EN penalty, which bridges the gap between RR and LASSO. When the ϑ E N approaches zero, the GTWENOLR tends to have RR estimated coefficients. The values of the coefficients shrink towards zero, yet it does not eliminate any coefficients entirely. Because all the predictors remain in the model, the RR does not produce an optimal model. When there are many predictors and we keep them in our model, the overfitting may arise [5]. LASSO yields a simple regression model, where the predictors can be fewer in count. The existence of the LASSO penalty makes some coefficients shrink to zero, that is, make the model simpler. When ϑ E N = 1 , GTWENOLR has LASSO coefficients. The purpose of the use of EN in GTWOLR is to obtain a simple regression model with no multicollinearity. One of the obstacles when using EN is how to choose the appropriate ϑ E N and λ E N . Another obstacle is how to choose the optimal spatial and temporal bandwidths.
In [31], CV is used to determine the optimal bandwidth. To compute the CV, one must know the fitted value for the i response, with point i excluded from the calibration process. In our proposed model, since the response has an ordinal category, we can use the Leave-One-Out Approach Error Rate (LOOAER).
A relevant case for applying GTWENOLR is the Human Development Index (HDI). The HDI measures how the population can access the outcomes of development in terms of income, health, education, and other aspects. It serves as an important indicator for measuring the success of improvement of human life quality. However, the HDI varies significantly due to socioeconomic disparities, infrastructure quality, and geographical differences. In urban areas, income levels and healthcare access often drive development, while in rural regions, education accessibility and basic infrastructure play a more pivotal role. Additionally, geographic factors such as climate and natural resource availability further shape development patterns, underscoring the need for region-specific policy interventions to address localized challenges effectively. Over time, government policies, economic growth, demographic shifts, and external shocks (such as financial crises or pandemics) contribute to HDI fluctuations. These temporal dynamics underscore the need for region-specific and time-sensitive policy interventions to ensure sustainable and equitable human development. Given these complexities, GTWENOLR provides a robust framework for analyzing the HDI by incorporating both spatial and temporal effects. The model captures spatial dependencies, where the HDI in one region may be influenced by neighboring regions due to economic spillovers or shared infrastructure. Simultaneously, temporal dependencies are considered to reflect long-term policy impacts and socioeconomic trends. Furthermore, GTWENOLR mitigates multicollinearity through EN regularization, ensuring more stable and interpretable estimates.

6. Conclusions

The GTWENOLR model was developed to address the limitations of the GWOLR model, which, despite its ability to account for spatial variation, does not incorporate temporal effects. While there has been research discussing the influence of time in regression modeling, GWOLR does not fully capture the dynamic nature of temporal changes, which can limit its applicability in time-sensitive analyses. Additionally, the GTWENOLR model itself faces challenges, particularly in handling issues such as multicollinearity and overfitting. These problems arise when spatial and temporal variables interact in complex ways, leading to unstable parameter estimates. To get around these problems and make the GTWENOLR model more reliable, more advanced techniques like regularization need to be added.
Using MLE and the BHHH algorithm to estimate the parameters of the GTWENOLR model provides several key advantages. When used together, MLE and the BHHH algorithm make computations faster, especially when working with complicated models that have both spatial and temporal parts. The coefficients obtained from this method enable the development of regression models that vary by location and time, which better capture the unique characteristics of each specific geographic area and temporal period. Additionally, this approach allows for the selection of independent variables that do not exhibit high multicollinearity, improving the stability and interpretability of the model. As a result, the model becomes more adaptive and accurately reflects spatial and temporal variations in the data.
The GTWENOLR model offers several opportunities for future research and development. One potential extension is Bayesian GTWENOLR, which incorporates prior distributions for greater robustness in handling uncertainty and small sample sizes. Additionally, integrating machine learning techniques, such as deep learning, could enhance the model’s ability to capture nonlinear spatial–temporal dependencies. Further improvements may include adaptive bandwidth selection, allowing data-driven adjustments to spatial and temporal weighting. Expanding GTWENOLR to multivariate ordinal outcomes could also enhance its applicability in fields like economic forecasting, healthcare analytics, and climate modeling. Moreover, a formal investigation of the asymptotic properties of the estimators, such as consistency and asymptotic normality, remains an open question. Establishing these properties would strengthen the theoretical foundation of the model and ensure its reliability for large-sample applications. These advancements would further refine GTWENOLR as a versatile and powerful tool for spatial–temporal ordinal classification problems.

Author Contributions

Conceptualization, P., A.C. and M.O.; methodology, P., A.C. and M.O.; writing—original draft preparation, M.O.; writing—review and editing, P., A.C. and M.O. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The original contributions presented in the study are included in the article, further inquiries can be directed to the corresponding author.

Acknowledgments

All authors thank the editor and reviewers for the improvement of this paper through criticism and suggestions provided.

Conflicts of Interest

The authors declare no conflicts of interest.

Nomenclature

BHHHBerndt–Hall–Hall–Hausman.
ENElastic Net.
ENOLRElastic Net Ordinal Logistic Regression.
GTWENOLRGeographically and Temporally Weighted Elastic Net Ordinal Logistic Regression.
GTWNBRGeographically and Temporally Weighted Negative Binomial Regression.
GTWOLRGeographically and Temporally Weighted Ordinal Logistic Regression.
GWOLRGeographically Weighted Ordinary Logistic Regression.
GWRGeographically Weighted Regression.
LASSOLeast Absolute Shrinkage and Selection Operator.
LOOAERLeave-One-Out Approach Error Rate.
MLEMaximum Likelihood Estimation.
OLROrdinal Logistic Regression.
RRRidge Regression.

References

  1. Wu, D.; Xu, J. On the optimal weighted l2 regularization in overparameterized linear regression. Adv. Neural Inf. Process. Syst. 2020, 33, 10112–10123. [Google Scholar]
  2. Dertli, H.I.; Hayes, D.B.; Zorn, T.G. Effects of multicollinearity and data granularity on regression models of stream temperature. J. Hydrol. 2024, 639, 131572. [Google Scholar] [CrossRef]
  3. Zabor, E.C.; Reddy, C.A.; Tendulkar, R.D.; Patil, S. Logistic Regression in Clinical Studies. Int. J. Radiat. Oncol. Biol. Phys. 2022, 112, 271–277. [Google Scholar] [CrossRef] [PubMed]
  4. Lukman, A.F.; Kibria, B.M.G.; Nziku, C.K.; Amin, M.; Adewuyi, E.T.; Farghali, R. K-L Estimator: Dealing with Multicollinearity in the Logistic Regression Model. Mathematics 2023, 11, 340. [Google Scholar] [CrossRef]
  5. Tsigler, A.; Bartlett, P.L. Benign overfitting in ridge regression. J. Mach. Learn. Res. 2020, 24, 1–76. [Google Scholar]
  6. Hastie, T.; Tibshirani, R.; Friedman, J. The Elements of Statistical Learning Data Mining, Inference, and Prediction. Math. Intell. 2009, 27, 83–85. [Google Scholar]
  7. Kuhn, M.; Johnson, K. Applied Predictive Modeling; Springer: New York, NY, USA, 2013; ISBN 9781461468493. [Google Scholar]
  8. Hans, C. Elastic net regression modeling with the orthant normal prior. J. Am. Stat. Assoc. 2011, 106, 1383–1393. [Google Scholar] [CrossRef]
  9. Giglio, C.; Brown, S.D. Using elastic net regression to perform spectrally relevant variable selection. J. Chemom. 2018, 32, e3034. [Google Scholar] [CrossRef]
  10. Sloboda, B.W.; Pearson, D.; Etherton, M. An application of the LASSO and elastic net regression to assess poverty and economic freedom on ECOWAS countries. Math. Biosci. Eng. 2023, 20, 12154–12168. [Google Scholar] [CrossRef]
  11. Das, U.; Das, K. Selection of influential variables in ordinal data with preponderance of zeros. Stat. Neerl. 2021, 75, 66–87. [Google Scholar] [CrossRef]
  12. Wang, C.; Li, N.; Diao, H.; Lu, L. Variable selection through adaptive elastic net for proportional odds model. Jpn. J. Stat. Data Sci. 2024, 7, 203–221. [Google Scholar] [CrossRef]
  13. Wang, Q.; Hall, G.J.; Zhang, Q.; Comella, S. Predicting implementation of response to intervention in math using elastic net logistic regression. Front. Psychol. 2024, 15, 1410396. [Google Scholar] [CrossRef] [PubMed]
  14. Purhadi; Rifada, M.; Wulandari, S.P. Geographically Weighted Ordinal Logistic Regression Model. Int. J. Math. Comput. 2012, 16, 116–126. [Google Scholar]
  15. Zuhdi, S.; Widyaningsih, P.; Sari Saputro, D.R.; Putri, A.N. Fisher Scoring Method for Parameter Estimation of Geographically Weighted Ordinal Logistic Regression (GWOLR) Model. J. Phys. Conf. Ser. 2017, 855, 012060. [Google Scholar] [CrossRef]
  16. Saputro, D.R.S.; Widyaningsih, P. Limited memory Broyden-Fletcher-Goldfarb-Shanno (L-BFGS) method for the parameter estimation on geographically weighted ordinal logistic regression model (GWOLR). In AIP Conference Proceedings; AIP Publishing: Melville, NY, USA, 2017; Volume 1868. [Google Scholar]
  17. Zhang, J.; Wu, X.; Edwin Chow, T. Space-time cluster’s detection and geographical weighted regression analysis of COVID-19 mortality on texas counties. Int. J. Environ. Res. Public Health 2021, 18, 5541. [Google Scholar] [CrossRef]
  18. Ohyver, M.; Purhadi; Choiruddin, A. Parameter Estimation of Geographically and Temporally Weighted Ordinal Logistic Regression Model. In Proceedings of the Conference Proceedings for the 13th ISNPINSA 2023, Semarang, Indonesia, 8–9 November 2023.
  19. Ma, X.; Zhang, J.; Ding, C.; Wang, Y. A geographically and temporally weighted regression model to explore the spatiotemporal influence of built environment on transit ridership. Comput. Environ. Urban Syst. 2018, 70, 113–124. [Google Scholar] [CrossRef]
  20. Ling, L.; Zhang, W.; Bao, J.; Ukkusuri, S.V. Influencing factors for right turn lane crash frequency based on geographically and temporally weighted regression models. J. Saf. Res. 2023, 86, 191–208. [Google Scholar] [CrossRef]
  21. Comber, A.; Harris, P. Geographically weighted elastic net logistic regression. J. Geogr. Syst. 2018, 20, 317–341. [Google Scholar] [CrossRef]
  22. Cannarile, F.; Compare, M.; Baraldi, P.; Diodati, G.; Quaranta, V.; Zio, E. Elastic net multinomial logistic regression for fault diagnostics of on-board aeronautical systems. Aerosp. Sci. Technol. 2019, 94, 105392. [Google Scholar] [CrossRef]
  23. Kyriazos, T.; Poga, M. Dealing with Multicollinearity in Factor Analysis: The Problem, Detections, and Solutions. Open J. Stat. 2023, 13, 404–424. [Google Scholar] [CrossRef]
  24. Li, K.; Lam, N.S.N. Geographically Weighted Elastic Net: A Variable-Selection and Modeling Method under the Spatially Nonstationary Condition. Ann. Am. Assoc. Geogr. 2018, 108, 1582–1600. [Google Scholar] [CrossRef]
  25. Chatterjee, S.; Hadi, A.S. Regression Analysis by Example; Wiley: Hoboken, NJ, USA, 2012. [Google Scholar]
  26. Zou, H.; Hastie, T. Regularization and variable selection via the elastic net. J. R. Stat. Society. Ser. B Stat. Methodol. 2005, 67, 301–320. [Google Scholar] [CrossRef]
  27. Altelbany, S. Evaluation of Ridge, Elastic Net and Lasso Regression Methods in Precedence of Multicollinearity Problem: A Simulation Study. J. Appl. Econ. Bus. Stud. 2021, 5, 131–142. [Google Scholar] [CrossRef]
  28. Tay, J.K.; Narasimhan, B.; Hastie, T. Elastic Net Regularization Paths for All Generalized Linear Models. J. Stat. Softw. 2021, 106, 1–31. [Google Scholar] [CrossRef]
  29. Ahmad, G.N.; Fatima, H.; Shafiullah; Saidi, S.; Imdadullah, A. Efficient Medical Diagnosis of Human Heart Diseases Using Machine Learning Techniques with and Without GridSearchCV. IEEE Access 2022, 10, 80151–80173. [Google Scholar] [CrossRef]
  30. Chaves, L.M.; de Carvalho, L.D.; dos Reis, C.J.; de Souza, D.J. Explaining the Generalized Cross-Validation on Linear Models. J. Math. Stat. 2019, 15, 298–307. [Google Scholar] [CrossRef]
  31. Fotheringham, A.S.; Crespo, R.; Yao, J. Geographical and Temporal Weighted Regression (GTWR). Geogr. Anal. 2015, 47, 431–452. [Google Scholar] [CrossRef]
  32. Huang, B.; Wu, B.; Barry, M. Geographically and temporally weighted regression for modeling spatio-temporal variation in house prices. Int. J. Geogr. Inf. Sci. 2010, 24, 383–401. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Ohyver, M.; Purhadi; Choiruddin, A. Parameter Estimation of Geographically and Temporally Weighted Elastic Net Ordinal Logistic Regression. Mathematics 2025, 13, 1345. https://doi.org/10.3390/math13081345

AMA Style

Ohyver M, Purhadi, Choiruddin A. Parameter Estimation of Geographically and Temporally Weighted Elastic Net Ordinal Logistic Regression. Mathematics. 2025; 13(8):1345. https://doi.org/10.3390/math13081345

Chicago/Turabian Style

Ohyver, Margaretha, Purhadi, and Achmad Choiruddin. 2025. "Parameter Estimation of Geographically and Temporally Weighted Elastic Net Ordinal Logistic Regression" Mathematics 13, no. 8: 1345. https://doi.org/10.3390/math13081345

APA Style

Ohyver, M., Purhadi, & Choiruddin, A. (2025). Parameter Estimation of Geographically and Temporally Weighted Elastic Net Ordinal Logistic Regression. Mathematics, 13(8), 1345. https://doi.org/10.3390/math13081345

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop