Journal Description
Stats
Stats
is an international, peer-reviewed, open access journal on statistical science published quarterly online by MDPI. The journal focuses on methodological and theoretical papers in statistics, probability, stochastic processes and innovative applications of statistics in all scientific disciplines including biological and biomedical sciences, medicine, business, economics and social sciences, physics, data science and engineering.
- Open Access— free for readers, with article processing charges (APC) paid by authors or their institutions.
- High Visibility: indexed within ESCI (Web of Science), Scopus, RePEc, and other databases.
- Rapid Publication: manuscripts are peer-reviewed and a first decision is provided to authors approximately 13.7 days after submission; acceptance to publication is undertaken in 2.8 days (median values for papers published in this journal in the first half of 2023).
- Recognition of Reviewers: reviewers who provide timely, thorough peer-review reports receive vouchers entitling them to a discount on the APC of their next publication in any MDPI journal, in appreciation of the work done.
Impact Factor:
1.3 (2022);
5-Year Impact Factor:
1.2 (2022)
Latest Articles
Ensemble Algorithms to Improve COVID-19 Growth Curve Estimates
Stats 2023, 6(4), 990-1007; https://doi.org/10.3390/stats6040062 - 29 Sep 2023
Abstract
In January 2020, the world was taken by surprise as a novel disease, COVID-19, emerged, attributed to the new SARS-CoV-2 virus. Initial cases were reported in China, and the virus rapidly disseminated globally, leading the World Health Organization (WHO) to declare it a
[...] Read more.
In January 2020, the world was taken by surprise as a novel disease, COVID-19, emerged, attributed to the new SARS-CoV-2 virus. Initial cases were reported in China, and the virus rapidly disseminated globally, leading the World Health Organization (WHO) to declare it a pandemic on 11 March 2020. Given the novelty of this pathogen, limited information was available regarding its infection rate and symptoms. Consequently, the necessity of employing mathematical models to enable researchers to describe the progression of the epidemic and make accurate forecasts became evident. This study focuses on the analysis of several dynamic growth models, including the logistics, Gompertz, and Richards growth models, which are commonly employed to depict the spread of infectious diseases. These models are integrated to harness their predictive capabilities, utilizing an ensemble modeling approach. The resulting ensemble algorithm was trained using COVID-19 data from the Brazilian state of Paraíba. The proposed ensemble model approach effectively reduced forecasting errors, showcasing itself as a promising methodology for estimating COVID-19 growth curves, improving data forecasting accuracy, and providing rapid responses in the early stages of the pandemic.
Full article
Open AccessArticle
Confounder Adjustment in Shape-on-Scalar Regression Model: Corpus Callosum Shape Alterations in Alzheimer’s Disease
Stats 2023, 6(4), 980-989; https://doi.org/10.3390/stats6040061 - 28 Sep 2023
Abstract
►▼
Show Figures
Large-scale imaging studies often face challenges stemming from heterogeneity arising from differences in geographic location, instrumental setups, image acquisition protocols, study design, and latent variables that remain undisclosed. While numerous regression models have been developed to elucidate the interplay between imaging responses and
[...] Read more.
Large-scale imaging studies often face challenges stemming from heterogeneity arising from differences in geographic location, instrumental setups, image acquisition protocols, study design, and latent variables that remain undisclosed. While numerous regression models have been developed to elucidate the interplay between imaging responses and relevant covariates, limited attention has been devoted to cases where the imaging responses pertain to the domain of shape. This adds complexity to the problem of imaging heterogeneity, primarily due to the unique properties inherent to shape representations, including nonlinearity, high-dimensionality, and the intricacies of quotient space geometry. To tackle this intricate issue, we propose a novel approach: a shape-on-scalar regression model that incorporates confounder adjustment. In particular, we leverage the square root velocity function to extract elastic shape representations which are embedded within the linear Hilbert space of square integrable functions. Subsequently, we introduce a shape regression model aimed at characterizing the intricate relationship between elastic shapes and covariates of interest, all while effectively managing the challenges posed by imaging heterogeneity. We develop comprehensive procedures for estimating and making inferences about the unknown model parameters. Through real-data analysis, our method demonstrates its superiority in terms of estimation accuracy when compared to existing approaches.
Full article

Figure 1
Open AccessArticle
Terroir in View of Bibliometrics
by
, , , , , , , , and
Stats 2023, 6(4), 956-979; https://doi.org/10.3390/stats6040060 - 27 Sep 2023
Abstract
►▼
Show Figures
This study aimed to perform a bibliometric analysis of terroir and explore its conceptual horizons. Advancements in terroir research until 2022 were investigated using the Scopus database, R, and VOSviewer. Out of the 907 results, the most prevalent document types were articles (771)
[...] Read more.
This study aimed to perform a bibliometric analysis of terroir and explore its conceptual horizons. Advancements in terroir research until 2022 were investigated using the Scopus database, R, and VOSviewer. Out of the 907 results, the most prevalent document types were articles (771) and reviews (70). The annual growth rate of published manuscripts in this field was 7.8%. The research on terroir encompassed a wide range of disciplines, with significant contributions from Agricultural and Biological Sciences, Social Sciences, Environmental Science, Biochemistry, Genetics, and Molecular Biology. Through keyword analysis, the study identified the most frequently occurring terms in titles, abstracts, and keywords fields, including ‘terroir’, ‘wine’, ‘soil’, ‘wines’, ‘grape’, ‘analysis’, ‘vineyard’, ‘composition’, and ‘climate’. A trend topic analysis revealed that research in terroir primarily focused on the geo-ecology and physiology of grapes. Furthermore, considerable attention was given to methods and techniques related to the physicochemical, sensory, and microbial characterization of terroir and various aspects of the wine industry. Initially, the research in this domain was focused on terroir, authenticity, grapevine, soils, soil moisture, and wine quality. However, over time, the research agenda expanded to include topics such as food analysis, viticulture, wine, taste, sustainability, and climate change. New research areas emerged, including phenolic compounds, anthocyanin, phenols, sensory analysis, and precision agriculture—all of which became integral components of the scientific studies on terroir. Overall, this study provided valuable insights into the historical trends and current developments in terroir research, contributing to our understanding of the frontiers in this field.
Full article

Figure 1
Open AccessArticle
A Family of Finite Mixture Distributions for Modelling Dispersion in Count Data
Stats 2023, 6(3), 942-955; https://doi.org/10.3390/stats6030059 - 18 Sep 2023
Abstract
This paper considers the construction of a family of discrete distributions with the flexibility to cater for under-, equi- and over-dispersion in count data using a finite mixture model based on standard distributions. We are motivated to introduce this family because its simple
[...] Read more.
This paper considers the construction of a family of discrete distributions with the flexibility to cater for under-, equi- and over-dispersion in count data using a finite mixture model based on standard distributions. We are motivated to introduce this family because its simple finite mixture structure adds flexibility and facilitates application and use in analysis. The family of distributions is exemplified using a mixture of negative binomial and shifted negative binomial distributions. Some basic and probabilistic properties are derived. We perform hypothesis testing for equi-dispersion and simulation studies of their power and consider parameter estimation via maximum likelihood and probability-generating-function-based methods. The utility of the distributions is illustrated via their application to real biological data sets exhibiting under-, equi- and over-dispersion. It is shown that the distribution fits better than the well-known generalized Poisson and COM–Poisson distributions for handling under-, equi- and over-dispersion in count data.
Full article
(This article belongs to the Special Issue Advances in Probability Theory and Statistics)
►▼
Show Figures

Figure 1
Open AccessArticle
A Detecting System for Abrupt Changes in Temporal Incidence Rate of COVID-19 and Other Pandemics
Stats 2023, 6(3), 931-941; https://doi.org/10.3390/stats6030058 - 18 Sep 2023
Abstract
►▼
Show Figures
COVID-19 spread dramatically across the world in the beginning of 2020. This paper presents a novel alert system that will detect abrupt changes in the COVID-19 or other pandemic incidence rate through the estimated time-varying reproduction number (Rt). We applied the system to
[...] Read more.
COVID-19 spread dramatically across the world in the beginning of 2020. This paper presents a novel alert system that will detect abrupt changes in the COVID-19 or other pandemic incidence rate through the estimated time-varying reproduction number (Rt). We applied the system to detect abrupt changes in the COVID-19 pandemic incidence rates in thirteen world regions with eight in the US and five across the world. Subsequently, we also evaluated the system with the 2009 H1N1 pandemic in Hong Kong. Our system performs well in detecting both the abrupt increases and decreases. Users of the system can obtain accurate information on the changing trend of the pandemic to avoid being misled by low incidence numbers. The world may face other threatening pandemics in the future; therefore, it is crucial to have a reliable alert system to detect impending abrupt changes in the daily incidence rates. An added benefit of the system is its ability to detect the emergence of viral mutations, as different virus strains are likely to have different infection rates.
Full article

Figure 1
Open AccessArticle
Orthonormal F Contrasts for Factors with Ordered Levels in Two-Factor Fixed-Effects ANOVAs
Stats 2023, 6(3), 920-930; https://doi.org/10.3390/stats6030057 - 01 Sep 2023
Abstract
In multifactor fixed-effects ANOVAs, we show how to construct orthonormal F contrasts for main effects. Our primary focus is the case when the levels of the factor of interest are ordered. Likewise, in multifactor equally replicated fixed-effects ANOVAs, we show how to construct
[...] Read more.
In multifactor fixed-effects ANOVAs, we show how to construct orthonormal F contrasts for main effects. Our primary focus is the case when the levels of the factor of interest are ordered. Likewise, in multifactor equally replicated fixed-effects ANOVAs, we show how to construct orthonormal F contrasts for interactions. The primary focus here is on interactions when both factors are ordered, although the approach also applies if just one factor is ordered. Interactions with both factors ordered may be interpreted in terms of generalised correlations.
Full article
(This article belongs to the Section Data Science)
►▼
Show Figures

Figure 1
Open AccessArticle
Investigating Self-Rationalizing Models for Commonsense Reasoning
Stats 2023, 6(3), 907-919; https://doi.org/10.3390/stats6030056 - 29 Aug 2023
Abstract
The rise of explainable natural language processing spurred a bulk of work on datasets augmented with human explanations, as well as technical approaches to leverage them. Notably, generative large language models offer new possibilities, as they can output a prediction as well as
[...] Read more.
The rise of explainable natural language processing spurred a bulk of work on datasets augmented with human explanations, as well as technical approaches to leverage them. Notably, generative large language models offer new possibilities, as they can output a prediction as well as an explanation in natural language. This work investigates the capabilities of fine-tuned text-to-text transfer Transformer (T5) models for commonsense reasoning and explanation generation. Our experiments suggest that while self-rationalizing models achieve interesting results, a significant gap remains: classifiers consistently outperformed self-rationalizing models, and a substantial fraction of model-generated explanations are not valid. Furthermore, training with expressive free-text explanations substantially altered the inner representation of the model, suggesting that they supplied additional information and may bridge the knowledge gap. Our code is publicly available, and the experiments were run on open-access datasets, hence allowing full reproducibility.
Full article
(This article belongs to the Special Issue Machine Learning and Natural Language Processing (ML & NLP))
►▼
Show Figures

Figure 1
Open AccessArticle
Statistical Modeling of Implicit Functional Relations
Stats 2023, 6(3), 889-906; https://doi.org/10.3390/stats6030055 - 25 Aug 2023
Abstract
►▼
Show Figures
This study considers the statistical estimation of relations presented by implicit functions. Such structures define mutual interconnections of variables rather than outcome variable dependence by predictor variables considered in regular regression analysis. For a simple case of two variables, pairwise regression modeling produces
[...] Read more.
This study considers the statistical estimation of relations presented by implicit functions. Such structures define mutual interconnections of variables rather than outcome variable dependence by predictor variables considered in regular regression analysis. For a simple case of two variables, pairwise regression modeling produces two different lines of each variable dependence using another variable, but building an implicit relation yields one invertible model composed of two simple regressions. Modeling an implicit linear relation for multiple variables can be expressed as a generalized eigenproblem of the covariance matrix of the variables in the metric of the covariance matrix of their errors. For unknown errors, this work describes their estimation by the residual errors of each variable in its regression by the other predictors. Then, the generalized eigenproblem can be reduced to the diagonalization of a special matrix built from the variables’ covariance matrix and its inversion. Numerical examples demonstrate the eigenvector solution’s good properties for building a unique equation of the relations between all variables. The proposed approach can be useful in practical regression modeling with all variables containing unobserved errors, which is a common situation for the applied problems.
Full article

Figure 1
Open AccessArticle
Statistical Predictors of Project Management Maturity
by
, and
Stats 2023, 6(3), 868-888; https://doi.org/10.3390/stats6030054 - 15 Aug 2023
Abstract
►▼
Show Figures
Global scenarios of organizations show investments wasted in projects with poor performances in more than 11 percent of cases, according to the Project Management Institute. This research aims to guide organizations in assertively investing in the right pertinent factors to improve project success
[...] Read more.
Global scenarios of organizations show investments wasted in projects with poor performances in more than 11 percent of cases, according to the Project Management Institute. This research aims to guide organizations in assertively investing in the right pertinent factors to improve project success rates and speed up project management maturity at a higher accuracy level using statistical predictions. Challenging existing drivers for project management maturity models and expanding their current practical view will be the result of a quantitative methodology based on a survey supported by data collection targeting the project management community in Brazil. The originality and value of this research are in contributing to the development of new project maturity models statistically supported by the increasing rate of maturity accuracy, which can be continually improved by confident data input into the model. The results show a high correlation between the performance measurement system and the project success rate associated with project management maturity. In addition, this research contemplates the relationship between organizational culture, business type, and project management office and project management maturity.
Full article

Figure 1
Open AccessArticle
Multi-Step-Ahead Prediction Intervals for Nonparametric Autoregressions via Bootstrap: Consistency, Debiasing, and Pertinence
by
and
Stats 2023, 6(3), 839-867; https://doi.org/10.3390/stats6030053 - 11 Aug 2023
Abstract
To address the difficult problem of the multi-step-ahead prediction of nonparametric autoregressions, we consider a forward bootstrap approach. Employing a local constant estimator, we can analyze a general type of nonparametric time-series model and show that the proposed point predictions are consistent with
[...] Read more.
To address the difficult problem of the multi-step-ahead prediction of nonparametric autoregressions, we consider a forward bootstrap approach. Employing a local constant estimator, we can analyze a general type of nonparametric time-series model and show that the proposed point predictions are consistent with the true optimal predictor. We construct a quantile prediction interval that is asymptotically valid. Moreover, using a debiasing technique, we can asymptotically approximate the distribution of multi-step-ahead nonparametric estimation by the bootstrap. As a result, we can build bootstrap prediction intervals that are pertinent, i.e., can capture the model estimation variability, thus improving the standard quantile prediction intervals. Simulation studies are presented to illustrate the performance of our point predictions and pertinent prediction intervals for finite samples.
Full article
(This article belongs to the Section Time Series Analysis)
Open AccessArticle
Analysis of Ordinal Populations from Judgment Post-Stratification
by
and
Stats 2023, 6(3), 812-838; https://doi.org/10.3390/stats6030052 - 09 Aug 2023
Abstract
In surveys requiring cost efficiency, such as medical research, measuring the variable of interest (e.g., disease status) is expensive and/or time-consuming; however, we often have access to easily obtainable characteristics about sampling units. These characteristics are not typically employed in the data collection
[...] Read more.
In surveys requiring cost efficiency, such as medical research, measuring the variable of interest (e.g., disease status) is expensive and/or time-consuming; however, we often have access to easily obtainable characteristics about sampling units. These characteristics are not typically employed in the data collection process. Judgment post-stratification (JPS) sampling enables us to supplement the random samples from the population of interest with these characteristics as ranking information. This paper develops methods based on the JPS samples for estimating categorical ordinal populations. We develop various estimators from the JPS data even for situations where the JPS suffers from empty strata. We also propose the JPS estimators using multiple ranking resources. Through extensive numerical studies, we evaluate the performance of the methods in estimating the population. Finally, the developed estimation methods are applied to bone mineral data to estimate the bone disorder status of women aged 50 and older.
Full article
(This article belongs to the Section Statistical Methods)
►▼
Show Figures

Figure 1
Open AccessCommunication
On the Extreme Value
by
, , and
Stats 2023, 6(3), 802-811; https://doi.org/10.3390/stats6030051 - 04 Aug 2023
Abstract
►▼
Show Figures
In the present paper, a new special function, the so-called extreme value -function, is introduced. This new function, which is a generalization of the H-function with a particular set of parameters, appears while dealing with products and quotients of a wide class
[...] Read more.
In the present paper, a new special function, the so-called extreme value -function, is introduced. This new function, which is a generalization of the H-function with a particular set of parameters, appears while dealing with products and quotients of a wide class of extreme value random variables. Some properties, special cases and a series representation are provided. Some statistical applications are also briefly discussed.
Full article

Figure 1
Open AccessArticle
The New Exponentiated Half Logistic-Harris-G Family of Distributions with Actuarial Measures and Applications
Stats 2023, 6(3), 773-801; https://doi.org/10.3390/stats6030050 - 31 Jul 2023
Abstract
►▼
Show Figures
In this study, we introduce a new generalized family of distributions called the Exponentiated Half Logistic-Harris-G (EHL-Harris-G) distribution, which extends the Harris-G distribution. The motivation for introducing this generalized family of distributions lies in its ability to overcome the limitations of previous families,
[...] Read more.
In this study, we introduce a new generalized family of distributions called the Exponentiated Half Logistic-Harris-G (EHL-Harris-G) distribution, which extends the Harris-G distribution. The motivation for introducing this generalized family of distributions lies in its ability to overcome the limitations of previous families, enhance flexibility, improve tail behavior, provide better statistical properties and find applications in several fields. Several statistical properties, including hazard rate function, quantile function, moments, moments of residual life, distribution of the order statistics and Rényi entropy are discussed. Risk measures, such as value at risk, tail value at risk, tail variance and tail variance premium, are also derived and studied. To estimate the parameters of the EHL-Harris-G family of distributions, the following six different estimation approaches are used: maximum likelihood (MLE), least-squares (LS), weighted least-squares (WLS), maximum product spacing (MPS), Cramér–von Mises (CVM), and Anderson–Darling (AD). The Monte Carlo simulation results for EHL-Harris-Weibull (EHL-Harris-W) show that the MLE method allows us to obtain better estimates, followed by WLS and then AD. Finally, we show that the EHL-Harris-W distribution is superior to some other equi-parameter non-nested models in the literature, by fitting it to two real-life data sets from different disciplines.
Full article

Figure 1
Open AccessCommunication
Khinchin’s Fourth Axiom of Entropy Revisited
Stats 2023, 6(3), 763-772; https://doi.org/10.3390/stats6030049 - 27 Jul 2023
Abstract
The Boltzmann–Gibbs–Shannon (BGS) entropy is the only entropy form satisfying four conditions known as Khinchin’s axioms. The uniqueness theorem of the BGS entropy, plus the fact that Shannon’s mutual information completely characterizes independence between the two underlying random elements, puts the BGS entropy
[...] Read more.
The Boltzmann–Gibbs–Shannon (BGS) entropy is the only entropy form satisfying four conditions known as Khinchin’s axioms. The uniqueness theorem of the BGS entropy, plus the fact that Shannon’s mutual information completely characterizes independence between the two underlying random elements, puts the BGS entropy in a special place in many fields of study. In this article, the fourth axiom is replaced by a slightly weakened condition: an entropy whose associated mutual information is zero if and only if the two underlying random elements are independent. Under the weaker fourth axiom, other forms of entropy are sought by way of escort transformations. Two main results are reported in this article. First, there are many entropies other than the BGS entropy satisfying the weaker condition, yet retaining all the desirable utilities of the BGS entropy. Second, by way of escort transformations, the newly identified entropies are the only ones satisfying the weaker axioms.
Full article
(This article belongs to the Section Data Science)
Open AccessArticle
Exploring Heterogeneity with Category and Cluster Analyses for Mixed Data
Stats 2023, 6(3), 747-762; https://doi.org/10.3390/stats6030048 - 05 Jul 2023
Abstract
►▼
Show Figures
Precision medicine aims to overcome the traditional one-model-fits-the-whole-population approach that is unable to detect heterogeneous disease patterns and make accurate personalized predictions. Heterogeneity is particularly relevant for patients with complications of type 2 diabetes, including diabetic kidney disease (DKD). We focus on a
[...] Read more.
Precision medicine aims to overcome the traditional one-model-fits-the-whole-population approach that is unable to detect heterogeneous disease patterns and make accurate personalized predictions. Heterogeneity is particularly relevant for patients with complications of type 2 diabetes, including diabetic kidney disease (DKD). We focus on a DKD longitudinal dataset, aiming to find specific subgroups of patients with characteristics that have a close response to the therapeutic treatment. We develop an approach based on some particular concepts of category theory and cluster analysis to explore individualized modelings and achieving insights onto disease evolution. This paper exploits the visualization tools provided by category theory, and bridges category-based abstract works and real datasets. We build subgroups deriving clusters of patients at different time points, considering a set of variables characterizing the state of patients. We analyze how specific variables affect the disease progress, and which drug combinations are more effective for each cluster of patients. The retrieved information can foster individualized strategies for DKD treatment.
Full article

Figure 1
Open AccessCommunication
Some More Results on Characterization of the Exponential and Related Distributions
Stats 2023, 6(3), 740-746; https://doi.org/10.3390/stats6030047 - 29 Jun 2023
Abstract
There are given characterizations of the exponential distribution based on the properties of independence of linear forms with random coefficients. Results based on the constancy of regression of one statistic in a linear form are obtained. Related characterizations based on the property of
[...] Read more.
There are given characterizations of the exponential distribution based on the properties of independence of linear forms with random coefficients. Results based on the constancy of regression of one statistic in a linear form are obtained. Related characterizations based on the property of the identical distribution of statistics are also provided.
Full article
(This article belongs to the Special Issue Advances in Probability Theory and Statistics)
Open AccessCommunication
Guess for Success? Application of a Mixture Model to Test-Wiseness on Multiple-Choice Exams
Stats 2023, 6(3), 734-739; https://doi.org/10.3390/stats6030046 - 26 Jun 2023
Abstract
The use of large lecture halls in business and economic education often dictates the use of multiple-choice exams to measure student learning. This study asserts that student performance on these types of exams can be viewed as the result of the process of
[...] Read more.
The use of large lecture halls in business and economic education often dictates the use of multiple-choice exams to measure student learning. This study asserts that student performance on these types of exams can be viewed as the result of the process of elimination of incorrect answers, rather than the selection of the correct answer. More specifically, how students respond on a multiple-choice test can be broken down into the fractions of questions where no wrong answers can be eliminated (i.e., random guessing), one wrong answer can be eliminated, two wrong answers can be eliminated, and all wrong answers can be eliminated. The results from an empirical model, representing a mixture of binomials in which the probability of a correct choice depends on the number of incorrect choices eliminated, we find, using student performance data from a final exam in principles of microeconomics consisting of 100 multiple choice questions, that the responses to all of the questions on the exam can be characterized by some form of guessing, with more than 26 percent of questions being completed using purely random guessing.
Full article
Open AccessArticle
The Gamma-Topp-Leone-Type II-Exponentiated Half Logistic-G Family of Distributions with Applications
Stats 2023, 6(2), 706-733; https://doi.org/10.3390/stats6020045 - 19 Jun 2023
Abstract
►▼
Show Figures
The new Ristić and Balakhrisnan or Gamma-Topp-Leone-Type II-Exponentiated Half Logistic-G (RB-TL-TII-EHL-G) family of distributions is introduced and investigated in this paper. This work derives and studies some of the main statistical characteristics of this new family of distributions. The maximum likelihood estimation technique
[...] Read more.
The new Ristić and Balakhrisnan or Gamma-Topp-Leone-Type II-Exponentiated Half Logistic-G (RB-TL-TII-EHL-G) family of distributions is introduced and investigated in this paper. This work derives and studies some of the main statistical characteristics of this new family of distributions. The maximum likelihood estimation technique is used to estimate the model parameters, and a simulation study is used to assess the consistency of the estimators. Applications to three real-life datasets from various fields show the value and adaptability of the new RB-TL-TII-EHL-G family of distributions. From our results, it is evident that the new proposed distribution is flexible enough to characterize datasets from different fields compared to several other existing distributions in the literature.
Full article

Figure 1
Open AccessArticle
Modeling Model Misspecification in Structural Equation Models
Stats 2023, 6(2), 689-705; https://doi.org/10.3390/stats6020044 - 14 Jun 2023
Cited by 1
Abstract
Structural equation models constrain mean vectors and covariance matrices and are frequently applied in the social sciences. Frequently, the structural equation model is misspecified to some extent. In many cases, researchers nevertheless intend to work with a misspecified target model of interest. In
[...] Read more.
Structural equation models constrain mean vectors and covariance matrices and are frequently applied in the social sciences. Frequently, the structural equation model is misspecified to some extent. In many cases, researchers nevertheless intend to work with a misspecified target model of interest. In this article, a simultaneous statistical inference for sampling errors and model misspecification errors is discussed. A modified formula for the variance matrix of the parameter estimate is obtained by imposing a stochastic model for model errors and applying M-estimation theory. The presence of model errors is quantified in increased standard errors in parameter estimates. The proposed inference is illustrated with several analytical examples and an empirical application.
Full article
(This article belongs to the Special Issue Advances in Probability Theory and Statistics)
Open AccessArticle
Area under the Curve as an Alternative to Latent Growth Curve Modeling When Assessing the Effects of Predictor Variables on Repeated Measures of a Continuous Dependent Variable
Stats 2023, 6(2), 674-688; https://doi.org/10.3390/stats6020043 - 25 May 2023
Abstract
Researchers conducting longitudinal data analysis in psychology and the behavioral sciences have several statistical methods to choose from, most of which either require specialized software to conduct or advanced knowledge of statistical methods to inform the selection of the correct model options (e.g.,
[...] Read more.
Researchers conducting longitudinal data analysis in psychology and the behavioral sciences have several statistical methods to choose from, most of which either require specialized software to conduct or advanced knowledge of statistical methods to inform the selection of the correct model options (e.g., correlation structure). One simple alternative to conventional longitudinal data analysis methods is to calculate the area under the curve (AUC) from repeated measures and then use this new variable in one’s model. The present study assessed the relative efficacy of two AUC measures: the AUC with respect to the ground (AUC-g) and the AUC with respect to the increase (AUC-i) in comparison to latent growth curve modeling (LGCM), a popular repeated measures data analysis method. Using data from the ongoing Panel Study of Income Dynamics (PSID), we assessed the effects of four predictor variables on repeated measures of social anxiety, using both the AUC and LGCM. We used the full information maximum likelihood (FIML) method to account for missing data in LGCM and multiple imputation to account for missing data in the calculation of both AUC measures. Extracting parameter estimates from these models, we next conducted Monte Carlo simulations to assess the parameter bias and power (two estimates of performance) of both methods in the same models, with sample sizes ranging from 741 to 50. The results using both AUC measures in the initial models paralleled those of LGCM, particularly with respect to the LGCM baseline. With respect to the simulations, both AUC measures preformed as well or even better than LGCM in all sample sizes assessed. These results suggest that the AUC may be a viable alternative to LGCM, especially for researchers with less access to the specialized software necessary to conduct LGCM.
Full article
(This article belongs to the Section Statistical Methods)
►▼
Show Figures

Figure 1
Highly Accessed Articles
Latest Books
E-Mail Alert
News
Topics
Topic in
Entropy, Mathematics, Modelling, Stats
Interfacing Statistics, Machine Learning and Data Science from a Probabilistic Modelling Viewpoint
Topic Editors: Jürgen Pilz, Noelle I. Samia, Dirk HusmeierDeadline: 31 December 2024

Conferences
Special Issues
Special Issue in
Stats
Statistics, Analytics, and Inferences for Discrete Data
Guest Editor: Dungang LiuDeadline: 30 November 2023
Special Issue in
Stats
Advances in State-Space Modeling of Time Series
Guest Editors: Marco André da Silva Costa, Magda Sofia Valério MonteiroDeadline: 31 December 2023
Special Issue in
Stats
Statistical Learning for High-Dimensional Data
Guest Editor: Paulo Canas RodriguesDeadline: 31 January 2024
Special Issue in
Stats
Statistical Developments and Applications in Rare Disease Diagnostic Testing and Classification
Guest Editor: Kabir JalalDeadline: 29 February 2024