Previous Article in Journal
Unraveling Similarities and Differences Between Non-Negative Garrote and Adaptive Lasso: A Simulation Study in Low- and High-Dimensional Data
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

Individual Homogeneity Learning in Density Data Response Additive Models

1
Division of Public Health Sciences, Fred Hutchinson Cancer Center, Seattle, WA 98109, USA
2
School of Statistics and Data Science, Shanghai University of Finance and Economics, Shanghai 200433 , China
3
Department of Mathematics and Statistics, McMaster University, Hamilton, ON L8S 4L8, Canada
*
Author to whom correspondence should be addressed.
Stats 2025, 8(3), 71; https://doi.org/10.3390/stats8030071 (registering DOI)
Submission received: 14 July 2025 / Revised: 6 August 2025 / Accepted: 7 August 2025 / Published: 9 August 2025

Abstract

In many complex applications, both data heterogeneity and homogeneity are present simultaneously. Overlooking either aspect can lead to misleading statistical inferences. Moreover, the increasing prevalence of complex, non-Euclidean data calls for more sophisticated modeling techniques. To address these challenges, we propose a density data response additive model, where the response variable is represented by a distributional density function. In this framework, individual effect curves are assumed to be homogeneous within groups but heterogeneous across groups, while covariates that explain variation share common additive bivariate functions. We begin by applying a transformation to map density functions into a linear space. To estimate the unknown subject-specific functions and the additive bivariate components, we adopt a B-spline series approximation method. Latent group structures are uncovered using a hierarchical agglomerative clustering algorithm, which allows our method to recover the true underlying groupings with high probability. To further improve estimation efficiency, we develop refined spline-backfitted local linear estimators for both the grouped structures and the additive bivariate functions in the post-grouping model. We also establish the asymptotic properties of the proposed estimators, including their convergence rates, asymptotic distributions, and post-grouping oracle efficiency. The effectiveness of our method is demonstrated through extensive simulation studies and real-world data analysis, both of which show promising and robust performance.
Keywords: heterogeneity; latent group structures; hierarchical agglomerative clustering; post-grouping oracle heterogeneity; latent group structures; hierarchical agglomerative clustering; post-grouping oracle

Share and Cite

MDPI and ACS Style

Han, Z.; Li, T.; You, J.; Balakrishnan, N. Individual Homogeneity Learning in Density Data Response Additive Models. Stats 2025, 8, 71. https://doi.org/10.3390/stats8030071

AMA Style

Han Z, Li T, You J, Balakrishnan N. Individual Homogeneity Learning in Density Data Response Additive Models. Stats. 2025; 8(3):71. https://doi.org/10.3390/stats8030071

Chicago/Turabian Style

Han, Zixuan, Tao Li, Jinhong You, and Narayanaswamy Balakrishnan. 2025. "Individual Homogeneity Learning in Density Data Response Additive Models" Stats 8, no. 3: 71. https://doi.org/10.3390/stats8030071

APA Style

Han, Z., Li, T., You, J., & Balakrishnan, N. (2025). Individual Homogeneity Learning in Density Data Response Additive Models. Stats, 8(3), 71. https://doi.org/10.3390/stats8030071

Article Metrics

Back to TopTop