1. Introduction
Today’s elementary school children are growing up in an environment in which digital technologies permeate virtually every aspect of their daily lives. Early adolescence, typically spanning Grades 3 through 6, represents a developmental transition period during which children experience significant increases in digital media exposure and access to personal devices. Survey research indicates that elementary school children are increasingly consuming online video- and game-based content and are gaining independent access to smartphones and tablets (
Rideout & Robb, 2019;
Rideout et al., 2022).
As peer interactions move further into digital environments (
Subrahmanyam & Šmahel, 2011;
Valkenburg & Peter, 2011), children may become particularly vulnerable to problematic or excessive media use. Although digital media can support learning and social connectedness (
Canadian Paediatric Society, 2019), its excessive use has been associated with reduced self-regulation, sleep disruption, impaired daily functioning, and increased interpersonal conflict (
OECD, 2024). International monitoring further underscores these concerns, with a WHO HBSC report showing an increase in problematic social media use from 7% in 2018 to 11% in 2022 (
WHO Regional Office for Europe, 2024).
From a neurodevelopmental perspective, early adolescence is considered a critical period. The prefrontal systems underlying self-regulation are still maturing while environmental demands are increasing, making behavioral patterns particularly susceptible to consolidation (
Crone & Konijn, 2018). However, much of the existing research relies on cross-sectional designs or single time-point predictors. As a result, there is limited longitudinal evidence regarding which children persistently follow high-risk trajectories of digital media overdependence and how these patterns evolve over time. Addressing this gap requires analytical approaches capable of capturing latent state changes and transitions, alongside theoretical frameworks that account for multidimensional childhood risks and protective factors.
Developmental cascade and risk accumulation models (
Blair & Raver, 2015;
Masten & Cicchetti, 2010) provide a strong conceptual foundation for anticipating stable, high-risk pathways. These models posit that early vulnerabilities, such as executive function difficulties, sleep problems, parental stress, and reduced parent–child interaction, interact and accumulate across development, shaping long-term self-regulation trajectories. From this perspective, persistent media overdependence may emerge when early risk processes compound and reinforce one another over time. Thus, this framework offers a compelling rationale for modeling longitudinal transitions in risk states rather than focusing solely on single time-point associations.
The predictors examined in this study were selected for their theoretical relevance and empirical support within the developmental, self-regulation, and media use frameworks (
Baumeister & Heatherton, 1996;
Brand et al., 2014;
Y. Chen et al., 2024;
Diamond, 2013;
Domoff et al., 2020;
Fitzpatrick et al., 2025;
Linebarger et al., 2014;
McDaniel & Radesky, 2018). These factors have consistently emerged as core mechanisms underlying problematic media use and represent modifiable targets for early intervention (
Linebarger & Vaala, 2010;
Madigan et al., 2020;
Radesky et al., 2014). Because the developmental pathways identified through the Hidden Markov Models (HMM) reflect latent trajectories of self-regulatory development rather than static clusters, it is theoretically meaningful to hypothesize that early regulatory capacities and environmental conditions shape whether children remain in or transition out of high-risk pathways.
Executive function (EF) has been established as a central mechanism of self-regulation and predictor of impulsivity, attentional dysregulation, and problematic media use (
Diamond, 2013;
Fitzpatrick et al., 2025). Sleep provides a biobehavioral foundation for emotional and behavioral regulation, and early sleep problems predict later internalizing and externalizing difficulties (
Blacher et al., 2024;
Quach et al., 2018). Furthermore, screen time has been consistently linked to reduced sleep duration and later bedtime (
Bauducco et al., 2024;
Brosnan et al., 2024;
Y. Chen et al., 2024). Family factors, including parenting stress, parent–child interaction, and early media exposure, further shape children’s self-regulation and subsequent media-use patterns (
Madigan et al., 2020;
McDaniel & Radesky, 2018). Given that these domains interact and accumulate over time, identifying how early childhood factors shape trajectories of digital media overdependence is an essential developmental question.
From a methodological standpoint, this study applies the HMM to estimate latent states and transition probabilities (
Rabiner, 1989). The HMM is well-suited for examining media-use trajectories because it does not require predefined latent profiles (
Bartolucci et al., 2012) and can identify data-driven state patterns when the underlying structure is unknown. By contrast, latent transition analysis requires predefined latent profiles, making HMM a more appropriate choice when the latent-state structure has not yet been empirically established. Furthermore, machine learning approaches, including SHAP-based interpretation, enable the examination of nonlinear relationships among multidimensional early predictors, which is an improvement over prior work that has often relied on limited predictor sets or single-time-point measures. Together, these analytical approaches strengthen explanatory precision and support the earlier identification of children at elevated risk.
Finally, this study draws on the interactional theory of problematic childhood media use (IT-CPMU;
Domoff et al., 2020). This theory posits that self-regulatory vulnerabilities interact with family and contextual factors, thereby reinforcing problematic media use patterns over time. The high-risk maintenance pathways identified in this study reflected these interactional processes and underscored the need for developmentally timed interventions.
Building on this foundation, this study investigated how multidimensional early childhood and early elementary school factors shape the developmental trajectories of digital media overdependence in early adolescence. Using long-term longitudinal data, we identified HMM-based developmental pathways and detected early risk signals using machine learning to inform prevention and intervention strategies across family, school, and community settings. Accordingly, we addressed the following two research questions:
When distinguishing the developmental trajectories of digital media overdependence among early adolescents using HMM, can we statistically identify a subgroup with a persistently high-risk status that is distinct from other trajectories?
Which early and middle childhood factors predict a high-risk maintenance trajectory of digital media overdependence in early adolescence?
2. Methods
2.1. Participants and Data Source
We analyzed secondary data from the Panel Study on Korean Children (PSKC), a nationally representative panel survey of Korean children and their families that initially recruited families of children born in 2008, and followed them annually thereafter. In the present study, we used four consecutive assessments from wave 10 (age 9) to wave 13 (age 12). Children who participated in all four waves were included (n = 1354), comprising of 686 boys (50.66%) and 668 girls (49.34%). Sex was coded as 1 for boys and 2 for girls, with boys used as the reference category in the subsequent analyses. Using an HMM classification of media overdependence trajectories across waves, 188 children (13.9%) were identified as the persistent high-risk group, and 1166 (86.1%) as the non-high-risk group. The Institutional Review Board of Dong-A University exempted the study (exemption no. 2-1040709-AB-N-01-202507-HR-048-01; determination date: 17 July 2025).
2.2. Measures
2.2.1. Outcome: Media Overdependence (Scoring and Risk Classification)
At each wave (w10–w13), media overdependence was assessed using a standardized instrument comprising of three subscales: self-regulation difficulty (F1), daily life interference (F3), and control failure (F4). The internal consistency was high across the waves (Cronbach’s α = 0.840, 0.854, 0.864, and 0.855, respectively). The instrument and all risk thresholds were based on the nationally validated Korean Smartphone and Media Overdependence Scale developed by the National Information Society Agency is widely used for population monitoring and clinical screening (
Yang & Kim, 2021). No new cut-off values were introduced in the present study.
Risk categories were assigned using validated concurrent thresholds. Children were classified as general users when the total score was ≤27 and subscale scores satisfied F1 ≤ 12, F3 ≤ 10, and F4 ≤ 9. They were classified as potential-risk users when the total score was 28–29 or subscale scores met F1 ≥ 13, F3 ≥ 11, and F4 ≥ 10, provided that high-risk criteria were not met. High-risk users were defined as those with a total score ≥30 or subscale scores satisfying F1 ≥ 14, F3 ≥ 12, and F4 ≥ 11. Thresholds were applied concurrently; category assignment required meeting both the total score range and the corresponding subscale cutoffs. In cases of overlap, the classification defaulted to the high-risk category.
These wave-specific classifications were used as the observed emissions for the HMM, which estimated the latent states (normal, at-risk, and high-risk) and transitions across the waves. For machine learning analyses, we constructed a binary outcome variable coded 1 for children who were classified as high risk in all four waves (persistent high risk) and 0 otherwise.
2.2.2. Predictor Variables
Predictor variables were drawn from waves 7 to 10 and spanned the demographic, child, and parent–family domains. The demographic variables included the child’s sex and age, maternal and paternal age, proportion of parental university education, and monthly household income. Child-related factors included externalizing problems (cbt_expT_c7), EF difficulties (exf_c8), school adjustment (pScl_c8; single item), sleep duration (sleepTime_c7), overall media use time (midUseTime_c8), proportions of gaming- and learning-oriented use (midUse_game_c9 and midUse_learn_c9), and longitudinal change indices.
Parent/family factors included maternal parenting stress (prs_m7), maternal–family interaction (fli_m7), maternal controlling parenting (crs_ct_m7), paternal controlling parenting (crs_ct_f7), and the corresponding residualized change scores. Multi-item scales demonstrated acceptable reliability (Cronbach’s α ≥ 0.70): exf_c8 = 0.944, prs_m7 = 0.881, fli_m7 = 0.912, crs_ct_m7 = 0.737, and crs_ct_f7 = 0.786. All continuous predictors were standardized using z-scores.
The residualized change variables (prefixed with “change_”) were computed as the residuals from regressing later-wave scores on earlier-wave scores. These values represented the net change from the baseline. Positive scores indicate increases greater than expected given the baseline, whereas negative scores indicate below-expected increases or relative declines. When the baseline and residualized change variables were included simultaneously in the models, the variance inflation factors were below 2.5, suggesting that multicollinearity was not a substantive concern.
Information on variable coding is provided in
Supplementary Table S1. The maximum item-level missingness for any predictor was 16%, with all missingness rates being below 20%. Pairwise correlations among the predictors were below 0.50, and the full correlation matrix is shown in
Supplementary Figure S1.
2.3. Missing Data Handling
Missingness was summarized at both the wave and predictor levels. Wave-level missingness across waves 10–13 ranged from 0.05% to 6.1%, and the maximum item-level missingness among predictors was 16% (all < 20%); detailed percentages for each variable are provided in
Supplementary Table S2. Missing predictor values were imputed using multivariate imputation with chained equations and predictive mean matching, generating 20 datasets, which were pooled according to Rubin’s rules. The imputation model incorporated all predictors alongside conceptually related auxiliary variables (e.g., concurrent indicators of sleep, parenting stress, and family interaction) to strengthen the plausibility of the missing-at-random assumption. The convergence of the MICE algorithm was evaluated using trace plots to examine the stability of the chain means across 50 iterations, which confirmed stable mixing with no systematic trends (see
Supplementary Figure S2). In addition, the density plots show comparable distribution shapes between the observed and imputed values, supporting the adequacy of the imputation process (see
Supplementary Figure S3).
To avoid circularity and artificial stabilization of the state process, the HMM-derived latent states and trajectory labels were excluded from the imputation model. As a result, the HMM estimation was conducted separately using the original observed risk categories. Media-overdependence outcomes were not imputed because the risk labels were determined using HMM.
2.4. Analytical Strategy
The analysis is conducted in three stages. First, we generated descriptive statistics and conducted bivariate comparisons (t tests and x2 tests) between the persistent high-risk and non-high-risk groups, with internal consistency assessed using Cronbach’s α. Second, we estimated the HMMs to identify the latent states of media overdependence and their transition probabilities across waves. Model fit indices-guided state selection, and Viterbi decoding were used to assign the most likely latent state sequence for each child. Third, we constructed supervised machine learning models to predict persistent high-risk membership based on baseline predictors.
These steps involved data wrangling and visualization using the
tidyverse suite (
Wickham, 2016;
Wickham et al., 2019), including
dplyr,
tidyr, and
ggplot2. The machine learning component was implemented within a predictive rather than a causal inference framework to support early screening and risk identification, rather than to estimate causal effects.
2.5. Hidden Markov Modeling
Wave-to-wave changes in the observed risk categories (general, potential, and high-risk) were modeled using a time-homogeneous first-order HMM. Candidate models with two to four latent states were fitted and compared based on the log-likelihood, Akaike Information Criterion (AIC), Bayesian Information Criterion (BIC), entropy, and interpretability to determine the optimal specification. Model parameters, including initial state probabilities, transition matrices, and state-dependent emission probabilities, were estimated using the EM (Baum–Welch) algorithm with multiple random starting values to avoid local optima, and individual state sequences and posterior probabilities were derived using Viterbi decoding. When the transition probabilities could not be extracted directly from the fitted model, they were estimated from the iterbi-decoded transitions as a pre-specified fallback. Classification uncertainty was summarized using posterior probabilities. For downstream machine learning analyses, a persistent binary high-risk label was created, defined as being decoded as high risk for all four waves. All HMM analyses were performed in R using the depmixS4 package (
Visser & Speekenbrink, 2010). Although the wave-specific classifications were based on validated cutoff criteria, these thresholds can be sensitive to short-term fluctuations and may over discretize the risk. HMMs refine these classifications by incorporating temporal continuity and probabilistic state decoding, thereby reducing noise from single-wave misclassification and providing more developmentally meaningful estimates of persistent or changing media use risk states over time. Thus, HMM-derived latent states serve as conceptually sound and methodologically robust outcomes for predicting developmental trajectories of media overdependence risk.
2.6. Machine Learning Modeling
An extreme gradient boosting (XGBoost) classifier was implemented in R (version 4.3.2) using the XGBoost package (
T. Chen & Guestrin, 2016) within the care framework (
Kuhn, 2008). Predictors were drawn from waves 7 to 9, spanning early childhood through Grade 2. The binary outcome, “persistent high risk,” was derived from HMM-decoded states across waves 10–13. This temporal ordering ensured that all the predictors preceded the outcome and prevented information leakage.
The dataset was divided into training (70%), validation (15%), and test (15%) sets using stratified sampling at the child ID level. Because the PSKC included only one target child per family, no sibling clustering was present. To address class imbalance, the training set was upsampled prior to model selection. Oversampling was applied only to the training set, whereas the original class distributions of the validation and test sets were retained to avoid leakage. Cross-validation employed a grouped 3-fold procedure (groupKFold), using the area under the ROC AUC as the optimization criterion. This approach separates model tuning in the oversampled folds from the threshold selection in the validation set, with a final evaluation conducted on the untouched test set.
The hyperparameters were tuned using a grid search, and the tuning ranges are summarized in
Supplementary Table S7. The preprocessing included median imputation, zero-variance filtering, centering, and scaling. A decision threshold was selected for the validation set using Youden’s index and the final predictive performance was evaluated using the independent test set. Discrimination was summarized using ROC AUC (DeLong confidence intervals) and Precision–Recall Area Under the Curve (PR AUC) (bootstrap confidence intervals), computed via
pROC (
Robin et al., 2011) and
PRROC (
Grau et al., 2015), and comparative performance across classifiers (ROC AUC, F1, PR AUC) is reported in
Supplementary Table S6.
Calibration was assessed using the Brier score, Hosmer–Lemeshow test, and reliability curves. The Hosmer–Lemeshow test was performed using the ResourceSelection package (
Lele et al., 2019), and calibration binning for reliability diagrams was computed using the data table package (
Dowle & Srinivasan, 2022). The expected calibration error (ECE) and maximum calibration error (MCE) were estimated using bootstrap confidence intervals. Threshold-dependent metrics, including sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), accuracy, and F1-score, were computed for the test set.
Predictor contributions were examined using model-based feature importance (via Caret’s varImp), and SHAP values were computed using the SHAPforxgboost package (
Liu, 2020). These SHAP values provide a predictive rather than a causal interpretation of the model behavior.
3. Results
3.1. Longitudinal Trends in Measured Media Overdependence
As shown in
Table 1, across four annual waves (w10–w13), the total media-overdependence score increased from
M = 23.59 (
SD = 5.94) at w10 to
M = 27.24 (
SD = 6.52) at w13, and this change was statistically significant (
F = 1925.98,
p < 0.001).
3.2. HMM Analysis of Measured States
To characterize the latent transitions, we fitted the HMMs to two, three, and four states (
Table 2). All models converged. The three-state model provided an optimal balance between fit and parsimony (AIC = 7740.34; BIC = 7832.09). The classification entropy was 0.562, indicating moderate certainty. Because the HMM entropy reflects posterior probability dispersion rather than static accuracy, this value suggests a partial overlap across adjacent states for some children. The emission probabilities of each state are listed in
Supplementary Table S4. Additionally, bootstrap-based 95% confidence intervals for the latent state transition probabilities are reported in
Supplementary Table S5 to quantify the uncertainty in the state dynamics.
The transition matrix (
Table 3) revealed strong stability in the high-risk state (0.897). The at-risk state transitioned toward both the high-risk (0.321) and normal (0.118) states. The normal state remained stable (0.750) but demonstrated movement toward the at-risk state (0.250). These patterns suggest that high-risk status tends to persist, at-risk status exhibits bidirectional movement, and normal status generally remains unchanged. Bootstrap-based 95% confidence intervals for key transitions (e.g., Normal, At-risk, High-risk, High-risk) showed reasonably narrow ranges, supporting the stability of these estimates. The detailed classification criteria, emission probabilities, and transition probability estimates with confidence intervals are reported in
Supplementary Tables S3–S5.
It is important to distinguish between “states” and “trajectories.” The HMM identified three time-specific latent states, whereas the five trajectories reflected multiyear patterns in the decoded sequences. Thus, the trajectories represent longitudinal profiles rather than additional latent states.
3.3. HMM-Derived Developmental Trajectories
Posterior decoding was used to identify five trajectory groups (
Figure 1). The Stable Normal group accounted for 39.1% of the analytical sample, the Stable High-Risk group for 13.9%, the Gradually Increasing Risk group for 17.9%, the Risk Reduction (from high risk) group for 0.7%, and the fluctuating risk group for 28.4%. These proportions indicate that, within the present analytical sample, the largest share followed a consistently normal pattern, a smaller share maintained a consistently high-risk pattern, and the remaining groups showed increasing, decreasing, or fluctuating patterns over time (
Supplementary Table S3).
The risk reduction group should be interpreted descriptively rather than inferentially because its size was extremely small (0.7%). This rare pattern likely reflects the small number of children whose observed transitions follow a decreasing risk sequence; however, classification uncertainty and potential misclassification cannot be ruled out. Accordingly, we refrained from drawing strong theoretical or policy conclusions from this subgroup. Nonetheless, the presence of a small decreasing risk pattern underscores that the developmental trajectories of media overdependence are heterogeneous rather than strictly uniform.
A moderate entropy indicates that minor-state misassignments may have occurred. However, as the persistent high-risk group required a high-risk status for every wave, it was less sensitive to uncertainty in any single wave.
3.4. Descriptive and Bivariate Results for Machine Learning Predictors
For the machine learning analysis, the persistent high-risk outcome was defined as being classified as high risk for all four waves, corresponding to a Stable High-risk trajectory. The analytical sample included 188 participants in the high-risk ML group and 1166 participants in the non-high-risk group (
Table 4).
In the sociodemographic comparisons, the ML high-risk group contained a higher proportion of boys (62.8 percent vs. 48.7 percent, p < 0.001) and a lower proportion of mothers with college or higher education (64.4 percent vs. 74.6 percent, p = 0.004).
Persistently high-risk children showed higher levels of preschool-externalizing behaviors (p < 0.001), shorter sleep duration (p < 0.001), and poorer executive functioning in Grade 1 (p < 0.001), with an additional decline through Grade 3 (p < 0.001). School adjustment was lower (p < 0.001) and worsened over time (p = 0.037). Media engagement was higher in Grade 1 (p < 0.001) and increased over time (p = 0.002). The share of gaming in total media use was larger (p < 0.001), whereas the share of learning-oriented media use was smaller (p < 0.001).
Regarding parental and family factors, mothers reported higher parenting stress (p < 0.001), with an increasing trend over time (p < 0.001). Maternal–family interaction was lower (p < 0.001) and declined more steeply (p < 0.001). Both maternal and paternal parenting controls increased significantly in the high-risk ML group (p < 0.05).
These comparisons indicate that within the data analyzed in this study, the ML high-risk group differed from the non-high-risk group across the sociodemographic, child, and parental domains, as summarized in
Table 4.
3.5. Machine Learning Model Performance
On the independent test set, the XGBoost classifier demonstrated strong discrimination for persistent high risk (ROC AUC = 0.84, 95% CI 0.78–0.91; DeLong), which is generally considered good performance (ROC AUC ≥ 0.80) (
Table 5).
Figure 2 shows the ROC curve for the test dataset, illustrating the overall model discrimination. The details of the model configuration are listed in
Supplementary Table S7 (XGBoost hyperparameters). At the prespecified decision threshold of 0.2954, the confusion matrix comprised 23 true positives, 51 false positives, 123 true negatives, and 5 false negatives (see
Supplementary Table S8: Confusion matrix and basic classification summary). Threshold-dependent metrics at this operating point were sensitivity 0.82 (95% CI, 0.63–0.94), specificity 0.71 (95% CI, 0.63–0.77), PPV 0.31 (95% CI, 0.21–0.43), NPV 0.96 (95% CI, 0.91–0.99), F1-score 0.45 (95% CI, 0.33–0.56), and accuracy 0.72 (95% CI, 0.66–0.78) (
Table 5). Given the observed prevalence of 0.14, a PPV of 0.31 represents enrichment over the baseline at this threshold, and a NPV of 0.96 indicates a low false-negative rate among non-high-risk predictions. Calibration was acceptable by summary indices: Brier score 0.13, ECE 0.15 (95% CI, 0.11–0.19), and MCE 0.57 (95% CI, 0.44–0.82) (
Table 5). The close agreement between the cross-validated performance on the training data and the performance on the independent test set suggests that substantial overfitting is unlikely. Confidence intervals were obtained using a 2000-sample bootstrap, except for the ROC AUC, which used DeLong’s method. Under class imbalance (13.9% positives), the precision–recall (PR) curve yielded an PR AUC of 0.48 (95% CI, 0.29–0.65), illustrating stable precision–recall trade-offs across thresholds; the full PR curve is presented in
Supplementary Figure S4 (class imbalance performance).
3.6. Key Predictors Identified by the Machine Learning Model
The feature importance and SHAP analyses converged on a consistent set of predictors for the classification of persistently high risks (
Figure 3 and
Figure 4). In the XGBoost feature importance ranking, EF difficulties in Grade 1 (exf_Zc8) and their residualized change between Grades 2 and 3 (change_exf_Zc8on9) were the two most influential variables, indicating that both lower baseline EF levels and subsequent declines, adjusted for prior levels, were associated with a higher model-predicted probability of persistent high risk. Additional contributors with substantial importance included maternal parenting stress in preschool (prs_Zm7), preschool sleep duration (sleepTime_Zc7), overall media use time in Grade 1 (midUseTime_Zc8), and the proportion of gaming within the total media use in Grade 2 (midUse_game_Zc9). Other notable predictors included residualized changes in sleep duration (change_sleepTime_Zc9on78), maternal and paternal parenting controls (change_crs_ct_Zm7on8; change_crs_ct_Zf7on8), and maternal parenting stress (change_prs_Zm7on8). Lower-than-expected sleep duration or a greater-than-expected increase in controlling or stressful parenting (relative to prior levels) was linked to a higher predicted risk. Preschool family interaction (fli_Zm7) and its residualized change over time (change_fli_Zm7on9) also meaningfully contributed, whereas gender and changes in school adjustment had minimal importance in this model.
The SHAP summary (
Figure 4) mirrors these patterns. Variables related to EF exhibited the largest absolute SHAP values (≈0.47–0.48), followed by sleep, media use, gaming proportion, and maternal-stress indicators. Positive SHAP values indicated feature effects that increased the predicted probability of persistent high risk (e.g., lower EF, shorter sleep, heavier media use, and higher gaming share), whereas higher learning-oriented media use (midUse_learn_Zc9) and longer sleep duration were associated with a lower predicted risk. Taken together, these findings suggest that early EF capacity, developmental residual change, deviations in media use, and parenting stress jointly account for a substantial proportion of the model’s predictions of persistent high-risk media overdependence.
4. Discussion
This study longitudinally examined how multidimensional early childhood predictors shape the developmental trajectories of digital media overdependence across Grades 3–6. Using HMM, we identified five distinct pathways, including one subgroup that persistently remained at high risk, demonstrating that the developmental patterns of media overdependence are heterogeneous rather than uniform.
A very small risk-reduction subgroup (0.7%) was also observed. Because of its small size and susceptibility to classification uncertainty, this subgroup was interpreted descriptively rather than inferentially. Nonetheless, its presence underscores the fact that not all trajectories escalate or are stable, and a small minority of children show meaningful improvement.
Difficulties in EF at Grade 1 along with further deterioration through Grade 2 were associated with remaining in the high-risk pathway. This pattern reflects a declining self-regulatory capacity and aligns with established evidence linking EF to impulsivity, attention dysregulation, and problematic media use (
Diamond, 2013;
Fitzpatrick et al., 2025). Early EF vulnerabilities may accumulate and increase sensitivity to immediately rewarding, socially reinforced digital stimuli. These cascading processes may include sleep disruptions that weaken next-day attention and inhibitory control, thereby reinforcing persistent risk.
Sleep has also been identified as a protective factor. Longer sleep in early childhood and increased sleep duration from preschool to Grade 1 were associated with a reduced risk. This aligns with longitudinal evidence showing that early childhood sleep problems predict later emotional and behavioral difficulties (
Sivertsen et al., 2015;
Quach et al., 2018). Prebedtime screen use can disrupt sleep physiology, thereby weakening next-day regulatory capacity and reinforcing media-seeking behaviors. These findings highlight sleep as a modifiable biobehavioral target with substantial implications for prevention.
Maternal parenting stress during early childhood predicts persistent high-risk pathways. Elevated stress may increase technoference, reduce behavioral consistency, and increase children’s externalizing behaviors in bidirectional cycles (
McDaniel & Radesky, 2018). Conversely, consistent parental rules and strong family interactions served as protective factors. These results align with the evidence that parental monitoring and emotionally supportive relationships reduce problematic media use (
Gentile et al., 2014;
Lin et al., 2023). Therefore, interventions that reduce parenting stress and strengthen parent–child engagement may provide meaningful leverage points.
The proportion of game-centered media use functioned as a distinct risk factor, even after accounting for total exposure time, indicating that the structural characteristics of gaming, such as rapid reward cycles, variable reinforcement schedules, competitive ranking, and persistent progression, independently sustained overdependence risk. These dynamics may be amplified in the Korean context where gaming is deeply embedded in peer norms and social routines.
The HMM entropy indicated moderate uncertainty in state assignments, consistent with the probabilistic nature of latent-state modeling. Although machine learning analyses relied on iterbi-decoded labels, the focus on multiyear persistent patterns reduced sensitivity to wave-level misclassification. Nevertheless, developmental processes are complex and observational data cannot fully establish causality; thus, the findings should be interpreted with caution.
Methodologically, this study demonstrates the value of combining the HMM with machine learning. The HMM enables the data-driven identification of latent states without prespecifying profiles, whereas machine learning models leverage nonlinear interactions across multidomain predictors. As shown in
Supplementary Table S6, XGBoost outperformed the logistic regression and random forest models in terms of the ROC AUC, F1, and PR AUC metrics, supporting its selection as the primary predictive model. The SHAP analyses further clarified the individual and joint contributions of the predictors. As demonstrated in
Supplementary Table S9 and Figures S5 and S6, the SHAP interaction patterns reflected predictive— not-causal relationships and illustrated how feature pairs jointly influenced the model outputs.
Theoretically, these findings align with and extend the Interactional Theory of Childhood Problematic Media Use (IT-CPMU). The persistent high-risk trajectory and its early predictors reflect the theory’s emphasis on the interactions between self-regulatory vulnerabilities and environmental stressors. The identification of modifiable protective factors, such as sleep trajectories and consistent parental regulation, further reinforces the hypothesis that developmental pathways are malleable rather than fixed. Thus, the present findings empirically support the theoretical hypothesis that accumulated early self-regulatory and environmental risks contribute to the development of persistent high-risk media use pathways.
This study has some limitations. First, the measures were not designed to test the hypotheses. Second, some variables relied on parental reports, which may have been susceptible to recall or perception biases. Third, moderate state uncertainty may have contributed to the borderline classifications. Fourth, machine learning models inherit this uncertainty from the decoded labels. Finally, predictive models support associations rather than causal inferences. Future research should incorporate multiple informant reports, objective assessments, and causal inference designs to strengthen precision and validity.
5. Conclusions
This study identified modifiable early life factors associated with persistent digital media overdependence. Despite the moderate state uncertainty in the HMM, one high-risk trajectory remained stable across all four waves. EF difficulties in Grade 1 and their deterioration through Grade 2 increased the likelihood of remaining in this persistent-risk group. In contrast, sufficient early childhood sleep, an upward sleep trajectory, consistent parental rules, and strong family interactions served as protective factors. Higher maternal parenting stress, longer total media time, and a greater proportion of game-centered media use were important risk factors. Combining HMMs with machine learning enhances both the interpretability of developmental pathways and the practical value of predictive modeling.
In practice, key priorities include supporting EF and self-regulation during the early elementary years, promoting healthy sleep routines, identifying and reducing parenting stress, strengthening consistent household rules and positive family interactions, and improving parent–child literacy regarding structural risks common in game-based media. Future research should refine early indicators that differentiate acceleration from stable risk, incorporate objective usage and contextual data, and evaluate fairness and generalizability across diverse cultural settings so that predictive insights can support evidence-based practices in homes and schools.