1. Introduction
Mental health and personality development issues continue to receive attention in contemporary psychology. The distribution of psychological symptoms across different life stages, age-related changes in personality dimensions, and inter-individual differences in emotional externalisation provide theoretical support for constructing mental health risk prediction models [
1].
This study constructs an analytical framework based on three core theoretical perspectives. First, the Five-Factor Model (FFM) provides a stable operational foundation for measuring personality dimensions. Systematically developed by Costa and McCrae [
2,
3], this theory categorises personality traits into five dimensions: neuroticism, extraversion, openness, agreeableness, and conscientiousness. These dimensions demonstrate cross-cultural consistency and have been shown to predict important outcomes, including mental health status, social adaptability, and life satisfaction. For the present cross-age study, the Big Five Inventory-2 (BFI-2) was selected as the operationalization of the FFM. The BFI-2 is a 60-item revised version of the original BFI, developed by Soto and John [
4], featuring improved item coverage, higher structural validity, and stronger discriminant validity across facets than its predecessor. Critically, the BFI-2 has demonstrated a stable factor structure and acceptable reliability across diverse age groups, from adolescents to older adults [
5], making it well suited for lifespan comparative designs such as the present study.
Second, the theoretical foundation for emotional expression derives from Ekman’s Basic Emotions Theory [
6,
7,
8]. Through cross-cultural research, Ekman established the existence of universal basic emotions (happiness, sadness, anger, fear, surprise, and disgust) that are expressed through specific facial expressions and are consistently recognised across cultures. This theory provides a theoretical basis for objectively measuring emotional externalisation patterns using facial expression videos, enabling the study to transcend traditional self-report methods and capture authentic individual differences in emotional expression. Accordingly, the present study targets facial emotional expression behaviour as its measurement construct, treating habitual expression patterns as behavioural phenotypes rather than as direct indicators of subjective felt emotion. This distinction is maintained throughout the manuscript.
The Lifespan Development Perspective serves as the foundation of this study [
9,
10,
11]. Proposed by Baltes and colleagues, this theory emphasises that development is a lifelong process in which personality traits, emotion regulation abilities, and mental health exhibit dynamic trajectories across different ages. The perspective posits that development is multidimensional, multidirectional, and plastic; individuals face distinct developmental tasks at various life stages, and their personality characteristics and psychological functions continuously evolve through interactions among environmental, physiological, and social factors. This theoretical framework provides a rationale for this study’s design, spanning four age groups from adolescence to old age. It establishes a foundation for understanding age-related differences in the relationships among personality, emotion, and mental health.
From a lifespan development perspective, psychological symptom expression varies across age groups. Research has demonstrated that age of onset moderates subsequent symptom manifestations in major depressive disorder [
12], while digital media use duration in adolescents correlates with mental symptom severity [
13]. Normative studies using the SCL-90 scale have revealed temporal evolution in mental health status [
14], with different age groups exhibiting distinct psychological problem profiles [
15]. The SCL-90 has established Chinese normative data covering populations from early adolescence through late adulthood [
16,
17], and confirmatory factor analyses have supported the stability of its nine-factor structure in both clinical and community adolescent samples [
18], as well as in large college-age samples [
19]. Among older adults, the scale has demonstrated acceptable reliability. However, prior research suggests that the expression of psychological distress may differ somewhat from younger groups [
20], a limitation acknowledged in the present study. Given the absence of a single cross-age measurement-invariance study spanning all four age groups, the SCL-90 was used as a uniform screening tool to enable cross-group comparisons of symptom severity rather than as a clinical diagnostic instrument, and group-level findings should be interpreted accordingly. Life stressors are considered important sources of psychological distress and somatization reactions [
21], while individuals employing avoidance coping strategies often experience adverse health outcomes [
22].
Personality dimensions change dynamically throughout the lifespan rather than remaining static [
23]. The Big Five personality model, the most widely applied theoretical framework in personality psychology [
24], provides an operational basis for exploring personality–health relationships—individuals with higher conscientiousness tend to exhibit more positive health behaviours. In contrast, those with elevated neuroticism often experience negative emotions and maladaptive behavioural tendencies. Recently, neural network technology has been applied to predict personality traits [
25]. In older adults, personality dimensions mediate the relationship between childhood trauma and depressive symptoms [
26] and are associated with cardiovascular disease risk [
27].
Emotional externalisation characteristics constitute an important observational dimension in psychological symptom and personality research. Multiple studies have employed cluster analysis to explore heterogeneous structures within psychological symptoms, effectively identifying distinct depression subtypes [
28]. Machine learning algorithms have demonstrated potential for distinguishing depression and anxiety symptom expression patterns [
29], while similar computational techniques have quantified and analysed the effects of individual difference variables on treatment response [
30].
Mental health risk assessment has emerged as a research focus in recent years. The existing literature indicates that social support levels correlate negatively with psychological problem severity [
31], while psychological resilience serves a protective function in stressful situations [
32]. Risk identification research has expanded from general populations to high-risk clinical groups [
33], and machine learning technology has shown preliminary promise for psychosomatic disease classification and diagnostic support [
34].
However, several gaps remain in the existing literature. First, most studies examine psychological symptoms, personality traits, or emotional expression in isolation, without simultaneously modelling their interrelationships across age groups [
35]. Second, while cluster analysis has been applied to identify depression subtypes [
28], its application to emotional expression patterns as a basis for mental health risk stratification remains underexplored. Third, existing risk prediction models based on personality traits have largely been developed in single age-group samples—predominantly young adults—limiting their generalizability across the lifespan [
36]. Fourth, the SCL-90 has been used as a clinical diagnostic criterion in prior studies, whereas its application as a community screening tool across age-diverse samples warrants systematic examination. These gaps collectively point to the need for a multidimensional, cross-age analytical framework that integrates personality, emotional expression, and psychological symptom data for mental health risk screening.
Addressing these research gaps, this study simultaneously incorporates age, psychological symptoms, personality dimensions, and emotional externalisation characteristics into an analytical framework to construct a multidimensional mental health risk prediction model. Specifically, this study examines the dynamic evolution of psychological symptoms and personality dimensions across age stages, identifies potential risk subgroups based on emotional externalisation patterns, and establishes a risk prediction model to improve classification accuracy, thereby providing empirical evidence for early detection and intervention for mental health problems.
Building on the theoretical framework outlined above and the identified gaps in the literature, three directional hypotheses were formulated: (1) young adults would exhibit the highest levels of psychological symptoms, particularly depression, while conscientiousness would increase and neuroticism would decrease with age; (2) the subgroup characterised by negative emotional expressions (anger, disgust, and fear) would show significantly higher neuroticism and psychological symptom severity than subgroups characterised by positive or neutral expressions; (3) neuroticism would be the strongest positive predictor of mental health risk, while extraversion and conscientiousness would serve as significant negative predictors.
4. Discussion
The relationship between personality dimensions, emotional externalisation, and mental health constitutes the core focus of this study, with particular attention to the moderating role of age. Neuroticism scores exhibited a statistically significant decreasing trend across age groups (η
2 = 0.11), with higher scores in adolescent and young adult groups and significantly lower scores in middle-aged and elderly groups. This trajectory suggests that emotional stability and psychological resilience may increase with age [
44]. The age effect on conscientiousness was the most prominent (η
2 = 0.24). Adolescent participants scored lowest on this dimension, with scores gradually increasing across age groups and reaching peak levels in the elderly group. This pattern aligns with the continuous reinforcement of self-discipline, organisation, and goal-directed behaviour throughout adulthood [
45].
Notably, young adults aged 19–35 scored highest across all groups on depression, anxiety, and interpersonal sensitivity dimensions. Existing literature identifies this developmental period as a high-incidence stage for psychological distress [
46]. Socioemotional selectivity theory provides a possible explanatory framework for improved emotional well-being in older adults, suggesting that individuals increasingly optimise their emotional regulation and expression as they age, actively focusing on positive emotions while avoiding negative stimuli [
47].
The sample spanned ages 10–77, with symptom scores across age groups showing characteristics consistent with developmental stages. The young adult group (19–35 years) exhibited the highest depression (M = 1.73) and anxiety (M = 1.61) scores, consistent with stress associated with developmental tasks, including academic achievement, career establishment, and intimate relationship formation. The elderly group’s somatisation score (M = 1.37) paralleled that of the young adult group, potentially reflecting ageing-related chronic physical symptoms. The adolescent group showed relatively elevated interpersonal sensitivity (M = 1.62) and hostility (M = 1.42), consistent with peer relationship sensitivity characteristic of this developmental period [
48]. Total mean scores across groups exhibited similar ranges (1.32–1.57), indicating stable SCL-90 measurement performance across this age-diverse sample. It should be noted that the young adult group comprised only 17 participants; all findings specific to this group, including the elevated depression and anxiety scores, should be treated as directional rather than definitive and require replication in larger young adult samples before broader conclusions can be drawn. Future research should further examine age as a moderating variable or conduct controlled verification using age-specific assessment tools for adolescents aged 10–18. Extraversion and openness showed marginal but non-significant effects (H(3) = 7.65,
p = 0.054, η
2 = 0.05). Given that η
2 = 0.05 corresponds to a small-to-medium effect, these results deserve brief substantive comment. The lower extraversion score in the young adult group (M = 3.08) relative to the remaining three groups may reflect role-related pressures specific to early adulthood, with scores recovering thereafter. The gradual decline in openness from adolescence to older adulthood is directionally consistent with normative personality development research [
1]. Both non-significant results are likely attributable to the small young adult subgroup (
n = 17) rather than a true absence of age-related change, and replication with larger samples is needed.
Cluster analysis revealed three distinct emotional externalisation patterns, each with statistically significant associations with the Big Five personality dimensions. It should be emphasised that the CNN-derived probability scores represent observable facial expression behaviour rather than direct measurements of internal emotional experience. The cluster labels describe habitual expression patterns as behavioural phenotypes and should not be interpreted as indicators of subjectively felt emotion. Accordingly, the associations reported below are between facial expression patterns and personality or symptom profiles, not between felt emotion and psychopathology. Importantly, no significant differences emerged among clusters regarding gender (p = 0.525) or age (p = 0.193), suggesting that the identified emotional externalisation patterns are not attributable to demographic confounds, though the cross-sectional design does not permit inference about the temporal stability or trait-like nature of these patterns.
Cluster 1, characterised predominantly by positive emotions, showed higher scores for extraversion, agreeableness, and conscientiousness, along with relatively lower neuroticism. Cluster 2, dominated by negative emotions, showed significantly elevated neuroticism, reduced extraversion and agreeableness, and heightened anxiety, depression, and mood instability in the present sample. Cluster 3, characterised primarily by neutral, sad, and surprised expressions, occupied an intermediate position but outperformed Cluster 2 on energy and motivation indices. It should be noted that the extraversion subscale showed lower internal consistency in the present sample ( = 0.600); the extraversion differences observed across clusters may therefore partly reflect measurement imprecision and should be interpreted with caution.
The present data indicated an association between emotional externalisation patterns and mental health status. Cluster 2 participants scored higher than other groups across multiple SCL-90 dimensions, including interpersonal sensitivity, depression, anxiety, hostility, paranoid ideation, and psychoticism, suggesting that habitual negative emotional expression was associated with elevated risk for multiple psychological symptoms [
49]. It is worth emphasising that Cluster 2 is characterised not by sadness but by a co-occurring pattern of anger, disgust, and fear. The association between this specific combination and elevated depression and anxiety scores is not self-evidently circular, as anger and disgust are not definitional components of depression in standard diagnostic criteria. The contribution of the present clustering analysis lies in identifying this multivariate behavioural phenotype and documenting its co-occurrence with elevated neuroticism, reduced extraversion and agreeableness, and higher scores across multiple SCL-90 dimensions within this sample. Conversely, Cluster 1 participants exhibited lower overall psychological symptom levels, consistent with existing research demonstrating an association between positive emotions and lower mental health symptom levels [
50]. These cluster-based association, however, were derived from a single convenience sample from one geographic region and have not been validated in independent samples. The generalizability of this three-cluster typology to other populations or cultural contexts cannot be assumed from the present data alone.
The mental health risk screening system constructed in this study stratified participants into three risk levels: low-risk (55.6%), medium-risk (17.9%), and high-risk (26.5%). In distinguishing low-risk from medium- and high-risk groups, personality dimensions showed notable discriminative power within this sample. Neuroticism exhibited the most prominent effect (d = 0.81, large effect size), while extraversion (d = 0.56) and agreeableness (d = 0.54) both showed moderate protective effects, with conscientiousness demonstrating a moderate protective impact (d = 0.44). The logistic regression model achieved acceptable-to-good classification performance (AUC = 0.742, 95% CI: 0.663–0.821), correctly ranking randomly selected pairs of high- and low-risk individuals in 74% of cases. Regarding predictor weights, neuroticism was the strongest predictor of risk in this sample (OR = 4.58), while extraversion (OR = 0.41) and conscientiousness (OR = 0.57) were associated with lower risk. These findings provide a preliminary empirical basis for early identification; however, the AUC of 0.742 is an internal estimate, no external validation dataset was available, and the 10-fold cross-validation mean AUC (0.708) should be regarded as the more conservative performance indicator. The model’s discriminative ability in independent samples from other regions, as well as the predictive weights of individual personality dimensions across different socioeconomic and cultural contexts, remains to be established.
These clustering results hold potential application value across multiple mental health service domains. However, the following implications are preliminary and contingent on independent validation of the CNN-derived emotion features in future studies. First, regarding screening and stratification, the three emotional externalisation patterns can serve as preliminary screening indicators of mental health risk within routine psychological assessment protocols. Specifically, individuals classified into Cluster 2 (dominated by anger, disgust, and fear) showed elevated symptom burden in the present sample, suggesting this pattern may warrant closer attention in screening contexts. Whether this association is sufficiently stable and generalisable to support routine clinical flagging, however, requires prospective validation in independent samples before any applied use. Compared with traditional self-report scales, facial expression video-based emotion recognition offers advantages, including non-invasiveness and reduced social desirability bias, proving particularly suitable for populations with limited self-report capabilities (e.g., adolescents or cognitively impaired elderly individuals).
Second, regarding differentiated intervention design, cluster results inform personalised intervention planning. For Cluster 2 individuals, their personality profile—high neuroticism, low extraversion, and low agreeableness—suggests that interventions centred on emotion regulation skills training may be worth prioritising. However, this hypothesis requires prospective testing, including cognitive reappraisal strategies and mindfulness-based stress reduction [
51]. For Cluster 3 individuals (dominated by neutral and sad emotions), whose symptom levels fall between Clusters 1 and 2, preventive interventions emphasising social skills training and positive emotion activation may be associated with lower risk progression, pending longitudinal validation. For Cluster 1 individuals (predominantly positive emotions) who exhibit favourable mental health, the focus should be on maintaining and enhancing psychological resilience by strengthening existing protective factors.
Third, concerning longitudinal monitoring, emotion clustering can serve as a baseline marker for dynamic tracking. Migration of an individual’s emotional externalisation pattern from Cluster 1 or 3 to Cluster 2 across time points may co-occur with deteriorating mental health; whether such migration temporally precedes symptom worsening remains to be established in longitudinal studies. This dynamic monitoring approach, based on changes in emotional externalisation patterns, aligns with contemporary mental health services’ increasing emphasis on prevention and personalisation.
Finally, this study’s integrated analysis of emotion clustering with personality dimensions and SCL-90 symptom data demonstrates that emotional externalisation patterns represent not isolated behavioural manifestations but rather systematic correlates of personality structure and psychological symptom profiles. This finding provides a preliminary empirical foundation for constructing a three-dimensional “emotional expression–personality traits–psychological symptoms” joint assessment framework. Future research should explore the feasibility of embedding emotion-clustering labels into multimodal mental health risk prediction models to enhance classification accuracy.
The most consequential limitation concerns external validity. First, all participants were recruited through convenience sampling within the Hefei metropolitan area; findings should not be generalised to broader Chinese or international populations without independent replication. Second, the young adult subgroup (
n = 17) was substantially smaller than the other three groups; all results specific to this age group should be regarded as preliminary. Third, the three-cluster emotional expression typology was both derived and evaluated within the same sample and has not been externally validated; its stability across different populations remains unknown. Fourth, the logistic regression risk model lacks external validation, and the AUC of 0.742 reflects internal fit only, which is likely to overestimate true discriminative performance in new samples. Fifth, the CNN-based emotion inference pipeline was not validated on a held-out subset of the present sample; therefore, the absolute accuracy of the derived emotion scores in this community population remains unknown. Per-participant averaging across 52,500–75,000 frames and frame-level ICC values of 0.287–0.589 provide partial assurance that the aggregated scores reflect stable individual differences, and the cluster-based conclusions rest on relative distributional patterns rather than the accuracy of any individual classification. Independent external validation remains a priority for future work. This study exhibits several design and implementation limitations. The cross-sectional design precludes causal inference about the relationships among personality dimensions, emotional externalisation, and psychological symptoms. In particular, whether negative expression patterns precede symptom onset or are consequences of existing distress cannot be determined from the present data, and this constitutes a priority question for future longitudinal research. Elucidating the temporal dynamics and interaction mechanisms of these variables requires longitudinal designs [
51]. Additionally, all measures were collected at a single time point, whereas both psychological symptoms and emotional expression are known to vary over time and across situational contexts. The extent to which the identified patterns reflect stable individual characteristics rather than state-dependent responses warrants careful consideration. Frame-level ICC analysis provided partial empirical support: ICC values ranged from 0.287 (fear) to 0.589 (anger), indicating that a moderate proportion of frame-level variance is attributable to stable between-person differences [
42]. Averaging probability scores across all frames of a 35–50 min interview further attenuates momentary noise, aligning with the operationalisation of affective traits as temporally aggregated expression tendencies [
52]. Nevertheless, the relatively lower ICC for fear (0.287) suggests that fear-related cluster differences should be interpreted with particular caution, and longitudinal assessment across multiple occasions would be needed to establish the test–retest stability of the identified expression patterns directly.
The sample size of 151 participants and the severely unequal age group distribution (young adult subgroup
n = 17) constitute a significant limitation. No a priori power analysis was conducted before data collection. Post hoc sensitivity analysis indicated that with
N = 151, four groups, α = 0.05, and power = 0.80, the study was adequately powered to detect effects of η
2 ≥ 0.069 for age-group comparisons; both primary findings (neuroticism η
2 = 0.11; conscientiousness η
2 = 0.24) exceeded this threshold. For the logistic regression model, the minimum detectable effect was f
2 = 0.088, well below the observed f
2 = 0.582 (Nagelkerke R
2 = 0.368). However, statistical power was insufficient to detect small effects reliably, and findings for dimensions with smaller effect sizes should be interpreted with caution. Future studies should conduct a priori power analyses and recruit 30–40 participants per age group. Although the down-sampling sensitivity analysis confirmed the robustness of the two primary findings, the small subgroup size reduces power for dimensions with smaller effect sizes. The lower internal consistency of the extraversion and openness subscales (=0.600 and 0.619, respectively) may have attenuated effect size estimates for these two dimensions. Findings involving extraversion and openness should therefore be interpreted with caution, and future studies should consider alternative scoring approaches or report facet-level reliability to better characterise the psychometric properties of these dimensions in Chinese samples. Socioeconomic and educational data were not systematically collected in the present study; occupational categories served as a partial proxy for SES and are reported in
Table 1. The absence of formal education-level data is acknowledged as a limitation, as educational attainment may independently influence both personality trait expression and psychological symptom severity. Future studies should incorporate standardised measures of education and SES to enable covariate adjustment and more rigorous assessment of socioeconomic confounding. Incorporating multicenter data would further enhance generalizability [
53]. Regarding the cluster analysis specifically, while the three-cluster solution demonstrated perfect reproducibility across 100 random initialisations (ARI = 1.000), the generalizability of the identified emotional expression patterns to broader populations warrants caution, and replication in larger multicenter samples is recommended.
The study employed only the SCL-90 self-report scale for symptom screening, lacking validation from gold standard diagnostic tools such as structured clinical interviews (e.g., SCID), precluding distinction between clinically diagnosed cases and subclinical symptom presentations. Self-report scales remain susceptible to social desirability bias and self-awareness limitations, potentially overestimating or underestimating true mental health status. Additionally, SCL-90 emphasises symptom severity assessment without adequately examining functional impairment—a critical clinical diagnostic criterion. More specifically, the following subsequent steps are suggested. First, a pre-registered longitudinal study following the same participants for at least two time points could directly assess whether emotional expression cluster membership predicts subsequent changes in psychological symptoms. Second, future studies should use structured clinical interviews such as the Structured Clinical Interview for DSM Disorders, SCID, in conjunction with SCL-90, to establish the convergent validity between the risk classification system and the gold standard diagnostic criteria. Third, multimodal physiological measures, including heart rate variability, skin conductance, and EEG, should be integrated to provide objective, non-self-reported indicators of emotional state and complement facial expression data [
54]. The application of adult-normed instruments (SCL-90 and BFI-2) to participants aged 10–12 years (
n = 10) represents an additional methodological concern. Constructs such as psychoticism and aesthetic sensitivity have limited developmental validity at this age, and reliance on parental supervision for item comprehension may have introduced response bias. Although the sensitivity analyses described in
Section 2.2 confirmed that excluding these participants did not materially alter the primary findings (Big Five age-group comparisons: neuroticism H = 19.582,
p < 0.001; conscientiousness H = 37.615,
p < 0.001; logistic regression AUC = 0.724, 95% CI: 0.639–0.808), future studies should employ age-appropriate instruments for participants below age 13.
The emotional externalisation assessment relied primarily on facial expression analysis. Furthermore, although standardised recording conditions were maintained, the structured interview setting may not fully capture spontaneous emotional expressions in naturalistic contexts, potentially limiting the ecological validity of the emotion recognition results. Three factors limit the external validity of the present findings. First, the sample was recruited from a single province in eastern China, and cultural norms surrounding emotional expression and help-seeking behaviour may differ substantially across regions and countries, warranting caution in generalising findings to non-Chinese populations. Second, the age distribution was unequal across groups, with the young adult subgroup substantially underrepresented (n = 17); findings for this age group should therefore be interpreted with particular caution. Third, the CNN emotion recognition model was pre-trained on AffectNet and fine-tuned on Chinese facial images; its recognition accuracy for other ethnic groups has not been validated, and cross-ethnic application of the emotion clustering results requires independent replication. A conceptual limitation of the present study is that facial expression recognition captures expressed emotion rather than felt emotion. Facial expressions can be voluntarily masked or faked, and the two need not correspond; a point well illustrated in clinical contexts where individuals may maintain positive facial affect despite underlying distress. The present study, therefore, characterises habitual facial expression patterns as behavioural phenotypes in their own right rather than as proxies for internal emotional states. Future research incorporating self-reported momentary affect alongside facial expression data would allow direct examination of the expression–experience correspondence within this population. A further limitation concerns the scope and nature of emotions assessed. Basic emotions represent prototypical high-intensity affective states and may not adequately capture the full range of everyday emotional experience. More specifically, the seven basic emotions captured by the CNN model do not include self-conscious emotions such as guilt, shame, and helplessness, which are theoretically more proximal to the aetiology of depression and cannot currently be captured by automated facial recognition systems. The associations observed between Cluster 2 and psychological symptoms should therefore be interpreted as concurrent correlates rather than etiological indicators. Future studies should consider complementing categorical emotion recognition with dimensional approaches (e.g., valence–arousal circumplex models) to provide a more comprehensive characterisation of affective expression in naturalistic settings. The use of frame-level mean probability scores as clustering inputs represents a further methodological limitation. Although ICC analysis indicated moderate frame-level stability for the primary clustering emotions (anger ICC = 0.589, disgust ICC = 0.458, neutral ICC = 0.448, sad ICC = 0.431), the arithmetic mean cannot distinguish brief, intense emotional episodes from sustained, mild ones. Future studies should extract the standard deviation and the proportion of time spent in the dominant emotion as complementary clustering features to better capture emotional variability and state duration.
Evaluating the effectiveness of mental health interventions constitutes another critical future research direction. The mechanisms by which interventions such as emotion regulation training and cognitive-behavioural therapy influence emotional externalisation patterns and mental health warrant further investigation [
55]. This research would provide an empirical foundation for transitioning mental health services from reactive to preventive models and advancing the development of personalised interventions.