1. Introduction
Camellia reticulata belongs to the Camellia section of the genus
Camellia within the family Theaceae. It boasts a wide variety of cultivars, characterized by large and brightly colored flowers, lush foliage, and an elegant tree form, granting it exceptionally high ornamental value [
1,
2,
3,
4]. Simultaneously, it serves as an important woody oil species in China, offering significant ecological benefits and economic value, making it an outstanding tree species with comprehensive advantages [
5]. With a cultivation history of 1500 years,
C. reticulata is primarily distributed in Yunnan, southwestern Sichuan, and western Guizhou. Notably, regions such as Chuxiong, Tengchong, and Dali in Yunnan host extensive primitive wild communities. Influenced by factors such as natural hybridization, artificial selection, and environmental conditions, these populations have developed into germplasm resources rich in genetic variation [
6,
7]. As of November 2025, according to statistics from the International Camellia Register (
https://camellia.iflora.cn/), the number of registered cultivars of
C. reticulata has reached 859.
In germplasm resource evaluation, phenotypic trait analysis serves as the most direct and straightforward method for detecting genetic variation, and plays a key role in resource identification and conservation [
8,
9]. Among these, research on the genetic diversity of fruit characteristics is particularly important, as it forms the foundation for the direct application and innovation of germplasm resources. This method has been widely applied in various economic plants such as
Diospyros kaki [
10], peras (
Pyrus L.) [
11], and cherry (
Cerasus pseudocerasus) [
12], providing a theoretical basis for their genetic improvement. Furthermore, studies indicate that significant differentiation exists in fruit phenotypic traits between cultivated varieties and wild populations: cultivated varieties typically exhibit traits that align with human utilization needs, such as increased fruit size and thinner pericarp, while wild populations tend to retain natural adaptive characteristics, including smaller fruit size and thicker pericarp [
13,
14,
15]. Most species in the genus
Camellia are cross-pollinated and commonly exhibit reproductive barriers such as low self-compatibility and “more flowers but fewer fruits,” which not only limits yield but also significantly affects their genetic structure and phenotypic variation [
16]. In oil-tea camellia breeding practice, traits such as large fruit size, thin pericarp, and high seed yield have consistently been the main target traits for cultivar selection [
17]. Currently, fruit phenotypic traits of
C. chekiangoleosa [
18,
19],
C. meiocarpa [
20], and
C. oleifera [
14] have been extensively studied. However, research on
C. reticulata has primarily focused on genetic diversity [
21], population structure [
22], floral morphology [
23], leaf characteristics [
24], cultivar selection [
25], and seed germination traits [
26,
27,
28]. There remains a relative lack of research on the patterns of fruit morphological variation and their evolutionary trends in this species, and the characteristics of its phenotypic variation are still not well understood. Therefore, this study collected fruits from nine natural populations and thirty cultivated varieties of
C. reticulata. Using methods including significance testing, correlation analysis, and principal component analysis, we systematically evaluated the patterns of phenotypic trait variation and elucidated the distinguishing characteristics between the two types of populations. The aim is to provide theoretical support for the breeding of superior varieties and the production of high-quality tea oil from this species.
2. Results
2.1. Analysis of Differences in Quantitative Phenotypic Traits of Fruits
The mean values and standard deviations of the quantitative phenotypic traits of fruits from nine populations and thirty cultivars of
C. reticulata were compared for differences (
Table S1). The results indicated significant differences (
p < 0.05) in most fruit phenotypic traits among the studied materials. Fruit weight exhibited the most pronounced variation, with mean values ranging from 28.499 to 149.068 g. The extreme values were both from different populations. For instance, the mean FW of the Fengqing I population reached 149.068 g, which was significantly higher than that of other samples, while the Yongping I population had the lowest mean FW at only 28.499 g. Significant differences were also observed in FL and FH, with mean values ranging from 41.032 to 72.929 mm and 26.654 to 60.534 mm, respectively. Among all samples, the cultivar ‘Kunming Chun’ had the largest mean FL, and the Fengqing I population had the largest mean FH. In contrast, the mean values for both FL and FH of the Yongping I population were significantly lower than those of the other samples.
Traits including PT, SW, SL, SH, and SN also showed significant variation. Their mean value ranges were 4.723–16.115 mm, 3.157–28.204 g, 12.653–21.192 mm, 15.052–24.630 mm, and 1.450–11.800 seeds, respectively. Regarding ON, the difference was relatively small, with mean values ranging from 1.350 to 4.600. The cultivar ‘Donglin’ had the highest mean ON at 4.600, which was significantly greater than other samples, whereas the cultivar ‘Duxin Shizitou’ had a significantly lower number compared to others. The results demonstrate clear differences in the nine studied phenotypic traits of fruits among the populations and cultivars of C. reticulata. FW exhibited the greatest variability, while ON showed the smallest range of variation, indicating that FW is the most variable trait, whereas the number of locules is relatively stable.
2.2. Analysis of Phenotypic Differences in Fruit Color
For the nine populations of
C. reticulata, the fruit
L* values ranged from 45.735 to 66.594,
a* values from −1.531 to 11.358, and
b* values from 27.200 to 38.465. The
h° values range from −0.611 to 1.340, and
C* values from 28.531 to 38.925. There were minor differences in pericarp color parameters among the different populations. Significant differences (
p < 0.05) in
L*,
a*,
b*,
h°, and
C* values were found only between the samples collected from Heiniu Mountain I and the other populations (
Table S2). Furthermore, the fruit color across the nine populations showed a continuous variation from yellowish-green (negative
a* value) to orange-red (positive
a* value). As the red tone (
a*) intensified, the yellow tone (
b*) also deepened correspondingly, resulting in a gradual shift in fruit color from yellowish-green to orange-red.
The fruit
L* value among the 30 cultivars of
C. reticulata ranged from 38.784 to 51.825, the
a* value from 5.763 to 13.490, the
b* value from 23.271 to 38.126, the
h° value from 1.165 to 1.384, and the
C* values from 24.454 to 39.480. Significant differences (
p < 0.05) were observed in pericarp color parameters among the cultivars (
Table S2). The lightness (
L*) of ‘Yulan Cha’ and ‘Lifang’ was significantly higher than that of other cultivars. The
a* value of ‘Xuejiao’ and ‘Duxin Shizitou’ were significantly higher, while the differences in
a* values among most cultivars were not significant. The
b* value of ‘Lifang’ was significantly elevated, whereas that of ‘Jingan Cha’ was significantly lower. The
h° value showed relatively minor variation across cultivars. Furthermore, ‘Lifang’ exhibited the highest
C* value, which was significantly greater than the others, while ‘Jingan Cha’ had the significantly lowest
C* value. The fruit color of different cultivars predominantly showed a continuous transitional characteristic from orange-red (with low
a* and
b* values) to orange-yellow (with high
a* and
b* values), typically driven by a concurrent increase in both the
a* value (indicating enhanced red tone) and the
b* value (indicating deepened yellow tone).
Based on the cluster analysis of
L*,
a*, and
b* values from the pericarps of nine populations and 30 cultivars, the fruit color phenotypes can be divided into two distinctly different groups at a Euclidean distance of 15 (
Figure 1). Group I belongs to the orange-yellow (red) color series, encompassing the fruits of 8 populations and all 30 cultivars. Group II is the yellow-green series, containing only the Heiniu Mountain I population. Comparative analysis using box plots of the
L*,
a*, and
b* values revealed significant differences in color parameters between the two-color series: as the fruit color shifts from yellow-green to orange-yellow, the lightness (
L*) shows a gradual decreasing trend. In terms of
a* value distribution, the yellow-green series has the smallest variation range. Although it partially overlaps with the distribution of the orange-yellow (red) series, the
a* value can serve as a key indicator for distinguishing the two series. Conversely, the distribution of
b* values indicates that the range for the yellow-green series is completely encompassed within the distribution range of the orange-yellow (red) series (
Figure 2).
2.3. Analysis of Diversity in Fruit Qualitative Traits Among Different Populations and Cultivated Varieties
According to
Table 1 and
Table 2, the fruit shape of
C. reticulata is primarily spherical. Among the different populations, the pericarp color is mainly of the orange-yellow (red) series (89.900%), with a minority being yellow-green color (11.100%). The seed shape is predominantly hemispherical (65.556%) and conical (30.556%), while spherical (2.778%) and reniform-like (1.111%) shapes are less common. The seed coat color is also mainly brown (96.667%), with a very small proportion being tan (3.333%) (
Figure 3). For the different cultivars, the pericarp color is primarily of the orange-yellow (red) series. The seed shape is predominantly hemispherical (71.500%), with some being conical (15.667%) and spherical (12.333%); reniform-like shapes are rare (0.500%). The seed coat color is primarily brown (98.167%), with an extremely small proportion being tan (1.833%).
2.4. Analysis of Diversity in Fruit Quantitative Traits Among Different Populations and Cultivated Varieties
The F-test was used to analyze the differences among different populations and cultivars of
C. reticulata. The results indicated that all nine phenotypic traits of the fruits exhibited highly significant differences among the nine populations and 30 cultivars (
Table 3 and
Table 4). The coefficient of variation (
CV) for the nine phenotypic traits in different populations ranged from 17.209% to 60.803%, with an average of 31.655%. The traits ranked in descending order of
CV were: SW (60.803%) > FW (49.536%) > SN (43.883%) > PT (32.642%) > FH (22.685%) > ON (21.835%) > SH (18.377%) > SL (17.924%) > FL (17.209%).
In contrast, the CV for the nine phenotypic traits in different cultivars ranged from 13.951% to 72.911%, with an average of 35.290%. The ranking was: SN (72.911%) > SW (67.243%) > FW (50.737%) > ON (35.271%) > PT (25.417%) > FH (18.614%) > FL (18.581%) > SL (14.883%) > SH (13.951%). The results demonstrate that significant variation in fruit phenotypic traits exists both among different populations and cultivars of C. reticulata. Among the populations, FL showed relative stability, whereas SW exhibited the most pronounced variation. In comparison, SH was relatively stable among cultivars, while SN displayed the greatest degree of variation.
Furthermore, the diversity indices for the nine phenotypic traits of C. reticulata fruits from different populations ranged from 1.273 to 1.735. Among these, FW and FL exhibited the highest diversity, both with an index of 1.735. FH and PT followed, each with a diversity index of 1.677. In contrast, SW showed the lowest diversity (1.273), while the ON and SN also had relatively low diversity indices of 1.279 and 1.311, respectively. For the cultivars, the diversity indices for the nine phenotypic traits ranged from 1.439 to 1.868. Specifically, FH displayed the highest diversity, with an index of 1.868. PT and FL followed closely, with diversity indices of 1.754 and 1.710, respectively. In comparison, SN showed the lowest diversity, with an index of only 1.439, while SW and SL also had relatively low diversity indices of 1.500 and 1.495, respectively.
The results of the weight calculation using the coefficient of variation method are shown in
Figure 4. Regarding the coefficient of variation weights for the nine fruit phenotypic traits, those for the nine populations of
C. reticulata different and ranked in descending order as follows: SW (21.342%) > FW (17.388%) > SN (15.403%) > PT (11.458%) > FH (7.963%) > ON (7.664%) > SH (6.450%) > SL (6.291%) > FL (6.040%). The maximum indicator weight was SW (21.342%), while the minimum was FL (6.040%). For the 30 cultivars, the coefficient of variation weights ranked in descending order as: SN (22.956%) > SW (21.172%) > FW (15.975%) > ON (11.105%) > PT (8.003%) > FH (5.861%) > FL (5.850%) > SL (4.686%) > SH (4.392%). Here, the maximum indicator weight was SN (22.956%), and the minimum was SH (4.392%). Among the different populations, SW, FW, and SN were also the traits that had a greater influence on fruit phenotypic traits. The remaining traits had a relatively smaller impact on
C. reticulata, though they still exerted a certain degree of influence on the fruit phenotypes. Among the 30 cultivars, the variability in SN, SW, and FW had a more significant influence among the nine fruit phenotypic traits.
2.5. Correlation Analysis of Fruit Phenotypic Traits Among Different Populations and Cultivated Varieties
The results of the correlation analysis (
Figure 5) indicate that there are highly significant correlations (
p < 0.05) among all fruit phenotypic traits. For the nine populations, the fruit
L* value exhibited a highly significant negative correlation with the
a* value, a highly significant positive correlation with the
b* value (r = 0.871), and the
a* value showed a highly significant negative correlation with the
b* value. Furthermore, the
L*,
a*, and
b* values either had highly significant negative correlations or showed no significant correlation with the nine quantitative phenotypic traits of the fruits. For the thirty cultivars, the fruit
L* value showed highly significant positive correlations with both the
a* and
b* values, and the
a* value also exhibited a highly significant positive correlation with the
b* value. However, the
L*,
a*, and
b* values displayed highly significant negative correlations or even no significant correlation with the nine quantitative phenotypic traits of the fruit.
Among the nine populations, FW maintained highly significant positive correlations with multiple traits, including FL (r = 0.955), FH (r = 0.879), PT (r = 0.818), SW, SL, and SH. However, no significant associations were found between FW, FL, and FH, and either SN or ON. PT, SL, and SH showed highly significant negative correlations with SN, while SW showed a highly significant positive correlation with SN (r = 0.870). Regarding ON, apart from highly significant positive correlations with SW and SN, no significant correlations were observed with other traits. Among the 30 cultivars, all nine quantitative traits of the fruits showed highly significant positive correlations with each other. Notably, FW showed particularly strong correlations with FL (r = 0.945), FH (r = 0.870), and SW (r = 0.853). In contrast, SH demonstrated a highly significant negative correlation with SN but showed no significant correlation with ON. These findings are of certain importance for understanding the fruit characteristics and their genetic basis in C. reticulata.
2.6. Principal Component Analysis of Fruit Phenotypic Traits Among Different Populations and Cultivated Varieties
The results of the principal component analysis (
Table S3) are as follows: Based on the criterion of eigenvalues greater than one, two principal components were extracted. The cumulative contribution rate reached 76.064% for the populations and 73.570% for the cultivars, indicating that these two principal components captured most of the information from the nine quantitative phenotypic traits for both the populations and cultivars. Among the different populations, the first principal component had the largest eigenvalue of 4.948. In its corresponding eigenvector, FW had the largest absolute value (0.944), contributing 54.979% to the variance. The second principal component had an eigenvalue of 1.898, with SN showing the largest absolute value (0.912) in its eigenvector, bringing the cumulative contribution rate to 76.064%. Similarly, for the thirty cultivars, the first principal component had the largest eigenvalue of 5.047, and FW had the highest absolute value (0.962) in its eigenvector, contributing 56.077%. The second principal component had an eigenvalue of 1.574, with SN exhibiting the largest absolute value (0.815) in its eigenvector, resulting in a cumulative contribution rate of 73.570%. The comparative results indicate that these two principal components collectively captured and explained most of the variation in the phenotypic traits.
Furthermore, the PCA biplot for different populations and cultivars (
Figure 6) reveals that populations are more widely dispersed, exhibiting richer phenotypic variation, while cultivars are more clustered, showing greater trait uniformity. The two groups demonstrate significant differentiation along the PC1 and PC2 axes. The loading plot indicates that the positive direction of PC1 is primarily driven by fruit size-related traits (FW, FL, FH, etc.), while the positive direction of PC2 is associated with seed number and locule number (SN, ON), and the negative direction is linked to seed size (SL, SH).
2.7. Comprehensive Evaluation of Fruit Phenotypic Traits in Different Populations and Cultivated Varieties
Through principal component analysis, the score coefficients for the nine quantitative phenotypic traits of fruits from different populations and cultivated varieties were obtained (
Table 5). Furthermore, a linear combination model based on these two principal components as parameters was constructed. Using this model, the comprehensive coefficients for the populations and cultivars across the two principal components were calculated separately (
Table 6 and
Table 7). To further evaluate the germplasm resources of
C. reticulata, this study established comprehensive evaluation models: for populations, F = (54.979F
1 + 21.086F
2)/76.065; for cultivars, F = (56.077F
1 + 17.493F
2)/73.570. These models integrate the contribution rates of each principal component for the systematic assessment of the overall performance of different populations and cultivars. The calculations from this model objectively reflect the fruit trait characteristics of different populations and cultivars, providing a scientific basis for germplasm resource evaluation and the selection of superior cultivars. In the comprehensive evaluation of the nine populations, the top three were Fengqing I (1.624), Hemu Village I (1.567), and Zixi Mountain II (1.459). In the comprehensive evaluation of the thirty cultivars, the top five were ‘Lichan’ (2.410), ‘Shizitou’ (2.148), ‘Kunming Chun’ (2.070), ‘Fengshan Cha’ (1.544), and ‘Lifang’ (1.404).
2.8. Cluster Analysis of Fruit Phenotypic Traits
Cluster analysis was performed on the nine quantitative phenotypic traits of fruits from nine populations and thirty cultivars of
C. reticulata. At a Euclidean distance of 15.0, the resources of
C. reticulata from both the nine populations and the thirty cultivars were divided into three major clusters (
Figure 7 and
Figure 8). The average coefficients of variation for these three clusters were 34.115%, 26.812%, and 18.621%, respectively, indicating rich genetic variation in the fruit phenotypes among these major groups (
Table S4).
Cluster I comprises four populations: Hemu Village II, Zixi Mountain I, Heiniu Mountain I, and Heiniu Mountain II, along with 21 cultivars including ‘Seben’, ‘Dahongpao’, ‘Manwu’, ‘Lianrui’, ‘Xuejiao’, and ‘Jinpaohong’ (
Figure 8). Compared to Clusters II and III (
Table S4), this cluster exhibits intermediate mean values for FW, FL, FH, PT, SW, SL, and SH. In contrast, the mean values for SN and ON are relatively low. This cluster has the largest average coefficient of variation at 34.115%, with SN showing the highest coefficient of variation (74.917%) and SH the lowest (14.216%). These results indicate that the fruit morphological characteristics of
C. reticulata in this cluster generally demonstrate an intermediate level of expression.
Cluster II comprises three populations: Hemu Village I, Zixi Mountain II (
Figure 8), and Fengqing I, along with nine cultivars including ‘Fengshan Cha’, ‘Shizitou’, ‘Kunming Chun’, ‘Dali Cha’, ‘Weixi Hong’, and ‘Maye Yinhong’. Compared to Clusters I and III (
Table S4), this cluster exhibits the highest mean values for FW, FL, FH, PT, SW, SL, SH, and ON, while the mean SN is at an intermediate level. The average coefficient of variation for this cluster is 26.812%, which is at an intermediate level among the clusters. Within this group, SN shows the highest coefficient of variation (57.879%), and SH the lowest (10.456%). These results indicate that the fruits and seeds of
C. reticulata in this cluster are relatively large.
Cluster III comprises only two populations of
C. reticulata: Yangbi I (
Figure 8) and Yongping I. Compared to Clusters I and II (
Table S4), this cluster exhibits relatively smaller mean values for FW, FL, FH, PT, SW, SL, and SH. In contrast, the mean SN is at its maximum, while the mean ON is at an intermediate level. The average coefficient of variation for this cluster is the smallest at 18.621%, with fruit weight showing the highest coefficient of variation (32.385%) and SH the lowest (9.158%). These results indicate that the fruits and seeds of
C. reticulata in this cluster are relatively small, yet the traits demonstrate higher stability.
2.9. Boxplot Analysis of Fruit Phenotypic Traits in Different Populations and Cultivated Varieties
The nine quantitative phenotypic traits of
C. reticulata fruits were visualized using box plots, reflecting the number, mass, and dimensions of the fruits and seeds. A comparative analysis was conducted between different populations and cultivars. The results indicate that, except for FL, SL, SH, and ON, the distribution of fruit traits in cultivars generally shifted downward compared to that in different populations. The specific distributions of all phenotypic traits are presented as follows (
Figure 9).
In terms of quantity, the distribution of SN in different populations of C. reticulata is broader than that in cultivars. Conversely, the distribution of ON in cultivars spans a wider range, completely encompassing the distribution observed in different populations. Additionally, the proportion of outliers for SN among cultivars is 4.000%. Regarding mass and dimensions, the distributions of FH, PT, SL, and SH are broader in different populations compared to cultivars. In contrast, the distributions of FL, SW, and FW are more extensive in cultivars than in different populations of C. reticulata. Additionally, the distributions of SL and SH are more concentrated (denser) in cultivars. For the traits of FW, FH, and PT, the proportion of outliers in different populations was 1.111%, respectively. In cultivars, the proportions of outliers for these three traits were lower, at 0.833%, 0.500%, and 0.167%, respectively. Among different populations, only SW exhibited outliers, with a proportion of 4.444%. For cultivars, the proportions of outliers for SL, SH, and SW were 1.000%, 0.500%, and 2.000%, respectively.
2.10. Frequency Distribution Function Analysis of Fruit Phenotypic Trait Variation in Different Populations and Cultivated Varieties
To further analyze the variation in fruit phenotypic traits among different populations and cultivars of
C. reticulata, frequency distribution functions were fitted for the nine quantitative phenotypic traits of the fruits (
Figure 10). All fruit organ phenotypic traits followed a normal distribution (R
2 = 0.673–0.999) (
Table S5). A comparison of the frequency distribution functions between the two categories, different populations, and cultivars revealed that, except for the power distribution functions of FW, FH, and ON, which were relatively concentrated, the power function distributions of the remaining six traits were more dispersed. This further indicates substantial variability in the fruit phenotypic traits between populations and cultivars.
Regarding mass traits, SW exhibited both vertical and horizontal shifts in its distribution, indicating greater variability in this trait. In contrast, the power distribution function of FW showed no significant shift, suggesting relatively lower variability. For size traits, FL, PT, and SH displayed vertical and horizontal shifts in their distributions, indicating greater variability in these traits. Conversely, the fruit shape index (FL to FH ratio) showed no significant horizontal shift, and the power distribution function of FH also lacked significant shifts, suggesting lower variability in these two traits. In terms of number traits, the normal distribution function of SN in populations was right-skewed compared to that in cultivars, indicating greater variability in populations. On the other hand, the power distribution function of ON showed no significant shift, suggesting lower variability in this trait.
3. Discussion
Phenotypic diversity, as a direct manifestation of genetic diversity, serves as a crucial entry point for analyzing the genetic structure, adaptation mechanisms, and evolutionary potential of species [
29]. Methods such as correlation, clustering, and principal component analysis can systematically reveal the genetic characteristics and variation patterns of germplasm resources [
30]. In cross-pollinated and often self-incompatible species like oil-tea camellia, the genetic origin of different traits significantly influences their variation patterns. Maternal tissue traits, such as pericarp characteristics, fully reflect the maternal genetic background, whereas seed traits integrate the nuclear genetic contributions from both parents, theoretically possessing greater potential for genetic variation [
31,
32]. Therefore, an in-depth exploration of the phenotypic characteristics of
C. reticulata fruits will facilitate the efficient screening of germplasm resources with superior traits.
This study conducted an integrated analysis of 13 fruit phenotypic traits from nine wild populations and 30 cultivated varieties of
C. reticulata. Both qualitative traits (fruit shape, pericarp color, seed shape, seed coat color) and quantitative traits (fruit size, seed size, etc.) were incorporated into a unified evaluation framework to comprehensively present the genetic variation pattern of fruit phenotypes. Regarding qualitative traits, the fruit shape was predominantly spherical. The pericarp color primarily fell within an orange-yellow (red) series (found in eight populations and all 30 cultivars), with the sole exception of the Heiniu Mountain I population, which exhibits a yellow-green series. This yellow-green series displayed distinct distribution characteristics in terms of its
a* value, making it a potential key indicator for cultivar identification. Considering the influence of the environment on phenotypes, this color differentiation may stem from geographical adaptation or genetic divergence among different populations [
33]. It also reflects the genetic conservation and differentiation trend of the pericarp, as a maternal trait, between wild and cultivated groups. Seed shape was mainly hemispherical (65.6%) and conical (30.6%) within the wild populations, while hemispherical seeds were even more predominant in cultivated varieties (71.5%). Seed coat color was overwhelmingly brown in both groups (96.7% in populations, 98.2% in varieties). Diversity analysis of quantitative traits revealed significant variation in the phenotypic traits of
C. reticulata. The Shannon diversity index (
H′) ranged from 1.270 to 1.870. Compared to existing studies, this range is lower than the average diversity index (
H′ = 1.8850) reported for seed and fruit traits of
C. chekiangoleosa [
18], and significantly lower than the diversity levels reported for
C. vietnamensis (
H′ = 2.7499) [
34] and for
C. meiocarpa (
H′ = 2.8160) [
35]. This discrepancy may reflect inherent differences in genetic diversity among species and simultaneously suggests considerable potential for enhancing the conservation and utilization of genetic resources for
C. reticulata.
To analyze the effects of natural evolution and artificial selection on phenotypic traits, populations and cultivated varieties were processed separately to systematically evaluate the diversity composition and breeding potential of
C. reticulata. The coefficient of variation, serving as a key metric for measuring trait dispersion, can intuitively reflect the fluctuation characteristics of phenotypic traits [
36]. Analysis of the coefficient of variation revealed extensive variation across the nine quantitative traits of
C. reticulata (13.950% ~ 72.910%). The average coefficient of variation for cultivated varieties (35.290%) was slightly higher than that for populations (31.655%), yet both were significantly lower than that reported for wild
C. oleifera (88.630%) [
37]. Among populations, SW exhibited the most pronounced variation (60.800%), consistent with findings reported by He Yimin, Jin Gaozhong et al. [
38,
39]. Among cultivated varieties, SN showed the highest coefficient of variation (72.910%), aligning with results from studies on
C. meiocarpa in Guizhou [
40]. Furthermore, a negative correlation was observed between diversity indices and coefficients of variation across different populations and varieties of
C. reticulata. This pattern is consistent with findings in plants such as
Nymphaea [
41] and
Machilus gamblei [
42], suggesting that this phenomenon may represent a common characteristic of phenotypic diversity in plants.
Correlation analysis of fruit phenotypic traits provides an important theoretical foundation for cultivar improvement and biotechnological research [
43]. The correlation analysis revealed highly significant positive correlations among most traits in
C. reticulata. FW showed highly significant positive correlations with key morphological indicators. In populations, FW was highly correlated with FL (r = 0.955), FH (r = 0.879), and PT (r = 0.818). In cultivated varieties, besides strong correlations with FL (r = 0.945) and FH (r = 0.870), it also exhibited a strong correlation with SW (r = 0.853). Notably, in both populations and varieties, all metrics reflecting fruit size and seed size demonstrated highly significant positive correlations. This finding aligns with results from studies on
Olea europaea [
44], providing new evidence for understanding the co-evolutionary mechanisms of fruit and seed traits in woody oil plants.
Principal component analysis (PCA) aims to reduce the dimensionality of morphological trait information while minimizing the loss of original data, thereby simplifying trait classification and focusing on core elements [
45]. The PCA revealed that PC1 primarily represents variation in traits related to fruit size, while PC2 mainly reflects variation in reproductive allocation traits associated with seed number. Together, they account for the majority of phenotypic differences among the groups. This finding further supports prioritizing fruit weight and seed number as key trait dimensions in subsequent genetic analysis and breeding efforts. Cluster analysis is capable of revealing the genetic structure and variation patterns of plant germplasm resources, providing crucial theoretical support for plant breeding [
46]. The cluster analysis further classified the materials into three groups: Group I exhibits superior comprehensive traits and is suitable as high-quality parental material; Group II demonstrates prominent ornamental value and adaptability, making it suitable for specialized breeding; Group III shows stable phenotypes and is applicable for specific research or applications.
Comprehensive evaluation of plant germplasm resources is a core component of breeding research and holds significant guiding value for cultivar selection [
47]. The trait-based comprehensive evaluation indicated that the cultivated variety ‘Lichan’ achieved the highest score (F = 2.410), significantly outperforming the wild population Fengqing I (F = 1.624), thus positioning it as a core parental line for genetic improvement. Boxplot analysis identified outlier individuals for specific traits, whose variation may stem from gene mutations, specific ecological environmental factors, genetic recombination, or other genetic mechanisms. These findings provide a reference for selecting new germplasm. Frequency distribution analysis demonstrated that all traits across different populations and varieties conformed to a normal distribution (R
2 = 0.673~0.999), further confirming the continuity and analyzability of phenotypic variation. Through a systematic analysis of phenotypic diversity in wild and cultivated populations of
C. reticulata, this study clarified their variation characteristics, key traits, and inter-trait correlations. The findings provide a theoretical foundation for the subsequent utilization of germplasm resources, genetic improvement, and application within the tea oil industry.