Skip to Content
PlantsPlants
  • Article
  • Open Access

21 January 2026

Phenotypic and Agromorphological Diversity Reveals Detailed Information About Accessions with Productive Potential for Cotton (Gossypium barbadense L.) in Northeastern Peru

,
and
1
Dirección de Recursos Genéticos y Biotecnología, Estación Experimental Agraria El Porvenir, Instituto Nacional de Innovación Agraria (INIA), Carretera Marginal Sur Fernando Belaúnde Terry KM 13.5, Tarapoto 22400, Peru
2
Center of Biotechnology and Genetics, Department of Biological Sciences, Universidade Estadual de Santa Cruz, Rodovia Jorge Amado km 16, Ilheus 45662-900, Brazil
*
Authors to whom correspondence should be addressed.

Abstract

Cotton (Gossypium barbadense L.) is a crop of great economic importance, as the superior quality of its fiber is highly valued worldwide. The objective of this research was to evaluate the agromorphological diversity of cotton germplasm using both qualitative and quantitative descriptors. A combination of univariate and multivariate statistical analyses was applied to identify promising accessions. The coefficients of variation (GCV and PCV) and high heritability estimates observed for descriptors such as the number of sympodial branches, fiber weight, and number of seeds per capsule confirm genetic control, ensuring the effectiveness of selection in future breeding programs. Correlation analysis revealed a strong positive relationship between lint yield and number of bolls per plant (0.893). Furthermore, principal component analysis indicated that accessions PER1010536, PER1010538, PER1010543, and PER1010547 were associated with high yield and early-maturity traits. Furthermore, multiple correspondence analysis and mixed data factor analysis demonstrated that the observed variability also depends on qualitative traits such as petal spot and bract color, supporting the concept of a complex genetic architecture. These findings provide a solid basis for the development of new cotton cultivars with improved productivity.

1. Introduction

Cotton is a major commercial crop of great economic importance, providing the textile industry with the largest source of natural, versatile, and renewable fiber. In addition, the seed is a significant source of vegetable oil and protein, providing essential nutrients for both human and animal diets [1,2,3]. This crop belongs to the genus Gossypium of the Malvaceae family and includes both diploid and tetraploid species. Among the latter, two species account for more than 97% of global fiber production: G. hirsutum L. (2n = 4x = 52) and G. barbadense L. (2n = 4x = 52) [4,5].
G. barbadense is a tropical perennial plant native to the Peruvian coast [6], with evidence of its use in textiles dating back thousands of years in pre-Incan cultures. Currently, its fiber is valued for its excellent quality—extra-long, strong, soft, lustrous, and fine—giving producers of specialty cotton a competitive advantage [7,8,9]. Likewise, native cotton, which has been preserved in the Peruvian Amazon through the sociocultural practices of local communities, has enabled the conservation of naturally colored cotton [10,11]. This represents an ecological and low-cost production alternative for the textile industry [12]. Native cotton is cultivated in northern Peru, particularly in the regions of Lambayeque, Piura, and San Martín [13].
The remarkable characteristics of G. barbadense are offset by low yield and the challenges posed by climate change. Furthermore, although the study by [3] demonstrated phenotypic variability in Gossypium spp., it also highlights the limited number of morphological characterization studies of native cotton, G. barbadense. In this context, it is essential to conduct an agromorphological characterization of the accessions in the cotton germplasm collection maintained by the Germplasm Bank of the National Institute of Agrarian Innovation (INIA), to determine their phenotypic and genotypic variability. Thus, 62 descriptors were evaluated to identify the categorical descriptors and their usefulness in distinguishing accessions with higher yield [14], which could be used in future genetic improvement programs.

2. Results

2.1. Genetic Variability

Analysis of variance (ANOVA) (Table 1) revealed highly significant differences (p ≤ 0.01 and p ≤ 0.001) among the 34 accessions for all traits except those related to seeds, such as SL and SWi. These differences were also detected throughout the years of evaluation, reflecting the influence of environmental conditions on plant performance. Highly significant differences were observed between accessions in several key traits, such as plant height (PH), number of sympodial and monopodial branches (NSP, NMP), days to fifty percent squaring (DFS), days to fifty percent flowering (DFF), and days to fifty percent pod opening (DBO), capsule length and width (BL, BWid, BLWR), number of seeds per capsule (NSB), seed shape index (SSI), seed weight (SW, SP), and traits related to cotton yield (SCY, SCWP, BW) and fiber weight (FW, GTP, GR). Similarly, the interaction between access and year showed the same pattern of significant differences, while seed dimensional traits (SL and SWi), which showed no significant differences, remained stable throughout the years of evaluation.
Table 1. Analysis of variance (ANOVA) for the 34 cotton accessions evaluated with the 32 quantitative characteristics.
The variability of quantitative descriptors was evaluated in the 34 accessions (Table 2). According to this criterion proposed by [15], the variables that showed high PCV values were NSP, Plen, SWi, SL, and SSI. NSP showed high values for both GCV and PCV. Moderate GCV and PCV values were found for LA, DFS, NSB, BW, SL, FW, and GR. In contrast, DTTL, NMP, PH, LL, LW, LI, DFF, CL, DBO, BL, Bwid, BLWR, NLP, DH, and SP corresponded to low GCV and PCV values. In terms of heritability, the categorization indicated by [16] showed that most traits had very high heritability (DFS, DFF, PLen, BL, Bwid, BLWR, NSB, and GR) and moderately high heritability (NSP, NMP, PH, PL, LL, LA, CL, DBO, BW, SW, SP, FW, and GTP). In addition, genetic progress (GA) and genetic progress as a percentage of the mean (GAM) were estimated to complement the heritability estimates. High GAM values were observed for NMP (83.03%), PLen (70.28%), NSB (65.41%), FW (59.21%), DFS (48.84%), BW (45.55%), GR (39.21%), LA (38.28%), and DFF (35.57%). Moderate GAM values were recorded for PH (23.86%), PL (27.74%), BWid (24.43%), BLWR (29.37%), SW (26.06%), SP (21.27%), NSP (20.17%), and GTP (32.50%). In contrast, low GAM values (<10%) were observed for DTTL, LW, LI, NLP, DH, SL, SWi, and SSI.
Table 2. Descriptive parameters and genetic variation of 32 quantitative traits evaluated in 34 cotton accessions.

2.2. Association of Agromorphological Descriptors

The analysis of phenotypic and genotypic correlation coefficients for agromorphological descriptors revealed highly significant positive and negative correlations (Figure 1). At both the phenotypic and genotypic levels, a strong and highly significant positive correlation was observed between traits related to cotton yield, such as SCY with NBP (0.893), between SCWP and NBP (0.893), and between BW and FW (0.881) and NSB (0.623), as well as between FW and GTP (0.701) and NSB (0.530). Highly significant positive relationships were also observed between leaf descriptors, such as LL with PL (0.624) and LA (0.995), and between LA and PL (0.620); descriptors related to earliness, such as DFF with DFS (0.844) and DBO (0.653); seed descriptors, such as SL with SSI (0.999) and SP with GR (0.807); and between PH and NSP (0.762) and LW (0.574).
Figure 1. Pearson correlation matrix for 32 agromorphological traits evaluated in 34 cotton accessions from the INIA cotton germplasm collection. Phenotypic correlation coefficient (lower diagonal), genotypic correlation coefficient (upper diagonal). Significant at 5% (*). Significant at 1% (**). Significant at 0.1% (***).
On the other hand, highly significant negative correlations were found between GTP and GR (0.968) and SP (0.820), FW and SP (0.638) and GR (0.669), as well as between DFS and NMP (0.537) and between BLWR and LI (0.621).

2.3. Multivariate Analysis Using Quantitative and Qualitative Data

Principal component analysis (PCA) of the quantitative traits of the 34 accessions evaluated revealed 31 principal components (PCs), of which five represented a cumulative variability of 71.03%. Dim1 showed a variability of 25.94%, with the largest contributions from BWid (8.75%), BW (8.47%), and DH (7.59%). Dim2 accounted for 17.25% of the variability, mainly represented by DBO (10.26%), LL (9.03%), LA (8.92%), and Plen (8.54%). Dim3 showed a variability of 12.19%, with predominant contributions from SW (13.04%), SP (11.75%), GTP (10.81%), and GR (10.71%). Dim4 explained 9.98% of the variability, represented by NBP (19.12%), CL (15.52%), SCWP (10.65%), and SCY (10.65%). Finally, Dim5 accounted for 6.57% of the variability, with high contributions from LI (17.00%) and BL (14.91%). On the other hand, the hierarchical dendrogram grouped the 34 accessions into four main clusters (Figure 2A), reflecting a considerable degree of divergence among the materials evaluated. These four groups were used as a reference in the PCA biplot, allowing the distribution of the accessions to be visualized in the two dimensions. Consequently, the PCA biplot (Figure 2B) shows four groups (red, green, blue, and purple), illustrating how the 34 accessions relate to the 32 quantitative descriptors. Group 1, represented in red in the PCA biplot (and purple in the dendrogram; Figure 2A) included accessions PER1010526, PER1010538, PER1010530, PER1010534, PER1010537, PER1010539, PER1010531, PER1010523, PER1010540, PER1010541, PER1010542, PER1010543, PER1010544, PER1010525, and PER1010524, which showed high values for the descriptors LL, LA, FW, BW, BWid, SCWP, SCY, and NSB. Group 2 (green) comprised accessions PER1010533, PER1010532, PER1010528, PER1010535, PER1010527, PER1010529, PER1010551, and PER1010536, which showed close relationships between SP, NSP, GR, and NBP. Group 3 (blue) included accessions PER1010553, PER1010549, PER1010548, PER1010547, PER1010552, PER1010545, PER1010555, and PER1010544, closely associated with BLWR, NSP, and DH. Group 4, represented in red in the dendrogram (Figure 2A) and in purple in the PCA biplot (Figure 2B), contained accessions PER1010550, PER1010554, PER1010546, and PER1010556, characterized by higher values of DH, DFF, DFS, BLWR, NMP, and DBO.
Figure 2. (A) Hierarchical dendrogram based on Gower’s distance matrix of the 34 cotton accessions, defining four groups (groups 1–4). In the dendrogram, group 1 is represented in purple, group 2 in green, group 3 in blue, and group 4 in red. (B) Principal component analysis (PCA) biplot of Dim 1 versus Dim 2 showing the divergence of quantitative descriptors and the 34 cotton accessions. The same four groups defined in the dendrogram are shown, with Group 1 represented in red, Group 2 in green, Group 3 in blue, and Group 4 in purple. (C) Multiple correspondence analysis (MCA) biplot of the 34 cotton accessions based on qualitative descriptors.
In turn, multiple correspondence analysis (MCA) performed on the 30 qualitative descriptors revealed that 32.01% of the variability was explained by the first two dimensions, Dim1 (21.84%) and Dim2 (10.17%). Associations were observed (Figure 2C) between several descriptors: SCl, SSp, and PN, and, to a lesser extent, PS. Accessions PER1010550, PER1010554, PER1010546, and PER1010556 were associated with these descriptors, showing a unique trait profile. Another group of descriptors, SCL and CP, was associated with accessions PER1010533, PER1010551, PER101053, PER101032, PER101035, PER101027, PER101028, and PER101029.
Finally, mixed data factor analysis (FAMD) summarizes the relationship between qualitative and quantitative variables. It was observed (Figure 3) that 19.39% of the total variance is explained by dimension 1 and 12.50% by dimension 2. Among the key descriptors for differentiating accessions are quantitative traits (S.1A): FW, NSB, DH, GR, and DBO, and qualitative traits (S.1B): SC, LC, BC, PSI, APP, FC, and SFC.
Figure 3. Biplot of mixed data factor analysis (FAMD) for qualitative and quantitative descriptors of the 34 cotton accessions.

3. Discussion

The germplasm of G. barbadense from northeastern Peru showed distinctive variations in its agromorphological traits. Analysis of variance revealed wide variability among accessions, with significant differences in most quantitative traits, confirming the potential of this germplasm for genetic improvement programs. Only seed size traits (SL, SWi, SSI) remained stable, suggesting more conserved genetic control over these characteristics, while the significant interaction between accession and year reflects the influence of the environment on trait expression. In this regard, phenotypic diversity and phenotypic plasticity have been reported to be key factors in selecting outstanding genotypes and improving this crop [17].
Similarly, both the phenotypic coefficient of variation (PCV) and the genotypic coefficient of variation (GCV) allowed us to quantify the variability of traits among germplasm accessions. These estimates suggest that descriptors with PCV values slightly higher than GCV values are less influenced by the environment and, therefore, their variability is mainly genetic [18]. This was observed in days to 50% square (DFS), ginning ratio (GR), number of seeds per capsule (NSB), capsule weight (BW), fiber weight (FW), pedicel length (Plen), and number of sympodial branches (NSP), all of which showed moderate to high PCV and GCV values. Similar results have been reported for BW and NSP [18] and NSB [19]. Similarly, the heritability of the seven descriptors mentioned above ranged from moderately high to very high. Since they are less influenced by the environment, their selection could be relatively straightforward due to the high additive effect on their expression [18,20].
Therefore, they can be improved using simple selection methods, such as direct selection in the first generations [21] or hybridization [22]. Similar patterns have been described for traits related to flowering, pod weight (BW), number of fruiting branches (NSP), number of seeds per pod (NSB) [19], and ginning index [23]. Favorable values for heritability and genetic variability indicate the effectiveness of selection for the genetic improvement of these traits.
Given that several traits showed genetic variability, it is important to understand their interactions with variables related to yield (capsule weight, fiber weight, and raw cotton yield), phenological traits (days to flowering and harvest), and architectural traits, such as the number of branches and leaf area. In this regard, the analysis of genotypic and phenotypic correlation coefficients helped identify significant associations between variables that directly affect cotton yield. Positive correlations were observed between the number of seeds per boll (NSB) and two key components of yield: fiber weight (FW) and boll weight (BW). This indicates that a higher number of seeds leads to an increase in fiber production and total boll weight, suggesting that a higher number of seeds per boll may be related to a reduction in the number of unfertilized ovules per boll [24], i.e., reproductive efficiency that positively impacts cotton yield. Similar results have been obtained between NSB, BW, and fiber yield [25]. Regarding the positive correlation between plant height (PH) and the number of sympodial branches (NSP), as well as leaf width (LW), similar findings have been reported between PH and NSP [23], in addition to their positive effect on cotton yield. In addition, taller plants with larger leaves can achieve greater photosynthetic efficiency, which would have an impact on final yield [26]. Strong correlations were also found, both at the genotypic and phenotypic levels, between seed cotton yield (SCY) and seed cotton weight per plant (SCWP), and with the number of bolls per plant (NBP), as well as between SCWP and NBP. This indicates that a greater number of reproductive structures and a greater mass of lint per plant contribute significantly to increased yield. These findings are consistent with those reported by [26], who observed a very strong association between fiber per boll and cotton weight per boll, indicating that heavier bolls produce more individual fiber, which has a positive impact on total yield. Regarding negative correlations, those observed between the boll-to-leaf ratio (BLWR) and the leaf-to-boll ratio (LI) indicate possible competition in resource allocation. These opposing phenotypic relationships have been described in studies on cotton, such as [27], in relation to source–sink dynamics and resource allocation between vegetative and reproductive structures [28].
In addition to phenotypic and genotypic correlations, multivariate analyses (PCA, MCA, and FAMD) were used to support the interpretation of patterns of variability among accessions by integrating multiple quantitative and qualitative traits. Rather than reiterating individual results, these approaches allowed for the identification of trait syndromes with potential agronomic relevance. PCA indicated that the principal axes were associated with traits related to yield, plant architecture, and phenological development, highlighting the multidimensional nature of cotton variability. Accessions were grouped according to combinations of vegetative vigor, reproductive allocation, and timing of development, reflecting distinct biological strategies. In particular, one group was characterized by greater leaf area and heavier bolls, traits commonly associated with biomass accumulation and yield potential. Comparable associations between capsule traits and yield components have been described for BW, FW, and BWid [29], as well as for SCY and BW [28,30,31,32,33]. The positive relationship between leaf area and yield, previously described by [34,35], suggests that increased photosynthetic capacity may indirectly contribute to productivity. These accessions represent promising material for breeding programs that aim to improve yield through indirect selection using leaf traits [36]. Another group was defined mainly by reproductive efficiency, integrating variables related to sympodial branches, number of capsules, seed percentage, and threshing ratio. Similar relationships between NSP and NBP have been described [37], supporting the role of fruiting structures in determining reproductive yield. These traits are particularly relevant for selection strategies focused on seed production efficiency and pod retention. Variability in phenological behavior was also evident, with accessions differing in maturity patterns and branch architecture. The association of days to harvest with branch type and capsule-related traits supports previous findings indicating that cotton maturity is influenced by plant architecture and fruiting habit [38]. The coexistence of early and late materials within this group may be advantageous for breeding programs targeting production cycles adapted to specific environmental or management conditions [39,40]. Finally, a distinct group was predominantly associated with phenological descriptors related to flowering, capsule formation, capsule opening, and harvest timing. These traits are closely related to crop maturity and the timing of reproductive development. Ref. [41] has described similar contributions of DBO, DFF, and DFS to maturity differentiation. The predominance of late-maturing accessions within this group highlights their potential use in breeding programs aimed at extending the production cycle or adapting cultivars to environments with longer growing seasons.
Complementarily, Multiple Correspondence Analysis (MCA) applied to qualitative variables explained 32.01% of the variability in its first two dimensions. In Dim1 (21.84%), a strong association was observed between descriptors such as seed cleft (SCl), seed shape (SSp), petiole nectaries (PN), and petal spot (PS). In this regard, the study by [15] revealed the high contribution of PS in determining the categorical traits of cotton. The accessions associated with these descriptors were PER1010550, PER1010554, PER1010546, and PER1010556, which showed a morphological profile that could be useful for identifying and characterizing materials with distinctive traits [15]. Another important association was detected between seed coat luster (SCL) and pulvinus color (CP), involving accessions PER1010533, PER1010551, PER1010535, PER1010532, PER1010536, PER1010527, and PER1010528, which overlap with the accessions highlighted in the PCA. This indicates that certain accessions simultaneously concentrate agronomic and qualitative traits of interest.
Finally, mixed data factor analysis (FAMD) integrated quantitative and qualitative variables, explaining 19.39% and 12.50% of the variability in the first two dimensions, respectively. Among the most influential descriptors were, among the quantitative traits, fiber weight, days to 50% capsule opening (DBO), days to harvest (DH), ginning ratio (GR), and number of seeds per capsule (NSB), and among the qualitative traits, stem color (SC), leaf color (LC), bract color (BC), petal spot (PS), petal spot intensity (PSI), presence of anthocyanin in the petiole (APP), fiber color (FC), and lint color (SFC). This integration shows that the variability observed does not depend exclusively on one type of descriptor, but rather on the interaction between productive and morphological traits, as observed in the study by [42], where genetic groups affecting both fiber quality and yield were identified in the same genomic locus. In this sense, accessions such as PER1010536, PER1010538, PER1010543, and PER1010547 emerge as strategic materials for breeding programs focused on yield and precocity, while PER1010550, PER1010554, and PER1010556 stand out for their distinctive morphological attributes.
The Germplasm of G. barbadense from northeastern Peru showed distinctive agromorphological variability in all traits evaluated. Analysis of variance revealed wide variation among accessions, with significant differences in most quantitative traits, confirming the potential of this germplasm for cotton breeding programs. In particular, traits showing high genetic variability and heritability, such as boll weight (BW) and number of seeds per boll (NSB), suggest that these characteristics can be effectively improved through direct selection in early generations, increasing fiber productivity and overall yield [23,43]. This study also provides a deeper understanding of how genotype × environment interactions influence the phenotypic expression of key traits such as number of sympodial branches (NSP) and ginning ratio (GR), creating opportunities for the development of varieties better adapted to specific environmental conditions. These morphological traits, which are closely related to increased fiber yield, have important implications for the textile industry, as they provide a genetic basis for producing higher quality and stronger fibers, which are necessary for the manufacture of high-performance textiles [44,45].

4. Materials and Methods

4.1. Plant Material

The plant material was collected in 2019 in the departments of San Martín and Loreto (Figure 4). A total of 34 accessions of G. barbadense were collected, mostly from domestic gardens preserved in the surrounding areas. The accessions were sown directly in the field in March, in final experimental plots of 15 m2 per accession. Each accession was established in two rows of 11 plants each, giving a total of 22 plants per accession. All accessions were maintained under uniform management conditions, including standardized irrigation, fertilization, and phytosanitary control during the three growing seasons.
Figure 4. Location of collection sites for the 34 cotton accessions in the departments of San Martín and Loreto.

4.2. Location of the Research

The research was conducted between 2022 and 2024 at the El Porvenir Agricultural Experimental Station of the National Institute of Agricultural Innovation (INIA). It is located in the district of Juan Guerra, province and department of San Martín, Peru, at a latitude of −6°35′26.5″ and a longitude of −76°19′25.47″, at an altitude of 213 m above sea level. Although all accessions received the same agronomic management, environmental conditions varied throughout the three growing seasons, including specific climatic factors for each year.

4.3. Experimental Design

The 34 accessions of G. barbadense were established in the same experimental plot, with each accession occupying 15 m2 distributed in two rows of 11 plants each (22 plants per accession). The distance between rows was 1.5 m and that between plants was 1.0 m.
From each accession, 10 plants were randomly selected and considered as independent experimental units for morpho-agronomic evaluation in each growing season. The experiment was conducted using a completely randomized design (CRD) in each year (2022, 2023, and 2024), considering each accession as a treatment. Since the evaluations were carried out for three consecutive years on the same plot, but under environmental conditions that varied between seasons, the Year factor was included as a fixed effect in the ANOVA. In addition, the Accession × Year interaction was evaluated to assess the phenotypic stability of the traits across seasons.

4.4. Variables Evaluated

A total of 32 quantitative traits and 30 qualitative traits were evaluated for the 34 accessions, following the recommendations and descriptors proposed by [46], and the different degrees of color were determined using the Royal Horticultural Society color chart [47]. Of the 62 descriptors evaluated, 4 corresponded to vegetative traits, 14 to the leaf, 14 to the flower, 15 to the capsule, 11 to the seed, and 4 to the fiber (Table 3). All traits were measured or observed at the appropriate stages of development: vegetative and leaf traits were recorded during the vegetative growth phase, floral traits during flowering, capsule and seed traits at fruit maturity, and fiber traits after ginning. Quantitative traits were measured using standard tools (e.g., ruler, caliper, or balance). Qualitative traits were scored visually following the descriptor guidelines, and color traits were determined using the Royal Horticultural Society (RHS) color chart.
Table 3. Qualitative and quantitative descriptors used in the characterization of tropical cotton accessions.

4.5. Statistical Analysis

Statistical analyses were performed to characterize the accessions. For quantitative descriptors, a two-way analysis of variance (ANOVA) was applied, considering the effects of accessions, years of evaluation, and their interaction, verifying the normality of the residuals using the Shapiro–Wilk test. From the quadratic means of the ANOVA, genetic parameters such as the genotypic coefficient of variation (GCV) and the phenotypic coefficient of variation (PCV) were estimated according to [48], as well as broad-sense heritability (H2) [49]. The categorization proposed by [15] was considered for GCV and PCV values, with low (0–10%), moderate (10–20%), and high (more than 20%) values, while for heritability, the categorization indicated by [16] was considered, with low values (below 40%), medium values (40–59%), moderately high values (60–79%), and very high values (≥80%). Similarly, Pearson’s phenotypic correlations and approximate genotypic correlations derived from variance components were calculated and represented using heat maps with significance levels. To explore the multivariate structure of the data according to the nature of the variables, complementary ordering methods were applied. Principal component analysis (PCA) was performed on the quantitative variables to identify the main gradients of variation and evaluate the contribution and correlation structure of the continuous descriptors.
Multiple correspondence analysis (MCA) was applied to the qualitative variables to explore associations between categories and visualize their relationships in a reduced-dimension space, and a mixed hierarchical cluster analysis based on Gower’s distance [50] and the Ward D2 method was represented with phylogenetic dendrograms. In addition, a mixed-data factor analysis (FAMD) was applied as an integrative approach to jointly analyze quantitative and qualitative variables, providing an overview of the multivariate relationships between accessions. Although FAMD incorporates all the variables analyzed separately using PCA and MCA, the latter methods were retained to obtain specific interpretations of the type of variables that are not fully captured by the mixed analysis alone. All analyses were performed in the R v.4.5.1 programming environment [51], using the ExpDes.pt, FactoMineR, factoextra, ggplot2, cluster, corrplot, tidyverse, and openxlsx packages within the RStudio Version 4.5.2 interface.

5. Conclusions

Cotton germplasm from northeastern Peru showed wide variability with highly heritable traits under genetic control, especially capsule weight, fiber weight, number of seeds, and days to flowering, making them reliable descriptors for direct selection. Multivariate analyses identified promising accessions such as PER1010536, PER1010538, PER1010543, and PER1010547 (notable for their yield and earliness), and PER1010550, PER1010554, and PER1010556 (with vegetative vigor and fiber attributes). Similarly, materials such as PER1010550, PER1010554, and PER1010556 stood out for their unique morphological traits, relevant for diversity and conservation. These results confirm the value of these accessions as a basis for cotton breeding programs.

Supplementary Materials

The following supplementary information can be downloaded at: https://www.mdpi.com/article/10.3390/plants15020314/s1. Figure S1: Factor Analysis of Mixed Data (FAMD) for quantitative (A) and qualitative (B) descriptors, highlighting the contribution of each descriptor to the principal dimensions.

Author Contributions

Conceptualization, D.C.-S. and S.L.-H.; methodology, D.C.-S., S.L.-H. and E.I.M.-C.; validation, D.C.-S. and E.I.M.-C.; formal analysis, D.C.-S. and S.L.-H.; research, D.C.-S. and S.L.-H.; data curation, D.C.-S. and E.I.M.-C.; writing: preparation of the original draft, D.C.-S. and S.L.-H.; writing: review and editing, D.C.-S. and S.L.-H.; supervision, D.C.-S. and E.I.M.-C. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the project “Mejoramiento de los servicios de investigación en la caracterización de los recursos genéticos de la agrobiodiversidad en 17 departamentos del Perú—ProAgrobio” of the Ministry of Agricultural Development and Irrigation (MIDAGRI) of the Peruvian Government, with grant number CUI 2480490, executed by the Subdirectorate of Genetic of the Directorate of Genetic Resources and Biotechnology at the National Institute of Agricultural Innovation (INIA).

Data Availability Statement

The original contributions presented in this study are included in the article/Supplementary Materials. Further inquiries can be directed to the corresponding author.

Acknowledgments

We would like to thank all the staff at the Genetic Resources Subdivision of the El Porvenir Agricultural Experiment Station (EEA-El Porvenir) for providing the field resources.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

References

  1. Lee, J.A.; Fang, D.D. Cotton as a World Crop: Origin, History, and Current Status. Cotton 2015, 57, 1–23. [Google Scholar] [CrossRef]
  2. Khan, M.A.; Wahid, A.; Ahmad, M.; Tahir, M.T.; Ahmed, M.; Ahmad, S.; Hasanuzzaman, M. World Cotton Production and Consumption: An Overview. In Cotton Production and Uses: Agronomy, Crop Protection, and Postharvest Technologies; Ahmad, S., Hasanuzzaman, M., Eds.; Springer: Singapore, 2020; pp. 1–7. [Google Scholar] [CrossRef]
  3. Morales-Aranibar, L.; Rivera, M.Y.N.; Gonzales, H.H.S.; Aranibar, C.G.M.; Gutiérrez, N.L.; Gomez, F.G.; Zuffo, A.M.; Aguilera, J.G.; Steiner, F. Comparative Analysis of Key Fiber Characteristics in White Pima Cotton (Gossypium barbadense L.): Native Accessions from the Peruvian Amazon. Agrosyst. Geosci. Environ. 2024, 7, e20517. [Google Scholar] [CrossRef]
  4. Yuan, D.; Grover, C.E.; Hu, G.; Pan, M.; Miller, E.R.; Conover, J.L.; Hunt, S.P.; Udall, J.A.; Wendel, J.F. Parallel and intertwining threads of domestication in allopolyploid cotton. Adv. Sci. 2021, 8, 2003634. [Google Scholar] [CrossRef]
  5. Wendel, J.F.; Brubaker, C.L.; Seelanan, T. The Origin and Evolution of Gossypium. In Physiology of Cotton; Stewart, J.M., Oosterhuis, D.M., Heitholt, J.J., Mauney, J.R., Eds.; Springer: Dordrecht, The Netherlands, 2010; pp. 1–18. [Google Scholar] [CrossRef]
  6. Westengen, O.T.; Huamán, Z.; Heun, M. Genetic Diversity and Geographic Pattern in Early South American Cotton Domestication. Theor. Appl. Genet. 2005, 110, 392–402. [Google Scholar] [CrossRef]
  7. Avci, U.; Pattathil, S.; Singh, B.; Brown, V.L.; Hahn, M.G.; Haigler, C.H. Cotton Fiber Cell Walls of Gossypium hirsutum and Gossypium barbadense Have Differences Related to Loosely-Bound Xyloglucan. PLoS ONE 2013, 8, e56315. [Google Scholar] [CrossRef] [PubMed]
  8. Liu, X.; Zhao, B.; Zheng, H.J.; Hu, Y.; Lu, G.; Yang, C.Q.; Chen, J.; Chen, J.J.; Chen, D.Y.; Zhang, L.; et al. Gossypium barbadense Genome Sequence Provides Insight into the Evolution of Extra-Long Staple Fiber and Specialized Metabolites. Sci. Rep. 2015, 5, 14139. [Google Scholar] [CrossRef] [PubMed]
  9. Hu, Y.; Chen, J.; Fang, L.; Zhang, Z.; Ma, W.; Niu, Y.; Ju, L.; Deng, J.; Zhao, T.; Lian, J.; et al. Gossypium barbadense and Gossypium hirsutum Genomes Provide Insights into the Origin and Evolution of Allotetraploid Cotton. Nat. Genet. 2019, 51, 739–748. [Google Scholar] [CrossRef] [PubMed]
  10. Ministerio del Ambiente (MINAM). Documentación de las Colecciones Herborizadas de Gossypium (Malvaceae), Conservadas en el Herbario de San Marcos; Informe Final; Ministerio del Ambiente: Lima, Perú, 2012. Available online: https://bioseguridad.minam.gob.pe/wp-content/uploads/2018/07/ldb_algodon_germoplasmaunmsm_12.pdf (accessed on 17 September 2025).
  11. Cardoso, K.C.M.; Barroso, G.H.; Freitas, F.O.; de Menezes, I.P.P.; Silva, C.F.; Arriel, N.H.C.; Sofiatti, V.; Hoffmann, L.V. Traditional Fabric and Medicinal Use Are the Leading Factors of In Situ Conservation of Gossypium barbadense in Central Brazil. Sustainability 2023, 15, 4552. [Google Scholar] [CrossRef]
  12. Blas-Sevillano, R.H.; Veramendi, T.; La Torre, B.; Velezmoro-Sánchez, C.E.; Oliva, A.I.; Mena-Martínez, M.E.; Herrera-Kao, W.A.; Uribe-Calderon, J.; Cervantes-Uc, J.M. Physicochemical Characterization of Several Types of Naturally Colored Cotton Fibers from Peru. Carbohydr. Polym. 2018, 197, 246–252. [Google Scholar] [CrossRef]
  13. Cortijo-de-Arbildo, D.; Cancio-Iparraguirre, R.S. Innovación Tecnológica para Recuperar el Algodón Nativo de Color. Ing. Ind. 2012, 30, 225–245. [Google Scholar] [CrossRef]
  14. Restrepo-Montoya, D.; Hulse-Kemp, A.M.; Scheffler, J.A.; Haigler, C.H.; Hinze, L.L.; Love, J.; Percy, R.G.; Jones, D.C.; Frelichowski, J. Leveraging National Germplasm Collections to Determine Significantly Associated Categorical Traits in Crops: Upland and Pima Cotton as a Case Study. Front. Plant Sci. 2022, 13, 837038. [Google Scholar] [CrossRef]
  15. Deshmukh, S.N.; Basu, M.S.; Reddy, P.S. Genetic Variability, Character Association and Path Coefficients of Quantitative Traits in Virginia Bunch Varieties of Groundnut. Indian J. Agric. Sci. 1986, 56, 816–821. [Google Scholar]
  16. Singh, B.D. Plant Breeding: Principles & Methods; Kalyani Publishers: New Delhi, India, 2016; ISBN 978-93-272-5232-3. Available online: http://macl-ustm.digitallibrary.co.in/handle/123456789/2009 (accessed on 17 September 2025).
  17. Huang, Y.; Qi, Z.; Li, J.; You, J.; Zhang, X.; Wang, M. Genetic Interrogation of Phenotypic Plasticity Informs Genome-Enabled Breeding in Cotton. J. Genet. Genom. 2023, 50, 971–982. [Google Scholar] [CrossRef]
  18. Harini, P.S.; Premalatha, N.; Subramanian, A.; Boopathi, N.M.; Guruswamy, K. Unveiling Genetic Variation in Egyptian Cotton (Gossypium barbadense L.) Germplasm: A Combined Approach of Morphological Characterization and Multivariate Analysis. Plant Sci. Today 2025, 12, 1–10. [Google Scholar] [CrossRef]
  19. Iqbal, Z.; Ullah, M.; Chohan, S.; Iqbal, J.; Farooq, M.; Sarwar, M.; Khan, R.; Latif, A.; Nazar, M.; Saleem, M.; et al. Evaluation of Genetic Variability for Different Fiber and Yield Related Components in Cotton (Gossypium hirsutum L.). Biol. Clin. Sci. Res. J. 2023, 4, 177. [Google Scholar] [CrossRef]
  20. Jarwar, A.H.; Wang, X.; Wang, L.; Jarwar, Z.H.; Ma, Q.; Fan, S. Genetic Advancement, Variability and Heritability in Upland Cotton (Gossypium hirsutum L.). J. Environ. Agric. Sci. 2018, 16, 24–31. [Google Scholar]
  21. Rehman, A.; Mustafa, N.; Du, X.; Azhar, M.T. Heritability and Correlation Analysis of Morphological and Yield Traits in Genetically Modified Cotton. J. Cotton Res. 2020, 3, 23. [Google Scholar] [CrossRef]
  22. Gibely, R.H. Variability, Genetic Components and Selection Response in Segregating Generations among Some Cotton Crosses. J. Sustain. Agric. Sci. 2021, 47, 1–12. [Google Scholar] [CrossRef]
  23. Sahar, A.; Zafar, M.M.; Razzaq, A.; Manan, A.; Haroon, M.; Sajid, S.; Rehman, A.; Mo, H.; Ashraf, M.; Ren, M.; et al. Genetic Variability for Yield and Fiber Related Traits in Genetically Modified Cotton. J. Cotton Res. 2021, 4, 19. [Google Scholar] [CrossRef]
  24. Davidonis, G.H.; Richard, O.A.; Ingber, B.F.; Meredith, W.R.; Heitholt, J.J. The Influence of Cotton Seed Weight on Fibers Per Seed and Fiber Property Uniformity. J. New Seeds 2005, 7, 1–13. [Google Scholar] [CrossRef]
  25. Khalid, M.A.; Malik, T.A.; Fatima, N.; Shakeel, A.; Karim, I.; Arfan, M.; Merrium, S.; Khanum, P. Correlation for Economic Traits in Upland Cotton. Acta Sci. Agric. 2018, 2, 59–62. Available online: https://www.actascientific.com/ASAG-2-10.php (accessed on 17 September 2025).
  26. Virk, G.; Snider, J.L.; Chee, P.; Virk, S.; Jones, D. Impact of Intra-Boll Yield Components on Lint Production per Boll and Fiber Quality in Advanced Cotton Breeding Lines. Crop Sci. 2024, 64, 126–140. [Google Scholar] [CrossRef]
  27. Li, D.; Letort, V.; Guo, Y.; De Reffye, P.; Zhan, Z. Modeling Branching Effects on Source–Sink Relationships of the Cotton Plant (Gossypium hirsutum L.). arXiv 2010, arXiv:1012.3276. [Google Scholar] [CrossRef]
  28. Qin, A.; Aluko, O.O.; Liu, Z.; Yang, J.; Hu, M.; Guan, L.; Sun, X. Improved Cotton Yield: Can We Achieve This Goal by Regulating the Coordination of Source and Sink? Front. Plant Sci. 2023, 14, 1136636. [Google Scholar] [CrossRef]
  29. Mangi, N.; Iqbal, M.S.; Shuli, F.; Iqbal, M.T.; Alharthi, B.; Jatoi, G.H.; Sarfraz, Z.; Ma, Q.; Sun, X. Empowering Cotton Breeding Programs through the Strategic Exploration and Exploitation of Phenotypic Diversity of Genetic Resources under Climate Change Conditions. Plant Stress 2024, 13, 100548. [Google Scholar] [CrossRef]
  30. Kaleri, R.A.; Malik, Y.; Majidano, T.; Abbasi, M.Z.H.; Ejaz, S.; Bahalkani, G.H.; Maryam, N.; Aleem, M.U.; Saleem, Z.; Kaleri, A.; et al. Genetic Diversity for Yield and Fiber Traits in Upland Cotton (Gossypium hirsutum L.) Genotypes. Pure Appl. Biol. 2025, 14, 524–530. [Google Scholar] [CrossRef]
  31. Vishnuvardhan, K.M.; Reddy, B.V.R.P.; Lakshmikalyani, D.; Sivaramakrishna, M.; Sudheepthi, K.; Amarnath, K.; Venkateswarlu, N.C. Multivariate Analysis in Upland Cotton (Gossypium hirsutum L.) Using Principal Component and Cluster Analysis. Ecol. Environ. Conserv. 2023, 29, S224–S230. [Google Scholar] [CrossRef]
  32. Riaz, B.; Saeed, A.; Fiaz, S.; Riaz, A. Genetic Diversity among Bt Cotton (Gossypium hirsutum L.) Germplasm Assessed through Morphological and within-Boll Yield Attributes. J. Anim. Plant Sci. 2019, 29, 226–231. Available online: https://www.thejaps.org.pk/docs/V-29-01/27.pdf (accessed on 18 September 2025).
  33. Rizwan, M.; Farooq, J.; Farooq, A.; Farooq, M.; Sarwar, G.; Nadeem, M.; Riaz, M.; Rafique Shahid, M.; Abbas, H.G.; Sarwar, M.K.S. Yield and Related Components of Cotton (Gossypium hirsutum L.) Effected by Chlorophyll Contents. Pak. J. Agric. Res. 2022, 35, 29–35. [Google Scholar] [CrossRef]
  34. Li, H.; Liu, Z.; Chen, Y.; Zhang, X.; Chen, D. A Positive Correlation between Seed Cotton Yield and High-Efficiency Leaf Area Index in Directly Seeded Short-Season Cotton after Wheat. Field Crops Res. 2022, 285, 108594. [Google Scholar] [CrossRef]
  35. Hussain, M.; Azhar, F.M.; Khan, A.A. Genetic Basis of Variation in Leaf Area, Petiole Length and Seed Cotton Yield in Some Cotton (Gossypium hirsutum) Genotypes. Int. J. Agric. Biol. 2008, 10, 705–708. Available online: https://www.researchgate.net/profile/Asif-Khan-53/publication/237540318_Genetic_Basis_of_Variation_in_Leaf_Area_Petiole_Length_and_Seed_Cotton_Yield_in_Some_Cotton_Gossypium_hirsutum_Genotypes/links/00b49528faf0374337000000/Genetic-Basis-of-Variation-in-Leaf-Area-Petiole-Length-and-Seed-Cotton-Yield-in-Some-Cotton-Gossypium-hirsutum-Genotypes.pdf (accessed on 5 October 2025).
  36. Fan, X.; Gao, P.; Zhang, M.; Cang, H.; Zhang, L.; Zhang, Z.; Wang, J.; Lv, X.; Zhang, Q.; Ma, L. The Fusion of Vegetation Indices Increases the Accuracy of Cotton Leaf Area Prediction. Front. Plant Sci. 2024, 15, 1357193. [Google Scholar] [CrossRef]
  37. Isong, A.; Balu, P.A.; Ramakrishnan, P. Association and Principal Component Analysis of Yield and Its Components in Cultivated Cotton. Electron. J. Plant Breed. 2017, 8, 857–864. [Google Scholar] [CrossRef]
  38. Chao, L.; Pan, Z.; Wang, J.; Wu, Y.; Shui, G.; Aini, N.; Tang, B.; Guo, C.; Han, P.; Shao, P.; et al. Genetic Mapping and Analysis of a Compact Plant Architecture and Precocious Mutant in Upland Cotton. Plants 2022, 11, 1483. [Google Scholar] [CrossRef]
  39. Sêkloka, E.; Lançon, J.; Gozé, E.; Hau, B.; Lewicki-Dhainaut, S.; Thomas, G. Breeding New Cotton Varieties to Fit the Diversity of Cropping Conditions in Africa: Effect of Plant Architecture, Earliness and Effective Flowering Time on Late-Planted Cotton Productivity. Exp. Agric. 2008, 44, 197–207. [Google Scholar] [CrossRef]
  40. Raj, S.; Patil, R.S.; Patil, B.R.; Nayak, S.N.; Pawar, K.N. Characterization of Early Maturing Elite Genotypes Based on MTSI and MGIDI Indexes: An Illustration in Upland Cotton (Gossypium hirsutum L.). J. Cotton Res. 2024, 7, 25. [Google Scholar] [CrossRef]
  41. Sarwar, G.; Nazir, A.; Rizwan, M.; Shahzadi, E.; Mahmood, A. Genetic Diversity among Cotton Genotypes for Earliness, Yield, and Fiber Quality Traits Using Correlation, Principal Component, and Cluster Analyses. Sarhad J. Agric. 2021, 37, 307–314. [Google Scholar] [CrossRef]
  42. Shi, Y.; Liu, A.; Li, J.; Zhang, J.; Zhang, B.; Ge, Q.; Jamshed, M.; Lu, Q.; Li, S.; Xiang, X.; et al. Dissecting the Genetic Basis of Fiber Quality and Yield Traits in Interspecific Backcross Populations of Gossypium hirsutum × Gossypium barbadense. Mol. Genet. Genom. 2019, 294, 1385–1402. [Google Scholar] [CrossRef] [PubMed]
  43. Delgado-Paredes, G.; Rojas-Idrogo, C.; Vásquez-Díaz, C.; Esquerre-Ibañez, B.; Bazán-Sernaqué, P.; Ayasta, P.C.; Silva, F.Z.-D. Collection, fiber color characterization and germplasm conservation of native cotton (Gossypium barbadense L.) in Peru. Chilean J. Agric. Anim. Sci. 2024, 40, 1–12. [Google Scholar] [CrossRef]
  44. Kaggwa-Asiimwe, R.; Andrade-Sanchez, P.; Wang, G. Plant architecture influences growth and yield response of upland cotton to population density. Field Crops Res. 2013, 145, 52–59. [Google Scholar] [CrossRef]
  45. Zhang, T.; Wang, S.; Chen, J.; Zhu, S.; Zhu, Q.; Zhao, T. Advances in cotton genomics, genetics and breeding. Plants 2024, 13, 2579. [Google Scholar] [CrossRef] [PubMed]
  46. Manco-Céspedes, E.I.; Chanamé, J.; Arévalo, G.; Mamani, W.; Hinostroza, L.; Garay, N.; Lindo, D.; Vásquez, J.; García-Serquén, A. Manual de Descriptores para Algodón Peruano (Gossypium barbadense L.); Instituto Nacional de Innovación Agraria (INIA): Lima, Perú, 2022; Available online: https://hdl.handle.net/20.500.12955/2026 (accessed on 19 September 2025).
  47. Royal Horticultural Society. The Royal Horticultural Society Colour Chart; The Royal Horticultural Society: London, UK, 2001. [Google Scholar]
  48. Comstock, R.E.; Robinson, H.F. Genetic Parameters, Their Estimation and Significance. In Proceedings of the 6th International Grassland Congress; The Pennsylvania State College: University Park, PA, USA, 1952; Volume 1, pp. 248–291. [Google Scholar]
  49. Falconer, D.S.; Mackay, T.C. Introduction to Quantitative Genetics, 4th ed.; John Wiley & Sons: New York, NY, USA, 1996; pp. 313–320. [Google Scholar]
  50. Gower, J.C. A General Coefficient of Similarity and Some of Its Properties. Biometrics 1971, 27, 857–871. [Google Scholar] [CrossRef]
  51. R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2025. Available online: https://www.R-project.org/ (accessed on 19 September 2025).
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Article Metrics

Citations

Article Access Statistics

Multiple requests from the same IP address are counted as one view.