Next Article in Journal
Soil Phosphorus Fraction Characteristics in Different Alpine Grassland Types of the Qinghai–Tibet Plateau
Previous Article in Journal
Spectroscopic Evidence of Soil Carbon and DOM Transformation Across an 8–63-Year Paddy Chronosequence in Western Jilin, China
Previous Article in Special Issue
Transcriptome Analysis of Potential Regulatory Genes under Chemical Doubling in Maize Haploids
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Merging Phenotypic Stability Analysis and Genomic Prediction for Multi-Environment Breeding in Capsicum spp.

by
Sebastian Parra-Londono
1,*,
Felipe López-Hernández
2,*,
Guillermo Montoya
3,
Juan Camilo Henao-Rojas
2,4,
Gustavo A. Ossa-Ossa
1 and
Andrés J. Cortés
5
1
Unidad Técnica para el Desarrollo Profesional (Utede), Red de Transformación Productiva, Programas Agrícolas, Carrera 12 N° 26 C-64, Guadalajara de Buga 763041, Colombia
2
Corporación Colombiana de Investigación Agropecuaria (AGROSAVIA)—CI La Selva, Km 7 vía Rionegro—Las Palmas, Rionegro 054048, Colombia
3
Departamento de Ciencias Farmacéuticas, Biomédicas y Veterinarias, Facultad Barberi de Ingeniería, Diseño y Ciencias Aplicadas, Universidad Icesi, Cali 760031, Colombia
4
Grupo de Investigación en Sustancias Bioactivas, Facultad de Ciencias Farmacéuticas y Alimentarias, Universidad de Antioquia UdeA, Calle 70 No. 52-21, Medellín 050010, Colombia
5
Facultad de Ciencias Agrarias—Departamento de Ciencias Forestales, Universidad Nacional de Colombia—Sede Medellín, Medellín 050034, Colombia
*
Authors to whom correspondence should be addressed.
Agronomy 2025, 15(12), 2690; https://doi.org/10.3390/agronomy15122690
Submission received: 22 October 2025 / Revised: 14 November 2025 / Accepted: 17 November 2025 / Published: 22 November 2025
(This article belongs to the Special Issue Omics Approaches for Crop Improvement—Volume II)

Abstract

Capsicum spp. support diverse fresh and processing value chains, yet integrated assessments of phenotypic stability and genome-enabled prediction remain limited. In this study, 32 representative accessions, selected from a panel of 235 genotyped entries from the Colombian Capsicum germplasm collection, were evaluated across three contrasting environments to characterize physicochemical traits (texture, pH, soluble solids, color) and biochemical attributes (total carotenoids, capsaicin, dihydrocapsaicin, phenolics, antioxidant capacity). Variance partitioning and AMMI models quantified the contributions of genotype (G), environment (E), and G × E interactions (GEIs). Significant effects were detected for most traits. The AMMI analysis identified stable genotypes across locations for pH, moisture, firmness, and cohesiveness. In contrast, color attributes, total carotenoids, and phenolic compounds showed greater environmental responsiveness. Texture-related and solid content traits showed broad adaptability and high phenotypic stability, making them reliable targets for selection under variable production conditions. For the genomic component, we analyzed 235 accessions genotyped with 68,481 high-quality SNPs obtained through GBS. These data were used to estimate genomic heritability and prediction accuracy with Bayesian and semi-parametric models. Among them, BayesC showed the best performance. Prediction accuracy reached r = 0.94 within the training environment and ranged from r = 0.64 to 0.73 when tested across contrasting environments. Genomic heritability was highest for pH (h2 = 0.48) and pungency-related traits, including capsaicin (h2 = 0.39) and dihydrocapsaicin (h2 = 0.48), indicating strong additive genetic control. Finally, by integrating AMMI-based stability analysis and BayesC genomic prediction, we identified genotypes exhibiting both high performance and environmental robustness. This combined selection approach provides a comprehensive framework for genomic-assisted breeding to enhance fruit quality, carotenoid content, and pungency stability in Capsicum spp. under heterogeneous environments.

1. Introduction

Capsicum spp. are a major vegetable and spice crop worldwide and the most economically important genus within the Solanaceae family. The genus originated in Central and South America, where it was first domesticated and diversified [1,2]. Annually, approximately 37 million tons of chili peppers are produced globally, with China being the largest producer, accounting for nearly 46% of total production [3]. In Colombia, national production reached about 100,000 tons in 2023, with an average yield of 10 tons per hectare [4]. Approximately 30 Capsicum species have been described, of which five—Capsicum annuum, C. chinense, C. frutescens, C. baccatum, and C. pubescens—are domesticated and widely cultivated [5]. The selection of Capsicum fruits is based on traits such as fruit size, shape, color, pungency, and chemical composition, which are key for their use in the food, pharmaceutical, and cosmetic industries. Agronomic traits including fruit number, weight, and plant architecture also determine yield potential, highlighting the importance of understanding both genetic and environmental factors influencing these characteristics, particularly under changing climatic conditions [6].
The global genetic diversity of Capsicum has been extensively characterized using morphological and molecular approaches. A recent large-scale analysis of 10,000 accessions using genotyping-by-sequencing (GBS) identified nearly 500,000 single-nucleotide polymorphisms (SNPs), providing unprecedented resolution of the evolutionary history and global dispersal of the genus [7]. Regional studies have also revealed significant intra- and inter-specific variability, such as the clustering of Spanish germplasm into seven genetic groups [8], and the observation of higher within-group diversity in Indian collections [5]. In Colombia, Capsicum germplasm has shown high genetic heterogeneity, with six distinct clusters identified among 95 accessions, and the Andean group exhibiting the highest diversity [9]. More recently, the first comprehensive pangenomic characterization of the entire national Capsicum germplasm collection was conducted using genotyping-by-sequencing (GBS) and reference-guided pangenomic assembly [10]. This study integrated genomic and structural variation data from the Colombian collection, revealing five ancestral populations as inferred from ancestry and unsupervised clustering analyses, and demonstrating extensive admixture among domesticated species. These results confirmed the broad genetic structure and high intra-specific diversity within the Colombian Capsicum panel, positioning this germplasm as one of the most genetically diverse regional resources reported to date. Consequently, this dataset constitutes a foundational genomic reference for future association studies, genomic prediction, and pre-breeding initiatives in Capsicum spp. at the national scale.
Large information about Capsicum genomic variability is available, however the elucidation of the genetic background of phenotypic and biochemical quantitative traits remains unclear. Understanding the genetic basis of such traits may accelerate improvements in the crop. The chemical composition of Capsicum fruits is complex, including vitamins, carotenoids, phenolic compounds, proteins, and minerals, which together determine organoleptic quality and industrial value [11]. The concentration of these substances exerts a significant influence on their organoleptic properties, a factor that is of leading importance for its industrial utilization. Organoleptic properties are influenced by a multitude of social and cultural factors, which in turn shape consumer preferences and market needs. The chemical composition of the fruits is subjected to environmental variations and management factors present within the cropping system. It is therefore essential to comprehend the interaction between the individual and their environment if improvements are to be made to Capsicum yield and quality. Multi-environment studies are useful in understanding Genotype-Environment-Management (GxExM) interactions in crops and in uncovering their performance under a range of growing conditions [12]. In addition, given that the majority of yield- and quality-related traits are inherited quantitatively, leveraging the genomic and phenotypic diversity from local collections is key for crop improvement [13].
In Colombia, the national Capsicum germplasm collection represents a valuable regional genetic resource encompassing wide diversity within the five domesticated species. Characterizing this collection across different environments would enable the identification of superior accessions for targeted breeding objectives and industry-specific applications. The use of multi-environment trials and stability models such as AMMI (Additive Main Effect and Multiplicative Interaction) has proven effective in evaluating phenotypic stability and adaptation patterns. Through these analyses, accessions showing either broad stability or environment-specific performance can be identified, supporting selection strategies aligned with agro-industrial needs.
In parallel, the rise in genomic selection (GS) has transformed modern breeding by enabling the prediction of genetic merit through genome-wide marker information [14,15,16]. Genomic prediction models capture the additive effects of numerous small-effect loci underlying quantitative traits, overcoming the limitations of GWAS in detecting rare variants or low-effect alleles [17,18]. Bayesian and semi-parametric regression methods—such as BayesA [14], BayesB (Meuwissen et al., 2001) [14], BayesC [19], Bayesian LASSO (Park and Casella 2012) [20], Bayesian Ridge Regression, and Reproducing Kernel Hilbert Spaces (RKHS) [15,21,22]—have been successfully applied in several crops as beans [16], maize [23], or wheat [24] to increase selection accuracy and accelerate genetic gain per cycle.
However, despite the extensive application of genomic prediction in cereals and legumes, its integration with stability and adaptability models such as AMMI has not yet been widely implemented for genetic improvement. The combined use of genomic prediction and AMMI represents an innovative framework. This allows the simultaneous estimation of genomic-estimated breeding values (GEBVs) and the partitioning of G × E interactions (GEIs) within a unified predictive architecture. This integrative approach enhances the capacity to identify accessions that exhibit both high genomic merit and environmental stability, improving prediction ability for genomic-assisted selection under multi-environment conditions. To our knowledge, this study constitutes the first attempt to integrate genomic prediction with AMMI analysis in Capsicum spp., providing a novel methodological contribution to the pre-breeding strategies of this genus.
Accordingly, the present study aims to (i) characterize the physicochemical and chemical variability of Capsicum fruits under contrasting Colombian environments; (ii) assess the influence of genotype-by-environment interaction (GEI) using the AMMI model; (iii) estimate the narrow sense genomic heritability and genomic prediction accuracy of key agro-industrial traits; and (iv) explore the integration of phenotypic stability analysis with genomic prediction for the selection of superior and broadly adapted accessions, based on their genomic breeding values (GBVs) as a novel framework for the genetic improvement of Capsicum spp.

2. Materials and Methods

2.1. Plant Material and Experimental Design

The field trials were conducted with 32 Capsicum genotypes belonging to four domesticated species C. annuum, C. chinense, C. pubescens, and C. baccatum. They are part of the Colombian germplasm collection which comprises approximately 300 accessions and represents a wide genetic diversity of the five domesticated Capsicum species. Plants were cultivated in three different locations with contrasting environmental conditions. Palmira (Valle del Cauca), located at 1000 m asl, had a yearly mean temperature of 25 °C; while Timbío (Cauca), located at 1700 m asl had a mean temperature of 24 °C. La Selva (Antioquia), located at 2100 m asl had a mean temperature of 17 °C.
Capsicum seedlings were grown in mulched plots previously prepared with organic matter and covered with agricultural plastic. Plants were watered using a localized drip irrigation system maintaining optimal soil moisture (10–30 kPa). A complete randomized block design was applied for plants cultivation at each location. Nutrition was applied by supplying essential macro and micronutrients to the plants. Seven plants per plot, located in the middle, were used to collect fruits which were harvested at full ripening stage. Border plants located at the edge of the plots served as microenvironmental controls to ensure representative sampling. Harvest was performed at the maximum production stage of the plants, approximately after 90 days of sowing. Three replications per trait measurement were conducted evaluating three different fruits.

2.2. Physicochemical Characterization of Fruits

Capsicum fruits were processed to produce homogenous paste which was used to analyze physicochemical traits. Three replications of each accession were analyzed to assess the parameters. Fruits were collected in the three cultivated plots from plants located in the middle. The pH was measured using a pH meter and soluble solids were quantified using a digital refractometer on juice obtained from the paste.
Color parameters were evaluated measuring the CIELab attributes red-green coordinate (a*), blue-yellow coordinate (b*), lightness (L*), chroma (C) and hue (h) angle using a high-quality spectrophotometer NS810 (3nh Global, Guangzhou, China). Color index was calculated using the attributes a*, L* and h implemented in the formula suggested by [11].
C o l o r   i n d e x = 1000 a * ( L * + h )
On the other hand, textural properties were evaluated using a TA-XT Plus Texture Analyzer (Stable Micro Systems Ltd., Godalming, UK) fitted with a 35 mm diameter back-extrusion ring assembly. The instrument was employed to quantify four key textural parameters: firmness (Firm), consistency (Con), cohesiveness (Coh), and viscosity index (Wcoh). The analytical protocol was configured with operational parameters comprising a trigger force threshold of 5 g, differential probe velocities of 1 mm/s during the pre-test phase, 5 mm/s throughout the testing sequence, and 10 mm/s for the post-test withdrawal stage. The penetration depth was standardized at 20 mm, adhering to the experimental methodology established by Sert et al. (2017) [25].
Finally, total solids and moisture content were determined gravimetrically through thermogravimetric analysis following standardized drying procedures. Triplicate samples of 2.0 g were accurately weighed into pre-dried aluminum trays and subjected to thermal dehydration in a forced-air convection oven (Venticell, MMM Medcenter Einrichtungen GmbH, Planegg/München, Germany) at 105 °C for 2 h [26].

2.3. Chemical Characterization of Fruits

2.3.1. Capsaicinoid Analysis by High-Performance Liquid Chromatography (HPLC)

Sample Preparation and Extraction
Capsaicinoid extraction was performed following a modified solid-phase extraction protocol. Precisely weighed samples (200 mg) of dehydrated chili paste powder were subjected to methanolic extraction using reagent-grade methanol (Merck, Darmstadt, Germany) at a sample-to-solvent ratio of 1:10 (w/v). The extraction procedure involved mechanical agitation using a vortex mixer for 3 min, followed by ultrasonic-assisted extraction at 45 °C for 20 min to enhance analyte recovery. After sonication, samples were allowed to equilibrate for 3 min to facilitate phase separation.
The resulting supernatant was subjected to solid-phase extraction (SPE) cleanup using pre-conditioned Oasis HLB cartridges (1 mL, Waters Corporation, Milford, MA, USA). Cartridge activation was performed with methanol prior to sample loading, and capsaicinoids were eluted using methanol as the desorption solvent. Purified extracts were stored under controlled conditions at 4 °C until chromatographic analysis.

2.3.2. Chromatographic Analysis

Capsaicinoid quantification was performed using a Hitachi LaChrome Ultra high-performance liquid chromatography system (Hitachi High-Technologies Corporation, Tohio, Japan) equipped with a quaternary pump, inline vacuum degasser, thermostated autosampler, and column oven. Chromatographic separation was achieved on a Luna® C18 reversed-phase column (150 mm × 4.6 mm i.d., 5 μm particle size; Phenomenex Inc., Torrance, CA, USA) maintained at 40 °C. The injection volume was set at 10 μL to ensure optimal peak resolution and sensitivity.
The mobile phase consisted of a binary gradient system comprising (A) ultrapure water (Type I) acidified with 0.1% (v/v) formic acid (Sigma Aldrich, St. Lous, MO, USA) and (B) HPLC-grade methanol (Merck, Darmstadt, Germany). The gradient elution profile was programmed as follows: initial conditions of 23% A (0 min), decreasing to 18.5% A at 7 min, further reduction to 10% A at 8.5 min, followed by 7% A at 12 min, and finally reaching 2% A at 14 min. The flow rate was maintained at 0.8 mL/min throughout the analysis.

2.3.3. Detection and Quantification

Capsaicinoid detection was performed using UV-visible spectrophotometry at 280 nm (Hunter Associates Laboratory, Resto, VA, USA), corresponding to the maximum absorption wavelength of capsaicinoid compounds. Quantitative analysis was conducted using external standard calibration curves constructed with authentic standards of capsaicin and dihydrocapsaicin. Peak identification was based on retention time comparison with reference standards, while quantification was performed by peak area integration.
The analytical method was adapted and optimized from the protocol described by [27]. Data acquisition and processing were performed using EZ-Chrome Elite chromatography software (Hitachi High-Technologies Corporation, Tokyo, Japan).

2.3.4. Total Carotenoids Expressed as β-Carotene

Total carotenoids were determined using an optimized liquid–liquid extraction protocol adapted to the sample matrix. Briefly, 100 mg of powdered paste were extracted with 5.0 mL of a binary solvent system of ethyl acetate/dichloromethane (Sigma Aldrich, St. Lous, MO, USA) (8:2, v/v) supplemented with 0.1% (w/v) butylated hydroxytoluene (BHT) (Anmol chemicals, New York, NY, USA) to prevent oxidative degradation during extraction. The mixture was homogenized by vortex mixing for 3 min, followed by ultrasound-assisted extraction at 40 °C for 30 min to maximize analyte recovery. The resulting extract was filtered to remove particulate matter, evaporated to dryness under controlled conditions, and reconstituted in 5.0 mL HPLC-grade methanol. Quantification was performed by microplate spectrophotometry, transferring 200 µL aliquots to individual wells and recording absorbance at 465 nm. Concentrations were calculated by external calibration using an analytical-grade reference standard of β-carotene and expressed as milligrams of β-carotene equivalents per gram of dry weight (mg g−1 DW) [11].

2.3.5. Quantification of Total Phenolics Compounds (TPC)

Total phenolics were quantified by the Folin–Ciocalteu colorimetric assay adapted to pepper fruit extracts. Briefly, fresh pericarp was homogenized, and 0.50 g of tissue was extracted with 5.0 mL of 80% (v/v) methanol acidified with 0.1% (v/v) HCl. The slurry was vortexed for 1 min, sonicated at room temperature (20–22 °C) for 15 min, and centrifuged at 10,000× g for 10 min; the supernatant was collected and, when necessary, diluted with extraction solvent to fall within the calibration range. For the colorimetric reaction (performed in microtubes), 50 µL of appropriately diluted extract, 425 µL of distilled water, and 125 µL of Folin–Ciocalteu reagent were combined, vortexed, and allowed to stand for 6 min. Then, 400 µL of 7.1% (w/v) Na2CO3 solution was added. After 60 min of incubation in the dark at room temperature, an aliquot of the reaction mixture (200 µL) was transferred to a 96-well microplate, and absorbance was read at 760 nm (pathlength-corrected). A reagent blank (all components except sample) was included and subtracted from sample readings. External calibration was constructed with gallic acid (analytical grade) prepared in the same extraction solvent (e.g., 0–250 mg L−1; R2 ≥ 0.99). All measurements were performed in triplicate (independent extracts). Results were expressed as mg gallic acid equivalents per gram of fresh weight (mg GAE g−1 FW); when required, values were converted to dry-weight basis using the corresponding moisture data.

2.4. Determination of Antioxidant Capacity

2.4.1. DPPH Radical Scavenging (TEAC)

The assay was adapted from [28] for pepper fruit extracts. Extracts were prepared from fresh pericarp as described above (80% methanol, 0.1% HCl), and diluted with the same solvent to fall within the calibration range. A 0.10 mM DPPH solution was prepared in methanol and equilibrated in the dark for 30 min. For the reaction, 20 µL of appropriately diluted extract (or Trolox standard) were mixed with 180 µL of DPPH solution in 96-well microplates. Solvent blanks received 20 µL extraction solvent instead of sample. Plates were incubated 30 min at room temperature in the dark, and absorbance was recorded at 517 nm (pathlength-corrected). Percent inhibition was calculated relative to the solvent control, and antioxidant capacity was obtained by external calibration with Trolox (e.g., 0–400 µM; R2 ≥ 0.99). Results are reported as TEAC: µmol Trolox equivalents per gram of sample (µmol TE g−1), on a fresh-weight basis (convertible to dry weight using moisture data). All measurements were performed in triplicate (independent extracts), including reagent blanks.

2.4.2. Ferric Reducing Antioxidant Power (FRAP; AEAC)

Ferric reducing antioxidant power (FRAP) was determined following [28] protocol with minor modifications. The FRAP working reagent was freshly prepared by mixing 300 mM acetate buffer (pH 3.6), 10 mM TPTZ in 40 mM HCl, and 20 mM FeCl3·6H2O in a 10:1:1 (v/v/v) ratio and equilibrating at 37 °C for 10 min. In microplates, 20 µL of diluted pepper extract (or ascorbic acid standard) were combined with 180 µL of FRAP reagent. After 30 min at 37 °C in the dark, absorbance was read at 593 nm (pathlength-corrected). A calibration curve was prepared with ascorbic acid in extraction solvent (e.g., 0–500 mg L−1; R2 ≥ 0.99). Activities were expressed as AEAC: mg ascorbic acid equivalents per gram of sample (mg AAE g−1 FW; convertible to DW). Analyses were conducted in triplicate with reagent blanks.

2.5. Phenotypic Stability and AMMI Analysis

Physicochemical and chemical data from fruits were analyzed using the open software R 4.4.1 (Core Team, 2023) [29]. Descriptive statistics were computed to estimate means and standard deviations among genotypes and environments, and analysis of variance (ANOVA) were conducted to evaluate the effect of plant growing locations. Classification of the genotypes, based on fruits properties, was conducted applying principal component analysis (PCA) and K-means cluster analysis. Before PCA and cluster analysis, trait means for each accession and environment were standardized using z-scores.
To estimate the proportion of the phenotypic variance accounted by genetic variation among genotypes, the broad-sense heritability (H2) of each trait was calculated according to [30], using the data obtained in Palmira, Cauca, and La Selva trials.
H 2 = σ G 2 σ G 2 + σ G × E 2 1 n + σ 2 1 n × r
where σ2G is the genotypic variance, σ2G×E is the genotype environment interaction (GEI) variance, σ2 is the error variance, n is the number of environments and r is the number of replications.
To identify the more promising genotypes for breeding purposes in the Capsicum spp. collection, we conducted phenotypic stability analysis, using the additive main effect multiplicative interaction (AMMI) model, in traits selected by their relevance in the agroindustry sector. Biplots were used to illustrate the results obtained in the AMMI test. AMMI procedures for calculating traits stability were carried out with the R library Metan [31].

2.6. Genomic Dataset

Genomic data for the 283 Capsicum accessions, including 30 inter-environmental lines from La Selva locality, were obtained through a genotyping-by-sequencing (GBS) approach as described by Vega-Muñoz et al. (2025) [10]. Specifically, DNA extraction followed AGROSAVIA’s in-house protocol (Mosquera, Colombia) using 20 mg of young leaf tissue per accession, and DNA quality and concentration were verified by NanoDrop and Qubit fluorometry. Libraries were prepared using the ApeK1 restriction enzyme and the NEBNext® Ultra™ II kit (Illumina®, San Diego, CA, USA, NEB, Ipswich, MA, USA), quantified with Qubit, and size-checked using the TapeStation 4200. Sequencing was performed on the Illumina HiSeq X Ten platform (Macrogen, Seoul, Republic of Korea), generating paired-end reads. Raw sequences were quality-checked with FastQC [32] and trimmed using Trimmomatic [33], ensuring Phred scores > Q30. Reads were aligned to the Capsicum annuum reference genome GCA_002878395.3 (UCD10Xv1.1) [34], and SNP calling was conducted using the GATK4 HaplotypeCaller pipeline implemented in the automated workflow (GitHub repository: https://github.com/FelipeLopez2019/Mr_Capsicum_SNP_Calling, accessed on 19 June 2024). Alignment statistics were computed with SAMtools v1.9 [35], and the final SNP matrix was filtered in TASSEL v5.2.94 [36] using standard thresholds for genomic prediction (read depth ≥ 3×, missing data ≤ 30%, and MAF ≥ 0.05). Following these quality-control filters, 68,481 polymorphic SNP loci were retained, representing the final high-quality dataset used for genomic heritability estimation and prediction modeling.

2.7. Pre-Breeding Strategy and Genomic Prediction Framework

The pre-breeding strategy envisioned by this study was designed to integrate genomic prediction into the Capsicum improvement pipeline by leveraging the expanded genetic panel [10,11] from La Selva locality. This approach aimed to estimate trait-specific genomic heritability and to develop predictive models trained in a control characterization environment (La Selva research station), subsequently evaluated across contrasting environments to assess their transferability and predictive accuracy.
Initially, the genomic heritability ( h g 2 ) of each quantitative variable was estimated following the additive infinitesimal model Meuwissen et al. 2001 [14], partitioning phenotypic variance into additive ( σ a 2 ) and residual ( σ e 2 ) components using the relationship:
h g 2   =   σ a 2 σ a 2   +   σ e 2
These variance components were computed for each trait using the genomic dataset.
Once the genomic heritability was estimated, a genomic prediction model was constructed using the phenotypic and genotypic data from La Selva environment, representing the baseline characterization reference. This model was subsequently validated in two additional contrasting environments, where its predictive performance was examined using multiple accuracy metrics. The primary indicator of predictive ability (rγ) was computed as the Pearson and Spearman correlation coefficient between observed phenotypic values (y) and predicted genomic-estimated breeding values (GEBVs). In addition, complementary metrics were calculated to provide a more robust assessment of prediction quality, including the mean absolute error (MAE), and the root mean square error (RMSE). These metrics jointly allowed quantifying not only the linear association between observed and predicted values but also their rank consistency and deviation magnitude, thus offering a comprehensive evaluation of the model’s generalization capacity across environments.
The genomic prediction model followed a standard linear mixed framework expressed as:
y = μ 1 n + X β + ε
where y is the vector of phenotypic records, μ the population mean, X the genotype matrix of n individuals and m SNPs, β the vector of SNP effects, and ε the random residual error (ε~N (0, σ2eI)). Model calibration was implemented using six Bayesian and semi-parametric regression methods—BayesA, BayesB, BayesC, Bayesian Ridge Regression (BRR), Bayesian LASSO (BL), and Reproducing Kernel Hilbert Spaces (RKHS)—as implemented in the BGLR R-package [18] each using 10,000 MCMC iterations and 1000 burn-in steps.
All models were executed following the standardized pipeline available at https://github.com/FelipeLopez2019/Genomic_prediction_Lopez-Hernandez-et-al-2023 (accessed on 19 June 2025). Hyperparameters followed the default prior densities described in Pérez and de los Campos 2014 [18].
To avoid overfitting, a five-fold cross-validation (CV) scheme was used for model training and testing, with 80% of the data randomly assigned to the training set and 20% to the validation set in each iteration. For each trait and SNP dataset, the mean predictive ability (rγ), genomic heritability (h2g), and mean squared error (MSE) were retained across folds.
Finally, based on the most robust predictive model (BayesC), we identified genotypes with superior genomic breeding values (GBVs). The best-performing subset derived from La Selva training environment offers a putative genomic selection panel, suggested for initial deployment within La Selva breeding population. This panel will serve as a basis for early-stage genomic selection and for optimizing the selection of superior lines with predicted high stability and performance across environments.

3. Results

3.1. Genotype × Environment Interaction (GEI) for Chemical and Physicochemical Traits

A significant effect of location was observed for all evaluated chemical and physicochemical traits (Figure 1). Additionally, significant GEIs were observed for all the traits (Table 1). The pH and cohesiveness values exhibited lower variation within locations, whereas color property values were less homogeneous. The color luminosity (L*) of the fruits was higher in plants grown at La Selva and Palmira than in those grown in Cauca. Similarly, significant differences in β-carotene concentration and total phenol content were observed among locations. Broad-sense heritability (H2) values were high for most traits; the largest values were observed for humidity, total solids, brix and firmness while the lowest values were found for β-carotene, total phenol content, and antioxidant capacity (Table 1).
The contributions of genotypes, environments, and GEI to the evaluated traits were calculated using the linear model applied in ANOVA. For most traits, residual values were low, ranging from 2% to 16%, indicating a good overall fit of the model. The highest value, 35%, was observed for the capsaicin concentration (Table 1). For most traits, the total amount of variance attributed to Capsicum accessions was predominant (from 30% to 72%). However, for pH, color attributes, total phenol content and cohesiveness, this value was lower than the proportion attributed to the environment. The traits most influenced by GEI were β-carotene concentration and antioxidant activity, while pH, total solids content and humidity percentage were the least influenced (Table 1).

3.2. Capsicum Classification by Cluster Analysis and PCA Revealed Different Fruit Properties at Three Locations

Principal component analysis (PCA) followed by k-means clustering was applied to classify Capsicum genotypes using fruit chemical and physicochemical descriptors measured across three field environments. All variables were z-standardized to equalize scale and ensure that principal components (PCs) reflected covariance structure rather than measurement units. The number of clusters (k = 3) was selected using the silhouette criterion (Figure S1). Across environments, the first two PCs accounted for >52% of total phenotypic variance (Figure 2). PC1 was dominated by textural traits—cohesiveness, consistency, and firmness exhibited the strongest loadings, whereas PC2 was driven by colorimetric parameters (L*, b*, h°) with a notable contribution from the color index, underscoring the chromatic basis of this axis.
Clustering patterns were environment specific. In Cauca, groups separated mainly along PC1, consistent with contrasts in biochemical composition (e.g., β-carotenoid and antioxidant capacity-DPPH and FRAP). In Palmira and La Selva, dispersion occurred mostly along PC2, indicating marked differences in lightness (L*) and yellowness (b*). Examination by site revealed no persistent geographic clustering suggesting that environmental effects did not overwhelm the underlying genetic structure. The K-means consistently resolved three phenotypic fruit types: (i) high-lightness/yellow fruits (elevated L*, b*, h°), (ii) high-red/intense fruits (greater a*, chroma, and color index), and (iii) bioactive-enriched fruits (higher β-carotenoids and antioxidant capacity, with favorable texture). In Cauca, the commercial accessions Cayena, Habanero, and Rocoto co-clustered (blue cluster), exhibiting desirable combinations of cohesiveness, moisture, antioxidant capacity, and color index. Genotypes 36 and 198 showed phenotypic stability, clustering together across all environments with consistently high consistency and cohesiveness (Table S1).
These results show that an integrated PCA–k-means framework provides a basis for identifying genotype sets with commercially desirable trait combinations, thereby informing pre-breeding and selection strategies aimed at developing Capsicum varieties aligned with agro-industrial specifications and market needs.

3.3. Broad and Narrow Sense Performance of Capsicum Genotypes for Industry Relevant Traits

The elucidation of GEI in the multi-environment trials was conducted by applying an additive main effect multiplicative interaction (AMMI) analysis. This test integrates additive effects for genotypes and environments with multiplicative effects for GEI, estimated by PCA [37]. A selection of four out of 17 traits was made for the purpose of conducting an AMMI analysis, with the selection being based on their importance in the food industry. The parameters in question were pH, humidity, firmness, and cohesion. The influence of genotypes, environments, and GEI was significant in the evaluated traits (Table 2).
The analysis of variance (ANOVA) for pH, humidity, firmness, and cohesiveness revealed that PC1 accounts for at least 58.5% and PC2 for 10.1% of the GEI variance (Figure 3). The three environments contributed significantly to GEI. For pH, genotypes positioned close to the biplot origin (Habanero, cayenna and 209) exhibited minimal contribution to GEI, indicating a stable pH across environments (Figure 3a). For humidity, this was observed for genotypes 61, 139 and 209 (Figure 3b). For firmness and cohesiveness, the most stable genotypes across locations were identified as 113 and 139, respectively (Figure 3c,d). Commercial accessions, such as rocoto and habanero, exhibited superior performance in terms of firmness and cohesiveness when cultivated in Palmira. Conversely, rocoto demonstrated a narrow adaptation, showing an affinity for higher pH levels in Cauca.
Additive main effect multiplicative interaction (AMMI) analysis revealed two types of adaptation: broad adaptation, which is applicable to accessions stable across environments, and narrow adaptation, which refers to genotypes that demonstrated superior performance in a single locality (Table 3). This approach generally permits the selection of accessions, particularly those regarded as stable across various environments, as well as those that exhibited adaptation to particular sites. Genotypes with the best stability across localities were identified in the evaluated panel. The selection criterion for stability was based on the distance from the origin to the first component in the AMMI analysis for the point representing each genotype and phenotypic stability index (PSI), which incorporates both mean of the trait and stability in a single criterion. Low values of PSI show desirable accessions with high means and stability (Table 3). On the other hand, predicted values of the selected traits were calculated across the analyzed locations (Table S2).

3.4. Genomic Heritability Reveals High Additive Control of pH and Pungency Traits

Narrow sense genomic heritability (h2) estimates across traits indicated that pH and pungency-related compounds exhibited the strongest additive genetic control within the Capsicum panel. Among all analyzed traits, pH showed the highest genomic heritability, with values ranging between 0.470 and 0.484 across Bayesian models and a standard deviation below 0.003. Similarly, capsaicinoid-related traits showed moderate-to-high heritabilities: h2 = 0.344 ± 0.006 for total capsaicinoids, 0.394 ± 0.004 for capsaicin, and 0.476 ± 0.003 for dihydrocapsaicin, suggesting strong additive genetic contributions to pungency (Table 4).
These results confirm that both pH and alkaloid biosynthetic traits were predominantly controlled by additive loci, with limited environmental noise, which supports their use as primary targets for genomic selection and prediction. Such high and stable heritability values reinforce the potential of incorporating genomic prediction for improving fruit quality and pungency-related attributes in Capsicum spp.

3.5. BayesC Achieved the Highest Predictive Accuracy and Model Stability

Among the evaluated models (BayesA, BayesB, BayesC, Bayesian Ridge Regression, Bayesian LASSO, and RKHS), BayesC consistently outperformed all others in predictive ability and stability across traits. It yielded the lowest mean squared errors (MSE) in both training and testing datasets, and the highest validation correlations R_testing, confirming its superior capacity to capture relevant additive effects while minimizing overfitting (Table S3).
For example, BayesC achieved R_testing = 0.94 for pH, 0.94 for Brix, 0.97 for dry matter, and above 0.93 for most texture and color parameters. In the case of pungency traits, BayesC maintained high predictive consistency (r = 0.94 for capsaicinoids, 0.98 for capsaicin, and 0.94 for dihydrocapsaicin) with the lowest residual variance among all models. The robust performance of BayesC arises from its hierarchical prior structure, which incorporates both shrinkage and sparsity, allowing non-informative markers to be effectively down-weighted. This property makes BayesC particularly suited for moderate-size genomic datasets, such as the Colombian Capsicum germplasm, where the number of SNPs far exceeds the number of observations.

3.6. Genomic Prediction Models Trained in La Selva Showed Broad Environmental Transferability

The genomic prediction model trained in La Selva environment displayed high predictive accuracy and strong transferability across distinct agroecological zones (Figure 4). Validation metrics, including Pearson and Spearman correlation coefficients, mean absolute error (MAE), and root mean square error (RMSE), indicated excellent model performance in La Selva, with the expected reduction in predictive accuracy under contrasting environmental conditions (Table 5). For pH, the predictive ability in La Selva reached a Pearson correlation of 0.976 and a Spearman correlation of 0.960, with MAE and RMSE values of 0.0768 and 0.0981, respectively. When applied to the external environments of Palmira and Cauca, the predictive correlations decreased to 0.708 and 0.644, respectively, indicating a moderate but consistent level of transferability.
Similarly, for pungency-related traits, the model maintained high predictive consistency across environments. In La Selva, correlation values were 0.949 for total capsaicinoids, 0.914 for capsaicin, and 0.963 for dihydrocapsaicin, whereas in Palmira, the correlations declined to 0.617, 0.543, and 0.729, respectively. These results demonstrate that the BayesC model retains substantial predictive capacity when transferred between environments, especially for traits exhibiting high genomic heritability such as pH and pungency components. Consequently, La Selva environment can serve as a robust calibration reference for the development and deployment of genomic recommendation models applicable across multiple production regions within the Colombian Capsicum improvement program.

3.7. Genomic Breeding Value (GBV) Ranking in La Selva Environment

The genomic breeding values (GBVs) estimated under La Selva environment provided a robust quantitative framework for ranking Capsicum accessions according to their predicted performance for key quality and pungency-related traits. Rankings were derived from the BayesC model calibrated with genomic and phenotypic data from La Selva, which exhibited the highest predictive ability across environments (Table S4).
For pH, accessions 113, 70, and 277 exhibited the highest positive GBVs (0.396, 0.337, and 0.274, respectively), suggesting stable buffering capacity and favorable fruit acidity levels under the temperate highland conditions of La Selva. Regarding total capsaicinoids, accessions 160, 161, and 231 showed the strongest predicted accumulation (5.18, 3.20, and 2.68 mg g−1, respectively). Similarly, for capsaicin, accessions 160, 161, and 178 reached the highest predicted concentrations (4.57, 2.77, and 1.99 mg g−1, respectively). In the case of dihydrocapsaicin, accessions 231, 222, and 280 exhibited the top predicted values (0.43, 0.39, and 0.38 mg g−1, respectively), with accession 160 also ranking within the upper range across all pungency-related traits.
This cross-trait consistency indicates a shared genetic basis for capsaicinoid biosynthesis and supports the use of these genotypes as elite donors in breeding programs aimed at enhancing flavor intensity and biochemical uniformity in Capsicum spp. Interestingly, several of the top-performing genotypes in La Selva—particularly 160, 161, 231, 168, 216, and 260—exhibited consistently high GBVs for pungency-related traits (Table 6), confirming their strong additive potential for total capsaicinoid production and metabolite stability. Accessions 160 and 161 showed superior performance across all pungency metabolites, while 231 and 168 stood out for dihydrocapsaicin accumulation. Additionally, 216 and 260 maintained intermediate-to-high rankings across traits, suggesting broad adaptability and potential for use as multi-environment donors.
In parallel, accession 113 maintained one of the highest GBVs for fruit pH, reflecting a stable buffering capacity that is desirable for processing quality and product stability. The co-occurrence of these high-performing accessions among the 30 inter-environmental lines evaluated in Palmira and Cauca further supports the transferability of genomic predictions across distinct agroecological conditions.
Moreover, the consistent ranking of accessions such as 113 (high pH and moderate pungency) and 160 or 231 (extremely high capsaicinoid content) suggested the potential to design genomic selection panels combining genotypes with complementary industrial attributes. These lines could be strategically integrated into Capsicum pre-breeding pipeline to exploit additive GEI and enhance both flavor stability and heat intensity in future breeding populations.
Overall, La Selva-based GBV ranking demonstrated the feasibility of implementing a genomic-assisted selection strategy capable of identifying elite accessions within a single, broad-characterization environment while ensuring predictive transferability across other agroecological contexts such as Palmira and Cauca, as validated by phenotypic stability analysis.

4. Discussion

Capsicum breeding programs are mainly focused on plant yield attributes [2], often disregarding other traits such as chemical and physicochemical properties that are crucial for several industrial sectors. Evaluating the performance and phenotypic stability of diverse accessions across environments is therefore essential to identify genotypes best suited for specific processing or industrial purposes. This information is particularly valuable to accelerate progress in conventional breeding programs that must respond to the demands of growers, processors, and the chili pepper industry [6]. In the present study, we characterized the Capsicum collection through multivariate analysis and the estimation of heritability for key physicochemical and phytochemical traits. These results provide a relevant basis to address local breeding challenges using the Colombian germplasm collection. Building on this general characterization, we first examined the extent and structure of phenotypic diversity across environments to identify the major axes of variation underlying fruit quality traits.

4.1. Phenotypic Diversity Across Environments

The physicochemical and biochemical profiling of Capsicum fruits revealed substantial phenotypic dispersion both among genotypes and across sites, consistent with a pronounced G × E component. Quality-related traits—particularly texture (firmness, consistency, cohesiveness), color attributes (L*, a*, b*, h°, color index), and total solids—exhibited wide interspecific ranges, corroborating the breadth of diversity present in the Colombian Capsicum collection. These observations align with prior reports documenting marked morphological and metabolic variability within domesticated Capsicum spp. [7,8,9].
The differential environmental responsiveness observed across trait categories in this study aligns with emerging evidence from multi-environment crop studies, revealing fundamental differences in the genetic architecture and environmental sensitivity of mechanical versus optical fruit properties. The structure of phenotypic variation demonstrated clear trait-dependent patterns, with texture-related attributes (cohesiveness, consistency, firmness) primarily explaining the first principal component, while chromatic parameters (L*, b*, hue angle) dominating the secondary axis of variation. This orthogonal segmentation suggests partially independent physiological control mechanisms governing mechanical versus pigment-associated properties, consistent with the distinct genetic pathways underlying cell wall modification and carotenoid biosynthesis [38]
The greater environmental responsiveness of optical traits (L*, b*) compared to compositional attributes (pH, moisture) observed in our study is supported by recent findings in Capsicum multi-environment trials. Tripodi et al. (2018) [39] demonstrated that environmental factors explained the variation for carotenoids, ascorbic acid, and tocopherols in hot pepper varieties, whereas structural compounds like capsaicinoids remained remarkably stable across environments. This differential sensitivity reflects the inherent biological constraints of these trait categories: colorimetric attributes are directly influenced by light quality, temperature fluctuations, and water stress through their effects on carotenoid biosynthesis and degradation pathways [40], while basic compositional traits operate within narrower physiological ranges dictated by cellular homeostasis requirements [41].
The trait-specific GEI documented in our analysis are consistent with broader patterns reported across crop species. Multi-environment association mapping in Capsicum has revealed quantitative trait loci (QTLs) showing significant interactions with environmental gradients, with many QTLs affecting phenotypic plasticity rather than trait means alone [42]. Notably, genome-wide association studies have identified distinct sets of candidate genes for texture-related traits (including cell-wall modifying enzymes) versus color-determining loci (cytochrome P450s, MYB transcription factors), reinforcing the independent genetic control of these trait categories [43].
The observed environmental stability of texture and total solids traits have important implications for breeding strategy optimization. Singh et al. demonstrated that structural traits including pericarp thickness and fruit dimensions exhibit moderate to high heritability with consistent genetic effects across environments, making them reliable targets for broad-adaptation breeding programs. Conversely, the site-specific variation in chromatic parameters suggests that color-related selection should incorporate environment-specific evaluation protocols, particularly for market classes where specific color attributes are critical quality determinants [44].
These findings collectively support a trait-stratified approach to multi-environment breeding, where mechanically related properties provide transferable differentiation across production environments, while optical attributes warrant location-specific optimization. Such strategies can maximize genetic gains by exploiting stable QTLs for broad adaptation while leveraging environment-specific genetic effects for targeted market development [45]. Furthermore, the orthogonal nature of texture and color variation axes suggests opportunities for simultaneous improvement of both trait categories without significant trade-offs and negative correlations, facilitating the development of varieties combining superior processing characteristics with desired visual appeal.
From a breeding perspective, the observed phenotypic heterogeneity represents a strategic asset rather than a limitation, expanding the genetic search space for complementary trait combinations while facilitating the pyramiding of attributes critical to agro-industrial applications. This heterogeneity can be leveraged through advanced selection strategies such as genomic-assisted recurrent backcrossing (GABC), capable of exploiting GEI. Some recent advances [46,47] in phenomic selection demonstrate similar behavior to that found in this work, but using phenotyping techniques based on near-infrared spectral data captured across genotype × environment combinations. Crossa et al. (2022) [48] found that they can model non-additive G × E components with prediction accuracies comparable to genomic approaches, enabling sparse testing while maintaining selection efficiency. Such approaches are particularly valuable for identifying complementary trait combinations, such as high color index paired with favorable texture profiles, by capturing the full covariance structure of trait expression across environments. While phenotypic diversity reflects the combined effects of genotype and environment, quantifying heritability provides a complementary perspective on the genetic determinism of key traits. Therefore, we next focused on identifying physicochemical traits with high heritable components and direct industrial relevance.

4.2. Heritable Physicochemical Traits of Industrial Value

Fruit pH emerged as a highly heritable and stable trait across environments, showing low intra-location variability and a broad-sense heritability of 0.83. Moreover, its genomic heritability reached 0.49, indicating that a large portion of variance is attributable to additive genetic effects [49]. This trait is critical for fruit preservation, flavor, and microbial stability [50], and thus of major relevance for the food and fermentation industries. Similarly, moisture content exhibited minimal environmental influence (4%) and high heritability (0.93), suggesting strong genetic control. Cohesiveness and firmness followed similar trends, both being highly heritable and fundamental to processing quality, texture, rehydration capacity, and structural stability of chili-derived products [50]. These findings reinforce the potential of these traits as early-selection targets for food industry-oriented breeding programs. Beyond physicochemical properties, biochemical traits such as phenolics and carotenoids contribute critically to industrial and nutritional quality. To understand how these secondary metabolites respond to environmental variation, we evaluated their compositional plasticity across contrasting environments.

4.3. Environmental Effects on Biochemical Composition

Phenolic compounds and carotene content in peppers are of particular interest to the pharmaceutical and nutraceutical industries because of their antioxidant properties and potential health benefits [51]. The substantial environmental variation observed across locations in our study, with Palmira and Cauca exhibiting higher bioactive compound levels compared to the cooler La Selva environment, is consistent with recent multi-environment studies [39,52] demonstrating that carotenoid concentrations are strongly influenced by location and year effects, while phenolic acid variation is primarily driven by annual climatic conditions and GEI [53]. The differential response patterns observed—where total phenolic content variation was largely explained by environment while carotenoid variation showed stronger GEI effects—align with controlled temperature studies showing that elevated temperatures (30 °C vs. 20 °C) significantly enhance carotenoid biosynthesis through upregulation of key pathway genes, including a 5.5-fold increase in phytoene synthase expression, alongside concurrent increases in total phenolics and ascorbic acid content [54].
The lower broad sense heritability estimates observed for these bioactive compounds compared to physicochemical traits reflect their greater environmental sensitivity, biosynthetic phenomena that extend beyond temperature or altitude effects to encompass complex interactions, e.g., light × temperature interaction. Postharvest studies have shown that specific light spectra can induce dramatic biochemical changes in carotenoid expression in peppers—red light (660 nm) increases total carotenoids by more than 3.5 times in 24 h, while exposure to far-red light (730 nm) for 72 h can increase capsaicinoid content by more than 8 times, thereby driving phytochrome-mediated regulation of phenylpropanoid biosynthesis (Pashkovskiy et al., 2023) [55]. This example of context-dependent biochemical plasticity represents a fundamental feature of secondary metabolite production in Capsicum species, where the interaction between genetic potential and environmental triggers the final expression of bioactive compound profiles crucial for both nutritional quality and commercial value. Given the contrasting environmental sensitivity of physicochemical versus biochemical traits, we next assessed the heritability patterns underlying fruit quality parameters to elucidate their potential for genetic improvement.

4.4. Heritability Estimates for Fruit Quality Traits

Understanding trait heritability is fundamental for achieving higher genetic gains in breeding programs and ensuring that new varieties meet consumer and market demands. The high broad heritability values observed here suggest that fruit texture, moisture content, and pH represent promising starting points for the development of food-industry-oriented breeding schemes. Recent genomic studies have identified candidate loci underlying these traits in Capsicum [10,11], including SNPs associated with firmness and cohesiveness distributed across multiple chromosomes. Functional annotation of these loci revealed genes involved in cell wall remodeling and cuticular structure, indicating a polygenic genomic architecture governing fruit texture [10]. To translate these trait-level insights into practical breeding implications, we used multivariate analyses to integrate the phenotypic data and classify genotypes according to their industrial suitability.

4.5. Multivariate Classification and Industrial Grouping

Multivariate approaches such as PCA and hierarchical clustering have proven effective in capturing phenotypic diversity and grouping accessions with similar industrial potential [11,13,56]. In the present study, PCA and cluster analysis distinguished three main genotype groups characterized by distinct combinations of fruit physicochemical properties: (i) soluble solids, firmness, consistency, and total solids, (ii) color and pH parameters, and (iii) texture-related traits such as cohesiveness and work of cohesion. One cluster—comprising accessions 12, 29, 36, 129, and 198—displayed superior performance for group (i) traits, while accessions 54, 60, 88, and 113 were distinguished by higher color and acidity values. These patterns indicate that multivariate classification can guide the identification of contrasting genotypes for targeted industrial purposes within the Colombian germplasm.
A previous classification of the same genotypes based on physicochemical and phytochemical descriptors identified three industrial categories—pharmaceutical, food, and cosmetic [11]. The clusters identified in the present study correspond closely to these functional groupings, reinforcing the utility of multivariate analysis as a tool to align breeding strategies with specific industrial value chains. Identification of broadly adapted genotypes is especially important given that Capsicum spp. are highly sensitive to climate variability [56], and agronomic and phytochemical traits can be strongly affected by GEI [13]. Because industrial performance ultimately depends on trait stability under variable conditions, we further explored genotype × environment interactions to identify stable and broadly adapted accessions

4.6. Genotype × Environment Interactions (GEIs) and Phenotypic Stability

The analysis of variance (ANOVA) and the AMMI model further revealed significant genotype × environment interactions for most traits, confirming that the performance of genotypes was strongly influenced by environmental conditions. Nonetheless, some genotypes exhibited low interaction scores and high mean values, indicating phenotypic stability across contrasting environments. These accessions are valuable as stable parents for selection in breeding programs, especially those targeting fruit quality traits with industrial relevance. The combination of high trait expression and low GEI magnitude is an important criterion for identifying broadly adapted genotypes under variable agroecological conditions.
The estimation of broad-sense heritability (H2) indicated that traits such as pH, firmness, and total soluble solids have a considerable genetic component, suggesting that additive and dominant effects explain a large portion of their phenotypic variance. In contrast, color parameters exhibited lower heritability, implying a stronger environmental component. These differences reflect the complex nature of fruit traits, where biochemical pathways and environmental responses interact to shape final phenotypes. The high heritability values obtained for physicochemical attributes suggest that these traits can be improved effectively through recurrent selection or, more efficiently, via genomic-assisted approaches. To complement the phenotypic analyses, we incorporated genomic data to dissect the additive genetic basis of key traits and evaluate the predictive performance of different genomic models.

4.7. Genomic Heritability and Model Performance

Beyond the phenotypic assessment, the integration of genomic prediction provided a deeper understanding of the genetic control underlying key industrial traits in Capsicum spp. The high genomic heritability (h2) values estimated for pH and pungency-related compounds confirmed that these traits are strongly governed by additive genetic effects, supporting their suitability for marker-assisted and genomic selection. Specifically, the BayesC model revealed h2 values of 0.484 for pH, 0.344 for total capsaicinoids, 0.394 for capsaicin, and 0.476 for dihydrocapsaicin. Although estimates of genomic-based narrow-sense heritability for these traits are scarce in Capsicum, the magnitudes observed here are consistent with the highly additive genetic architecture reported for related metabolic and quality traits in other Solanaceae species [57,58], where key biosynthetic pathways—such as phenylpropanoid and capsaicinoid synthesis—are under strong additive control.
Among all models tested, BayesC achieved the highest predictive accuracy and lowest error variance across traits and environments. This performance arises from its selective-shrinkage Bayesian structure, which effectively captures major-effect loci while minimizing noise from small-effect SNPs—a critical advantage for moderately sized training populations. Similar model superiority has been reported in other crops including common beans, maize, sorghum, and rice [16,18,37], highlighting the robustness of BayesC under high-dimensional genomic data. Therefore, BayesC represents a reliable and computationally efficient method for genomic prediction in Capsicum, particularly when integrating multi-environment datasets.
This study applied single-trait genomic prediction models; therefore, genotype-by-trait (genetic) correlations were not estimated. Given the limited number of genotypes with multi-environment phenotyping and the heterogeneous heritability profiles, multivariate genomic models were not feasible. Nonetheless, closely related traits (e.g., capsaicin and dihydrocapsaicin) showed consistent GEBV rankings, suggesting shared additive control. Future multivariate genomic analyses could formally quantify genetic correlations and co-selection potential among traits.
A crucial question for practical breeding is whether genomic models trained in one environment can be effectively applied to others. Thus, we assessed the cross-environment transferability of prediction accuracy.

4.8. Cross-Environment Transferability of Genomic Models

The cross-environment validation demonstrated that genomic models trained under La Selva environment maintain strong predictive power when applied to contrasting locations such as Palmira and Cauca. Predictive correlations reached up to r = 0.976 for pH and 0.963 for dihydrocapsaicin in La Selva, with moderate but consistent transferability (0.54–0.73) across other environments. These results suggest that a single calibration environment, characterized by moderate stress and wide variability, can serve as a robust reference for model training, reducing the need for multi-environment calibration phases. This finding parallels evidence from cereals and legumes, where genomic models trained under representative conditions retain significant predictive ability across target environments [59,60]. Consequently, La Selva emerges as a key environment for developing and calibrating genomic selection pipelines within the Colombian Capsicum improvement network. Having validated the robustness of the genomic models across environments, we next ranked the genotypes by their genomic breeding values to identify elite candidates for selection.

4.9. Genomic Breeding Value Ranking and Candidate Selection

The ranking of accessions by their genomic breeding values (GBVs) in La Selva further enabled the identification of superior accessions with stable additive potential. Genotypes such as 113, 70, and 277 exhibited high predicted pH stability, while lines 160, 161, and 231 displayed outstanding potential for total capsaicinoid and capsaicin accumulation. Importantly, several of these top-ranked genotypes—113, 129, 139, 149, 193, 216, 257, 260, and 261—were among the 30 inter-environmental accessions shared with Palmira and Cauca, maintaining consistent performance across locations. The GBVs allow the prediction of “superior” individuals at different environmental conditions. Such accessions might be selected as parental lines for hybridization in a breeding program [6]. This concordance confirms the robustness of genomic predictions and the potential of these lines as multi-environment donors for pre-breeding and hybridization. The convergence of high GBVs in acidity and pungency traits also suggests opportunities for complementary parental combinations, enabling the development of cultivars with optimized flavor, quality, and industrial suitability. Together, these integrative analyses demonstrate that combining phenotypic and genomic data across environments can substantially accelerate the improvement of Capsicum germplasm for diverse industrial applications.

4.10. Integrative Framework for Genomic-Assisted Breeding

The combination of phenotypic stability (AMMI-based) and genomic prediction results provides a comprehensive framework for identifying elite Capsicum genotypes that combine high performance, phenotypic stability, and genomic prediction accuracy across environments. Traits such as pH and capsaicinoid content, characterized by high heritability and robust genomic predictability, can now be targeted for early selection in genomic-assisted breeding cycles. Integrating these genomic tools into breeding pipelines will accelerate genetic gain, enhance selection precision, and reduce phenotyping costs in multi-environment trials.
Overall, this study represents the first implementation of a genome-enabled prediction framework for Capsicum germplasm improvement in Colombia. By coupling high-throughput GBS data, Bayesian genomic models, and multi-environment validation, it establishes the foundation for a national genomic selection platform. Such integration will strengthen the capacity to develop new Capsicum varieties with enhanced quality, stability, and adaptability—addressing both local production needs and global market demands.

5. Conclusions

Multi-environment evaluation of Capsicum genotypes revealed broad, trait-dependent phenotypic variation, with significant genotype, environment, and G × E effects on fruit quality attributes. Integrating stability analysis (AMMI) with genomic prediction models provided a coherent framework to prioritize traits and accessions exhibiting high performance and environmental robustness. Collectively, heritability estimates and prediction accuracies support the feasibility of marker-assisted and genome-enabled selection for agro-industrial targets. This approach enables the identification of promising candidates for pre-breeding and the definition of complementary parental combinations. Operationally, the findings demonstrate how characterizing phenotypic diversity, quantifying G × E, and linking phenotypic stability with genomic prediction can guide breeding decisions under heterogeneous production conditions. Building upon this phenotypic foundation, we integrated the physicochemical and biochemical fruit characterization with last-generation sequencing data through the genomic prediction framework order to estimate trait-specific heritability and predictive accuracy using Bayesian models trained on the full 283-accession panel. This combined framework—linking stability analysis with genome-enabled prediction—provides a novel methodological contribution for identifying genotypes that combine high performance, environmental robustness, and additive genetic potential.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/agronomy15122690/s1, Figure S1: Number of optimal k-means determined by the silhouette method in the three locations; Table S1: Results of the PCA and cluster analysis showing the genotypes that comprised the clusters at La Selva, Palmira and Cauca; Table S2: Predicted values for the evaluated traits with the AMMI method at La Selva, Palmira and Cauca; Table S3: Results and coefficients obtained after model applications for genomic prediction; Table S4: Genomic predicted values for the selected traits in the evaluated genotypes.

Author Contributions

S.P.-L. formal analysis, investigation, data curation, writing—original draft preparation, review and editing, visualization. F.L.-H. Conceptualization, methodology, data curation, formal analysis, writing—original draft preparation, review and editing, visualization. G.A.O.-O. Methodology, formal analysis, data curation, writing—original draft preparation. G.M. Conceptualization, review and editing, supervision, funding acquisition. J.C.H.-R. Conceptualization, methodology, data curation, review and editing, supervision, funding acquisition. A.J.C. Conceptualization, data interpretation, writing—original draft preparation, review and editing. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Ministry of Science, Technology, and Innovation of the Colombian Government throughout grant 80740-451-2021, specifically funding from the subproject “Selection of Capsicum spp. materials with differential attributes and competitive agronomic and agroindustrial advantages” (code number 86924).

Data Availability Statement

The original contributions presented in this study are included in the article/Supplementary Materials. Further inquiries can be directed to the corresponding authors.

Acknowledgments

We express our sincere gratitude to the Technical Unit for Professional Development (UTEDÉ), the Colombian Agricultural Research Corporation (AGROSAVIA), the Ministry of Agriculture and Rural Development (MADR), the Ministry of Science (MinCiencias), Hugo Restrepo & CIA, and ICESI University for their institutional support and the allocation of research time for this work. Thank you to the Ministry of Science, Technology, and Innovation of the Colombian Government throughout grant 80740-451-2021, specifically funding from the subproject “Selection of Capsicum spp. materials with differential attributes and competitive agronomic and agroindustrial advantages” (code number 86924).

Conflicts of Interest

Authors Sebastian Parra-Londono and Gustavo Ossa-Ossa were employed by the company Unidad Técnica para el Desarrollo Profesional (Utede); Authors Felipe López-Hernández and Juan Camilo Henao-Rojas were employed by the company Corporación Colombiana de Investigación Agropecuaria (AGROSAVIA). The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

  1. Antonio, A.S.; Wiedemann, L.S.M.; Veiga Junior, V.F. The Genus Capsicum: A Phytochemical Review of Bioactive Secondary Metabolites. RSC Adv. 2018, 8, 25767–25784. [Google Scholar] [CrossRef]
  2. Herath, H.M.S.N.; Rafii, M.Y.; Ismail, S.I.; Nakasha, J.J.; Ramlee, S.I. Improvement of Important Economic Traits in Chilli through Heterosis Breeding: A Review. J. Hort. Sci. Biot. 2021, 96, 14–23. [Google Scholar] [CrossRef]
  3. FAOSTAT. Available online: https://www.fao.org/faostat/en/#data/QCL (accessed on 16 October 2025).
  4. AGRONET Estadísticas Agropecuarias. Available online: https://www.agronet.gov.co/estadistica/Paginas/home.aspx (accessed on 16 October 2025).
  5. Verma, V.K.; Pandey, A.; Thirugnanavel, A.; Rymbai, H.; Dutta, N.; Kumar, A.; Bhutia, T.L.; Jha, A.K.; Mishra, V.K. Ecology, Genetic Diversity, and Population Structure among Commercial Varieties and Local Landraces of Capsicum spp. Grown in Northeastern States of India. Front. Plant Sci. 2024, 15, 1379637. [Google Scholar] [CrossRef] [PubMed]
  6. Lozada, D.N.; Bosland, P.W.; Barchenger, D.W.; Haghshenas-Jaryani, M.; Sanogo, S.; Walker, S. Chile Pepper (Capsicum) Breeding and Improvement in the “Multi-Omics” Era. Front. Plant Sci. 2022, 13, 879182. [Google Scholar] [CrossRef] [PubMed]
  7. Tripodi, P.; Rabanus-Wallace, M.T.; Barchi, L.; Kale, S.; Esposito, S.; Acquadro, A.; Schafleitner, R.; Van Zonneveld, M.; Prohens, J.; Diez, M.J.; et al. Global Range Expansion History of Pepper (Capsicum spp.) Revealed by over 10,000 Genebank Accessions. Proc. Natl. Acad. Sci. USA 2021, 118, e2104315118. [Google Scholar] [CrossRef] [PubMed]
  8. Pereira-Dias, L.; Vilanova, S.; Fita, A.; Prohens, J.; Rodríguez-Burruezo, A. Genetic Diversity, Population Structure, and Relationships in a Collection of Pepper (Capsicum spp.) Landraces from the Spanish Centre of Diversity Revealed by Genotyping-by-Sequencing (GBS). Hortic. Res. 2019, 6, 54. [Google Scholar] [CrossRef]
  9. Viáfara-Vega, R.A.; Cárdenas-Henao, H. Identification of Capsicum Species from Colombia by DNA Barcoding and High Resolution Melting (HRM) Analysis. Genet. Resour. Crop Evol. 2025, 72, 2277–2286. [Google Scholar] [CrossRef]
  10. Vega-Muñoz, M.A.; López-Hernández, F.; Cortés, A.J.; Roda, F.; Castaño, E.; Montoya, G.; Henao-Rojas, J.C. Pangenomic and Phenotypic Characterization of Colombian Capsicum Germplasm Reveals the Genetic Basis of Fruit Quality Traits. Int. J. Mol. Sci. 2025, 26, 8205. [Google Scholar] [CrossRef]
  11. Castaño, E.; Vega-Muñoz, M.A.; Grisales-Vásquez, N.Y.; Loaiza-Loaiza, O.A.; Henao-Rojas, J.C.; Montoya, G. Capsicum Germplasm Targeted Valorization Using Physicochemical and Phytochemical Descriptors. Front. Sustain. Food Syst. 2025, 9, 1571012. [Google Scholar] [CrossRef]
  12. Wang, T.-C.; Rose, T.; Zetzsche, H.; Ballvora, A.; Friedt, W.; Kage, H.; Léon, J.; Lichthardt, C.; Ordon, F.; Snowdon, R.J.; et al. Multi-Environment Field Trials for Wheat Yield, Stability and Breeding Progress in Germany. Sci. Data 2025, 12, 64. [Google Scholar] [CrossRef]
  13. McLeod, L.; Barchi, L.; Tumino, G.; Tripodi, P.; Salinier, J.; Gros, C.; Boyaci, H.F.; Ozalp, R.; Borovsky, Y.; Schafleitner, R.; et al. Multi-environment Association Study Highlights Candidate Genes for Robust Agronomic Quantitative Trait Loci in a Novel Worldwide Capsicum Core Collection. Plant J. 2023, 116, 1508–1528. [Google Scholar] [CrossRef]
  14. Meuwissen, T.H.E.; Hayes, B.J.; Goddard, M.E. Prediction of Total Genetic Value Using Genome-Wide Dense Marker Maps. Genetics 2001, 157, 1819–1829. [Google Scholar] [CrossRef]
  15. Gianola, D.; Fernando, R.L.; Stella, A. Genomic-Assisted Prediction of Genetic Value With Semiparametric Procedures. Genetics 2006, 173, 1761–1776. [Google Scholar] [CrossRef] [PubMed]
  16. López-Hernández, F.; Villanueva-Mejía, D.F.; Tofiño-Rivera, A.P.; Cortés, A.J. Genomic Prediction of Adaptation in Common Bean (Phaseolus vulgaris L.) × Tepary Bean (P. acutifolius A. Gray) Hybrids. Int. J. Mol. Sci. 2025, 26, 7370. [Google Scholar] [CrossRef] [PubMed]
  17. De Los Campos, G.; Hickey, J.M.; Pong-Wong, R.; Daetwyler, H.D.; Calus, M.P.L. Whole-Genome Regression and Prediction Methods Applied to Plant and Animal Breeding. Genetics 2013, 193, 327–345. [Google Scholar] [CrossRef] [PubMed]
  18. Pérez, P.; De Los Campos, G. Genome-Wide Regression and Prediction with the BGLR Statistical Package. Genetics 2014, 198, 483–495. [Google Scholar] [CrossRef]
  19. Habier, D.; Fernando, R.L.; Kizilkaya, K.; Garrick, D.J. Extension of the Bayesian Alphabet for Genomic Selection. BMC Bioinform. 2011, 12, 186. [Google Scholar] [CrossRef]
  20. Park, T.; Casella, G. The Bayesian Lasso. J. Am. Stat. Assoc. 2008, 103, 681–686. [Google Scholar] [CrossRef]
  21. De Los Campos, G.; Gianola, D.; Rosa, G.J.M. Reproducing Kernel Hilbert Spaces Regression: A General Framework for Genetic Evaluation1. J. Anim. Sci. 2009, 87, 1883–1887. [Google Scholar] [CrossRef]
  22. De Los Campos, G.; Naya, H.; Gianola, D.; Crossa, J.; Legarra, A.; Manfredi, E.; Weigel, K.; Cotes, J.M. Predicting Quantitative Traits With Regression Models for Dense Molecular Markers and Pedigree. Genetics 2009, 182, 375–385. [Google Scholar] [CrossRef]
  23. Millet, E.J.; Kruijer, W.; Coupel-Ledru, A.; Alvarez Prado, S.; Cabrera-Bosquet, L.; Lacube, S.; Charcosset, A.; Welcker, C.; Van Eeuwijk, F.; Tardieu, F. Genomic Prediction of Maize Yield across European Environmental Conditions. Nat. Genet. 2019, 51, 952–956. [Google Scholar] [CrossRef]
  24. Juliana, P.; Poland, J.; Huerta-Espino, J.; Shrestha, S.; Crossa, J.; Crespo-Herrera, L.; Toledo, F.H.; Govindan, V.; Mondal, S.; Kumar, U.; et al. Improving Grain Yield, Stress Resilience and Quality of Bread Wheat Using Large-Scale Genomics. Nat. Genet. 2019, 51, 1530–1539. [Google Scholar] [CrossRef]
  25. Sert, D.; Mercan, E.; Dertli, E. Characterization of Lactic Acid Bacteria from Yogurt-like Product Fermented with Pine Cone and Determination of Their Role on Physicochemical, Textural and Microbiological Properties of Product. LWT 2017, 78, 70–76. [Google Scholar] [CrossRef]
  26. AOAC Official Method 934.01 Loss on Drying (Moisture) at 95–100 °C for Feeds Dry Matter on Oven Drying at 95–100 °C for Feeds First. J. AOAC 1998, 577, 6155.
  27. Penagos-Calvete, D.; Guauque-Medina, J.; Villegas-Torres, M.F.; Montoya, G. Analysis of Triacylglycerides, Carotenoids and Capsaicinoids as Disposable Molecules from Capsicum Agroindustry. Hortic. Environ. Biotechnol. 2019, 60, 227–238. [Google Scholar] [CrossRef]
  28. Zapata-Vahos, I.C.; Henao-Rojas, J.C.; Yepes-Betancur, D.P.; Marín-Henao, D.; Giraldo Sánchez, C.E.; Calvo-Cardona, S.J.; David, D.; Quijano-Abril, M. Physicochemical Parameters, Antioxidant Capacity, and Antimicrobial Activity of Honeys from Tropical Forests of Colombia: Apis Mellifera and Melipona Eburnea. Foods 2023, 12, 1001. [Google Scholar] [CrossRef] [PubMed]
  29. R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2023. [Google Scholar]
  30. Hill, J.; Becker, H.C.; Tigerstedt, P.M.A. Quantitative and Ecological Aspects of Plant Breeding; Springer: Dordrecht, The Netherlands, 1998; ISBN 978-94-010-6463-7. [Google Scholar]
  31. Olivoto, T.; Lúcio, A.D.C.; da Silva, J.A.G.; Sari, B.G.; Diel, M.I. Mean Performance and Stability in Multi-Environment Trials II: Selection Based on Multiple Traits. Agron. J. 2019, 111, 2961–2969. [Google Scholar] [CrossRef]
  32. Babraham Bioinformatics—FastQC. A Quality Control Tool for High Throughput Sequence Data. Available online: https://www.bioinformatics.babraham.ac.uk/projects/fastqc/ (accessed on 20 October 2025).
  33. Bolger, A.M.; Lohse, M.; Usadel, B. Trimmomatic: A Flexible Trimmer for Illumina Sequence Data. Bioinformatics 2014, 30, 2114–2120. [Google Scholar] [CrossRef]
  34. Hulse-Kemp, A.M.; Maheshwari, S.; Stoffel, K.; Hill, T.A.; Jaffe, D.; Williams, S.R.; Weisenfeld, N.; Ramakrishnan, S.; Kumar, V.; Shah, P.; et al. Reference Quality Assembly of the 3.5-Gb Genome of Capsicum annuum from a Single Linked-Read Library. Hortic. Res. 2018, 5, 4. [Google Scholar] [CrossRef] [PubMed]
  35. Li, H.; Handsaker, B.; Wysoker, A.; Fennell, T.; Ruan, J.; Homer, N.; Marth, G.; Abecasis, G.; Durbin, R. 1000 Genome Project Data Processing Subgroup The Sequence Alignment/Map Format and SAMtools. Bioinformatics 2009, 25, 2078–2079. [Google Scholar] [CrossRef]
  36. Bradbury, P.J.; Zhang, Z.; Kroon, D.E.; Casstevens, T.M.; Ramdoss, Y.; Buckler, E.S. TASSEL: Software for Association Mapping of Complex Traits in Diverse Samples. Bioinformatics 2007, 23, 2633–2635. [Google Scholar] [CrossRef]
  37. Crossa, J.; Pérez-Rodríguez, P.; Cuevas, J.; Montesinos-López, O.; Jarquín, D.; De Los Campos, G.; Burgueño, J.; González-Camacho, J.M.; Pérez-Elizalde, S.; Beyene, Y.; et al. Genomic Selection in Plant Breeding: Methods, Models, and Perspectives. Trends Plant Sci. 2017, 22, 961–975. [Google Scholar] [CrossRef]
  38. Sahmat, S.S.; Rafii, M.Y.; Oladosu, Y.; Jusoh, M.; Hakiman, M.; Mohidin, H. Unravelling the Dynamics of Genotype and Environment Interactions on Chilli (Capsicum annuum L.) Yield-Related Attributes in Soilless Planting Systems. Sci. Rep. 2024, 14, 1698. [Google Scholar] [CrossRef] [PubMed]
  39. Tripodi, P.; Cardi, T.; Bianchi, G.; Migliori, C.A.; Schiavi, M.; Rotino, G.L.; Lo Scalzo, R. Genetic and Environmental Factors Underlying Variation in Yield Performance and Bioactive Compound Content of Hot Pepper Varieties (Capsicum annuum) Cultivated in Two Contrasting Italian Locations. Eur. Food Res. Technol. 2018, 244, 1555–1567. [Google Scholar] [CrossRef]
  40. Wahyuni, Y.; Ballester, A.-R.; Sudarmonowati, E.; Bino, R.J.; Bovy, A.G. Secondary Metabolites of Capsicum Species and Their Importance in the Human Diet. J. Nat. Prod. 2013, 76, 783–793. [Google Scholar] [CrossRef]
  41. Bosland, P.W.; Votava, E.J. Peppers: Vegetable and Spice Capsicums, 2nd ed.; CABI: Wallingford, UK, 2012; ISBN 978-1-84593-784-3. [Google Scholar]
  42. Subhavyuktha, S.; Usha Nandhini Devi, H.; Kumar, K.K.; Vethamoni, P.I.; Premalatha, N.; Srividhya, S. Employing Empirical Models to Analyze Stability of Yield and Quality Traits in Chili Peppers (Capsicum spp.). Crop Sci. 2024, 64, 2977–2997. [Google Scholar] [CrossRef]
  43. Kim, S.; Park, M.; Yeom, S.-I.; Kim, Y.-M.; Lee, J.M.; Lee, H.-A.; Seo, E.; Choi, J.; Cheong, K.; Kim, K.-T.; et al. Genome Sequence of the Hot Pepper Provides Insights into the Evolution of Pungency in Capsicum Species. Nat. Genet. 2014, 46, 270–278. [Google Scholar] [CrossRef]
  44. Juharni; Syukur, M.; Suwarno, W.B.; Maharijaya, A. Analisis Stabilitas Parametrik Hasil Cabai Rawit (Capsicum fructescens L.) Pada Empat Lokasi Dataran Rendah. J. Agron. Indones. 2020, 48, 258–267. [Google Scholar] [CrossRef]
  45. Kusmana, N.; Kirana, R.; Rahayu, A. Uji Adaptasi Dan Stabilitas Hasil Enam Genotipe Cabai Hibrida Di Dataran Tinggi Jawa Barat (Adaptation and Yield Stability of Six Hybrid Chili Genotypes in Highland Area of West Java). J. Hort. 2019, 29, 17. [Google Scholar] [CrossRef]
  46. Robert, P.; Brault, C.; Rincent, R.; Segura, V. Phenomic Selection: A New and Efficient Alternative to Genomic Selection. In Genomic Prediction of Complex Traits; Ahmadi, N., Bartholomé, J., Eds.; Methods in Molecular Biology; Springer: New York, NY, USA, 2022; Volume 2467, pp. 397–420. ISBN 978-1-0716-2204-9. [Google Scholar]
  47. Robert, P.; Goudemand, E.; Auzanneau, J.; Oury, F.-X.; Rolland, B.; Heumez, E.; Bouchet, S.; Caillebotte, A.; Mary-Huard, T.; Le Gouis, J.; et al. Phenomic Selection in Wheat Breeding: Prediction of the Genotype-by-Environment Interaction in Multi-Environment Breeding Trials. Theor. Appl. Genet. 2022, 135, 3337–3356. [Google Scholar] [CrossRef]
  48. Crossa, J.; Montesinos-López, O.A.; Pérez-Rodríguez, P.; Costa-Neto, G.; Fritsche-Neto, R.; Ortiz, R.; Martini, J.W.R.; Lillemo, M.; Montesinos-López, A.; Jarquin, D.; et al. Genome and Environment Based Prediction Models and Methods of Complex Traits Incorporating Genotype × Environment Interaction. In Genomic Prediction of Complex Traits; Ahmadi, N., Bartholomé, J., Eds.; Methods in Molecular Biology; Springer: New York, NY, USA, 2022; Volume 2467, pp. 245–283. ISBN 978-1-0716-2204-9. [Google Scholar]
  49. Flegr, J. Heritability. In Encyclopedia of Sexual Psychology and Behavior; Shackelford, T.K., Ed.; Springer International Publishing: Cham, Switzerland, 2024; pp. 1–16. ISBN 978-3-031-08956-5. [Google Scholar]
  50. Faliarizao, N.T.; Siddiq, M.; Dolan, K.D. Total Phenolics, Antioxidant, and Physical Properties of Red Chili Peppers (Capsicum annum L.) as Affected by Drying Methods. Int. J. Food Prop. 2025, 28, 2492823. [Google Scholar] [CrossRef]
  51. The Biological Activity of Phytochemicals; Gang, D.R., Ed.; Springer: New York, NY, USA, 2011; ISBN 978-1-4419-6961-3. [Google Scholar]
  52. Kim, E.-H.; Lee, K.M.; Lee, S.-Y.; Kil, M.; Kwon, O.-H.; Lee, S.-G.; Lee, S.-K.; Ryu, T.-H.; Oh, S.-W.; Park, S.-Y. Influence of Genetic and Environmental Factors on the Contents of Carotenoids and Phenolic Acids in Red Pepper Fruits (Capsicum annuum L.). Appl. Biol. Chem. 2021, 64, 85. [Google Scholar] [CrossRef]
  53. Herrera-Pool, E.; Ramos-Díaz, A.L.; Padilla De La Rosa, J.D.; García-Cruz, U.; Lizardi-Jiménez, M.A.; Ayora-Talavera, T.; Cuevas-Bernardino, J.C.; Pacheco, N. UPLC-PDA-ESI-MS Based Chemometric Analysis for Solvent Polarity Effect Evaluation on Phytochemical Compounds and Antioxidant Activity in Habanero Pepper (Capsicum chinense Jacq) Fruit Extract. J. Food Sci. 2025, 90, e17630. [Google Scholar] [CrossRef]
  54. Sun, Y.; Ma, Q.; Mao, L.; Zhou, Y.; Shen, Y.; Wu, W.; Dai, Y.; Liu, Z. Integrated Transcriptome and Metabolome Analysis Reveals the Mechanism of Carotenoid Regulation in the Yellowing-Leaf Mutant of Pepper (Capsicum annuum L.) in Response to Different Temperatures. Sci. Hortic. 2024, 323, 112530. [Google Scholar] [CrossRef]
  55. Pashkovskiy, P.; Sleptsov, N.; Vereschagin, M.; Kreslavski, V.; Rudometova, N.; Sorokoumov, P.; Ashikhmin, A.; Bolshakov, M.; Kuznetsov, V. Post-Harvest Red- and Far-Red-Light Irradiation and Low Temperature Induce the Accumulation of Carotenoids, Capsaicinoids, and Ascorbic Acid in Capsicum annuum L. Green Pepper Fruit. Foods 2023, 12, 1715. [Google Scholar] [CrossRef] [PubMed]
  56. Lahbib, K.; Dabbou, S.; Bnejdi, F.; Pandino, G.; Lombardo, S.; El Gazzah, M.; El Bok, S. Agro-Morphological, Biochemical and Antioxidant Characterization of a Tunisian Chili Pepper Germplasm Collection. Agriculture 2021, 11, 1236. [Google Scholar] [CrossRef]
  57. Kaushik, P. Genetic Analysis for Fruit Phenolics Content, Flesh Color, and Browning Related Traits in Eggplant (Solanum melongena). Int. J. Mol. Sci. 2019, 20, 2990. [Google Scholar] [CrossRef]
  58. Sauvage, C.; Segura, V.; Bauchet, G.; Stevens, R.; Do, P.T.; Nikoloski, Z.; Fernie, A.R.; Causse, M. Genome-Wide Association in Tomato Reveals 44 Candidate Loci for Fruit Metabolic Traits. Plant Physiol. 2014, 165, 1120–1132. [Google Scholar] [CrossRef] [PubMed]
  59. Roorkiwal, M.; Jarquin, D.; Singh, M.K.; Gaur, P.M.; Bharadwaj, C.; Rathore, A.; Howard, R.; Srinivasan, S.; Jain, A.; Garg, V.; et al. Genomic-Enabled Prediction Models Using Multi-Environment Trials to Estimate the Effect of Genotype × Environment Interaction on Prediction Accuracy in Chickpea. Sci. Rep. 2018, 8, 11701. [Google Scholar] [CrossRef] [PubMed]
  60. Bassi, F.M.; Bentley, A.R.; Charmet, G.; Ortiz, R.; Crossa, J. Breeding Schemes for the Implementation of Genomic Selection in Wheat (Triticum spp.). Plant Sci. 2016, 242, 23–36. [Google Scholar] [CrossRef]
Figure 1. Boxplot of the chemical and physicochemical traits evaluated in Capsicum fruits at La Selva, Palmira and Cauca. Crossed circles indicate the mean of the trait per location. Different letters above the boxplot indicate significant differences after the Tukey test.
Figure 1. Boxplot of the chemical and physicochemical traits evaluated in Capsicum fruits at La Selva, Palmira and Cauca. Crossed circles indicate the mean of the trait per location. Different letters above the boxplot indicate significant differences after the Tukey test.
Agronomy 15 02690 g001
Figure 2. Biplots generated from PCA with the percentage of variance explained by the first two components (Dim 1 and Dim 2) for fruits traits assessed at La Selva (A,B), Palmira (C,D) and Cauca (E,F). Directional vectors represent the evaluated traits and points represent the genotypes. Color of the dots display the fruit types defined by cluster analysis at the three locations: red cluster is composed by high-lightness yellow fruits, blue cluster by high-red intense fruits and green cluster by bioactive-enriched fruits.
Figure 2. Biplots generated from PCA with the percentage of variance explained by the first two components (Dim 1 and Dim 2) for fruits traits assessed at La Selva (A,B), Palmira (C,D) and Cauca (E,F). Directional vectors represent the evaluated traits and points represent the genotypes. Color of the dots display the fruit types defined by cluster analysis at the three locations: red cluster is composed by high-lightness yellow fruits, blue cluster by high-red intense fruits and green cluster by bioactive-enriched fruits.
Agronomy 15 02690 g002
Figure 3. AMMI analysis for Capsicum genotypes evaluated across three locations: La Selva, Palmira and Cauca. Principal components biplots for pH (a), humidity (b), firmness (c) and cohesiveness (d).
Figure 3. AMMI analysis for Capsicum genotypes evaluated across three locations: La Selva, Palmira and Cauca. Principal components biplots for pH (a), humidity (b), firmness (c) and cohesiveness (d).
Agronomy 15 02690 g003
Figure 4. Predictive accuracy of BayesC genomic models across environments for Capsicum spp. Observed versus predicted genomic-estimated breeding values (GEBVs) for pH (A) and pungency-related traits (total capsaicinoids (B), capsaicin (C), dihydrocapsaicin (D)) across La Selva, Palmira, and Cauca. The BayesC model achieved the highest accuracy in La Selva (R = 0.96–0.98) with moderate transferability in external environments (R = 0.54–0.73), confirming its robustness for cross-environment genomic prediction in the Colombian Capsicum panel.
Figure 4. Predictive accuracy of BayesC genomic models across environments for Capsicum spp. Observed versus predicted genomic-estimated breeding values (GEBVs) for pH (A) and pungency-related traits (total capsaicinoids (B), capsaicin (C), dihydrocapsaicin (D)) across La Selva, Palmira, and Cauca. The BayesC model achieved the highest accuracy in La Selva (R = 0.96–0.98) with moderate transferability in external environments (R = 0.54–0.73), confirming its robustness for cross-environment genomic prediction in the Colombian Capsicum panel.
Agronomy 15 02690 g004
Table 1. Proportion of the variance (PVE) explained by locations (Env—light blue), genotypes (Geno—navy blue), their interaction (GEI—orange), and residuals (Res—green) on evaluated traits. Broad sense heritability (H2) calculated according to Hill et al. (1998) [30].
Table 1. Proportion of the variance (PVE) explained by locations (Env—light blue), genotypes (Geno—navy blue), their interaction (GEI—orange), and residuals (Res—green) on evaluated traits. Broad sense heritability (H2) calculated according to Hill et al. (1998) [30].
PVETraitAbvEnvGenoGEIResH2
Agronomy 15 02690 i001pHpH0.490.340.090.080.83
Agronomy 15 02690 i002Soluble solids [°]Brix0.090.670.210.030.91
Agronomy 15 02690 i003Total solids [%]Solt0.030.720.120.130.93
Agronomy 15 02690 i004Humidity [%]Hum0.030.720.120.130.93
Agronomy 15 02690 i005LuminosityL*0.450.300.220.030.82
Agronomy 15 02690 i006Red–green coordinatesa*0.390.370.230.020.80
Agronomy 15 02690 i007Yellow–blue coordinatesb*0.550.270.160.020.83
Agronomy 15 02690 i008ChromaC0.650.220.110.020.84
Agronomy 15 02690 i009Hueh0.650.220.110.020.87
Agronomy 15 02690 i010Firmness [g]Firm0.020.420.490.060.91
Agronomy 15 02690 i011Consistency [g × s−1]Cons0.020.470.480.040.90
Agronomy 15 02690 i012Cohesiveness [g]Cohe0.690.130.160.020.86
Agronomy 15 02690 i013Cohesiveness work [g × sec−1]Cohe.w0.680.090.160.070.81
Agronomy 15 02690 i014Total phenol content [mg × g−1]Tpc0.570.170.250.020.67
Agronomy 15 02690 i015β-carotent content [mg × g−1]Bcaro0.130.300.470.090.65
Agronomy 15 02690 i016Antioxidant capacityDpph0.280.180.370.160.56
Agronomy 15 02690 i017Antioxidant capacityFrap0.120.470.390.030.78
Agronomy 15 02690 i018Color index C.index0.080.600.290.030.85
Agronomy 15 02690 i019Capsaicin [mg × kg−1]Capsa0.040.410.210.350.79
Agronomy 15 02690 i020Dihidrocapsaicin [mg × kg−1]Dicapsa0.000.680.210.110.89
Table 2. Analysis of variance (ANOVA) for the AMMI models.
Table 2. Analysis of variance (ANOVA) for the AMMI models.
Sourcep-Value
pHHumFirmCohe
ENV2.23 × 10−132.19 × 10−54.39 × 10−54.17 × 10−10
GEN5.76 × 10−1551.42 × 10−773.21 × 10−635.97 × 10−69
GEI2.06 × 10−813.20 × 10−145.86 × 10−621.07 × 10−65
Table 3. Stability index and other AMMI statistics for pH, humidity, cohesiveness and firmness in Capsicum spp. genotypes evaluated across three locations in Colombia. Genotype (GEN), phenotypic stability index (PSI), and ranking (r) of the genotype based on PSI.
Table 3. Stability index and other AMMI statistics for pH, humidity, cohesiveness and firmness in Capsicum spp. genotypes evaluated across three locations in Colombia. Genotype (GEN), phenotypic stability index (PSI), and ranking (r) of the genotype based on PSI.
GenotypeCohesivenessFirmnesspHHumidity
PSIrMeanPSIrMeanPSIrMeanPSIrMean
11341.523−31.5833.515249.173935.51673478.07
1242.524−34.7924.56485.361845.49512982.48
12950.532−116.9620.521021.292965.36552782.93
13047.529−86.2227.59330.1021115.21271286.42
13936.518−16.1344.52644.7459305.02312084.67
14935.517−10.8045.52723.1646344.91532483.66
16825.5745.7039.52193.3446125.19663181.59
18229.5116.3243.52553.864185.3030489.91
19339.521−30.9942.52457.0649195.15532683.22
19524.5653.0937.519122.2441225.10532882.49
19819.51148.4719.511323.633195.27723672.61
209A21.53124.2452.534−58.0616145.176291.76
209N40.522−31.5834.516249.1730245.0911589.77
209R22.54124.2453.535−58.0620155.176191.76
21628.5109.0232.514270.6842295.02452184.42
2452.534−123.6651.533−25.4545255.0840886.98
25731.513−4.3628.510323.6743364.88281884.99
26043.525−46.5621.53898.3742215.12633279.93
26134.516−10.3648.5309.3456315.01231485.70
26738.520−24.6638.52098.3161354.91442583.50
2946.528−71.0629.511314.4252205.12513379.10
3630.5120.3223.55498.4451235.09653577.26
4048.530−95.3322.54654.7132175.16373082.28
4433.515−8.6226.58371.3834105.27291785.08
5253.535−124.1950.532−16.9332165.17291585.64
5437.519−24.0241.52357.352815.59422284.02
5744.526−56.6431.513296.4636285.04271186.53
6026.5828.5040.52267.353725.54361385.71
6132.514−7.8130.512313.431875.34321984.97
6345.527−60.9525.57400.3342334.95241685.62
8723.5553.9135.517151.2649185.15311086.67
8827.5911.6736.518148.4443265.06402383.88
Cayena49.531−115.0446.52818.6235325.0110986.78
Habanero54.536−128.3347.52913.29655.4410787.20
Rocoto20.52139.2054.536−83.5245275.0537390.16
Table 4. Genomic heritability (h2), standard deviation (sd h2), and predictive performance metrics (MSE, COR) estimated with the BayesC model for physicochemical and biochemical traits of Capsicum spp. evaluated in La Selva environment under five-fold cross-validation.
Table 4. Genomic heritability (h2), standard deviation (sd h2), and predictive performance metrics (MSE, COR) estimated with the BayesC model for physicochemical and biochemical traits of Capsicum spp. evaluated in La Selva environment under five-fold cross-validation.
Variableh2sd h2MSE TRNMSE TSTCOR TRN COR TST
pH0.48365780.00288380.00658680.00653090.94450050.9383034
Brix [°]0.08628850.00933820.93421690.95838660.93887870.9395094
Dry matter [%]0.04845070.00505741.3677061.4004640.97170240.9727942
Luminosity0.00845890.00105469.2996269.1090060.9756130.9701257
Chroma0.005870.0003630.93421690.95838660.93887870.9395094
Matiz0.00651250.00044110.93421690.95838660.93887870.9395094
Firmness [g]0.00020814.35 × 10−50.93421690.95838660.93887870.9395094
Consistency [g × sec−1]0.00024864.74 × 10−5109903.9108739.50.95471330.9387682
Cohesiveness [g]0.00045526.13 × 10−5587.2524611.63430.96146550.9642437
Cohesive work [g × sec−1]0.00039544.30 × 10−5746.8806762.35290.9373090.9339638
Total carotenoids [mg × g−1]0.01422710.00069156.5542666.5708740.94407850.9295193
Capsaicinoids [mg × kg−1]0.34436090.00647910.05302780.049630.9815780.9735871
Capsaicin [mg × kg−1]0.39398890.00458750.02863280.02546710.98499310.9763634
Dihidrocapsaicin [mg × kg−1]0.47635350.00276040.01003650.00974640.9457050.9385038
Table 5. Predictive performance of BayesC genomic prediction models for Capsicum fruit traits (pH, capsaicin, total capsaicinoids, and dihydrocapsaicin) across contrasting environments (La Selva, Palmira, and Cauca). N indicates the number of accessions with genomic and phenotypic data available for the analysis. Abbreviations are as follows: Pearson and Spearman correlations (r), mean absolute error (MAE), and root mean square error (RMSE) between observed and predicted genomic estimated breeding values (GEBVs).
Table 5. Predictive performance of BayesC genomic prediction models for Capsicum fruit traits (pH, capsaicin, total capsaicinoids, and dihydrocapsaicin) across contrasting environments (La Selva, Palmira, and Cauca). N indicates the number of accessions with genomic and phenotypic data available for the analysis. Abbreviations are as follows: Pearson and Spearman correlations (r), mean absolute error (MAE), and root mean square error (RMSE) between observed and predicted genomic estimated breeding values (GEBVs).
Genomic Prediction Model (pH)
EnvironmentNr pearsonr spearmanMAERMSE
La Selva 270.9760.960.07680.0981
Palmira250.7080.7230.2140.253
Cauca250.6440.6350.5370.572
Genomic Prediction Model (Capsaicin)
EnvironmentNr pearsonr spearmanMAERMSE
La Selva 190.9140.8630.1030.136
Palmira230.5430.6030.3360.528
Genomic Prediction Model (Capsaicinoides)
EnvironmentNr pearsonr spearmanMAERMSE
La Selva 190.9490.9090.1720.235
Palmira230.6170.6970.4540.779
Genomic Model Dihidrocapsaicin
EnvironmentNr pearsonr spearmanMAERMSE
La Selva 220.9630.9110.09880.146
Palmira230.7290.7120.1350.295
Table 6. Genomic Breeding Values (GBVs) for Key Quality and Pungency Traits in Capsicum spp. under La Selva Environment.
Table 6. Genomic Breeding Values (GBVs) for Key Quality and Pungency Traits in Capsicum spp. under La Selva Environment.
TraitTop AccessionsGBV Interpretation
pH113, 70, 2770.396, 0.337, 0.274Stable buffering capacity and desirable fruit acidity under highland conditions
Total Capsaicinoids (mg × kg−1)160, 161, 2315.18, 3.20, 2.68Strong additive potential for overall pungency intensity
Capsaicin (mg × kg−1)160, 161, 1784.57, 2.77, 1.99High predicted accumulation of the main pungent metabolite
Dihydrocapsaicin (mg × kg−1)231, 222, 2800.43, 0.39, 0.38Elevated GBVs indicating stability of secondary capsaicinoid synthesis
Multi-trait High Performers160, 161, 231, 168, 216, 260Consistent additive effects across pungency traits; suitable for genomic selection and multi-environment testing
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Parra-Londono, S.; López-Hernández, F.; Montoya, G.; Henao-Rojas, J.C.; Ossa-Ossa, G.A.; Cortés, A.J. Merging Phenotypic Stability Analysis and Genomic Prediction for Multi-Environment Breeding in Capsicum spp. Agronomy 2025, 15, 2690. https://doi.org/10.3390/agronomy15122690

AMA Style

Parra-Londono S, López-Hernández F, Montoya G, Henao-Rojas JC, Ossa-Ossa GA, Cortés AJ. Merging Phenotypic Stability Analysis and Genomic Prediction for Multi-Environment Breeding in Capsicum spp. Agronomy. 2025; 15(12):2690. https://doi.org/10.3390/agronomy15122690

Chicago/Turabian Style

Parra-Londono, Sebastian, Felipe López-Hernández, Guillermo Montoya, Juan Camilo Henao-Rojas, Gustavo A. Ossa-Ossa, and Andrés J. Cortés. 2025. "Merging Phenotypic Stability Analysis and Genomic Prediction for Multi-Environment Breeding in Capsicum spp." Agronomy 15, no. 12: 2690. https://doi.org/10.3390/agronomy15122690

APA Style

Parra-Londono, S., López-Hernández, F., Montoya, G., Henao-Rojas, J. C., Ossa-Ossa, G. A., & Cortés, A. J. (2025). Merging Phenotypic Stability Analysis and Genomic Prediction for Multi-Environment Breeding in Capsicum spp. Agronomy, 15(12), 2690. https://doi.org/10.3390/agronomy15122690

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop