1. Introduction
Chickpea is the second most grown food legume worldwide after the common bean [
1]. It was domesticated in the Fertile Crescent, particularly in south-eastern Turkey [
2,
3]. Since the Bronze Age, chickpea cultivation has spread throughout the Mediterranean Basin, central Asia, and Africa [
4]. There are two main cultivated types of chickpea: the ‘desi’ type, which is characterized by purple flowers and small dark seeds, and is mostly grown in the Indian sub-continent and Ethiopia; and the ‘kabuli’ type, which has white flowers and large light seeds, and is mainly consumed in the Mediterranean Basin.
Chickpea is an example of a food legume crop that is suited to the European environment. This applies in particular to Mediterranean regions, where a large number of landraces are still maintained by smallholder farmers [
5]. In the Mediterranean area, chickpea is usually grown in the spring–summer season, relying on the residual moisture of the soil. However, many studies conducted in Mediterranean countries have shown that the mild winter conditions of this area offer the possibility to sow chickpea in autumn, with increased yield and yield stability [
6,
7]. However, breeding efforts to develop novel chickpea varieties that are better adapted to European agro-ecosystems are still lacking [
8]. Indeed, breeding activities for legumes almost stopped in the 1940s, although national activities and recent projects are now growing due to the stimulus of consumer demand and market trends [
9].
In this regard, plant genetic resources and their characterization represent a crucial prerequisite toward the development of elite varieties with enhanced quality traits that might be suitable for sustainable European agro-ecosystems. Pre-breeding resources are now available for chickpea, which include large plant collections, as mainly the gene banks of the International Crops Research Institute for the Semi-Arid Tropics (ICRISAT) and the International Centre for Agricultural Research in Dry Areas (ICARDA), that hold 20.456 and 15.749 accessions, respectively (
https://www.genesys-pgr.org/), reference genomes [
10,
11,
12], and resequenced genotypes [
4].
Large germplasm collections can make breeding evaluations difficult due to the large numbers of individuals. Core collections represent a useful pre-breeding tool that reduces large germplasm collections to manageable samples that maintain rich genetic diversity [
13]; often, such collections or part of them are used as association panels [
14,
15,
16], holding historical recombination events for greater genetic resolution needed for genome-wide association (GWA) studies. Chickpea core collections have been developed and characterized both phenotypically and genotypically [
4,
17,
18,
19,
20,
21,
22] and used for association mapping [
23,
24]. However, these collections are mainly characterized by accessions of Asian origin, and thus they are not the ideal samples to targeting European agro-ecosystems.
In this study, we aimed to develop the first chickpea association panel, composed of genetic resources collected mainly in Europe and the Mediterranean Basin, which will represent an important pre-breeding tool for European food legume research.
2. Materials and Methods
2.1. Plant Material and Field Evaluation
A total of 1931 chickpea accessions from 47 different countries were used, as the
UNIVPM_Ca_ALL collection. The seeds of 1812 and 119 accessions were provided by the Western Regional Plant Introduction Station (USDA-ARS, Washington State University, USA) and the Leibniz Institute of Plant Genetics and Crop Plant Research-IPK (Gatersleben, Germany), respectively. The selection criteria to order the accessions were: (i) to include mostly landraces; (ii) to cover the geographic distribution of the species, with a focus on European and Mediterranean regions. A complete list of the accessions studied along with passport information are available in
Supplementary File S1.
The whole collection was sown the 18 February 2018, at the Research Centre for Cereal and Industrial Crops (CREA-CI) experimental station of Osimo (Ancona, Italy; latitude, 43°45′04″; longitude, 13°49′98″; 41 m above sea level), in single-row plots of 10 plants (distance between plants, 10 cm) without replicates. For each accession, a representative single plant was identified at maturity stage, and the seeds were harvested separately (i.e., single seed descent [SSD]).
Phenotypic characterization was carried out for the whole collection, considering the following traits: flower color (purple, white), days to first flower (number of days after sowing [DAS] when one open flower was present within a plot), days to full flowering (number of DAS when 50% of all plants had at least one open flower), days to first pod (number of DAS when one pod was present within a plot), yield (production in grams [g] of the entire row plot), and 1000-seed weight.
2.2. Association Panel Development
Starting from the
UNIVPM_Ca_ALL collection, a nested set of materials (
UNIVPM_Ca_core1) was developed based on three specific selection criteria: (i) adaptation to the specific environment, taking into account yield and 1000-seed weight evaluated in the
UNIVPM_Ca_ALL collection field trial in 2018; (ii) equal representation of the desi and kabuli types; and (iii) maximized geographic representation (i.e., country of origin) using the passport information (
Figure 1).
The
UNIVPM_Ca_core1 included: 234 accessions with yield and/or 1000-seed weight greater than 150 g and 450 g, respectively; 92 accessions as Italian landraces, erect habitus accessions and/or early flowering accessions; 74 accessions with purple flower color, to balance the desi and kabuli types; and 26 accessions from countries not represented in the first selection in order to cover as much as possible the geographic distribution of the species. In this way, a total of 426 accessions were selected from the
UNIVPM_Ca_ALL collection. For each of these accessions, the seeds produced by the selected single plants were collected (first SSD cycle). Finally, further 54 chickpea accessions (30 breeding lines from ICARDA, Rabat, Morocco; 24 Italian cultivars and landraces) were added to the
UNIVPM_Ca_core1 set for a total of 480 entries (
UNIVPM_Ca_core2). The 24 Italian further accessions include the main commercial varieties grown in Italy as controls and landraces, collected on farm, thus adapted to local environments. Based on passport information (i.e., accession codes/synonyms), no duplicates were included in
UNIVPM_Ca_Core2 (
Supplementary File S1).
2.3. UNIVPM_Ca_core2 Phenotypic Evaluation
The 480
UNIVPM_Ca_core2 lines were phenotypically evaluated in a field trial carried out in 2019 at the CREA-CI experimental station (Osimo, Ancona, Italy). The environmental data for the whole of 2019 were recorded by the weather station at CREA-CI in Osimo (Ancona, Italy). The weather data for the growing season (April–July 2019) and the soil data are reported in
Supplementary File S2. The thermo-pluviometric data for the 2019 growing season (April–July) are illustrated in
Supplementary Figure S2. The total precipitation was 316 mm and the mean temperature was 22.6 °C. High temperatures (>25 °C) were recorded from the last week of June to the end of the growing season. The experimental station was characterized by a loamy alkaline soil of pH 8 and 1.4% organic matter. The sowing was carried out on 10 April 2019, in single rows of 10 plants (distance between plants, 10 cm), using a randomized complete block design with two replicates. The first selfing cycle seeds of each
UNIVPM_Ca_core1 genotype were used to continue the cycles of selfing for the development of the SSD lines, whereby seeds of a single plant for each plot of the first replicate were harvested separately to obtain the second selfing cycle seeds.
Several flowering, architectural, and production traits were evaluated. In addition to the flowering traits used for
UNIVPM_Ca_ALL, the duration of flowering (i.e., difference between days to first pod and days to first flower) was also calculated. Days to first flower, days to full flowering, and days to first pod were also measured as the growing degree days (GDDs), according to Equation (1) [
25]:
where Tm is the daily average temperature (°C), and Tb is the chickpea base temperature (here taken as 0 °C).
At complete mature stage, the following architectural traits were evaluated: the growth habit (1, erect; 2, semi-erect; 3, semi-spreading; 4, spreading; 5, prostrate) [
26]; and plant height and first pod height (distance [cm] from ground to highest part of plant, and to first pod, respectively; average values from 3 plants, taken from the middle of the plot). Finally, yield and 1000-seed weight were also recorded.
2.4. Data Analysis
The passport information and phenotypic data collected in 2018 for the UNIVPM_Ca_ALL collection were used to select accessions to be included in UNIVPM_Ca_core1. The same data were used to compare the overall collection (UNIVPM_Ca_ALL) and the developed UNIVPM_Ca_core1. Geographic origin, market type (desi, kabuli), production traits (yield, 1000-seed weight) and flowering traits were compared across the two collections.
Phenotypic data collected during the 2019 field trial were used to analyze the phenotypic variation of the
UNIVPM_Ca_core2 genotypes. Frequency distributions of all of the phenotypic traits were obtained. The narrow sense heritability (
h2) was estimated according to Equation (2):
where Y is the observed phenotype, μ is the overall mean, G is the effect for the
ith genotype,
bk is the effect for the
kth block, and
ε is the plot error effect. Here, G
i was taken as random, so that the heritability was estimated from genotypic variance
σ2g estimation.
Two types of multivariate analyses were carried out: (i) hierarchical cluster analysis (cluster distances calculated using Ward’s linkage method [
27]), using means of flowering traits (days to first flower, days to full flowering, days to first pod expressed as GDD, duration of flowering); (ii) principal component analysis, with all of the phenotypic traits included.
Differences among clusters, growth habit classes, and desi/kabuli types were tested for production and architectural traits using one-way analysis of variance (ANOVA). Mean discrimination was performed, applying Student’s t-tests for independent samples, and statistical significance was set at p < 0.05. Correlations between phenotypic traits were investigated according to the Pearson r coefficient.
A scatter plot for average yield and 1000-seed weight was constructed and Sultano, Pascià, Maragià, Reale, and Ituchi cultivars were highlighted as controls.
Correlated responses were calculated for the 426 genotypes belonging to
UNIVPM_Ca_core1 using the data for the production traits (yield, 1000-seed weight) recorded for the 2018 and 2019 growing seasons. Best-fit linear relationships were computed for both of these traits. The regression coefficient
r was used as an estimate of the narrow sense heritability for each trait [
28].
The statistical package JMP, version 8 (SAS Institute Inc., Cary, NC, USA) was used for the data analysis.
4. Discussion
The conservation and valorization of plant genetic resources (PGR) is crucial for the future of agriculture [
29]. Indeed, the genetic diversity of PGR and particularly of food legumes represents a strategic tool for agriculture today, to guarantee food security and food quality, to cope with climate change and to protect the environment [
9,
30]. To conserve such precious resources is not enough; it is crucial to also characterize them and to be able to fully exploit the genetic diversity they hold for plant breeding. In the last decade, GWA mapping has become a very popular tool for gene discovery and PGR enhancement. The prerequisite to conduct a successful GWA program is the availability of well-characterized association panels built as a collection of a high number of pure lines (for autogamous species). These association panels are efficient pre-breeding tools in the hands of breeders. Often, association panels are derived from large core collections by sampling a set of genotypes balancing the population structure and the size of the panels (at least 200 genotypes) [
31]. Moreover, association panels need to present a wide variation and the proper resolution for the trait of interest. Here, from a collection of about 2000 chickpea accessions, characterized in a field trial, we developed a panel of 480 pure line-based genotypes for GWA studies, primarily targeting agronomic traits for European and Mediterranean environments.
Several studies are currently available based on phenotypic and genotypic characterization of chickpea core collections [
4,
17,
18,
19,
20,
21,
22]. However, the majority of the accessions that constitute the existing and deeply analyzed chickpea core collections were lacking in materials from the Mediterranean Basin, and especially from Europe. Moreover, phenotypic evaluations of wide collections of chickpea genetic resources in such environments are also lacking.
The association panel developed in this study, UNIVPM_Ca_core2, included 180 European genotypes together with 105 genotypes from north Africa, Turkey, Lebanon, Cyprus and Syria and these represent 59.4% of this entire set of materials. The remaining genotypes are from diverse geographic areas and mean that UNIVPM_Ca_core2 includes worldwide materials. The main strength of this panel is that it was developed by selection of the materials (i.e., landraces, cultivars) that did not show any difficulties to grow, flower, and mature in a Mediterranean environment (i.e., central Italy) over both the 2018 and 2019 growing seasons. This is also the reason why the Italian local germplasm is a little more represented. This means that the set of materials was developed to constitute an association panel that has the potential to be used for breeding programs aimed to develop varieties adapted to European and Mediterranean environments.
The analysis of the phenotypic data recorded over 2 years of field trials highlighted that the collection encompasses large phenotypic variability in terms of the flowering, architectural, and production traits, which confirms the high potential of such a set of materials to be used as a tool for association mapping studies and as a training population for genomic selection.
Considering the different traits, wide variation was observed in this
UNIVPM_Ca_core2 panel for the architectural traits, in terms of plant height (10–70 cm) and first pod insertion (5–45 cm). A similar range of variability for plant height was reported by Upadhyaya and Ortiz [
17] for both their core subset of 1956 accessions and their mini-core set of 211 accessions that were grown in the 1999/2000 post rainy season at the ICRISAT Centre at Patancheru (India). Considering first pod height, a smaller range (6–36 cm) was shown by Canci and Toker [
32] in their evaluation of 377 chickpea accessions over 2 years (2005, 2006) in the Antalya region (Turkey), although this was carried out under drought and heat stress conditions. Moderate heritability for plant height (
h2 = 52.0%) and first pod height (
h2 = 50.5%) were estimated in the present study. Similar estimates for plant height were also obtained by Mallu et al. [
33] in their evaluation of 60 genotypes over four different field trials carried out in 2013 at two locations (Kabete and Juja, Kenya) and through two rainy seasons.
Variability was also observed in the present study for flowering time. The geographic distribution of the three different clusters identified on the basis of the flowering traits showed that the genotypes from geographic areas that are characterized by lower latitudes, such as Ethiopia and Central America, included more early flowering, while for the Mediterranean Basin there were higher proportions of intermediate and late-flowering genotypes. In western and eastern Europe at higher latitudes, the late-flowering genotypes were more prevalent. This observation reflects what has already been shown for the chickpea, where temperature and day length were identified as the key factors that affect the timing of floral initiation [
34,
35]. Thus, in warmer environments, such as southern and central India, early flowering genotypes have been selected to avoid strong terminal drought stress, while in cooler habitats that are characterized by longer growing seasons (e.g., northern India, Mediterranean), the selection has been for later-flowering genotypes [
36,
37]. Of course, the availability of the different genotypes characterized by variability in flowering time was crucial for the development of varieties adapted to different latitudes and target locations, as well as to different sowing seasons (i.e., autumn, spring).
In the developed panel, the growth habit also showed high variability. The multivariate analysis showed opposite correlations between flowering time and growth habit, with early flowering genotypes from Mexico and Ethiopia having a spreading growth habit, and late-flowering genotypes characterized by erect and semi-erect growth habits. The same correlation was reported for the phenotypic diversity analysis of the core collection developed by ICRISAT [
19].
For the production traits, the present study highlighted large variability for 1000-seed weight, from 100 to >500 g; moreover, this trait showed strong heritability, as also seen in the literature [
32,
33]. This suggests that the control of the 1000-seed weight is characterized by a narrow genetic basis [
38], which was also confirmed by the strong correlation between the seed sizes across the 2 years of cultivation.
Variability was also seen for yield, even if low heritability and little correlation was seen between the 2018 and 2019 years of cultivation. This was expected considering that yield is a quantitatively inherited multigenic trait that is strongly affected by the environment [
39].
The
UNIVPM_Ca_core2 panel was also balanced between the desi and kabuli types. The literature based on genotypic characterization of different sets of chickpea germplasm suggests that desi and kabuli are not distinct genetic groups [
21,
40,
41]. Varma Penmetsa et al. [
40] analyzed a sample of 322 chickpea accessions that included 224 cultivated and 98 wild genotypes, and defined 538 single nucleotide polymorphisms. Their population structure data indicated that none of the genetic groups identified corresponded to the desi and kabuli types. They also suggested a diverse hypothesis for the origin of the kabuli type. While the general belief is that the desi form represents the early domesticate, with kabuli being a subsequently derived type [
3], Varma Penmetsa et al. [
40] showed that the kabuli form is polyphyletic as a result of mutations that have occurred in genes which mainly control pigmentation of flowers and seeds. These mutations have arisen multiple times during the phase of phenotypic diversification after the initial domestication of chickpea. Interestingly, extensive admixture was detected within the cultivated germplasm among groups with distinct geographic origins, which appears to be a consequence of breeding [
40]. Di Giovanni et al. [
41] used 22 single sequence repeats to characterize a sample of 103 chickpea accessions. Here, they identified three different genetic groups, which mainly represented not only the two market types, but also the geographic origin. In particular, a third group included both the desi and kabuli types mostly from Ethiopia and European countries. At the same time, they observed that the European desi accessions appeared to be closer to the Ethiopian gene pool than to the Asian gene pool.
The comparison of the desi and kabuli types of the
UNIVPM_Ca_core2 panel based on the phenotypic data from the 2019 field experiment indicated that there were significant differences between these two market types for all of the traits considered. These differences between the desi and kabuli genotypes were mainly related to the geographic origins of the accessions. The desi forms showed significantly earlier flowering than the kabuli forms, with the opposite for the yield; the desi form mainly originated from countries at low latitudes, such as India and Ethiopia, while the kabuli form was from the Mediterranean basin. These findings suggest that although the desi and kabuli forms are not clearly differentiated from a genetic point of view [
40], their adaptation to different environments has led to the selection of different genetic backgrounds (i.e., allele combinations) that form the basis of the phenotypic differentiation seen here. In this scenario, the balanced accessions of the desi and kabuli types and their origins from different geographic areas (which include the main areas of chickpea cultivation), makes this
UNIVPM_Ca_core2 collection a very useful set of materials to also investigate post-domestication diversification traits.
Finally, the
UNIVPM_Ca_core2 panel was developed with the aim to obtain SSD purified lines. This is a crucial step when the objective is to associate phenotypes to reliable genotypic information. In this regard, the
UNIVPM_Ca_core2 set of materials has strong potential to be used as an association panel in pre-breeding programs aimed at the identification of the genetic control of important agronomic, adaptive, and quality traits. Moreover, the linking of the diverse genetic backgrounds to specific phenotypes can be used to predict the phenotype of genetic resources conserved in genebanks not included in core collections, by only looking at the genetic data [
42].