Abstract
We produced homogeneous lines of 227 pea accessions from the Nordic Genetic Resource Center via single seed descent. The genetic diversity among these, mostly Scandinavian accessions, was investigated using three microsatellite markers, A9, AC58 and AA5. The microsatellites were highly informative and separated 153 of 194 accessions on a Neighbor Joining topology. The high polymorphism information content (PIC) values between 0.87 and 0.91 indicated that the gene bank material contains a large number of pea accessions with different breeding histories. The peas were grown in the field for two years and seed protein content showed variation between 9.3% and 34.1% over the years and accessions, respectively. The mean thousand seed weight was 152.05 g. More than 10 accessions had a protein content above 28%, showing that the collection has potential as breeding nursery for high-protein pea cultivars.
1. Introduction
Peas (Pisum sativum L.) have been grown in Denmark since 500–1500 AD and sweet and soft pea types for human consumption since 1600 [1]. Pisum sativum, originating in the Mediterranean region, has been adapted to the temperate climate and long days of Denmark. The world production of peas is 10.5 mill hectares [2]. Pisum sativum belongs to the legume family, Fabaceae, and is a self-pollinated, diploid species with 2n = 14 and a very large genome of 3.92 Gb [3]. Peas were the model plant used by Gregor Mendel to discover general rules for genetic heredity [4]. Peas are currently an important crop for food, feed and sidechain products, with a great potential for future farming. The content of dietary proteins in different pea cultivars ranges from 24.3 to 32.6%. The starch content is between 33.4 and 47.5% and peas contain fibers as well as mineral nutrients [5,6]. Others found that the protein content of pea cultivars varies with environmental and agronomic factors [7,8,9] with a typical span in protein content from 14.5% to 28% [10,11]. The symbiosis with nitrogen-fixing soil bacteria favors the growing of peas in low-input farming systems because it reduces the need for nitrogen fertilizers, and when used in crop rotation, peas add nitrogen to the soil [3,12].
The genetic diversity in peas has been studied using microsatellites also named simple sequence repeats (SSR). A worldwide collection of 372 pea cultivars and landraces has been analyzed using 29 different microsatellite markers [13]; a collection of 175 pea cultivars from the Czech Republic has been analyzed using seven microsatellite markers [14], and 130 pea landraces from Turkey were analyzed using 14 markers [15]. The genetic diversity measured as polymorphism information content (PIC) ranges from 0.15 to 0.96 for different types and numbers of microsatellites [15]. The microsatellite technique is also efficient when analyzing smaller dedicated pea collections such as garden peas, forage peas, landraces and historical collections from various countries, e.g., Tunisia, Anatolia, Australia, India, and from gene banks [16,17,18,19,20,21,22,23,24,25]. These studies included between 8 and 34 microsatellites showing PIC values ranging from 0.19 to 0.84. Specific microsatellite markers are developed for a collection of field pea cultivars from Ethiopia, which also identify private alleles [26]. A recent study of a forage pea collection of landraces from Anatolia successfully differentiated the accessions into subgroups of locally adapted genotypes [17]. With the SSR markers already available in the literature cited above, the initial cost of employing SSR markers to the present study is low.
The polymorphic information content (PIC) describes the polymorphism and thus the informational value of the microsatellite. Generally, PIC values above 0.5 are highly informative, between 0.25 and 0.5 reasonably informative, and below 0.25 slightly informative [27]. In a study based on the pea genome, 309 microsatellites were constructed and 235 were polymorphic in either the first or second set of tests [28]. The number of PCR fragments (alleles) per microsatellite is between two and seven. The PIC value of the microsatellites varies from 0.04 to 1 with a mean of 0.63, thus showing a wide range of information from slightly to highly informative [27,28]. A comparison of the genetic variation and morphological features of 65 pea cultivars and 21 wild pea accessions [29] found 51% polymorphic information content and the dendrogram shows that diversity among the varieties from Europe is narrower than in the rest of the accessions. Furthermore, the study supports the assumption that the topology of the dendrogram reflects the pattern of refinement of the cultivars [29]. In an analysis of 19 pea cultivars primarily from Australia and Russia, five of eight microsatellites successfully produced 34 PCR fragments in total with 3 to 13 fragments per microsatellite [30]. The PIC values ranged from 0.18 to 0.79 with a mean of 0.62 and classified the microsatellites as highly informative. The genetic diversity among 35 pea accession was assessed with 15 microsatellites, which produced 41 different fragments, and all genotypes were identified on a UPGMA tree [31]. The PIC values ranged from 0.05 to 0.66 with a mean of 0.46. Jain et al. [32] investigated the genetic diversity among 96 cultivars from all over the world using 31 microsatellites along with 42 expressed sequence tag markers and 11 modern markers. The microsatellites identified 83 PCR fragments with an average of 2–6 per microsatellite and the PIC values varied from 0.01 to 0.56 with a mean of 0.29. Nasiri et al. [33] investigated 77 pea accessions from 17 countries using 10 microsatellites. The results included 59 PCR fragments, which varied from 2 to 8 per microsatellite. The PIC values ranged between 0.56 and 0.84, with a mean of 0.72. Kumari et al. [34] investigated 28 pea genotypes with 32 microsatellites, which resulted in 44 fragments with 2 to 4 fragments per microsatellites. The PIC values were between 0.31 and 0.66 with a mean of 0.49.
The current study surveyed a subset of the approximately 2500 pea accessions of Peas at the Nordic Genetic Resource Center, NordGen, https://www.nordgen.org/ (accessed on 16 March 2023), many of which were from the Weibullsholm Pisum collection. The accessions were primarily chosen based on prior knowledge of potential high protein content. The investigations were a first step in a pre-breeding process towards selecting and developing pea cultivars with a high protein content to meet the demand for plant-based protein for human consumption and as an alternative source of protein for husbandry. We address the following three questions: (1) What is the genetic diversity among the accessions and is the diversity structured? (2) What is the polymorphic information content? (3) What is the level of protein content and the thousand seed weight of accessions?
2. Materials and Methods
2.1. Plant Material
The plant material included in this study originated from the Nordic Genetic Resource Center, https://www.nordgen.org/ (accessed on 16 March 2023; Table 1. Most accessions originated from the Nordic countries and information on the specific location of origin, pedigree relationships, and proper cultivar name is in general sparse but some information may be retrieved in the seed database at https://www.nordic-baltic-genebanks.org/gringlobal/search.aspx (accessed on 16 March 2023). A total of 227 pea accessions were grown from a single seed in pots in the greenhouse. Plants were grown to maturity and the seeds were collected for further studies in the field.
Table 1.
List of Pisum sativum accessions included in this study. SKR is the working number given to the accession in this study. Accession is the number from NordGen, www.NordGen.org (accessed on 16 March 2023). Name included cultivar name, if existing, WBH designate a pre-breeding line from Weibuhlholm. AA5, AC58 and AA9 were the microsatellites used and x indicate that the accession produced a result. TSW were the thousand seed weight and then seed protein content in % for 2017, 2018 and 2019.
Of the 227 accessions grown in the greenhouse, 199 were chosen for total nitrogen analyses based on previous information of high protein content. These were grown at the experimental farm of University of Copenhagen in Taastrup (55°40′ N; 12°18′ E), Denmark in the years 2017, 2018 and 2019. One individual per accession per year was analyzed. The seeds were hand-sown at approximately 1 seed per 10 cm in a two meter single row, and nets supported the growing plants. Both greenhouse plants and plants grown in the field were organized in the order of the Nordic Genbank number. Automatic irrigation was used when needed. At maturity, whole plants were harvested keeping accessions separate and then air-dried. Pods were collected from the dry plants and threshed by hand.
2.2. Microsatellites
Total genomic DNA was extracted from green leaf material of 194 pea accessions via the CTAB method [35]. The DNA was dissolved in 50 μL 1× TE buffer and the quality of the total genomic DNA was assessed via agarose gel electrophoresis (Electrophoresis Consort EV265, Cohasset, MA, USA) and spectrophotometry on an Uvidoc (Buch & Holm, Herlev, Denmark).
The microsatellite markers AA5, AC58 and A9 were chosen after initial screening of a larger set of microsatellite markers and based on previous success reported by Loridon et al. [28]. When tested AA5, AC58 and A9 produced consistently good quality data across the large number of accessions. All forward primers had an M13-tail (CACGACGTTGTAAAACGAC) with a dye attached. The forward primer of AA5 had a FAM, A9 a NED and AC58 a VIC dye (Table 2).
Table 2.
The three microsatellite markers AA5, A9 and AC58 [28], the forward and reverse primer sequences of the markers, and the color attached to the M13-tail of the forward primer. The sequence of the M13-tail was: CACGACGTTGTAAAACGAC.
Each PCR sample consisted of 1 μL total genomic DNA, 7.25 μL double-distilled water, 1 μL 10× Dreamtaq buffer (20 mM Mg Cl2), 0.4 μL dNTP (2.5 μM), 0.125 μL forward primer with M13-tail and color (10 μM), 0.125 μL reverse primer (10 μM), and 0.1 μL Dreamtaq Polymerase. The samples were mixed and run on a T100 Thermal Cycler (Bio-Rad Laboratories, Copenhagen, Denmark) using the following protocol: 94 °C for 4 min, 19 cycles of 94 °C for 1 min, 64 °C for 30 s (annealing), and 1 min of 72 °C. The annealing temperature was decreased half a degree for every cycle ending at 55 °C. Then, an additional 19 cycles were conducted with the same steps using a constant annealing temperature of 55 °C, followed by 10 min at 72 °C, and a final cooling to 4 °C.
The PCR products were diluted 1:5, and 3 μL of the diluted PCR product was mixed with 10 μL formamide and 0.5 μL GeneScan™ 120 LIZ™ (standard). The samples were denatured at 95 °C for 5 min and put on ice for 2 min. The samples were applied to an ABI 3130xl Genetic Analyzer (Hitachi, Tokyo, Japan) using the program Genetic Mapper Instrument protocol Fragment36_POP7_G5, and the resulting fragments were visualized on GeneScan® 3.7 Analysis Software (Applied Biosystems, Roskilde Denmark).
The resulting microsatellite band patterns were treated as binary characters where the presence of a given PCR fragment was marked as 1, absence as 0 and lacking results were marked as “?”. All data from the three microsatellite markers were combined into one Microsoft Excel file and later converted to a text file (.txt) to generate the full matrix.
2.3. Thousand Seed Weight and Protein Content
Thousand seed weight (TSW) was determined by seed counting on a Contador Seed counter (Pfeuffer, Kitzingen, Germany) and then weighed.
Total N was determined via the Dumas method on a Vario Macro Cube (Elementar) [36] and crude protein content (%, w/w) was calculated using the conversion factor 5.44 [37,38].
2.4. Data Analysis
2.4.1. Polymorphism Information Content
Polymorphism information content (PIC) was calculated for the microsatellite data. The values were calculated using the following equation, where is the frequence of the i allele [27] (see also [28]):
2.4.2. Neighbor Joining Tree
A Neighbor Joining tree was constructed based on the binary matrix in PAUP 4.0a169 [39]. The parameters were set to BioNJ method with mean character difference as distance measure, and all characters were given equal weight. A Neighbor Joining bootstrap analysis was run with 2000 repetitions. Bootstrap values of 50% or above were manually added to the Neighbor Joining tree.
2.4.3. Principal Coordinate Analysis of Microsatellite Data
A Principal Coordinate analysis (PCoA) was performed in GenAlEx [40,41] on the microsatellite data. Binary data were converted into columns of codominant genotypic microsatellite data with loci scored as fragment size in accordance with the GenAlEx-formats. A genetic distance tri-matrix was then made in GenAlEx, from which a PCoA was performed.
2.4.4. ANOVA of Crude Protein across Years
A one-way ANOVA was performed in Microsoft Excel, comparing the difference between protein content across the three years. A Tukey–Kramer post hoc test was performed in Excel to compare the mean between each pairwise combination of years.
3. Results
Single seed descent was used to obtain a homogeneous seed stock of 227 genebank accessions for further field evaluations and for DNA extractions, molecular identification and phylogenetic analysis.
3.1. Plant Material
All seeds germinated and leaf material from all plants were collected for DNA extraction.
3.2. Microsatellites
DNAs were successfully extracted from all 194 samples selected for microsatellite analyses. PCRs from the marker AA5 resulted in 185 successful amplifications (95.4%), marker AC58 and A9 both resulted in 186 (95.8%) successful amplifications (Table 1). All together, a total of 25 accessions lacked results for one of the three primer pairs: SKR 8, 27, 42, 78, 97, 99, 100, 103, 106, 110, 117–119, 123, 125, 129, 131, 134, 135, 138, 182, 200, 202, 216 and 218 (Table 1).
The final matrix included 77 different PCR fragments from the three primer pairs. Marker AA5 provided 21 PCR fragments, marker AC58 27 PCR fragments, and marker A9 had 29 PCR fragments. The length of the PCR fragments varied from 110 to 407 base pairs (bp), where the smallest fragments were from AA5 and the largest from A9.
For the marker AA5, 5 of 21 PCR fragments were uninformative. The most common PCR fragment for AA5 was present in 64 samples and the length was 127 bp. For the marker AC58, 9 of 27 PCR fragments were uninformative. The most common in AC58 was present in 37 samples. For marker A9, 9 of 29 PCR fragments were uninformative. In A9, the most frequent PCR fragment was present in 26 accessions and had the length of 387 bp.
3.3. Thousand Seed Weight and Protein Content
Thousand seed weight (TSW) was determined for 102 field grown accessions in 2017 as a measure of seed size and ranged from 64 to 254.2 g (Table 1; Figure 1) with a mean of 152.05 g. The distribution (Figure 1) indicated that the majority of the accessions had a TSW between 120 and 180 g.
Figure 1.
Thousand seed weight of pea accessions in grams (n = 102). The peas were grown with irrigation and standard agricultural management in 2017 and harvested at maturity.
The protein content was successfully determined for field grown material, with 199 accession in 2017, 95 accessions in 2018 and 30 accessions in 2019 (Table 1). The protein content varied from 9.3% (SKR 182, NGB102130) to 34.1% (SKR 64, NGB105862) for all years (Table 1) and most samples had a protein content between 16% and 28% protein content (Figure 2). All years showed a Gaussian distribution of protein content of the accessions. The average was 20.7% in 2017, 26.2% in 2018 and 22.7% in 2019, and 13 accessions had a protein content above 28% (five above 30%) in 2017, 19% (five above 30%) in 2018 and 0 in 2019. Of the 10 accessions with the highest protein content in 2017, only 1 was in the top ten in 2018, although the 95 pea accessions selected for 2018 generally had a higher protein content and were chosen among the 199 accessions with the highest protein content in 2017. One-way ANOVA revealed a highly significant difference in protein content across the three years (F2.21 = 66.05, p = 9.44 × 10−25). Tukey–Kramer post hoc test found that the protein content was significantly different among years.
Figure 2.
Protein content (%, w/w) of pea accessions. Blue represents accession were from 2017 (n = 199), Green from 2018 (n = 95) and Black 2019 (n = 30).
3.3.1. Polymorphism Information Content
The PIC values for marker AA5, AC58 and AA9 were 0.87, 0.90 and 0.91, respectively. The average PIC value was 0.89.
3.3.2. Neighbor Joining Tree
The Neighbor Joining analyses resulted in a tree topology with 107 groups of 194 accessions (Figure 3). The bootstrap analysis resulted in branch support for 11 groups ranging between 50% and 89%. Most groups did not have bootstrap support (Figure 3). We had cultivar names for 77 of the 194 accessions included in the NJ analysis. Fifty-three cultivars were represented by only one accession. Twelve accessions were from the cultivar Pisum sativum “chlorotica”, four from “variomaculata”, two from “compactum”, two from “costata”, two from “New Wales”, and two from “Lilaska” (Figure 3; Table 1). The “chlorotica” cultivars were resolved in several different groups on the NJ tree, but most of them clustered around the same few branches (Figure 3). The other cultivars resolved far apart on the NJ tree.

Figure 3.
Neighbor Joining tree of 194 pea accessions with bootstrap values on supported branches. Highlighting indicates cultivars represented by more than one accession. Green: “costata”; orange: “chlorotica”; red: “compactum”; purple: “New Wales”; yellow: “Lilaska”; blue: “variomaculata”.
3.3.3. Principal Coordinates of Microsatellite Data
The first two principal components of the PCoA accounted for 16.62% of the variance between the accessions of the three microsatellites combined (Figure 4). The pattern of variation mirrored the placement of samples in the NJ tree.
Figure 4.
Principal coordinates analysis of the microsatellite results. X- and y-axis represent PC1 and PC2, respectively.
3.3.4. Nitrogen Content and Thousand Seed Weight in Relation to Genetic Diversity
The results from the total N analyses showed no apparent relationship between the protein content and the genetic diversity among the pea accessions. The accessions with high protein content did not cluster either in the NJ tree or in the PCoA.
4. Discussion
The primary goal of this study was to examine the genetic diversity in pea accessions, which have primarily been grown and collected in Scandinavia. The genetic variation was investigated using three microsatellite markers, which all functioned well and resulted in high genetic variation. Furthermore, the protein content was determined for different subsets of the accessions over three years and thousand seed weight was measured for one year.
4.1. Microsatellites
The microsatellite AC58 produced 186 PCR fragments that varied between 210 and 263 bp. The number of fragments were higher than other studies where 1–11 fragments are reported, but the lengths were within what other studies have found: 200–263 bp [15,19,28,31,34]. The microsatellite AA5 produced 185 different PCR fragments of lengths from 110 to 264 bp. The number of fragments were higher than other studies where 1–3 fragments are reported, and the lengths spanned a wider range than the 225 to 250 bp reported in other studies find [15,28,34]. The microsatellite A9 produced 186 different PCR fragments with lengths that varied between 325 and 407 bp. The number of fragments were higher than in other studies, where one to six fragments were reported, and the lengths spanned a somewhat wider range than the 330–390 bp observed in previous studies [14,17,18,21,28,31].
To sum up, most previous studies produced 2–4 DNA fragments for each microsatellite, except for the study of Hagenblad et al. [19], where 14–33 DNA fragments were found. Hagenblad et al. [19], like in our study, used Scandinavian pea accessions from NordGen. This indicates that the collection of Scandinavian pea accessions at NordGen are genetically diverse. The results could have been even more discriminating if more than three microsatellites had been analyzed.
4.2. Polymorphism Information Content
The PIC values in this study (0.87–0.91) were high when compared to other studies. The reported PIC values for pea accessions fall within a wide range from 0.3 to 1.0 for individual microsatellite markers. The variation in PIC values is higher in studies developing microsatellites, e.g., PICs varied between 0.04 and 1.0 with an average of 0.63 in Loridon et al. [28]. Many studies focus on the genetic diversity within a specific group of accessions and use already tested microsatellites. The PIC values of these studies do not span the same variation as Loridon et al. [28] did. A few studies did find PIC values as high as in this study. Burstin et al. [13] use a large number of accessions and many microsatellites and obtained PIC values between 0.46 and 0.97 with a mean of 0.8 for 372 wild and cultivated pea accessions using 29 microsatellites. Nasiri et al. [33] found an average PIC value of 0.72 for 77 pea accessions from 17 different countries with 10 microsatellites. However, most studies found lower PIC values than our study, between 0.3 and 0.65, most using a larger number of microsatellites [14,16,17,18,20,21,22,23,24,28,30,31,32]. Singh et al. [20] use 20 microsatellites to assess the genetic diversity of 47 pea accessions from India and obtained an average PIC value of 0.55, ranging from 0.04 to 0.85. Another study [22] assessed 40 pea accessions with five microsatellites and obtained PIC values between 0.14 to 0.82. Of all studies assessing genetic variation in peas [13,14,15,16,17,18,19,20,21,22,23,24,25,26,29,30,31,32,33,34], only three studies [13,15,33] reported results where all the microsatellites had a PIC value of 0.46 or higher.
The PIC values are mainly affected by the variation among the included individuals; however, the number of microsatellites may affect the outcome of the average PIC value because the number affects the statistical strength by potentially adding more variation [27,31]. The genome of Pisum sativum is large, approximately 4.45 giga base pairs [3], and therefore the results of this study could be affected by the low number of microsatellites included three.
The PIC values in this project varied between 0.87 and 0.91 with an average of 0.89. For the same specific microsatellites, Loridon et al. [28] finds PIC values of 0.69 (A9), 0.84 (AC58) and 0.78 (AA5). Other studies using the A9, AC58 and AA5 microsatellites find PIC values between 0.61 and 0.78 [14,15,17,18,19,31,34] except for the A9 microsatellite of Haliloglu et al. [17], which has a low PIC value of 0.03.
The PIC values of this study were therefore among the highest reported for genetic variation among pea accessions [13,14,15,16,17,18,19,20,21,22,23,24,25,26,28,29,30,31,32,33,34]. The average PIC value is not only affected by the variation among individuals for each microsatellite included but also by the number of accessions, because the chance of identifying different alleles rises with the number of accessions included [27,31]. Thus, the higher PIC values in the present study could partially be explained by the larger number of accessions (194) compared to most studies [16,17,18,19,20,21,22,23,26,28,29,31,32,33,34]. Some studies included more than 100 accessions [13,14,15], however, and few more than 500 [24,25,30].
This study, along with most of the studies on genetic diversity of peas, placed the microsatellites used in the category, “highly informative”, because the PIC values are above 0.5 [27]. The high PIC values of this and other studies could be explained by the selection of the most polymorphic markers from the study of Loridon et al. [28] or similar studies where a large number of microsatellites were included. The high average PIC value of this study could be interpreted as a reflection of a high genetic variation in general among the 194 accessions included. This high genetic variation indicates that the accessions had not undergone a targeted breeding for specific agronomical traits and homozygosity [32]. The high PIC values might thus be explained by the inclusion of a large number of pea accessions with different breeding histories.
4.3. Neighbor Joining Tree
The Neighbor Joining tree uniquely identified 153 of the 194 accessions (78%) and 108 groups were resolved as monophyletic using three microsatellites (Figure 3). Jain et al. [32] uniquely identify all 96 individuals included in the study in the NJ topology, but they included 42 polymorphic markers, distributed throughout the genome. Many studies using more microsatellites have produced a full resolution of the NJ tree [26,31,32]. One study using three microsatellites obtained 95% resolution of the topology, but this study was partly at the species level [29], whereas the present study was below the species level. In the study by Bouhadida et al. [30], 18 of 19 accessions (94.7%) are uniquely identified on an unweighted pair group method (UPGMA) topology through the use of five microsatellite markers. The UPGMA and NJ methods use the same underlying model, but UPGMA assumes a similar rate of evolution rate along the branches of the tree (molecular clock) [42]. The bootstrap support values of our study were low (Figure 3) [43]. The low bootstrap support and the lack of resolution in the NJ topology is mainly explained by a lack of variation due to insufficient DNA fragments of each microsatellite. A total of 77 variable DNA fragments is not enough to fully resolve 194 accessions [43]. Another possible explanation for the low bootstrap support is the taxonomic level. For studies below the species level, bootstrap values cannot always be expected to be high [43]. A third factor affecting the outcome of the NJ topology in this study is missing results for some individuals. Twenty-five accessions lacked data for one of the three microsatellites, which contributed to greater uncertainty in the branching pattern and collapse of branches [44].
The genetic diversity of named cultivars was expected to be structured such that accessions with the same cultivar name would be included in the same monophyletic group. However, such a structure did not appear. The 12 cultivars for which we had more than one accession were resolved in different positions on the NJ tree (Figure 3, colored cultivars). One group included accessions from “chlorotica”, “reductus”, “superaeromaculata” and “compactum” (Figure 3, colored cultivars). This lack of structure could partly be explained by the lack of sufficient data [3,24,34]. However, the resolution of individuals from the same cultivar at different places in the NJ tree may also be the result of genetic variation among individuals. This variation may be due to separation in time creating genetic diversity because of crosspollination resulting in polymorphism [29]. Peas from Ethiopia are genetically similar and resolve as a monophyletic group, whereas accessions from other countries, e.g., USA and Norway, do not show such similarity and do not resolve as monophyletic [33]. This lack of similarity can be explained by a gene flow between countries, e.g., use of genetic material from more than one country. The lack of a clear pattern among cultivars included in this study might be due to a high exchange of genetic material among breeding companies in Scandinavia, because the accessions of the same cultivar were not necessarily from the same source.
4.4. Thousand Seed Weight and Protein Content
Seed size is an important agronomic trait for yield and productivity that is inherited and characteristic of specific cultivars. Seed size is a quantitative trait controlled by many genes. Thousand seed weight is a simple method for measuring seed size. We found large variation from 64.4 g (SKR 46, NGB103568) to 251.2 g (SKR 218, NGB103607).
Protein content varied from 9.3% (SKR 182, NGB102130) to 34.1% (SKR 64 NGB105862) over all years and accessions. One goal of this study was to identify high-protein pea accessions, and 13 were found to have a protein content above 28% in 2017 and 19 in 2018, but none in 2019. This emphasizes the variation in protein content across the years, as the accessions with determined protein content for the years following 2017 was selected for high protein content. The selection of peas with a high protein in 2017 for analyses in 2018 moved the distribution to the right (Figure 2). We expected the protein content of the 2019 samples to show the same pattern and be placed even further to the right, but this was not the case. Year to year variation in protein content is well documented and is assumed to be due to differences in weather [45]. In comparison to other studies, the protein content was 23.9 ± 2.5% and TSW 195.9 ± 16.9 g for a collection of 1222 accession of cultivars and landraces cultivated during 1979–1982 with irrigation and standard management in Sweden [46]. A recent study of 50 accessions assessed for their potential in the Arctic region showed an average protein content of 26% in Denmark, 24% in Sweden and 20% in Finland [47].
An analysis of 198 peas (Pisum sativum L. cv. Trapper) from Saskatchewan farmers showed a protein content of 14.5% to 28.5% with a Gaussian distribution [11]. The protein content was determined from the Kjel-Foss method with a conversion factor of 6.25. Wang and Daun [10] determined the protein content via the Dumas method (N × 6.25) for six samples of four different pea varieties with a crude protein content ranging from 20.2% to 26.7%. Thus, the studies show that there is natural variation among peas. The variation in our study was more wide-ranging, and while some accessions had a lower protein content than that reported in previous studies, there were accessions that were higher than in most other reports. Only Slinkard [48] reported a protein content above 30%.
4.5. Future Perspectives
The next step could be to investigate the history of and characterize the cultivars and accessions included in this study. These accessions were primarily from Scandinavia and all from Nordic Gene bank and they were chosen primarily because there were some indications but not necessarily confirmations of high protein content.
A high genetic diversity among pea accessions from a gene bank is valuable because researchers want to preserve as much diversity as possible for future breeding. By targeted crossing of genetically diverse individuals, a selection of plants with desirable traits such as high protein, high resistance or tolerance to drought, followed by breeding towards homozygosity, it is possible to actively use the preserved genetic diversity to develop pea cultivars for the future [3,12,32].
Microsatellites are only one of several methods to study genetic diversity. Microsatellites are popular due to their co-dominance and high polymorphism. Methods in which differences across the genome are targeted give more data and are more likely to resolve genetic diversity among accessions. An example of such a method is Diversity Array Technology (DArT), [49].
Our results increase our understanding of genetic diversity across pea varieties, and could contribute to the sustainable cultivation of peas and adaptation to climate change.
5. Conclusions
Homogeneous lines of 227 pea accessions from seed stocks at NordGen were produced via single seed descent. The peas were grown in a field for 2 years and the seed protein content showed a variation between 9.3% and 34.1% over years and accessions and thousand seed weight were on average 152.05 g. More than 10 accessions had a protein content above 28%, showing that the collection has potential as breeding nursery for high-protein pea cultivars. Three microsatellites were highly informative. The microsatellites separated 153 of 194 accessions and had PIC values between 0.87 and 0.91, indicating that the gene bank material contains a large number of pea accessions with different breeding.
Author Contributions
Conceptualization, S.K.R., G.P. and C.B.A.L.; methodology, S.K.R. and C.B.A.L.; software, L.W. and C.B.A.L.; validation, C.B.A.L. and L.W.; formal analysis, C.B.A.L., L.W. and S.K.R.; investigation, C.B.A.L., S.K.R. and L.W.; data curation, L.W. and C.B.A.L.; writing—original draft preparation, L.W.; writing—review and editing, L.W., S.K.R. and C.B.A.L.; visualization, L.W. and C.B.A.L.; supervision, C.B.A.L. and S.K.R.; project administration, S.K.R.; funding acquisition, S.K.R. All authors have read and agreed to the published version of the manuscript.
Funding
This research was funded by The Danish Agricultural Agency, Genetic Resources, grant numbers 16-3262-000061 and 18-26051-000011; and organic RDD Peas & Love project 34009-21-1895.
Data Availability Statement
Data are available from the Dryad Digital Repository: at https://doi.org/10.5061/dryad.d7wm37q6b, accessed on 29 August 2023.
Acknowledgments
We thank NordGen for providing seeds of pea accessions, Didde Hedegaard Sørensen and Vinnie Deichmann for support in the lab, Weiyao Fan for help with the field trails, the anonymous reviewers for helpful comments on the manuscript, and Jacob Weiner for editorial support.
Conflicts of Interest
The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.
References
- Davies, D.R. Peas, Pisum sativum. In Evolution of Crop Plants, 2nd ed.; Smartt, J., Simmonds, N.W., Eds.; Longman Scientific & Technical: London, UK, 1995; pp. 294–296. ISBN 0-582-08643-4. [Google Scholar]
- FAOstat. 2022. Available online: https://www.fao.org/faostat/en/#data (accessed on 15 November 2022).
- Kreplak, J.; Madoui, M.-A.; Cápal, P.; Novak, P.; Labadie, K.; Aubert, G.; Bayer, P.E.; Gali, K.; Syme, R.A.; Main, D.; et al. A reference genome for pea provides insight into legume genome evolution. Nat. Genet. 2019, 51, 1411–1422. [Google Scholar] [CrossRef]
- Mendel, G. Versuche über Plflanzenhybriden. Verhandlungen des Naturforschenden Vereines in Brünn, Bd. IV für das Jahr 1865, Abhandlungen. 1866, pp. 3–47. Available online: http://www.esp.org/foundations/genetics/classical/gm-65.pdf (accessed on 17 July 2023).
- Bastianelli, D.; Grosjean, F.; Peyronnet, C.; Duparque, M.; Regnier, J.M. Feeding value of pea (Pisum sativum, L.). Chemical composition of different categories of pea. Anim. Sci. 1998, 67, 609–619. [Google Scholar] [CrossRef]
- Nikolopoulou, D.; Grigorakis, K.; Stasini, M.; Alexis, M.; Iliadis, K. Differences in chemical composition of field pea (Pisum sativum) cultivars: Effects of cultivation area and year. Food Chem. 2007, 103, 847–852. [Google Scholar] [CrossRef]
- Holl, F.B.; Vose, J.R. Carbohydrate and protein accumulation in the developing field pea seed. Can. J. Plant Sci. 1980, 60, 1109–1114. [Google Scholar] [CrossRef]
- McLean, L.A.; Sosulski, F.W.; Youngs, C.G. Effects of nitrogen and moisture on yield and protein in field peas. Can. J. Plant Sci. 1974, 54, 301–305. [Google Scholar] [CrossRef]
- Robertson, R.N.; Highkin, H.R.; Smydzuk, J.; Went, F.W. The Effect of Environmental Conditions on the Development of Pea Seeds. Aust. J. Biol. Sci. 1962, 15, 1–15. [Google Scholar] [CrossRef]
- Wang, N.; Daun, J.K. Effect of variety and crude protein content on nutrients and certain antinutrients in field peas (Pisum sativum). J. Sci. Food Agric. 2004, 84, 1021–1029. [Google Scholar] [CrossRef]
- Reichert, R.D.; MacKenzie, S.L. Composition of peas (Pisum sativum) varying widely in protein content. J. Agric. Food Chem. 1982, 30, 312–317. [Google Scholar] [CrossRef]
- Smykal, P.; Aubert, G.; Burstin, J.; Coyne, C.J. Pea (Pisum sativum L.) in the Genomic Era. Agronomy 2012, 2, 74–115. [Google Scholar] [CrossRef]
- Burstin, J.; Salloignon, P.; Chabert-Martinello, M.; Magnin-Robert, J.-B.; Siol, M.; Jacquin, F.; Chauveau, A.; Pont, C.; Aubert, G.; Delaitre, C.; et al. Genetic diversity and trait genomic prediction in a pea diversity panel. BMC Genom. 2015, 16, 105. [Google Scholar] [CrossRef]
- Cieslarova, J.; Hybl, M.; Griga, M.; Smykal, P. Molecular analysis of temporal genetic structuring in pea (Pisum sativum L.) cultivars bred in the Czech Republic and in former Czechoslovakia since the mid-20th century. Czech J. Genet. Plant Breed. 2012, 48, 61–73. [Google Scholar] [CrossRef]
- Hanci, F. Genetic variability in peas (Pisum sativum L.) from Turkey asssessed with molecular and morphological markers. Folia Hortic. 2019, 31, 101–116. [Google Scholar] [CrossRef]
- Sharma, A.; Sharma, S.; Kumar, N.; Rana, R.S.; Sharma, P.; Kumar, P.; Rani, M. Morpho-molecular genetic diversity and population structure analysis in garden pea genotypes using simple sequence repeat markers. PLoS ONE 2022, 17, e0273499. [Google Scholar] [CrossRef]
- Haliloglu, K.; Turkoglu, A.; Tan, M.; Poczai, P. SSR-Based Molecular Identification and Population Structure Analysis for Forage Pea (Pisum sativum var arvense L.) Landraces. Genes 2022, 13, 1086. [Google Scholar] [CrossRef]
- Mohamed, A.; García-Martínez, S.; Loumerem, M.; Carbonell, P.; Ruiz, J.J.; Boubaker, M. Assessment of genetic diversity among local pea (Pisum sativum L.) accessions cultivated in the arid regions of Southern Tunisia using agro-morphological and SSR molecular markers. Genet. Resour. Crop Evol. 2019, 66, 1189–1203. [Google Scholar] [CrossRef]
- Hagenblad, J.; Boström, E.; Nygårds, L.; Leino, M.W. Genetic diversity in local cultivars of garden pea (Pisum sativum L.) conserved ‘on farm’ and in historical collections. Genet. Resour. Crop Evol. 2014, 61, 413–422. [Google Scholar] [CrossRef][Green Version]
- Singh, J.; Dhall, R.K.; Vikal, Y. Genetic diversity studies in Indian germplasm of pea (Pisum sativum L.) using morphological and microsatellite markers. Genetika 2021, 53, 473–491. [Google Scholar] [CrossRef]
- Dribnokhodova, O.P.; Gostimsky, S.A. Allele polymorphism of microsatellite loci in pea Pisum sativum L. lines, varieties, and mutants. Russ. J. Genet. 2009, 45, 788–793. [Google Scholar] [CrossRef]
- Gainullina, K.P.; Kuluev, B.R.; Davletov, F.A. Genetic diversity assessment in pea cultivars and lines using the SSR analysis. Proc. Appl. Bot. Genet. Breed. 2020, 181, 70–80. [Google Scholar] [CrossRef]
- Ram, H.; Hedau, N.K.; Chaudhari, G.V.; Choudhary, M.; Kant, L. Genetic Diversity Assessment in Pea (Pisum sativum L.) using Microsatellite Markers. Int. J. Bio Resour. Stress Manag. 2021, 12, 402–408. [Google Scholar] [CrossRef]
- Wu, X.; Li, N.; Hao, J.; Hu, J.; Zhang, X.; Blair, M.W. Genetic Diversity of Chinese and Global Pea (Pisum sativum L.) Collections. Crop Sci. 2017, 57, 1574–1584. [Google Scholar] [CrossRef]
- Zong, X.-X.; Guan, J.-P.; Wang, S.-M.; Liu, Q.-C.; Redden, R.R.; Ford, R. Genetic Diversity and Core Collection of Alien Pisum sativum L. Germplasm. Acta Agron. Sin. 2008, 34, 1518–1528. [Google Scholar] [CrossRef]
- Teshome, A.; Bryngelsson, T.; Dagne, K.; Geleta, M. Assessment of genetic diversity in Ethiopian field pea (Pisum sativum L.) accessions with newly developed EST-SSR markers. BMC Genet. 2015, 16, 102. [Google Scholar] [CrossRef]
- Botstein, D.; White, R.L.; Skolnick, M.; Davis, R.W. Construction of a genetic linkage map in man using restriction length polymorphisms. Am. J. Hum. Genet. 1980, 32, 314–331. [Google Scholar] [PubMed]
- Loridon, K.; McPhee, K.; Morin, J.; Dubreuil, P.; Pilet-Nayel, M.L.; Aubert, G.; Rameau, C.; Baranger, A.; Coyne, C.; Lejeune-Hénaut, I.; et al. Microsatellite marker polymorphism and mapping in pea (Pisum sativum L.). Theor. Appl. Genet. 2005, 111, 1022–1031. [Google Scholar] [CrossRef] [PubMed]
- Tar’an, B.; Zhang, C.; Warkentin, T.; Tullu, A.; Vandenberg, A. Genetic diversity among varieties and wild species accessions of pea (Pisum sativum L.) based on molecular markers, and morphological and physiological characters. Genome 2005, 48, 257–272. [Google Scholar] [CrossRef] [PubMed]
- Bouhadida, M.; Srarfi, F.; Saadi, I.; Kharrat, M.M. Molecular characterization of pea (Pisum sativum L.) using microsatellite markers. IOSR J. Appl. Chem. 2013, 5, 57–61. [Google Scholar] [CrossRef]
- Ahmad, S.; Singh, M.; Lamb-Palmer, N.D.; Lefsrud, M.; Singh, J. Assessment of genetic diversity in 35 Pisum sativum accessions using microsatellite markers. Can. J. Plant Sci. 2012, 92, 1075–1081. [Google Scholar] [CrossRef]
- Jain, S.; Kumar, A.; Mamidi, S.; McPhee, K. Genetic diversity and population structure among pea (Pisum sativum L.) cultivars as revealed by simple sequence repeat and novel genic markers. Mol. Biotechnol. 2014, 56, 925–938. [Google Scholar] [CrossRef]
- Nasiri, J.; Haghnazari, A.; Saba, J. Genetic diversity among varieties and wild species accessions of pea (Pisum sativum L.) based on SSR markers. Afr. J. Biotechnol. 2009, 8, 3405–3417. [Google Scholar] [CrossRef]
- Kumari, P.; Basal, N.; Singh, A.K.; Rai, V.P.; Srivastava, C.P.; Singh, P.K. Genetic diversity studies in pea (Pisum sativum L.) using simple sequence repeat markers. Genet. Mol. Res. 2013, 12, 3540–3550. [Google Scholar] [CrossRef] [PubMed]
- Doyle, J.J.; Doyle, J.L. A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytochem. Bull. 1987, 19, 11–15. [Google Scholar]
- Saint-Denis, T.; Goupy, J. Optimization of a nitrogen analyses based on the Dumas method. Anal. Chim. Acta 2004, 515, 191–198. [Google Scholar] [CrossRef]
- Mosse, J. Nitrogen to Protein Conversion Factor for Ten Cereals and Six Legumes or Oilseeds. A Reappraisal of Its Definition and Determination. Variation According to Species and to Seed Protein Content. J. Agric. Food Chem. 1990, 38, 18–24. [Google Scholar] [CrossRef]
- Coyne, C.; Grusak, M.; Razai, L.; Baik, B. Variation for pea seed protein concentration in the USDA core collection. Pisum Genet. 2005, 37, 5–9. [Google Scholar]
- Swofford, D.L. PAUP*. Phylogenetic Analysis Using Parsimony (*and Other Methods); Version 4; Sinauer Associates: Sunderland, MA, USA, 2003. [Google Scholar]
- Peakall, R.; Smouse, P.E. GenAlEx 6.5: Genetic analysis in Excel. Population genetic software for teaching and research—An update. Bionformatics 2012, 28, 2537–2539. [Google Scholar] [CrossRef]
- Peakall, R.; Smouse, P.E. GENALEX6: Genetic analysis in Excel. Population genetic software for teaching and research. Mol. Ecol. Notes 2006, 6, 288–295. [Google Scholar] [CrossRef]
- Kalinowski, S. How well do evolutionary trees describe genetic relationships among populations? Heredity 2009, 102, 506–513. [Google Scholar] [CrossRef] [PubMed]
- Soltis, P.S.; Soltis, D.E. Applying the Bootstrap in Phylogeny Reconstruction. Stat. Sci. 2003, 18, 256–267. [Google Scholar] [CrossRef]
- Wiens, J.J. Missing data and the design of phylogenetic analyses. J. Biomed. Inform. 2006, 39, 34–42. [Google Scholar] [CrossRef]
- Klein, A.; Houtin, H.; Rond-Coissieux, C.; Naudet-Huart, M.; Touratier, M.; Marget, P.; Burstin, J. Meta-analysis of QTL reveals the genetic control of yield-related traits and seed protein content in pea. Sci. Rep. 2020, 10, 15925. [Google Scholar] [CrossRef] [PubMed]
- Solberg, S.Ø.; Yndgaard, F.; Poulsen, G.; von Bothmer, R. Seed yield and protein content in the Weibullsholm Pisum collection. Genet. Resour. Crop Evol. 2017, 64, 2035–2047. [Google Scholar] [CrossRef]
- Carlson-Nilsson, U.; Aloisi, K.; Vågen, I.M.; Rajala, A.; Mølmann, J.B.; Rasmussen, S.K.; Niemi, M.; Wojciechowska, E.; Pärssinen, P.; Poulsen, G.; et al. Trait Expression and Environmental Responses of Pea (Pisum sativum L.) Genetic Resources Targeting Cultivation in the Arctic. Front. Plant Sci. 2021, 12, 688067. [Google Scholar] [CrossRef] [PubMed]
- Slinkard, A.E. Percent Protein in the USDA World Collection of Peas Grown at Saskatoon in 1971; Crop Science Department, University of Saskatchewan: Saskatoon, SK, Canada, 1972. [Google Scholar]
- Wittenberg, A.H.J.; van der Lee, T.A.J.; Visser, R.G.F.; Schouten, H.J.; Kilian, A. Genome-Wide Profiling Using Diversity Arrays Technology (DArT); Theoretical Considerations and Practical Protocols. In Genetic Mapping Using the Diversity Arrays Technology (DArT)—Application and Validation Using the Whole-Genome Sequences of Arabidopsis Thaliana and the Fungal Wheat Pathogen Mycosphaerella Graminicola; Wittenberg, A.H.J., Ed.; Wageningen University: Wageningen, The Netherlands, 2007; Chapter 2; pp. 25–59. [Google Scholar]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).




