Genetic Diversity and Structure of Local Pear Cultivars from Mountainous Areas from Aragon (Northeastern Spain)

: The genetic diversity of pear local varieties prospected in mountainous areas from Northeastern Spain (Pyrenees and Iberian Cordillera) is not well known so far. In this study, an overall set of 252 accessions (178 prospected in mountainous areas from Aragon and a diverse set of 74 reference cultivars) was analyzed using 14 SSRs in order to estimate its genetic diversity and to identify the genetic structure and relationships among the pear germplasm studied. A total of 251 distinct alleles were successfully ampliﬁed with an average of 17.9 alleles per locus and with a wide genetic diversity (mean expected heterozygosity of 0.82). In total, 228 unique genotypes were identiﬁed and 210 genotypes were represented by a single accession indicating a situation of extreme vulnerability of these pear genetic resources held in the CITA collection. An amount of 32.9% of accessions were considered triploids displaying three alleles at least into two loci. Genetic analyses performed by a model-based Bayesian procedure, principal coordinate analysis and analysis of molecular variance supported the presence of a genetic stratiﬁcation with the existence of four sub-groups among the accessions, with a highly signiﬁcant differentiation (F ST = 0.132; p < 0.001). These results shed light on the characterization and genetic relatedness between these local accessions and currently cultivated pear cultivars and highlight the importance to safeguarding this diversity that might be essential for new breeding programs.


Introduction
Pears (Pyrus spp.) are one of the oldest and most widespread fruit crops in the world in temperate regions. Pear (Pyrus spp.) belongs to the genus Pyrus, subfamily Spiraeoideae, tribe Pyreae within of the Rosaceae plant family (a modified classification of Potter et al. [1]). The majority of pear cultivars are diploid (2n = 2x = 34) and the Pyrus genus could be the result of a hybridization between two primitive forms of Rosaceae: Prunoideae (x = 8) and Spiraeodeae (x = 9) [2][3][4]. The genus Pyrus has principally been divided into European and Asian pears [3,5]. Four important Pyrus species are commercially grown for edible fruit: Japanese pear (P. pyrifolia Nakai), European pear (P. communis L.) and Chinese pears (P. bretschneideri Rehd. and P. ussuriensis Maxim.) [5]. European pears comprise more than 20 species and are mainly cultivated by P. communis L., all indigenous to Europe, North and South American and Africa [3][4][5]. It is believed that cultivated European pears derived from the hybridization between Pyrus pyraster and Pyrus caucasica (which are interfertile with domesticated forms) during the tertiary period in the western mountains of China [6]. Despite the great genetic variability existing in pear (more than '3000' cultivars documented), only five cultivars are the base of the currently Spanish production: 'Conferencia', 'Blanquilla', 'Decana', 'Dr Jules Guyot' and 'Williams' [7]. Therefore, although In this study, the main goal is to evaluate the genetic diversity and relationships of local pear accessions obtained from different mountainous regions of Aragon (Northeastern Spain) by 14 SSRs markers in order to: (1) determine the genetic identity of the pear accessions, (2) compare between local genotypes and a diverse set of reference cultivars conserved at the CITA germplasm collection, and (3) assess the genetic structure of the overall set of pear accession studied. This work provides further insights into the diversity and genetic structure of pear local accessions that contribute for a better management and conservation of this local germplasm.

Plant Material and DNA Extraction
Pear cultivars analyzed in this study consisted on a set of 178 pear accessions collected from mountainous areas of Aragon (Pyrenees and Iberian Cordillera from Northeastern Spain since 2001) and 74 reference cultivars. Reference set includes traditionally cultivated varieties in Spain, cultivars bred in the 18th and 19th centuries (mainly European) and international cultivars recently introduced into the Spanish market [20] and belong to the National Pear Collection maintained in the orchards of CITA of Aragon, Zaragoza, Spain. All prospected accessions were obtained from abandoned old trees or small farms and were collected from 65 different localities from the three provinces of Aragon (Huesca, Zaragoza and Teruel) (Figure 1). When the local denomination could not be known, the accessions were named after the village where they had been collected from (Table S1). These trees were propagated vegetatively and were established in collections located in Bescos de la Garcipollera (Huesca, Aragon) at an altitude of 900 m and in the orchards from CITA of Aragon. Newly expanded leaves of each accession were collected in spring (March to April), immediately frozen in liquid nitrogen and stored at −80 °C until use. Genomic DNA was collected from young fresh leaves or vegetative buds following the procedure used by Hormaza [38]. Quantification of each DNA sample was performed using a Nanodrop Newly expanded leaves of each accession were collected in spring (March to April), immediately frozen in liquid nitrogen and stored at −80 • C until use. Genomic DNA was collected from young fresh leaves or vegetative buds following the procedure used by Hormaza [38]. Quantification of each DNA sample was performed using a Nanodrop 1000 (Thermo Fisher Scientific, Wilmington, DE, USA) and all samples were diluted to a final concentration of 10 ng·µL −1 .

PCR Reactions and SSR Analysis
Genomic DNA was analyzed using a set of 14 SSRs, including 13 SSRs recommended by the European Cooperative Program for Plant Genetic Resources (ECPGR) [27,28,39] and the remaining one (CH02c11) successfully used in other pear or apple diversity stud- ies [13,28,40] (Table 1). The genome coverage and high polymorphism had proven their adequation for this type of studies. Three multiplex PCRs were conducted, named MMA, MMB and MMC, in which forward primers were labelled with 6-FAM, VIC, NED or PET fluorescent at 5 end. All PCR reactions were carried out on an iCycler Thermal Cycler (Bio-Rad Laboratories, Hercules, CA, USA) with a final volume of 10 µL, using 10 ng of DNA template, 0.2, 0.4, or 0.6 µM of each primer (Table 1) and 1X QIAGEN Multiplex PCR Master Mix (Qiagen, Holden, Germany). The temperature profile for all three PCR reactions encompassed an initial 15 min denaturation step at 95 • C, followed by 10 touchdown cycles at 95 • C for 30 s, 65-1 • C/cycle for 1 min and 72 • C for 1 min, followed by 30 cycles at 95 • C for 30 s, 55 • C for 1 min, 72 • C for 1 min and a final step of 30 min at 72 • C as described by Pina [36]. DNA amplification products were checked using agarose gel electrophoresis. Fluorescently labelled DNA fragments were separated on an ABI Prism 3730 (Applied Biosystems, Carlsbad, CA, USA) by capillary electrophoresis and analyzed and sized with Peak Scanner Software 2.0 (Applied Biosystems, Foster City, CA, USA). Reference cultivars were introduced in all PCR reactions to ensure internal control and run-to-run variation and fingerprinting analysis were performed by duplicate.

Genetic Diversity Evaluation
Genetic diversity between prospected accessions and reference cultivars was evaluated with SPAGeDI software [45] analyzing the number of polymorphic markers, total number of alleles, average number of alleles per marker, the observed heterozygosity (Ho), the expected heterozygosity (He), Wright's F statistics, allelic frequencies and unique alleles.
The effective number of alleles (A e = (∑ p 2 i ) −1 ) [46] was quantified per locus, where p i represents the frequency of the ith allele. SSR markers were also evaluated using discrimination power (P D = 1 − ∑ p 2 i ) [47], where p i represents the frequency of the ith genotype, for each of them. In order to determine genetic relationships between the accessions studied, an UPGMA cluster analysis of the similarity matrix was performed using R software [48]. Genetic Structure of the population was assessed using a model-based clustering method implemented in STRUCTURE software v.2.3.4 [49]. Bayesian analysis of unique genotypes was performed for a genetic group range between 1 and 10 (K). The analysis was run under admixture model with correlated allele frequencies. As the dataset included diploid and triploid genotypes, the software was run using the recessive alleles approach, encoding the genotypes as detailed by Urrestarazu et al. [50]. Simulations were performed with 10 runs per each proposed K value and each of them was implemented with '200,000' burn-in interactions followed by another '500,000' iterations. When the results suggested that the K groups could be further structured in sub-groups, a second-level (nested) structure analysis was performed individually for each K group [51,52]. The obtained results were analyzed with Structure Harvester [53] in order to determine the most probable group number (K) using the statistic method described by Evanno et al. [54]. Genotypes were sorted out into the group whose membership probability was higher (qI ≥ 0.8) [14,36,50,55]. The analysis of molecular variance (AMOVA) that estimates the fraction of the genetic variation among and within population was calculated using the software GenoDive version 3.05 [56], since this software supports analyses of datasets containing individuals with different ploidy levels. Principal coordinate analysis (PCoA) was performed based on SSRs data using DARwin software v 6.0.21 [57]. Finally, in order to contribute to the automation of analysis, SSR polymorphism, genetic diversity and clonality level were also evaluated using a home-made Python script that also allowed to prepare input files for the rest of the software used in this study (SPAGeDI, STRUCTURE, DARwin and GenoDive). The developed code can be found in https://github.com/FJBiocode/ShortSequenceRastreator (accessed on 31 July 2021).

SSR Polymorphism and Genetic Diversity
In this study, 252 pear accessions held at the CITA of Aragon germplasm collection were analyzed using 14 SSR markers. The total of accessions comprised a set of 178 local accessions and 74 reference cultivars. All SSR markers were polymorphic. The primer pair CH02c11 amplified two loci but the secondary locus was monomorphic as reported by other authors [28,58], and only the amplification of the main locus was considered. The analysis of synonymy revealed 24 duplicated genotypes and reduced the dataset to 228 unique genotypes (156 local accessions and 72 reference cultivars) ( Table S1). The range of amplified alleles among the 228 unique genotypes identified varied from 11 (CH04e03) to 25 (CH01d09 and CH03g07) and 251 total amplified alleles were identified ( Table 2). The local accessions set and reference material set showed slight differences in the number of alleles amplified, 182 in the reference set and 239 in the local material set. Among the 251 total amplified alleles, 163 rare alleles (frequency lower than 0.05) were found. The local material set contained 151 of these alleles, whereas the reference material set revealed only 105. Moreover, 41 unique alleles were found among the studied population revealing an average of 2.92 unique alleles per locus and CH03g07 exhibited the Agronomy 2021, 11, 1778 6 of 17 majority of them. Allele richness showed an average value of 6.83 for the overall set which contrasts with the difference between the reference set (5.54) and local material set (6.98). SSR observed heterozygosity (Ho) in the overall set of cultivars ranged from 0.44 (CH04e03) to 0.91 (CH01d09) with an average value of 0.82, whereas expected heterozygosity (He) ranged from 0.45 (Ch04e03) to 0.92 (CH01d09) with an average value of 0.82. The power of discrimination (PD) varied from 0.65 (Ch04e03) to 0.98 (CH01d09), with a mean of 0.93. The observed Ho showed the same pattern as He. These results along with the high polymorphism and allele variety found by all markers, pointed out the adequacy of selected markers and ensured analysis accuracy. The Wright's fixation index (F is ) was also determined for the set of local accessions and for the reference material one. The inbreeding coefficient ranged from 0.017 (CH01f07) to −0.082 (EMPc11) in the overall set of cultivars, with an average value of −0.022 for all loci, and from 0.147 (CH05c06) to −0.097 (CH04e03) with a mean value of −0.008 for the reference material ( Table 2). As expected, the overall F is value in the overall set was slightly negative (outbreeding), consistent for the majority of the loci (14), except for CH01f07 (0.017, p < 0.05), for CH02b10 (0.007, p < 0.001), and Ch04e03 (0.001, p < 0.001), homozygous compared with the expected values. An SSR analysis identified 75 of the 228 unique genotypes with three distinguishable alleles at several loci (at least in two or more loci), suggesting triploid individuals. The percentage of triploids varied from 32.9% to 23.2% if we consider triploid-only accessions that displayed three alleles at two, three or more loci, respectively. The highest number of genotypes with three alleles was found at loci EMPc11 and CH-Vf1, with 49 and 41 individuals, respectively. The lower level of triploids was detected with CH04e03 with five accessions.

Elucidation of Genetic Relationships between Local and Reference Material
Genotypes were considered duplicates when all alleles matched across the 14 SSRs. In total, 228 unique genotypes were found among the 252 total accessions, and the synonymous accessions were as many as 42 organized into 18 identity groups (average of 9.5% clonality in the overall dataset, 12.3% in the local set) ( Table 3). 'Roma' was the most represented genotype in these 18 groups of synonyms with four more varieties within the group. However, 210 unique genotypes were represented by a single accession among the 228 unique genotypes, which constitutes a high level of vulnerability of these conserved genotypes. Most groups of duplicates comprised accessions from different geographic origin (localities) and it was not possible to distinguish the varieties 'Tendral de Valencia' and 'Tendral de Aragon', and 'Malacara' and 'Magallon' with the 14 SSRs. However, different SSR profiles were obtained between the reference cultivar 'Williams' and 'Max Red Barlett' for three loci CH02b10, EMPc117 and GD147, despite the last one being a bud mutation derived from 'Williams'. The genetic relationship between the studied genotypes is represented by UPGMA clustering. The neighbor-joining (NJ) tree was constructed, including both diploid and triploid unique genotypes, and the cluster analysis revealed four main different groups ( Likewise, group C included a total of 46 reference varieties but only encompasses 12 local accessions. Finally, an old Spanish traditional variety, 'Donguindo' could be found in group D along with 13 prospected accessions and 'Bella Early'. In summary, over 66.7% of local material was found in different subgroups than foreign cultivars, which could reflect a remarkable singularity and potential interest of an important part of the prospected material from mountainous areas of Aragon. Furthermore, local accessions grouped with foreign cultivars were collected across different mountainous valleys of Aragon, highlighting traditional exchanges and the transference of plant material through grafting.

Population Genetic Structure Analysis
A Bayesian analysis of the 228 unique genotypes was carried out with STRUCTURE 2.3.4 using K values ranging from 1 to 10. The results of the analysis were processed with Structure Harvester and showed a most probable hierarchy division into three groups (K = 3, ∆K = 20) ( Figure S1). However, differentiation in four subgroups could be possible too, as K = 4 also had a high ∆K value (∆K ≈ 7). Groups found with K = 3 showed a remarkable heterogeneity. Groups one and two encompassed 87% of local genotypes, whereas group three included 65% of reference varieties and only 13% of local accessions. Group one (G3.1) contained 73 local accessions and 13 reference cultivars with the majority belonging to cluster A in the UPGMA dendrogram. Reference cultivars in this group are mostly old local Spanish cultivars such as 'Ceremeño', 'Castell', 'Bergamotta de Estivo', 'Santiaguera Dura', 'Agua temprana', 'Tendral de Aragon', 'Abugo', 'Amarillo de Invierno', 'Azucar Verde', 'Magallon', 'Verde de Verano'. 'Bella di Giugno' and 'Delbard Premiere' were in admixed in this group. Group two (G3.2) was composed by 63 local accessions and 12 reference cultivars, including Spanish cultivars 'Donguindo', 'Blanquilla' and cultivars selected in the 18th and 19th centuries originating from Southern Europe such as 'Cure', 'Roma' and 'Decana de Invierno'. Mostly these accessions grouped in cluster B in the UPGMA dendrogram. Group three (G3.3) contained only 20 local accessions and 47 reference cultivars, in which is currently based the world pear production such as 'Williams', 'Doyenne du Comice', 'Abbe Fetel', 'Conference' 'Passe Crassanne' and more recently released and widely cultivated varieties such as 'Max Red Barlett' and 'Packham's Triumph', with the majority belonging to cluster C in the UPGMA analysis. All three groups were analyzed a second time with structure to determine which one of them would unfold into two, given K = 4. G3.2 showed the highest ∆K value (K = 2, ∆K ≈ 10). This nested structure analysis allowed for a better characterization of each sub-population. All 75 accessions had a qI ≥ 0.8 and divided into a first group (G4.2) containing 52 local cultivars and 12 reference varieties and a second group (G4.3) encompassing 11 local cultivars and 2 reference varieties. The G4.3 group contained 'Bella early', 'Donguindo' and a local cultivar that matched with those found in Group D in the UPGMA clustering, whereas the G4.2 group was composed by reference varieties such as 'Blanquilla', 'Limon de verano' or 'De agua de invierno' resembling group C of the UPGMA clustering. For K = 3, 166 genotypes were characterized by qI values higher than 0.8 for the three groups (qI > 0.8) and the remaining 62 genotypes were considered 'admixed'. The classification of accessions with qI > 0.80 was 72% for G3.1, 71% for G3.2 and 76% for G3.3 (Table 4). For K = 4, the classification of accessions with qI > 0.80 was 83% for sub-group G4.1, 65% for G4.2, 100% for G4.3 and 75% for G4.4 ( Table 4). Most of the accessions in groups G4.1 and G4.4 for K = 4 correspond to accessions in group G3.3 and G3.1 for K = 3, respectively. All runs at K = 3 and 4 produced identical clustering solutions with very similar values of assignation probability for all the individuals between runs. Overall, the accession assignation to each group was robust and 73% of accessions had a qI ≥ 0.8 for K = 3 and 81% for K = 4 ( Table 4). The remaining 27% and 19%, which had an admixture level lower than 0.8, could suggest the presence of genetic relationships between the accessions studied and pear cultivars not included in the analysis. Finally, based on an AMOVA analysis, significant variance differences were observed between the three and four groups identified by the model-based clustering method. The distinction between the three and four populations was further confirmed by the analysis of the fixation index (F st ), a summary statistic quantifying the variation in allelic frequencies between groups. The overall F st value of 0.062 and 0.101 suggested a moderate but highly significant (p < 0.001) differentiation between groups for K = 3 and K = 4, respectively. Excluding the admixed individuals for each group, the differentiation between groups was higher 0.106 for K = 3 and 0.132 for K = 4. Genetic diversity indexes were calculated by group (Table 4). Nei's gene diversity varied from 0.74 (G3.3) to 0.81 (G3.1) for K = 3 and from 0.62 (G4.3) to 0.81 (G4.4), revealing a high proportion of heterozygous individuals in the three and four populations for K = 3 and K = 4. Interestingly, groups one and two showed a slightly higher genetic diversity than group three for K = 3, which assures a prospection missions' role as a genetic diversity source. Furthermore, a multivariate principal coordinate analysis (PCoA) performed in DARwin [57] confirmed genetic discrimination between the three and four groups. The three groups revealed by Structure were clearly distinguished with the PCoA plot of the two first components drawn in DARwin software, where the two-principal axis (one and two) explained 8.55% and 7.30% of the variation (Figure 3). Group G3.1 (old and local Spanish cultivars) and G3.2 (mostly Southern European cultivars) were displayed in the positive and negative part of the first axis, respectively. G3.3 (Williams' group) was found in the negative part of axis two.

Genetic Diversity of Pear Accessions Prospected from Mountain Areas of Aragon
The benefit and need of the genetic identification of pear germplasm banks, improving the management of collections by enabling the identification of duplicates, synonyms, and homonyms, as well as to understand the origins of local varieties, and to ascertain the importance of introgression, polyploidy, and hybridization in their evolution have been widely reported [15,28,30,31,33]. In the last decades, several regional germplasm collections have been promoted in order to face the erosion caused by the introduction of improved varieties in specialized orchards, maintain and preserve the autochthonous diversity. In this study, a total of 252 pear accessions (178 local accessions prospected in mountainous areas from Northeastern Spain and 74 reference cultivars) held at the CITA of Aragon germplasm collection were genotyped with 14 SSR markers (13 suggested by ECPGR). High levels of heterozygosity and a high effective number of alleles per locus were found in the present study, suggesting a singularity and potential

Genetic Diversity of Pear Accessions Prospected from Mountain Areas of Aragon
The benefit and need of the genetic identification of pear germplasm banks, improving the management of collections by enabling the identification of duplicates, synonyms, and homonyms, as well as to understand the origins of local varieties, and to ascertain the importance of introgression, polyploidy, and hybridization in their evolution have been widely reported [15,28,30,31,33]. In the last decades, several regional germplasm collections have been promoted in order to face the erosion caused by the introduction of improved varieties in specialized orchards, maintain and preserve the autochthonous diversity. In this study, a total of 252 pear accessions (178 local accessions prospected in mountainous areas from Northeastern Spain and 74 reference cultivars) held at the CITA of Aragon germplasm collection were genotyped with 14 SSR markers (13 suggested by ECPGR). High levels of heterozygosity and a high effective number of alleles per locus were found in the present study, suggesting a singularity and potential interest of this local pear accessions prospected from mountainous areas of Aragon. CITA pear germplasm collection showed a slightly higher average number of alleles per locus (17.93) than similar studies for Spanish cultivars (16.06, Ferreira dos Santos et al. [13]; 12.13, Miranda et al. [14]; 16.00, Urrestarazu et al. [28]). Although genetic variability parameters are difficult to compare between studies because of differences in the number of accessions, loci analyzed and the detection method used, we can conclude that our expected and observed heterozygosity values are similar to those shown in other Spanish germplasm collections. CITA pear collection showed an expected heterozygosity (He) of 0.82, similar to other studies such as in local pear cultivars from Northwestern (0.80, Ferreira dos Santos et al. [13]) and Northern Spain (0.83, Miranda et al. [14]); from Italy (0.82, Baccichet et al. [60]) and other Pyrus species (0.79, Bassil et al. [25]). Expected heterozygosity was slightly higher in the set of local material (0.82) than in the reference material (0.79), with a significant difference in their allele count (total and rare alleles) which supported the previously mentioned singularity of local material. Although the local material set had a bigger sample size, it also displayed more unique alleles (those only present in one accession) than the reference set, 37 vs. 25, respectively, as the allele richness difference between them. In addition, the overall mean value of PD (discrimination power) of 0.93 (ranging from 0.97 to 0.65) indicated that the loci are polymorphic enough in discriminating individuals. Negative F is values (outbreeding) were found to the set of local material (−0.027) and reference (−0.008) which confirmed a heterozygotes excess, consisting of a large number of loci. The majority of cultivated pears are diploid (2n = 2X = 34) but a few cultivars of P. communis L. are polyploids and it was also found in the CITA collection. An amount of 32.9% of accessions were triploids in the overall set of pear accession, considering triploids only accessions that displayed three alleles at least in two loci. This high value could indicate the selection by farmers because of the interest in extra-large fruit and leaves, although, generally, they are not useful for breeding because of meiotic disturbance and little good pollen [7]. The accessions' origin and ploidy identification methods make the ploidy level vary greatly among collections [13,31,60,61]. Using flow cytometry, Puskas et al. [61] identified 24.2% triploids among 124 German and Romanian accessions, 7.0% triploids were reported in a Swedish collection [30] and 27.1% in a collection of 118 unique pear accessions from Italy [60], whereas 36% were identified in a pear collection from Northwestern Spain with three alleles for at least one locus [13].

Singularities of Genetic Relations among Local and Reference Material at CITA Germplasm Bank
Forty-two accessions were duplicated since the SSR profiles were identical throughout the 14 loci. Between the synonyms identified in the reference set, the pear cultivars 'Tendral de Valencia' and 'Tendral de Aragon', and 'Malacara' and 'Magallon' shared the same SSR profiles, although these cultivars expressed some slightly different phenotypic traits (data not published). Therefore, although the identification of synonymous accessions is important in order to avoid redundancy in the collections, reducing their management costs and being able to distribute true-to-type cultivars to the nurseries, it must be taken into account the morphological characterization of these accessions. In fact, the occurrence of punctual mutations produced through grafting in fruit trees, genomic structural variations or even epigenetic modifications can generate phenotypic differences not distinguishable through SSR markers [35,62,63]. Finally, slight differences (just one allele at the CH02b10, EMPc117 and GD147 loci) were found between 'Williams' and 'Max Red Barlett', which is expected since 'Max Red Barlett' is a chimeric bud mutation of green 'Williams' [8].
Based on the UPGMA analysis, four clusters were observed, but no correspondence between the geographical origin of the accessions and their cluster replacement was found, which agrees with the traditional exchanges of plant material through grafting between the three provinces (Huesca, Zaragoza and Teruel) and the Pyrenees region with France [35]. Mostly, the different clusters could be interpreted by the economic importance of the accessions studied. Cluster A and B contained old cultivars that remain important in Spain at a regional level such as 'Castell', 'Magallon', 'Flor de Invierno' and 'Roma' with mostly local accessions (79%). Cluster C contained cultivars that account for >80% of pear production in Spain such as 'Abbe Fetel', 'Conference', 'Williams', 'Doyenne du Comice' and most of the currently cultivated varieties derived from them in recent breeding programs [14,15]. Local cultivars located in cluster C might indicate introgressants of foreign cultivars in this area. Cluster D contains also local cultivars related to the local cultivar 'Donguindo', which was described as a widely spread cultivar in Spain by Herrero and Iturrioz [20] and 'Bella Early'.

Genetic Structure and Differentiation
Bayesian inference has become a powerful tool to assess the genetic structure in tree species such as pear [13,28,29] and apple [35,50,63]. In this study, the use of SSRs in a structure analysis allowed the definition of three-populations (K = 3), defining most ancient local cultivars on one side with all the reference cultivars cultivated in Spain (G3.1), mostly Southern pear cultivars in G3.2 and a third group of currently cultivated pears in the world and derived from them in the recent pear breeding programs (G3.3). Three groups were also identified in a collection of 141 ancient local Spanish pear cultivars [14]. A second level of partitioning was suggested for K = 4. A similar genetic structure was found in a collection of local and cultivated pears from Italy [28] and local pear cultivars from Northwestern Spain, including Asian cultivars [26] and local varieties and wild related species collected from Mount Etna [36]. F st values indicated a high differentiation among subpopulations (0.106 for K = 3 and 0.132 for K = 4), higher than those reported in the literature [14,28,29,32], indicating that the material from mountain regions from Aragon is a genetic pool worthy of safeguarding and conservation. Our values were more similar to those found between two subpopulations (0.096) by Bennici et al. [37].
Another significant result of our study was the development of different Phyton codes that can save several hours of work and contribute to the automation of an analysis workflow during this type of studies. The automation of science is the increasingly demanding creation of software that connects all steps of an investigation and reduces both workload and spent time. Moreover, open-source code and platforms such as GitHub promote international and interdisciplinary collaboration between groups that can contribute to software development in a public science-friendly space. In addition, the use of a harmonized set of microsatellite DNA markers will allow future comparisons with other germplasm collections at a national and international level, promoting coordinated actions for the efficient conservation of pear genetic resources.
All novel pear cultivars must respond to the demands of growers and consumers. In general, the main objectives of pear breeding programs are to improve cultivar traits (fruit quality, etc.) and to drive sustainable eco-production systems reducing the need for chemical treatments [8]. These factors have stimulated the exploration of the old germplasm of pear collected throughout the world which could contribute to the development of disease and pest-resistant pear cultivars [29,32,37,61,64]. In this sense, more effort should be invested by researchers to find resistance genes to the main biotic adversities of pear: the fire blight bacterium (Erwinia amylovora), the European pear psylla (Cacopsylla pyri), which is the vector of the phytoplasma causing pear decline, the scab-causing fungi Venturia pyrina and the black spot fungus Stemphylium vesicarium.

Conclusions
The local pear germplasm of mountainous areas (Pyrenees and Iberian Cordillera) is valued since it is adapted to diverse ecosystems, and could represent a wide genetic diversity that can help mitigate the current genetic erosion within agricultural diversity. Likewise, these germplasms might provide resistance and a low sensitivity to the main pests and a good organoleptic quality. Genotypes analyzed in this study showed high levels of genetic diversity, a significant number of rare alleles and a low clonality rate which can be considered as evidence of a singularity and richness existing in local material which can still be found in abandoned plots. Moreover, 66.7% of local material was grouped in specific subgroups containing nearly no foreign cultivars, suggesting a singularity and potential interest of this type of material prospected in mountain areas from Aragon. The high levels of genetic diversity found in this research could constitute a good opportunity to select those cultivars with interesting agronomic traits in order to study them in depth and assess their potential to include them in pear breeding programs.

Supplementary Materials:
The following are available online at https://www.mdpi.com/article/10 .3390/agronomy11091778/s1, Table S1: information of the pear accessions used in this study: accession name, origin, location, region, genotype (unique or duplicated), coordinate location (latitude (Lat.), longitude (Long.) and altitude (Alt.)); group placement by structure analysis (when K = 3 and K = 4 were considered). Accessions with qI > 0.8 are indicated in bold letters for K = 3 and K = 4, Figure S1: exploration of K value for structure analysis of the 228 unique pear genotypes. Estimates of the rate of change of the slope of the log likelihood curve (∆K) calculated according to Evanno et al. [54] plotted against K, displaying a robust ∆K maximum at K = 3 and a less pronounced peak found at K = 4.