Genetic Structure and Pod Morphology of Inga edulis Cultivated vs. Wild Populations from the Peruvian Amazon

: Research Highlights : This study assesses the genetic diversity and structure of the ice-cream-bean ( Inga edulis Mart.; Fabaceae) in wild and cultivated populations from the Peruvian Amazon. This research also highlights the importance of protecting the biodiversity of the forest in the Peruvian Amazon, to preserve the genetic resources of species and allow further genetic improvement. Background and Objectives : Ice-cream-bean is one of the most commonly used species in the Amazon region for its fruits and for shading protection of other species (e.g., cocoa and coffee plantations). Comprehensive studies about the impact of domestication on this species’ genetic diversity are needed, to find the best conservation and improvement strategies. Materials and Methods : In the current study, the genetic structure and diversity were assessed by genotyping 259 trees, sampled in five wild and 22 cultivated I. edulis populations in the Peruvian Amazon, with microsatellite markers. Pod length was measured in wild and cultivated trees. Results : The average pod length in cultivated trees was significantly higher than that in wild trees. The expected genetic diversity and the average number of alleles was higher in the wild compared to the cultivated populations; thus, a loss of genetic diversity was confirmed in the cultivated populations. The cultivated trees in the Loreto region had the highest pod length and lowest allelic richness; nevertheless, the wild populations’ genetic structure was not clearly differentiated (significantly different) from that of the cultivated populations. Conclusions : A loss of genetic diversity was confirmed in the cultivated populations. The species could have been simultaneously domesticated in multiple locations, usually from local origin. The original I. edulis Amazonian germplasm should be maintained. Cultivated populations’ new germplasm influx from wild populations should be undertaken to increase genetic diversity.


Introduction
The Peruvian rainforest, due to its large and relatively continuous area of primary forest, is a worldwide biodiversity hotspot, which is suffering intense disturbance and deforestation from human exploitation and global change [1]. Amazonian inhabitants have used these natural resources through millennia and modified the natural environment, but how human management practices resulted in Amazonian forests' domestication is not known, in particular the germplasm source [2]. Moreover, the species' gene pool could have been reduced due to farmers' selection, thus strategies for genetic resource conservation and management are needed [3]. The Peruvian rainforest that remains a large and relatively continuous area of primary forest has major conservation value and is considered a priority in nearly all global biodiversity inventories, due to its biodiversity and the disturbance and deforestation rates [4,5]. Despite the international recognition of its major conservation value resulting from its uniqueness and global importance, the impacts of human activities throughout the region remain poorly understood [5]. Indeed, information about the species' genetic structure will assist in tree breeding programmes and conservation strategies, in particular in tropical trees, and, also, to study the implications of human impact on genetic resources [6,7].
The genus Inga (Fabaceae) comprises ca. 300 species of neotropical rainforest trees [8], but earlier studies suggest that the diversification of Inga species in Amazonia is recent, during the past 2-10 million years [9,10]. Ice-cream-bean (Inga edulis Mart.; Fabaceae) is a lowland rainforest light-demanding species, distributed in Colombia and tropical South America to the east of the Andes, extending from south to north-western Argentina ( Figure 1). The species' natural altitudinal range is mostly below 750 m a.s.l., though it has been occasionally identified at 1200 m in Roraima, Brazil [8]. The flowers are hermaphrodite and the pollination is provided by hawkmoths, bats and hummingbirds that may carry the pollen grains across large areas [11]. The species is diploid, 2n = 26 [12], and it is believed to be self-incompatible [13]. Fruiting occurs, in three year-old trees, as a long pod containing recalcitrant seeds, covered by a white, fleshy and slightly sweet edible sarcotesta [14]. Seed dispersal is performed by mammals and birds after eating the sarcotesta [11,15]. The species is widely cultivated for its edible fruit throughout South and Central America [16]. It is one of the most widely distributed and economically useful species in the whole Amazon region [14,17]. In Amazonian Peru, the fruits from cultivated trees may exceed 2 m in length and 5-6 cm in diameter. The wild trees have smaller pods than cultivated trees, rarely exceeding 50 cm in length [8]. This fast growing, symbiotic nitrogen-fixing tree, with umbrella-like canopy, is commonly used as a shade tree for cocoa, coffee, coca and tea plantations, in agroforestry systems and in "home grown" multi-purpose cultivation uses [8]. Amazonia was a major centre of crop domestication, with at least 83 native species containing populations domesticated to some degree, which expanded rapidly in the Mid-Holocene [18], including the Inga genus. The historical records for Inga edulis show that this species has been cultivated in Peru for its edible fruit since the pre-Colombian time and has become a commonly used tree species in the Amazon region [19]. The origin of the cultivated populations of I. edulis is uncertain [8]; however, León [17] and Clement [20] claimed West Amazonia as a probable origin. The species' genetic structure was not studied in detail, yet a reduction of allelic richness in cultivated relative to natural populations was found in I. edulis from the Peruvian Amazon [13,21]. Inga edulis has become a model species to evaluate the maintenance of genetic resources in agroforestry systems, and the putative genetic diversity reduction associated with domestication [22]. More recently, Cruz-Neto et al. [23], using microsatellite markers, observed high levels of genetic diversity within I. vera populations from the Atlantic forest of north-eastern Brazil. They concluded that cultivated populations compared to natural populations displayed reduced genetic diversity. Nevertheless, maintaining high levels of genetic variation within agroforestry trees is important for two main reasons: genetic variation in agricultural landscapes helps farmers to manage their inputs in more efficient ways and because they provide the ability for tree species to adjust to new environments, such as the shifting climate and weather conditions, allowing local adaptation and the migration of better-suited provenances along ecological gradients [3]. In addition, a stronger emphasis on the genetic quality of the trees planted by smallholders is needed, which means paying attention both to domestication and to the systems by which improved germplasm is delivered to farmers for the management of tree genetic resources and the livelihoods of rural communities in the tropics [24].
In the present study, the objectives were to (i) explore differences in pod length between wild trees and cultivated I. edulis trees from different geographical regions in the Peruvian Amazon; (ii) compare the wild and cultivated I. edulis populations' genetic structure using microsatellite markers; and (iii) determine if the cultivated populations' genetic structure reflects the different uses and cultivation practices throughout the species' use history, to help design practical measures to preserve I. edulis genetic resources.

Plant Material Sampling
The leaves and mature fruits from 259 individuals and 27 populations of I. edulis were sampled in the Peruvian Amazon, between 2009 and 2012. Each tree was identified according to the morphological aspects detailed by Rollo et al. [25] and, additionally, with the help of locals. Each sampled tree's young leaves were preserved in micro test-tubes with silica gel for further DNA extraction. One to ten mature pods were sampled per tree, from opposite sides and different heights of the crown, according to the availability of mature fruits on the tree. The pod length was measured from the base to the top of the pod apex. The mature fruits had the following phenological characteristics: seeds from creamy white to purple black up to vivipary; and the sarcotesta membranous creamy to generally flashy white, watery, soft and slightly sweet [8]. A total of 448 mature pods were measured in the 259 trees from the 27 I. edulis populations. We chose to study the pod length, as we can draw a null hypothesis based on the domestication process, since the trees were selected for their pod length (H0: Is the pod length of the domesticated trees higher than that of the wild trees?). Indeed, this trait has economic importance in the species domestication and agroforestry value.
Each sampled tree's geographical coordinates were recorded, and the minimum distance between any two trees was 200 m. Voucher specimens were kept in the Regional Herbarium of Ucayali IVITA-Pucallpa, Peru, with the code AR1-384. Each population was numbered from 1 to 27 and coded, e.g., 1 SRc, 23 RPw (the two capital letters are taken from the initial letters of the geographic origin of the population, e.g., San Ramón or River Pacaya, and the third letter meaning either c = cultivated-managed by humans or w = wild-growing spontaneously) ( Table 1). Cultivated trees were sampled in 22 geographically different populations, in home gardens and agricultural landscapes surrounding the urban areas in the Selva Central, Ucayali and Loreto regions. Wild trees were sampled in five geographically different populations, in lowland forests; four populations (23 RPw, 24 RSw, 26 MAw, 27 SDw) in protected natural areas in original forest vegetation and one (25 RUw) in secondary forest as described in Rollo et al. [25]. Details on the sampled populations are displayed in Table S1. Finally, no I. edulis wild trees were found in the Selva Central region's original vegetation, since the species is a lowland rainforest species and has been only occasionally recorded above 750 m [8].

DNA Extraction and Amplification
Total DNA was extracted from dried young leaves, using the Invitek, Invisorb ® Spin Plant Mini Kit following the manufacturer's instructions. Four microsatellite markers were used to genotype all the individuals, Pel5 [26] and Inga03, Inga08 and Inga33 [21]. For microsatellite detection, each forward primer was fluorescently labelled at the 5' end (6-FAM, NED or VIC). Amplification conditions were performed according to the conditions described by Rollo et al. [25]. The amplified products were separated on an ABI PRISM 310 Genetic Analyzer (Applied Biosystems, Foster City, CA, USA) and ran according to the manufacturer's protocol. Fragment sizes were determined using the ROX500 internal size standard and the global southern algorithm implemented by ABI PRISM GeneMapper ® software version 4.0 (Applied Biosystems).

Morphological Data
The pod length was measured in the 448 pods sampled from the 259 trees of the I. edulis populations from the Peruvian Amazon, and all the individual values were used in the following analysis (not averaged per tree). The pod lengths' normality (in both cultivated populations originated in Selva Central, Ucayali and Loreto regions and in wild populations) was tested using the Kolmogorov-Smirnov test, and all the groups displayed normal distribution except for the wild trees group. A non-parametric Kruskal-Wallis test (k independent samples) was performed to check for significant differences in the groups' average followed by the non-parametric Mann-Whitney U post-hoc test [27]. The statistical analyses were performed using the IBM SPSS v.22 statistics software.

Molecular Data
The estimated genetic diversity parameters included the average number of alleles per locus (A), the effective number of alleles (Ne), the number of private alleles (Pa), the mean allelic richness (RS) that uses a rarefaction index to consider differences in sample size [28], the observed (HO) and expected (HE) heterozygosities [29] and the fixation index FIS. Comparisons of the genetic diversity parameters between groups (i.e., cultivated and wild populations) were performed with 10,000 permutations. The estimates were made using FSTAT 2.9.3.2 [30] and GenAlEx v. 6.501 [31]. Genepop 4.3 software [32] was used to test the heterozygote deficiency for each population and to compute the average frequency of null alleles.
Genetic variation at the level of populations and groups (i.e., cultivated and wild populations) was investigated with a hierarchical analysis of molecular variance (AMOVA), which partitions the total variance into covariance components due to inter-group differences, inter-populations within groups differences and inter-population differences, in the Arlequin software [33]. Levels of significance were determined by computing 1000 random permutation replicates.
A Bayesian clustering method was performed in the STRUCTURE v.2.3.4. software [34] to infer population genetic structure. The number of genetic clusters (K) was estimated and the individuals sampled from cultivated and wild populations were fractionally assigned to the inferred groups. Afterwards, the allele frequencies were estimated in each of the K groups as well as the proportion of the genome derived from each group for each tree. We applied the model allowing population admixture and correlated allele frequency [34]. However, due to the weak population structure found in the I. edulis populations, we used a model that incorporated a priori sampling location information [35], i.e., a "locprior" model. This improved model has the advantage of allowing cryptic structures to be detected at a lower level of divergence and does not bias towards detecting structures spuriously when none is present, helpful in situations when the standard structure models do not provide a clear signal of structure [35]. Two groups of populations were used as priors, i.e., cultivated and wild populations (see Table 1). The alternative ancestry prior 1/K was used due to unbalanced population sampling [36]. The number of clusters (K) was set from one through twenty-seven and the simulation was run ten times at each K value to confirm the repeatability of the results. Each run comprised a burn-in period of 25,000, followed by 100,000 Markov chain Monte Carlo (MCMC) steps. We used the ΔK distribution statistic of Evanno et al. [37] to determine the most appropriate number of genetic clusters. Hence, the STRUCTURE output data were parsed using the STRUCTURE HARVESTER [38] to determine the optimal K value following the method referred to above. Alignment of cluster assignments across replicate analyses was then conducted in CLUMPP 1.1.2 [39] and subsequently visualized using STRUCTURE PLOT [40]. The results of Bayesian clustering were further mapped in the ArcGIS ® Desktop version 10.2 software [41].

Pod Length
From a total of 259 individual trees, 448 fruits were collected and measured: 329 and 119 fruits from 197 cultivated trees and 62 wild trees, respectively. The longest pod (148 cm) was found in 18 EDc, a cultivated population in the Loreto region around El Dorado village (Table S2). The Kruskal-Wallis non-parametric test, using the four groups of populations (regions), showed significant group differences for pod length ( Figure 2). The Mann-Whitney U post-hoc test produced three homogeneous groups, which indicated that the pod length average (78 cm) in the Selva Central region was not significantly different (p < 0.05) from the Ucayali region's 80 cm long average, and both values were significantly different from the Loreto value (90 cm) ( Figure 2).

Figure 2.
Pod length comparison among the cultivated and wild populations. From Selva Central, 94 mature pods were collected from 45 trees from 5 populations; from Ucayali, 120 mature pods were collected from 72 trees from 7 populations; and from Loreto, 115 mature pods were collected from 80 trees from 10 populations. In the wild populations, 119 mature pods were collected from 62 trees from 5 populations. Significantly different means are followed by different letters (p < 0.05).
The Selva Central and Ucayali regions could be one group, considering the cultivated trees' average pod length. The Loreto region's cultivated trees produced the highest pod length average. The average pod length of 83 ± 1.17 cm (mean ± standard error) in the cultivated trees was significantly higher than the 39 ± 0.95 cm pod length average in the wild trees.

Genetic Diversity
We identified a total of 71 alleles using the four microsatellite markers after genotyping all the individuals from the 27 populations. The average A was 5.7, the RS was 4.4, the HO was 0.59, and HE was 0.69. The overall inbreeding coefficient FIS was 0.11 (Table 2). Table 2. Summary of the genetic diversity of the 27 I. edulis populations. Sample size (N), average number of alleles per locus (A), allelic richness (Rs), effective number of alleles (Ne), expected heterozygosity (HE), observed heterozygosity (HO) and fixation index (FIS) averaged over loci. Sig. refers to the significance resulting from the heterozygote deficiency test (a conservative α value for the test of at least p < 0.01 was used, due to the low number of individuals per population: NS, not significant, ** p < 0.01 and *** p < 0.001, significant). F-null refers to the average estimate of the null frequency over the loci. Standard errors in brackets.

I. edulis
Population  (27). The cultivated populations 8 CTc and 13 BRc also had a high number of sampled individuals, which was also reflected in the A and Ne parameters. The average expected genetic diversity is slightly higher in the wild compared to the cultivated populations, 0.72 and 0.69, respectively. The average number of alleles is much higher in the wild (7.4) than in the group of cultivated populations (5.3), but when we consider the allelic richness and effective number of alleles, these differences are reduced ( Table 2).
Six cultivated populations out of 22 had significant heterozygote deficiency, but this parameter was not significant in the wild populations ( Table 2). The Selva Central group of cultivated populations lacked populations with significant inbreeding coefficient, and the Loreto and the Ucayali regions had only two populations and more than half of the populations with heterozygote deficiency, respectively. Positive and significant FIS values mirror differences between observed and expected heterozygosity, due to putative heterozygosity loss because of non-random mating of the parents. We should also emphasize that the cultivated populations are an assembly of individuals and we should not expect them to be in Hardy-Weinberg Equilibrium. Additionally, the presence of null alleles is an unlikely explanation since the estimated frequency is very low across populations (Table 2). Nevertheless, when we compared the overall inbreeding coefficient from the wild with the cultivated populations, no significant differences were found between them. Conversely, the allelic richness and the observed heterozygosity were significantly lower in the cultivated populations (Table S3).
Seven private alleles (Pa) were identified in three wild populations, the highest Pa per population was found in the 26 MAw population (3) and two in both 23 RPw and 25 RUw. Only one Pa was identified in four different I. edulis cultivated populations (2 VRc, 9 CVc, 14 JHc and 22 MZc). The locus Inga08 had the highest Pa (7 across all populations) and Inga33 and Pel5 only one (data not shown).
The cultivated populations possessed 13 exclusive alleles compared to the wild ones, and only two had a frequency lower than 5%. The regions with the highest number of cultivated populations with exclusive alleles was Selva Central, 80%, followed by Ucayali, 60%, and the Loreto region had the lowest number of populations with those alleles (40%) (data not shown).

Population Structure
The population genetic structure was investigated by a hierarchical analysis of molecular variance (AMOVA), which revealed that most of the genetic diversity existed within populations (92%). The differentiation between the cultivated and wild group populations (ΦCT = 0.010) was low (~1%), and not significant (p < 0.0958), and the variation among populations within groups was appreciable, ca. 7% (ΦSC = 0.073), and significant (p < 0.0001) ( Table 3). Table 3. Hierarchical AMOVA between the cultivated and wild population groups, among populations within the cultivated and wild population groups and within I. edulis. populations. df = degrees of freedom; SS = sum of squared deviation; Φ statistics = fixation indexes; P = level of probability of obtaining a more extreme component estimate by chance alone. The significance of the variance components were tested by a permutation test. The I. edulis genetic structure was further estimated using a Bayesian approach. Using the method of Evanno et al. [37], the most appropriate number of genetic clusters (K) is 2, referred to as red and green (Figure 3; Figure 4 and Figure S1). The red cluster was predominant in the wild populations and in the cultivated populations in the northernmost region (Loreto). Conversely, the green cluster was predominant in the southernmost region (Selva Central). The Ucayali region displayed a mixture of both types of cultivated populations, probably a mixture from the southern and the northern regions ( Figure 3; Figure 4).  For K = 2, the highest proportion of red cluster was observed in cultivated populations along the navigable river watersheds in the Loreto and Ucayali regions (e.g., 6 ATc, 11 YAc, 14 JHc, 15 LAc, 16 NAc, 19 MAc, 20 SCc, 21 INc and 22 MZc). Moreover, the green cluster was found to be prevalent in populations cultivated on the Andean foothills and "terra firme" in the Selva Central and Ucayali regions (e.g., 2 VRc, 3 PIc, 4 SAc, 7 VHc and 10 ARc) (Figure 4).

Influence of Domestication on Fruit Length
Although the history of cultivation of I. edulis is not well documented, a crop domestication study suggested that humans have domesticated this species over a considerable period of time [20]. Indeed, Amazonia is a major world centre of plant domestication, where selection began in the Late Pleistocene to Early Holocene in peripheral parts of the basin [18]. The origin of cultivated I. edulis trees is uncertain, though probably Amazonian [8]; nevertheless, some authors have suggested it was started by European settlers in west Amazonia [17,20]. Since this tree was cultivated mainly for fruit production, domestication is expected to increase pod length [8,16,20]. To our knowledge, no study has been made comparing both types of populations, cultivated vs. wild, considering this morphologic characteristic (pod length). Certainly, the higher values found in cultivated trees compared to the wild trees clearly support the domestication of I. edulis for food supply. Plant domestication is a long-term process in which natural selection interacts with human selection, driving changes that improve usefulness to humans and adaptations to domesticated landscapes [18].
In the current study, maximum pod length in the wild and the cultivated populations was 73 and 148 cm, respectively, in agreement with Pennington [8]. This author reported that wild trees' pods rarely exceed 50 cm and cultivated trees could, exceptionally, produce pods exceeding 2 m. The average pod length was higher in the Loreto region's cultivated trees, compared to Ucayali and Selva Central regions; the smallest fruits were observed in Selva Central. The species' different cultivation and uses, and differences in ecological conditions, could explain these results. Indeed, in Selva Central, the species was mainly used to shade coffee or cocoa rather than to produce large fruits [42,43]. Farmers were focused mainly on the cash crop yield, rather than the fruit yield provided by shade trees. Additionally, large fruits could be more attractive to uninvited guests, which could then cause damage to the cash crop due to Inga fruit collection. Another supporting argument is the wild I. edulis local name among the Selva Central region inhabitants. The local name for the cultivated I. edulis in Selva Central is "pacay soga", whereas in the Ucayali and Loreto regions, the name "Guaba" is used for the cultivated type and "guabilla" or "guabilla del monte" for the wild tree (A. Rollo, pers. communication). The difference in local names in these regions might be related to the species abundance, both in the wild and cultivated form. Locals in Selva Central informed us that I. edulis was hard to find in the surrounding wild vegetation; indeed, the species is rarely seen above 750 m [8]. We were also unable to find and sample wild trees in the Selva Central region.

Genetic Diversity of Wild and Cultivated Populations of I. edulis in the Peruvian Amazon
The overall HE (0.69) was slightly higher than the HO (0.59), inducing an overall inbreeding coefficient index of 11%. In a meta-study for microsatellites and outcrossing species, the author showed a similar value of HE (0.65), but slightly higher HO (0.63) [44].
The results from our study further indicate that all the genetic diversity estimates were lower in the case of the cultivated populations compared to the wild ones, as well as the average inbreeding coefficient. These results confirm a loss of genetic diversity in the cultivated populations, in agreement with the studies by Hollingsworth et al. [21] and Dawson et al. [13] on the same species. These authors concluded that cultivated stands possessed lower total allelic richness than neighbouring wild populations, but the expected genetic diversity remained unchanged, indicating that the process of domestication reduced the number of alleles. Both authors stated that the wild plant material they studied was collected from nearby cultivated populations, in old-growth, primary forest, but due to (i) the long history of the species' use, (ii) the habits of slash-and-burn in primary forest and (iii) gene flow among nearby stands, the wildness of the trees could be questioned [2,18]. Nevertheless, Dawson et al. [13] observed marked differences in the haplotype composition between natural and cultivated stands. In our case, the wild material was sampled in natural vegetation in protected areas and secondary forest, and unless extensive long-distance gene flow existed, no ambiguities in distinguishing both types existed. In addition, the results regarding pod length clearly distinguish the wild from cultivated material. We found an important effect of the domestication on the natural resources of a species, which is an expected phenomenon when a species is used by humans [23,45]. In some cases, the expected heterozygosity might be higher or similar in the cultivated population than that displayed in the wild population, due to a putative "melting pot" phenomenon in the former populations (introduced alleles from different origins). Nevertheless, the allelic richness and observed heterozygosity found, in our study, in the cultivated populations was lower than in the wild ones, indicating the loss of rare alleles during selection as observed by other authors [23,46]. Some cultivated populations from the current study had a significant heterozygote deficit, particularly in the Ucayali region. The consequences of the inbreeding effect in fruit trees, such as I. edulis, might impact fruit production due to inbreeding depression, which would directly impact farmers' yield [15,23]. The fact that the species is self-incompatible [13] excludes the possibility of heterozygote deficiency due to self-pollination; probably, related trees were introduced in these populations and the value reflect biparental inbreeding.

Population Structure
The genetic variance partition in our study (92% of the variance was observed within populations and a low genetic structure, 7%, was detected among populations) is usual in outcrossing tropical forest tree species with high levels of gene flow [6]. The hierarchical AMOVA showed that the Φct between wild and cultivated populations was 1%, yet not significant.
Dawson et al. [13] found low genetic structure, similarly to our results, in I. edulis natural and cultivated stands, with nuclear but not chloroplast microsatellite data. Nevertheless, the authors used only two chloroplast loci, which might have biased the results, since the smaller effective population size of the chloroplast genome makes it more susceptible to genetic drift and species differentiation [47]. Conversely, a high genetic structure was found between natural and cultivated stands of I. vera, and the authors reasoned that the cultivated populations were derived from seeds coming from different mother trees, but a different geographic origin was also possible [23].
In the current study, for K = 2 ( Figure 3; Figure 4), the wild populations displayed identical composition, with the predominant red cluster. The uniform composition of the studied wild material could be due to the relatively recent speciation of the genus [9] and, also, to regional wild populations sampling [8]. The red cluster prevailed in the northern cultivated populations: the Loreto region and along the Ucayali river in the Ucayali region, which could express large population centres occupying the margins of main rivers with extensive trade networks [43]. A tiny green genetic cluster is present in the wild populations and in the cultivated populations of the Loreto region. Conversely, the green cluster is relevant in Selva Central and Ucayali cultivated populations. The green cluster increases in the sub-Andean Selva Central region and in the higher elevated sites from the Ucayali region, where the I. edulis trees were traditionally used on coca fields before the Conquest, and for shading protection of cocoa, coffee and tea plantations after the Spanish settlement [43]. In the Loreto region, the cultivated population 13 BRc possessed a higher proportion of green cluster than others from this region. This population is near Bretaña village, which was named after the Europeans, who arrived from the Andes and the coastal regions of Peru, during the rubber boom at the end of the 19th century [48].
Iquitos, in the Loreto region, is referred to as a crop domestication centre in Amazonia, created as populations expanded, and providing strong evidence that pre-conquest human populations had intensively transformed their plant resources [18,20]. Indeed, the I. edulis domestication was probably achieved by selecting from the local wild population and possibly started in the Loreto region, since the genetic structure of the cultivated populations from this region do not differ much from the wild ones. Moreover, they have bigger pods and lower allelic richness than the other cultivated populations, which could indicate that the selection intensity was higher here. Indeed, some authors claim that the possible origin of I. edulis domestication was in this region, which was also the location for the domestication of other species [18,22]. Additionally, the crop is probably recently domesticated, since when a crop is in an initial process of domestication no clear genetic structuring occurs, as in Brazil nut [22]. The genetic differentiation between the wild and cultivated populations is low and with admixture; the cultivated populations seem to originate from the wild ones. Conversely, the results of Dawson et al. [13] on chloroplast haplotype composition displayed a completely different pattern between natural and cultivated populations. The authors explained these results by a non-local origin of the I. edulis cultivated material. Our results do not support this theory. Instead, we inferred that the cultivated populations had a local germplasm origin, yet without representative sampling, which is expected, since a few trees were probably selected in nearby wild populations. Indeed, a possible genetic drift effect (change in the frequency of the allele in a population due to random sampling of organisms) in the cultivated populations is expected.

Practical Measures to Maintain I. edulis Genetic Resources
The I. edulis germplasm management should focus on both the wild and the cultivated stands. In the case of wild material, the protection of the original Amazonian vegetation remnants is key to maintaining the species' genetic resources in the region. In modern-day Amazonia, increasing deforestation for the establishment of pastures has become a global concern due to its impacts on biodiversity [43]. Considering the cultivated stands, the villages and indigenous settlements are the units of interest because they are the domesticated plant population keepers. Consequently, the fate of the village will determine the maintenance of the crop genetic resources. For example, the post-Colombian population collapse that resulted in a loss of village units and corresponded to the loss in human numbers (ca. 90-95% population decline), was quickly reflected in the loss of crop diversity [20,43]. The cultivated populations with low genetic diversity and/or high inbreeding estimates (e.g., 7 VHc, 10 ARc, 12 SSc, 14 JHc, 17 EPc, 18 EDc, 19 MAc and 21 INc) should be supported with new germplasm sources (from wild populations) to eliminate the risk of biparental inbreeding and diversity loss, which might be reflected in the future crop value (inbreeding depression, flower abortion, and crop yield failure).

Conclusions
The results of the current study on I. edulis show a significantly higher value for average pod length in cultivated trees than in wild trees. The wide-scale infusion from wild stands into farms could negatively affect fruit size and weaken domestication efforts over time. Additionally, the Loreto region displayed the highest average pod length, as well as having populations with a lower allelic richness when compared to the cultivated populations of other regions.
The cultivated stands in the Selva Central and Ucayali region could, additionally, be a germplasm material source, and thus could provide a long-term safeguard for on-farm conservation since the Loreto region possesses the populations with the lowest values of allelic richness. Hybridization programmes using such a germplasm source and local wild material with backward selection could help increase the crop yield and genetic diversity in the cultivated populations. Additionally, new selection should consider the needs of modern agriculture and forest management practices, as well as global warming.
Supplementary Materials: The following are available online at www.mdpi.com/1999-4907/11/6/655/s1, Figure  S1. Representation of the ΔK distribution statistic. The method of Evanno et al. [37] was used to determine the most appropriate number of genetic clusters (K); Table S1. Inga edulis 27 populations (cultivated and wild), region (Geographic region), sampling region (Site), population code (Pop.) and sampling details (Sampling description); Table S2. Average fruit length per population. The minimal and maximal pod length values (in brackets) and the number of pods (Nl) per population, in a total of 448 pods. Table S3. Diversity parameters comparison between cultivated and wild populations. Allelic richness (RS), observed heterozygosity (HO) and inbreeding coefficient (FIS). P = probability values for differences between groups for two-sided t-test after 1000 permutations. * = significant test (p < 0.05).