Molecular Characterization and Genetic Diversity of the Macaw Palm Ex Situ Germplasm Collection Revealed by Microsatellite Markers

Macaw palm (Acrocomia aculeata) is native to tropical forests in South America and highly abundant in Brazil. It is cited as a highly productive oleaginous palm tree presenting high potential for biodiesel production. The aim of this work was to characterize and study the genetic diversity of A. aculeata ex situ collections from different geographical states in Brazil using microsatellite (Simple Sequence Repeats, SSR) markers. A total of 192 accessions from 10 provenances were analyzed with 10 SSR, and variations were detected in allelic diversity, polymorphism, and heterozygosity in the collections. Three major groups of accessions were formed using PCoA—principal coordinate analysis, UPGMA—unweighted pair-group method with arithmetic mean, and Tocher. The Mantel test revealed a weak correlation (r = 0.07) between genetic and geographic distances among the provenances reaffirming the result of the grouping. Reduced average heterozygosity (Ho < 50%) per locus (or provenance) confirmed the predominance of endogamy (or inbreeding) in the germplasm collections as evidenced by positive inbreeding coefficient (F > 0) per locus (or per provenance). AMOVA—Analysis of Molecular Variance revealed higher (48.2%) genetic variation within population than among populations (36.5%). SSR are useful molecular markers in characterizing A. aculeata germplasm and could facilitate the process of identifying, grouping, and selecting genotypes. Present results could be used to formulate appropriate conservation strategies in the genebank.


Introduction
Macaw palm (Acrocomia aculeata (Jacq.)(Lodd.ex Mart.))-Arecaceae (2n = 2x = 30) is commonly known as macaúba in Brazil [1].This arborescent, spiny and single-stemmed palm is monoecious and self-compatible, and entomophily and anemophily forms of pollinations are reported [2].It bears a mixed reproductive system, with a predominance of outcrossing [3,4].The combination of the two pollination strategies with flexible reproductive systems suggests that A. aculeata can be highly successful in the colonization of new areas, as evidenced by the ample distribution of the species in the Neotropics.It is a very resilient palm and has abundant distribution in Brazil mainly in the regional States of Ceará, Minas Gerais, Mato Grosso, Mato Grosso do Sul, and São Paulo [2].
A. aculeata is little known globally, however, in recent years, it has raised interest due to its potential for social and economic use as an oil producer, considering that it is cited as one of the most important new sources of oil for biofuel [5,6].It produces fruits yielding up to 25 t/ha corresponding to about 4000 kg of oil.The solid waste is converted to charcoal and nutritious cakes that can be used to generate energy and feed livestock as well [7,8].The biochemical properties of the oil are proved to be suitable for the cosmetic industry and for biodiesel production [9][10][11].Moreover, this palm has environmental benefits as it can be fostered in impoverished soils and drought prevailing areas, which is a desirable trait for plants in order to rehabilitate degraded pastures or for agroforestry practices [12].Hence, A. aculeata can be a suitable option for production of biodiesel among the common food-based oleaginous plants such as soya bean, sunflower, and oil palms [13].
It is not commercially cultivated like the domesticated Arecaceae palms such as Elaeis guineensis, Cocus nucifera, and Enterpe oleraceae, which are important elements in the Brazilian savannah, and present great genetic diversity in natural populations [5,14,15].The palm's genetic diversity suffers by predatory extractivism, unsustainable land use, and climate change [16,17].Hence, genetic resource conservation and its sustainable use have paramount importance for future genetic improvement.A central point in its sustainable conservation is the knowledge of the genetic diversity present in genebank collection and potential exploitation of the genetic materials by breeding programs.The germplasm characterization and species genetic diversity could be effectively integrated by molecular analyses.
Therefore, we characterized and studied the genetic diversity of the macaw palm germplasm collections in a genebank using microsatellites (Simple Sequence Repeats-SSRs) [18,19].SSRs are well known molecular markers for their potentially high information content and versatility as molecular tools in germplasm characterization [20,21].They are often co-dominant, highly reproducible, frequent in most eukaryotes and are quite useful in various aspects of molecular genetic studies [22,23].Another aim was to study the distribution of the genetic diversity and in particular if a correlation exists between the genetic and the geographic distances and if distinct genetic groupings are formed among populations.These results will be useful in future conservation activities.

Plant Material and DNA Isolation
Leaf samples from 192 A. aculeata germplasm accessions were obtained from the ex situ plant collection, macaúba Active Genebank, situated in Araponga (S2040 01, W423115), State of Minas Gerais, Brazil.The accessions were originated from seeds collected in six regional states of the country (Figure 1) and germinated using a pre-germination protocol as described in patent INPI 014070005335 [24].The accessions represent 10 provenances having a total of 41 populations coded as BGP and 3-5 individuals were considered per population (Table 1).important new sources of oil for biofuel [5,6].It produces fruits yielding up to 25 t/ha corresponding to about 4000 kg of oil.The solid waste is converted to charcoal and nutritious cakes that can be used to generate energy and feed livestock as well [7,8].The biochemical properties of the oil are proved to be suitable for the cosmetic industry and for biodiesel production [9−11].Moreover, this palm has environmental benefits as it can be fostered in impoverished soils and drought prevailing areas, which is a desirable trait for plants in order to rehabilitate degraded pastures or for agroforestry practices [12].Hence, A. aculeata can be a suitable option for production of biodiesel among the common food-based oleaginous plants such as soya bean, sunflower, and oil palms [13].
It is not commercially cultivated like the domesticated Arecaceae palms such as Elaeis guineensis, Cocus nucifera, and Enterpe oleraceae, which are important elements in the Brazilian savannah, and present great genetic diversity in natural populations [5,14,15].The palm's genetic diversity suffers by predatory extractivism, unsustainable land use, and climate change [16,17].Hence, genetic resource conservation and its sustainable use have paramount importance for future genetic improvement.A central point in its sustainable conservation is the knowledge of the genetic diversity present in genebank collection and potential exploitation of the genetic materials by breeding programs.The germplasm characterization and species genetic diversity could be effectively integrated by molecular analyses.
Therefore, we characterized and studied the genetic diversity of the macaw palm germplasm collections in a genebank using microsatellites (Simple Sequence Repeats-SSRs) [18,19].SSRs are well known molecular markers for their potentially high information content and versatility as molecular tools in germplasm characterization [20,21].They are often co-dominant, highly reproducible, frequent in most eukaryotes and are quite useful in various aspects of molecular genetic studies [22,23].Another aim was to study the distribution of the genetic diversity and in particular if a correlation exists between the genetic and the geographic distances and if distinct genetic groupings are formed among populations.These results will be useful in future conservation activities.

Plant Material and DNA Isolation
Leaf samples from 192 A. aculeata germplasm accessions were obtained from the ex situ plant collection, macaúba Active Genebank, situated in Araponga (S2040 01, W423115), State of Minas Gerais, Brazil.The accessions were originated from seeds collected in six regional states of the country (Figure 1) and germinated using a pre-germination protocol as described in patent INPI 014070005335 [24].The accessions represent 10 provenances having a total of 41 populations coded as BGP and 3-5 individuals were considered per population (Table 1).Genomic DNA was isolated from leaflets following the CTAB (Cetyl Tri-methyl Ammonium Bromide) method [25] with modifications as described in [26].DNA samples were quantified with a Multiscan TM GO Microplate Spectrophotometer using absorbance at 260/280 nm.The integrity of the DNA samples was confirmed with 2% agarose gel electrophoresis and the working concentration was adjusted to 30 ng•µL −1 .

Condition of Polymerase Chain Reaction (PCR) and Electrophoresis
PCR was performed according to Nucci et al. [5], except for a lower primers concentration (0.15 µM each) and higher MgCl 2 concentration (4 mM for primers Aac04 and Aac12).The amplification cycles were also programmed according to Nucci et al. [5] in a thermal cycler (Applied Biosystem ® Verti ® cycler).Five of the primers (Aacu07, Aacu10, Aacu12, Aacu26, and Aacu30) were obtained from Nucci et al. [5] and three (Aacu38, Aacu45, and Aacu74) identified from Nucci [27].The other two markers (Aac04 and Aac12) were obtained from sets of SSR makers originally developed for Astrocaryum aculeatum and selected for A. aculeata [28] (Table 2).PCR products were denatured in a bromophenol blue dye solution at 95 • C for 5 min in the thermal cycler just before running in 6% polyacrylamide gel electrophoresis in 1xTBE (Tris-Borate-EDTA, Sigma-Aldrich Corporation, St. Louis, MO, USA) buffer solution at 60 W for 1 h and 40 min.

Polyacrylamide Gel Staining
After electrophoresis, the PCR products were visualized in polyacralamide gels stained with silver nitrate (AgNO 3 ) according to Brito et al. [29].The gels were immersed and agitated in several coloring steps in different solutions at different concentrations and durations until all allelic bands were totally visible for evaluation.Finally, the stained gels were allowed to dry out in the air and scanned for documentation and DNA fragments were scored as co-dominant alleles for data analyses.

Data Analyses
Allelic diversity, heterozygosity and polymorphism level of the SSR markers and for each provenance, were estimated from the co-dominant data.Allelic diversity was estimated by quantifying number of alleles per locus (A) and total number of alleles per provenance (N t ) [30].Average observed (H o ) and expected (H e ) heterozygosity per provenance was calculated by Equations ( 1) and (2), respectively.H o and H e (Equation (3)) are observed and expected heterozygosity per locus, respectively, in which a is number of loci and pi is frequency of the i th allele at j th locus [31].Inbreeding coefficient (F) per locus (or provenance) was estimated from H o and H e to determine the level of inbreeding (Equation ( 4)) [32].Polymorphic information content (PIC) per locus was calculated by Equation ( 5) [33], where pi and pj are frequencies of the i th allele at j th locus.Percentage of polymorphic loci (P) per provenance was estimated based on a criterion [34], at allelic frequency of less than 0.95 per locus.The criterion was designated herein as P95.
Principal coordinate analysis (PCoA) was performed for graphical dispersion of the accessions on bi-dimensional axes.PCoA was done from the genetic distance between pairs of accessions [35].Nei's genetic distance between pairs of provenances was computed with Equation ( 6), where I is Nei's genetic identity estimated by Equation ( 7), in which p ijx and p ijy are frequencies of the i th allele at j th locus of provenance x and y respectively and L stands for the number of loci [36].A dendogram was constructed from the genetic distance matrix between pairs of provenances using UPGMA-unweighted pair-group method with arithmetic mean.Tocher was also used to form homogenous groups of provenances from Nei's genetic distance matrix.
A Mantel test was applied using the Pearson correlation to test the hypothesis of relationships between genetic and geographic distances among A. aculeata accessions obtained from different regional states in Brazil.Analysis of Molecular Variance (AMOVA) was done to estimate the amount of genetic variation among and within the populations/or provenances.Φ-Statistics (Equations ( 8)-( 10)) were computed to test the null hypothesis ( σ2 a = σ2 b = σ2 c = 0), where σ2 a , σ2 b and σ2 c are genetic variations among provenances, among populations, and among individuals, respectively [37,38].The computed Φ-values were compared against values obtained under 1000 permutations for significance tests.Data analyses were performed using GENES [39] and GenAlex [35] statistical software programs. [ [ [Nei_D = −ln (I)] [ [ [

SSR Allelic Polymorphism, Heterozygosity and Informativeness
A total of 72 alleles were detected in the analysis of 192 A. aculeata accessions using ten SSR markers.A range of 4 (Aac12)-11 (Aacu12) alleles per locus were obtained with average of 7.2 alleles per locus (Table 2).In other study, using five of the SSRs (Aacu07, Aacu10, Aacu12, Aacu26, and Aacu30), a total of 30 alleles with average of 6 alleles per locus were reported from 43 accessions of A. aculeata from São Paulo and Minas Gerais populations [5].In the present study, those five markers detected 40 alleles with average of 8 alleles per locus (Table 2).Hence, the average number of alleles obtained per locus was higher here than in Nucci et al. [5].The wider coverage of the genetic materials analyzed from the six geographical states in this study led to yield more alleles per locus and consequently resulted in higher allelic diversity.
H o ranged from 0.06 (Aac12) to 0.61(Aac04) with average of 0.37 per locus; H e from 0.31(Aac12) to 0.72 (Aac04) with average of 0.54 per locus; while PIC varied from 0.27 (Aac12) to 0.68 (Aac04) with average of 0.50 per locus (Table 2).According to the criteria set by Bostein et al. [33], the SSR markers used in this study were informative and polymorphic to characterize the germplasm accessions (or populations) in A. aculeata.SSR markers are classified as informative when PIC > 0.50, reasonably informative (0.25 < PIC < 0.50) or less informative (PIC < 0.25).The number of alleles, H o , H e , and PIC obtained in this study was nearly similar to that of Nucci et al. [5] (average H o = 0.27; H e = 0.57; and PIC = 0.54), who first characterized SSR markers for A. aculeata.This could explain that the allelic frequencies of the loci have not changed significantly over generations.Although we could not trace precisely the germplasm analyzed by Nucci et al. [5], we may speculate that some similar accessions might be analyzed in the present study, probably from Minas Gerais and São Paulo.Besides, since macaw palm is still in the wild, under certain modes of random matting systems governing the rule of Hardy Weinberg Equilibrium in the absence of selection, mutation, and migration [32], allelic frequency in a given population could remain unchanged over generations.However, the low proportion of the average H o (<50%) could be caused by involvement of inbreeding or crossing between genetically related individuals.This was explained by a positive average inbreeding coefficient (F > 0) obtained per locus and per provenance confirming the presence of heterozygote deficiency in all the provenances studied (Table 2, Table 3).Although A. aculeata has a mixed mating system [2,4,15], its monoecious inflorescences could favor selfing or crossing between genetically related individuals that could reduce the proportion of heterozygotes in its progenies.According to Hartl and Clark [32], a positive inbreeding coefficient indicates predominance of inbreeding and values close to zero are expected under random mating, and negative values indicate an excess of heterozygote due to negative assortative (disassortative) mating or selection.

Genetic Diversity
Based on the ten SSR markers, N t detected per provenance varied from 28 (PB) to 57 (MS), hence, N a ranged from 2.8 to 5.7 respectively (Table 3).Consequently, the highest proportion of alleles (78%) was obtained in MS and the least (38%) in PB.The variation in the proportion of alleles (or allelic diversity) among the provenances attested the presence of genetic diversity of A. aculeata in the genebank.Besides, high average P (92%) was obtained per provenance with a range of 80%-100% based on the criterion proposed by Cole [34] (Table 3).The polymorphism level reported here was much higher than that obtained by Oliveira et al. [14] using Random Amplified Polymorphic DNA (RAPD) markers (P = 79%), analyzing Acrocomia aculeata from natural populations.This could be explained by the higher polymorphic level of SSRs compared with RAPD markers.Hence, the results attested the SSRs are highly polymorphic molecular markers to study genetic variability in A. aculeata.Moreover, SSRs are co-dominant while RAPD are dominant.
Genetically different groups of accessions were formed with different methods of grouping.Three distinct groups were formed using PCoA on the first two coordinates, explaining 31.5% and 20% of the total variability, respectively (Figure 2).Nt = total number of alleles per provenance; Pa = proportion of alleles per provenance; Na = average number of alleles per provenance; = average observed heterozygosity per provenance; = average expected heterozygosity per provenance; F = inbreeding coefficient; P (%) = percentage of polymorphic loci with a criterion (P95) mentioned in materials and methods.Provenances included: PA = Pará, PE = Pernambuco, PB = Paraiba, SP = São Paulo, NMG = North Minas Gerais, SMG = South Minas Gerais, CMG = Central Minas Gerais, EMG = East Minas Gerais, WMG = West Minas Gerais, and MS = Mato Grosso do Sul.

Genetic Diversity
Based on the ten SSR markers, Nt detected per provenance varied from 28 (PB) to 57 (MS), hence, Na ranged from 2.8 to 5.7 respectively (Table 3).Consequently, the highest proportion of alleles (78%) was obtained in MS and the least (38%) in PB.The variation in the proportion of alleles (or allelic diversity) among the provenances attested the presence of genetic diversity of A. aculeata in the genebank.Besides, high average P (92%) was obtained per provenance with a range of 80%-100% based on the criterion proposed by Cole [34] (Table 3).The polymorphism level reported here was much higher than that obtained by Oliveira et al. [14] using Random Amplified Polymorphic DNA (RAPD) markers (P = 79%), analyzing Acrocomia aculeata from natural populations.This could be explained by the higher polymorphic level of SSRs compared with RAPD markers.Hence, the results attested the SSRs are highly polymorphic molecular markers to study genetic variability in A. aculeata.Moreover, SSRs are co-dominant while RAPD are dominant.
Genetically different groups of accessions were formed with different methods of grouping.Three distinct groups were formed using PCoA on the first two coordinates, explaining 31.5% and 20% of the total variability, respectively (Figure 2).Collections from MG, represented the largest group in the study composed of five provenances (NMG, SMG, CMG, EMG, and WMG) (Table 1), clearly separated from the rest of the groups.Likewise, accessions from MS and SP formed the second group, while the third group composed of Collections from MG, represented the largest group in the study composed of five provenances (NMG, SMG, CMG, EMG, and WMG) (Table 1), clearly separated from the rest of the groups.Likewise, accessions from MS and SP formed the second group, while the third group composed of accessions from the regional States of PB and PE (Figure 2).The PCoA depicted the complete distinctness of MG collections, the genetic similarity between accessions from PA, SP, and MS and the genetic relatedness between PB and PE collections.Hence, formation of the different groups reiterated the presence of diverse genetic variability among the germplasm collections in the genebank.
Establishment of the three major groups using PCoA was consistent with the other two methods used in our analyses (UPGMA, Figure 3 and Tocher, Table 4).Using UPGMA, at 70% of dissimilarity, three hierarchical groups of A. aculeata provenances were established.Similar to the PCoA method, with UPGMA, MG provenances formed the first group and PA, PE, and PB established the second group, representing collections from the northern part of the country, while SP and MS formed the third distinct group, reaffirming the genetic relatedness between collections from the two neighboring geographical states (Figure 3).However, with the optimization method (Tocher), one additional group was formed due to the of PA provenance from the third group and formed the fourth independent group (sub-group) (Table 4).This is more likely because, unlike UPGMA, Tocher considers more similar groups (using least genetic distances) in each stage of group formation to establish new homogenous groups based on their genetic similarities [30].Hence, there is a possibility to establish an additional group (sub-group) with the method of Tocher at the last stage of grouping.This could be confirmed from the dendogram (Figure 3) that, at a low percentage of dissimilarity (50%-55%), using local criterion, PA remained as an independent and separate group, reaffirming its relative genetic distance from the two neighboring provenances (PB and PE).However, the least genetic distance (Table 4, D 3,4 = 0.50), between Groups 3 (PB and PE) and 4 (PA), elucidated the genetic relatedness among collections from the three geographical States (PA, PE, and PB), as evidenced by the PCoA (Figure 2) and UPGMA (Figure 3), which represented collections from the northern part of Brazil.
methods used in our analyses (UPGMA, Figure 3 and Tocher, Table 4).Using UPGMA, at 70% of dissimilarity, three hierarchical groups of A. aculeata provenances were established.Similar to the PCoA method, with UPGMA, MG provenances formed the first group and PA, PE, and PB established the second group, representing collections from the northern part of the country, while SP and MS formed the third distinct group, reaffirming the genetic relatedness between collections from the two neighboring geographical states (Figure 3).However, with the optimization method (Tocher), one additional group was formed due to the separation of PA provenance from the third group and formed the fourth independent group (sub-group) (Table 4).This is more likely because, unlike UPGMA, Tocher considers more similar groups (using least genetic distances) in each stage of group formation to establish new homogenous groups based on their genetic similarities [30].Hence, there is a possibility to establish an additional group (sub-group) with the method of Tocher at the last stage of grouping.This could be confirmed from the dendogram (Figure 3) that, at a low percentage of dissimilarity (50%-55%), using local criterion, PA remained as an independent and separate group, reaffirming its relative genetic distance from the two neighboring provenances (PB and PE).However, the least genetic distance (Table 4, D3,4 = 0.50), between Groups 3 (PB and PE) and 4 (PA), elucidated the genetic relatedness among collections from the three geographical States (PA, PE, and PB), as evidenced by the PCoA (Figure 2) and UPGMA (Figure 3), which represented collections from the northern part of Brazil.
Although the distinctness of the three major groups was confirmed by different methods, our results in relation to the genetic relatedness among the groups showed inconsistency with the hypothesis that collections from closer geographical regions are genetically more similar than distant ones and vice versa.This inconsistency primarily came from the results of the complete dissimilarity between the first (MG) and the second group (SP and MS), composed of provenances from neighboring geographical states (Figure 1).Secondly, at a higher percentage of dissimilarity (above 70%), using UPGMA, collections from MG State were genetically closer to that of the distant States of PB, PE, and PA than its neighbor States (SP and MS) (Figure 3).This scenario was also explained by the mean inter-group genetic distances using Tocher (Table 4).High average genetic distance (D1,2 = 1.05) was obtained between the first (MG) and the second group (MS and SP) than between the first and the third group (PE and PB, D1,3 = 0.64) and the first and the fourth group (PA, D1,4 = 0.77).Moreover, this result was confirmed with a Mantel test showing a weak correlation (r = 0.07) between genetic and geographic distances among the provenances studied (Figure 4).Similar results were also reported in other species, such as Italian red clover [40], globe artichoke [41], and Italian emmer wheat [42].Although the distinctness of the three major groups was confirmed by different methods, our results in relation to the genetic relatedness among the groups showed inconsistency with the hypothesis that collections from closer geographical regions are genetically more similar than distant ones and vice versa.This inconsistency primarily came from the results of the complete dissimilarity between the first (MG) and the second group (SP and MS), composed of provenances from neighboring geographical states (Figure 1).Secondly, at a higher percentage of dissimilarity (above 70%), using UPGMA, collections from MG State were genetically closer to that of the distant States of PB, PE, and PA than its neighbor States (SP and MS) (Figure 3).This scenario was also explained by the mean inter-group genetic distances using Tocher (Table 4).High average genetic distance (D 1,2 = 1.05) was obtained between the first (MG) and the second group (MS and SP) than between the first and the third group (PE and PB, D 1,3 = 0.64) and the first and the fourth group (PA, D 1,4 = 0.77).Moreover, this result was confirmed with a Mantel test showing a weak correlation (r = 0.07) between genetic and geographic distances among the provenances studied (Figure 4).Similar results were also reported in other species, such as Italian red clover [40], globe artichoke [41], and Italian emmer wheat [42].
Table 4. Grouping of ten Acrocomia aculeata provenances using the method of Tocher.With Tocher, homogeneous groups are formed, as it uses least genetic distance at each stage of group formation.Hence, mean intra-group distance is always less than mean inter-group distance.Minas Gerais; SMG = South Minas Gerais; CMG = Central Minas Gerais; EMG = East Minas Gerais; WMG = West Minas Gerais; MS = Mato Grosso do Sul.The first row numbers from 0-100 are percentages of dissimilarity; and the second row numbers from 0-0.94 are levels of fusion (average genetic distance) corresponding to percentage of dissimilarity.

Group
Table 4. Grouping of ten Acrocomia aculeata provenances using the method of Tocher.With Tocher, homogeneous groups are formed, as it uses least genetic distance at each stage of group formation.Hence, mean intra-group distance is always less than mean inter-group distance.

Analysis of Molecular Variance (AMOVA)
From AMOVA, more genetic variation within populations (48.2%) than among populations (36.5%) was obtained (Table 5).These variations were statistically significant (p < 0.01) in that the Φ −Statistics values obtained from estimated variances were higher than those values obtained under 1000 permutations.Previous studies in A. aculeata reported similar results, showing higher genetic variability within population than among populations [14,15].The higher genetic diversity within population is a result of the mixed mating system in A. aculeata and the involvement of metapopulation structure in natural populations [15].Metapopulation structure of species is caused by fragmentation of lands and creates spatially separated populations which interact at some level.However, this could favor genetic drift and restricted gene flow, which cause decrease in genetic diversity within a population especially in cross-pollinated species [43].However, in our case, the mixed mating nature of A. aculeata kept the genetic variability higher within population than among population.Similar works also reported higher genetic variation within populations in some palms and different tree species, such as Canarian endemic palm tree Phoenix canariensis [43], Populus tremuloides Michx [44], Digitalis minor [45], Piper hispidinervum [46], and Trichilia pallida [47].

Analysis of Molecular Variance (AMOVA)
From AMOVA, more genetic variation within populations (48.2%) than among populations (36.5%) was obtained (Table 5).These variations were statistically significant (p < 0.01) in that the Φ-Statistics values obtained from estimated variances were higher than those values obtained under 1000 permutations.Previous studies in A. aculeata reported similar results, showing higher genetic variability within population than among populations [14,15].The higher genetic diversity within population is a result of the mixed mating system in A. aculeata and the involvement of metapopulation structure in natural populations [15].Metapopulation structure of species is caused by fragmentation of lands and creates spatially separated populations which interact at some level.However, this could favor genetic drift and restricted gene flow, which cause decrease in genetic diversity within a population especially in cross-pollinated species [43].However, in our case, the mixed mating nature of A. aculeata kept the genetic variability higher within population than among population.Similar works also reported higher genetic variation within populations in some palms and different tree species, such as Canarian endemic palm tree Phoenix canariensis [43], Populus tremuloides Michx [44], Digitalis minor [45], Piper hispidinervum [46], and Trichilia pallida [47].

Conclusions
SSRs markers resulted in being very useful and efficient in characterizing A. aculeata germplasm.The SSR markers used are polymorphic among the A. aculeata accessions analyzed in this study and established different groups based on their genetic distances.This would facilitate the process of identifying, grouping and selecting genotypes during pre-breeding.Since A. aculeata is perennial and has a long cycle of growth, the use of SSR markers will accelerate the process of selecting genotypes at early stages, which will save time and resources.Moreover, the high genetic variations within population underlines the importance of having many genotypes in the genebank.The result will also help to minimize problems of replicates of genetic materials in the genebank and maintain genetic variability for sustainable use for future breeding programs.Further studies are necessary to investigate why genetic distance among populations did not couple with geographic distance, which will help in finding out the nature of gene flow and population structure of A. aculeata in Brazil.

Figure 1 .
Figure 1.Map shows the six geographical states in Brazil, where the original plant materials were obtained.MG = Minas Gerais; SP = São Paulo; MS = Mato Grosso do Sul; PA = Pará; PE = Pernambuco; PB = Paraiba.Araponga is a city in MG State, where the genebank is situated in which the experimental plant materials were collected.

Figure 1 .
Figure 1.Map shows the six geographical states in Brazil, where the original plant materials were obtained.MG = Minas Gerais; SP = São Paulo; MS = Mato Grosso do Sul; PA = Pará; PE = Pernambuco; PB = Paraiba.Araponga is a city in MG State, where the genebank is situated in which the experimental plant materials were collected.

Figure 4 .
Figure 4. Mantel test revealing weak correlation between genetic and geographical distances in Acrocomia aculeata accessions collected from different geographical states in Brazil.

Figure 4 .
Figure 4. Mantel test revealing weak correlation between genetic and geographical distances in Acrocomia aculeata accessions collected from different geographical states in Brazil.

Table 1 .
List of 41 Acrocomia aculeata populations, number of individuals, states (or provenances), and GPS coordinates.
= West Minas Gerais, and MS = Mato Grosso do Sul.** Coordinates are in degrees, minutes, and seconds for both the latitude (S = South) and longitude (W = West).

Table 2 .
Primer pairs of 10 SSR (Simple Sequence Repeats) markers used in the study along with average values obtained for different parameters per locus.

Table 3 .
Average estimates of genetic diversity parameters for ten Acrocomia aculeata provenances based on ten polymorphic SSR markers.