Phenotypic Variability Assessment of Sugarcane Germplasm ( Saccharum officinarum L . ) and Extraction of an Applied Mini-Core Collection

The sugarcane germplasm collection located in Khuzestan, Iran, is one of the most important genetic resources with valuable accessions from different continents. However, this collection has not been properly used by breeders due to the extremely large population. The aim of this study was to phenotypically characterize the sugarcane germplasm and form a mini-core collection. Hence, 13 morphological traits were evaluated on 253 accessions. The primary germplasm was grouped into 10 clusters based on partial repeated bisection (RB) data, where the smallest cluster contained three accessions from two breeding centres (USA and Cuba). Using principal component analysis (PCA), the first two PCs (principal component) explained 59.5% of the total variation. A mini-core of 21 accessions was created by using the maximization strategy, with a low mean difference percentage (MD = 2.31%) and large coincidence rate of range (CR = 93.96%). The sugarcane mini-core represented the major diversity of the primary collection. The means and medians between the mini-core and the primary collection did not differ significantly. Accessions with high sugar and cane yield, originating from the USA, Cuba, Argentina, and South Africa, were in the mini-core collection. In this paper, we established, for the first time, an applied mini-core collection in sugarcane germplasm. The mini-core collection, as a breeding collection, is a highly suitable, manageable, and efficient subset for the enhanced use of sugarcane germplasm in breeding programs.


Introduction
Sugarcane belongs to the genus Saccharum L., of the tribe Andropogoneae in the family of Poaceae (Gramineae), which is grown widely in tropical and subtropical regions across the world [1].
In Iran, there is a Cane Development and Sidelong Industrial Research and Education Institute located in Khuzestan province that was officially founded in 1981.The sugarcane germplasm collection located at this institute includes approximately 340 accessions from Saccharum officinarum L. and interspecific hybrids.This set has been collected from different continents, such as the USA, Cuba, Brazil, Argentina, South Africa, India, and Australia, and has been maintained over the years in Iran.It seems that present accessions are well adapted to the climate of Iran, and the collection probably contains genotypes carrying genes for cold tolerance.The sugarcane collection in Khuzestan (primary collection), which has been selected manually from whole accessions across the world, is actually a core collection (CC).Due to the large size of the collection (340 accessions), there are problems concerning the management and use of this genetic resource.Hence, a diverse mini-core collection or breeding collection could be a beneficial resource for sugarcane breeders [2].Formation of a reduced subset of sugarcane genetic resources in Khuzestan is very important for assessing the population structure and diversity and for identifying sources of variation in breeding programs.So far, there has been no research to assess the morphological diversity of the entire sugarcane germplasm in Iran, indicating the necessity of a comprehensive evaluation of the accessions in the mentioned collection.
Despite the diversity of resources and genotypes, one of the major problems faced during plant breeding programs is insufficient use of the whole germplasm, because it is difficult to manage and utilize a large number of genotypes in germplasm collections.It is accepted that recognition and use of diversity in germplasm is very important prior to starting a breeding program [3].There is a growing body of literature that recognizes the importance of constructing a core or mini-core collection for conservation of novel variations in genetic resources [2,4,5].A core collection is defined as a limited set of accessions representing the minimum of repetition and the maximum of the genetic diversity of a crop species and its wild relatives [6].Similar to the definition of a core collection [6], a mini-core collection is defined as a reduced subset of a core collection with minimum repetitiveness and maximum genetic diversity of the source germplasm [7].Previous studies have reported that a core collection would be better if it contains 10% of the entire collection and a mini-core collection contains 10% of the core collection or 1% of the entire collection [8].A mini-core collection is important in order to identify sources of resistance to biotic and/or abiotic stresses and agronomic and nutritional traits.In sugarcane, it is arguable that the accessions with high sucrose content can be selected for sugar production.More importantly, such a subset of germplasm is also useful for recognition of superior energy cane cultivars, i.e., with high biomass and lignocellulosic compounds and fiber, which are appropriate for lignocellulosic ethanol production [9].Additionally, a core or mini-core collection plays an important role in assessing allelic richness and association genetics, especially for identifying important QTLs (quantitative trait locus) controlling desirable traits, such as yield and fiber content.
Over the years, phenotypic traits have played an important role in genetic diversity analysis and in discriminating genotypes in order to form a core and/or mini-core collection [10][11][12].Similarly, previous studies have reported establishing a core collection of germplasm in sugarcane based on morphological traits [2,[13][14][15].Phenotypic characterization of a germplasm collection is an increasingly important step in plant breeding programs since breeders can evaluate variations and select high performance accessions more efficiently.Although several studies have been carried out on core collection construction in sugarcane [2], no single study has established the breeding collection (BC), actually named mini-core collection, of sugarcane germplasm.
The goals of our work were (1) comprehensive evaluation of sugarcane genetic resources in Iran and measuring important phenotypic traits on four dates over two growing seasons and (2) to create a mini-core collection or breeding collection that captures most of the phenotypic diversity of the primary collection.The present research attempts, for the first time, to establish a mini-core collection in sugarcane germplasm.This study provides an exciting opportunity to advance our knowledge of germplasm characterization for discovering new sources of variations in order to enhance the use of germplasm for cultivar improvement.

Plant Materials
The materials comprised 253 sugarcane accessions collected from different continents.All accessions in the collection belong to the Saccharum officinarum L. and all of them are hybrids.
The accessions' names have been derived from their origin or breeding centre.For example, C85-102 is a variety of Cuban sugarcane, and the breeding centre of the FGO2-250 variety is France.These accessions were evaluated in a randomized complete block design with four replications at the experimental station of the Cane Development and Sidelong Industrial Research and Education Institute of Khuzestan, Iran in two successive seasons (2012 and 2013).The plot size was 100 m 2 including five rows, 20 m long with a between-row spacing of 1 m.

Measurements of Phenotypic Traits
Sugarcane planting was done in June 2010, and morphological traits and sugar yield factors were measured during two separate seasons.The first season was January 2012, and coincided with 18-month-old plants.The second season was comprised of three successive time points, namely November 2012, January 2013, and February 2013, and coincided with ratoon plants.The plan was to record data in the abovementioned time points in order to gain a detailed understanding of growth periods, and to assess the difference between them and, consequently, to recognize critical stages for gaining the optimum yield.Thirteen traits (Table 1) were measured for 253 accessions grown in the field, across two separate seasons.First, the whole collection (comprising 340 accessions) was chosen, but because of the expected difficulty of obtaining phenotypic traits for all accessions, and also because of missing data in sugar yield measurements such as Brix% or sugar%, measurements were restricted to 253 accessions.Stalk height (cm), internode length (cm), and stalk diameter (mm) were measured on five stalks at harvest according to Dillewiijn [16] and were averaged subsequently.The canes were cut and the average weight of 10 canes was recorded.After peeling the selected canes, they were crushed in a mill to calculate the percentage and weight of juice.The Brix value (total soluble solids) was determined with a refractometer, and Pol (apparent sucrose) in sugarcane juice was determined with saccharimeter (a saccharimeter is simply a polarimeter specially designed for measuring the polarization of sugars).Purity is the percentage of pure sucrose in dry matter that was calculated as [Pol/Brix] × 100.In order to calculate the recoverable sucrose (%) (RS), the following formula was used: Cane yield was estimated as the number of stalk (m 2 ) × stalk weight.For calculation of sugar yield (kg sugar ha −1 ), RS was multiplied with cane yield per hectare [17].

Descriptive Statistics
Data management and analysis were carried out using SAS [18].Descriptive statistics including mean, median, range, and variance were generated for all variables for both the primary germplasm and the mini-core collection.A normality test was performed with the Shapiro-Wilk W method.In this study, the analysis of variance (ANOVA) was conducted and the experimental design was considered as a randomized complete block design with four replications.Each time point was mentioned as a replication.

Correlation and Regression Analysis
Pairwise correlations were obtained by using the Pearson method in Proc Corr of SAS [18].
Stepwise multiple linear regression analysis was carried out using SAS program using Proc Reg [18].In this analysis, recoverable sucrose (RS) was considered as dependent variable and the remaining traits as independent variables.Prior to analyzing the regression, in order to identify important characteristics affecting the amount of white sugar (RS), and moreover to avoid the loss of fit due to multicollinearity phenomenon (correlation between predictors that one can be linearly predicted from the others with a substantial degree of accuracy), the level of multicollinearity was estimated with the most widely used criterion of the Variance Inflation Factor (VIF) as suggested by Hair et al. [19].High VIF values (above 10) indicate a high collinearity [19].

Clustering
To reduce data dimensions for better visualization of accessions and traits, principal component analysis (PCA) was conducted for 13 traits and 253 accessions using statistical software SPSS (version 19, IBM Corp, Armonk, NY, USA).Clustering analysis of accessions in the primary collection based on morphological traits was performed using gCLUTO software (version 1.0, University of Minnesota, Twin Cities, MI, USA) based on the RB (repeated bisection) method.The advantage of Graphical Clustering Toolkit (gCLUTO) [20] is the better visualization of the clusters.

Establishment of Mini-Core Collection
PowerCore software v 1.1 [21] was used to construct the mini-core collection by analysing phenotypic data using maximization strategy (M strategy).The M strategy was used to select entries of subset collection with highest diversity through a modified heuristic algorithm [22].In order to compare means or medians of traits between primary and mini-core collection, t-test and Wilcoxon signed-rank method was used in Proc t-test and in Proc NPAR1WAY, respectively [18].For traits with normal distribution, the t-test was performed, and for traits with non-normal distribution, the non-parametric Wilcoxon signed-rank method was used.The Ansari-Bradley test was performed using Proc NPAR1WAY to compare the variance of traits with non-normal distributions between the mini-core and primary collection.Moreover, comparison of the variances of traits with normal distributions, between two collections, was carried out using F test.The Shannon-Weaver diversity index [23] was estimated using PowerCore software based on all the traits to measure the diversity of accessions in the primary and mini-core collection, and finally this index was compared between entire collection and mini-core collection with pairwise t-test in SAS Proc means.

Morphological Traits
The average stalk height in the second season was 168.2 cm, which was significantly (p ≤ 0.05) higher than the stalk height in the first year (125.6 cm).There were no significant differences in stalk height among the three time points of the second season (Figure 1a).The same trend was found for internode length, as the first season mean (8.8 cm) was lower than that of the second season (10.1 cm) (Figure 1b).Contrary to these findings, the average stalk diameter in the first season (23.4 mm) was higher than that of the second season (22.6 mm), and this difference was found to be significant (p ≤ 0.05) (Figure 1c).
2012 was 25% higher than January 2012, but no significant differences were found between the three time points in the second season.Figure 1h provides an overview of mean sugar yield of the primary collection sugarcane accessions measured in four harvest time points.In the third time point in the second season (February 2013), the mean sugar yield was significantly higher (p ≤ 0.05) than other time points.
The normality test was carried out according to the Shapiro-Wilk test.The results revealed that the distributions of all traits in the first season were non-normal, but in the second season, distributions for stalk height, stalk diameter, stalk internode number, cane weight, RS percentage, and sugar yield were normal.The distribution of the means of two seasons for stalk height, stalk internode length, stalk diameter, POL%, and RS% was normal, while for other traits it was nonnormal.

Correlations Analysis of Traits Measured in Different Time Points
The pair-wise Pearson product-moment correlation analysis was used to determine the relationship between different harvest time points in two seasons in the primary collection (Table 2).The analysis showed medium R-value for most traits among different time points.Nevertheless, in several traits there were significant positive correlations among different harvest times.For example, a significant positive correlation (p ≤ 0.01) was found between January 2013 and February 2013 for

Sugar-Related Traits and Cane Yield
There was a clear trend of an increase in in juice weight during two seasons (Figure 1d).In February 2013, juice weight was significantly higher than those in other time points.The results obtained from the measurement of mean percentage of Brix are presented in Figure 1e.The mean percentage of Brix (18.74%) was at a maximum in January 2012.In November 2012, the value obtained was minimum (17.70%), but it increased in subsequent time points.It was 18.10% and 18.53% at harvest times in January 2013 and February 2013, respectively.There were significant differences (p ≤ 0.05) between the three harvest times in the second season for this trait (Figure 1f).The trend for recoverable sucrose (RS %) was the same as that of Brix, i.e., the RS percentage was significantly higher in the first season than that of the second season.As shown in Figure 1g, the average cane yield in the second season was much higher than that in the first season.Cane yield in November 2012 was 25% higher than January 2012, but no significant differences were found between the three time points in the second season.Figure 1h provides an overview of mean sugar yield of the primary collection sugarcane accessions measured in four harvest time points.In the third time point in the second season (February 2013), the mean sugar yield was significantly higher (p ≤ 0.05) than other time points.
The normality test was carried out according to the Shapiro-Wilk test.The results revealed that the distributions of all traits in the first season were non-normal, but in the second season, distributions for stalk height, stalk diameter, stalk internode number, cane weight, RS percentage, and sugar yield were normal.The distribution of the means of two seasons for stalk height, stalk internode length, stalk diameter, POL%, and RS% was normal, while for other traits it was non-normal.

Correlations Analysis of Traits Measured in Different Time Points
The pair-wise Pearson product-moment correlation analysis was used to determine the relationship between different harvest time points in two seasons in the primary collection (Table 2).The analysis showed medium R-value for most traits among different time points.Nevertheless, in several traits there were significant positive correlations among different harvest times.For example, a significant positive correlation (p ≤ 0.01) was found between January 2013 and February 2013 for stalk height, internode length, stalk diameter and cane yield with R-values of 0.98, 0.99, 0.98 and 0.99, respectively.In contrast, the lowest correlations between time points were between January 2012 and November 2012 for Brix (R = 0.21) and RS (R = 0.15).Generally, as Table 2 shows, there are modest correlations among different time points for different traits.Further correlation analysis among traits (Table 3) revealed the strong positive correlation between POL% and RS% (R = 0.96, p ≤ 0.01) and also between Brix% and POL% (R = 0.95, p ≤ 0.01).It is apparent from this table that there is a significant positive correlation between cane yield and sugar yield (R = 0.92).Also, RS% was correlated significantly with Brix%; likewise, juice weight with cane weight, both of them with R = 0.90 (p ≤ 0.01).A significant negative correlation was detected between stalk diameter and cane yield (Table 3).In this investigation, the correlation between sugar yield with internode number and POL% was not significant (Table 3).Among the traits, RS%, Pol%, Brix%, PTY%, stalk height, and internode length had significant correlation with sugar yield, suggesting that these traits perhaps can be manipulated for sugar yield improvement in sugarcane.* and **, significant at the 5% and 1% levels probability, respectively.SH = Stalk height, InN = internode number, InL = internode length, SD = stalk diameter, SW = 10-stalk weight, JW = juice weight, J% = juice percent, Br% = Brix percent, Po% = POL percent (sucrose content in the cane), PT% = PTY (the purity of juice), RS% = recoverable sugar percent, CY = cane yield, SY = sugar yield.

Multiple Stepwise Regression Analysis
During genetic improvement, breeders are interested in those traits that have causative effects on economical traits.In this research, to do multiple stepwise regressions, the percentage of recoverable sucrose (RS%) was considered as the dependent variable and other remaining characters as independent variables.The result of the stepwise regression analysis is summarized in Table 4. Accordingly, 96% of total variation for the RS% could be explained by two characters; Brix percent (82%) and PTY percent (14%).The most positive effective traits on RS% were Brix and PTY.Juice weight had a significant negative effect on RS%.

Principal Component Analysis
Figure 2 exhibits the distribution of accessions and traits in groups according to PCA.The first two principal components (PCs) provided a reasonable summary of the data and explained 59.51% of the total variation.PC1 explained 32.36% of the total variation.The most effective traits on this component were sugar yield, cane yield, stalk height and internode length.The second PC, which represented 27.15% of the variation, mainly represented RS% and Brix%.According to PCA result, CL61-620 and CL73-239 (from the USA) were two accessions with the highest content of RS and Brix percent.Likewise, CP73-1547, CP44-101 (from the USA) and TUC68-19 (from Argentina), were accessions with the highest sugar yield and cane yield.From this data, it can be seen that F134, Q138 and CL54-336 accessions had the lowest content of Brix and RS percent.Also, some accessions like CP75-1353, CP81-1254 and CP73-21 (from the USA) had high values for both Brix and sugar yield.accessions with the highest sugar yield and cane yield.From this data, it can be seen that F134, Q138 and CL54-336 accessions had the lowest content of Brix and RS percent.Also, some accessions like CP75-1353, CP81-1254 and CP73-21 (from the USA) had high values for both Brix and sugar yield.

Graphical Clustering Analysis
The results obtained from the clustering analysis are summarized in Table 5.As can be seen, all the accessions were grouped in 10 segregated clusters based on the RB method.The highest and lowest numbers of accessions were observed in Clusters 8 and 1, respectively.This data showed that Clusters 7 and 9 resulted in the highest value of average internal similarities (Isim) (0.999), and Clusters 2 and 6 with Isim of 0.997 had the lowest value of internal similarities.Data from this table can be compared and completed with Figure 3, which shows the mountain visualization of relationships between the 10 clusters.Considering the distances between peaks, cluster 1 is the farthest group from other clusters and thus has the lowest value of average external similarities (Esim) (Table 5).Moreover, the lowest peak height was seen in Clusters 2 and 6, which represents the lowest value of internal similarity in these clusters.As Figure 3 shows, accessions from Cuba, the USA, South Africa, India, and Iran were grouped in the same cluster (Cluster 3).The data in Figure 3 and Table 5 makes it apparent that the smallest group is Cluster 1, which contains only three accessions from two breeding centres in the USA and Cuba, while other clusters, which had accessions from several breeding centres, were grouped in the same cluster.

Graphical Clustering Analysis
The results obtained from the clustering analysis are summarized in Table 5.As can be seen, all the accessions were grouped in 10 segregated clusters based on the RB method.The highest and lowest numbers of accessions were observed in Clusters 8 and 1, respectively.This data showed that Clusters 7 and 9 resulted in the highest value of average internal similarities (Isim) (0.999), and Clusters 2 and 6 with Isim of 0.997 had the lowest value of internal similarities.Data from this table can be compared and completed with Figure 3, which shows the mountain visualization of relationships between the 10 clusters.Considering the distances between peaks, cluster 1 is the farthest group from other clusters and thus has the lowest value of average external similarities (Esim) (Table 5).Moreover, the lowest peak height was seen in Clusters 2 and 6, which represents the lowest value of internal similarity in these clusters.As Figure 3 shows, accessions from Cuba, the USA, South Africa, India, and Iran were grouped in the same cluster (Cluster 3).The data in Figure 3 and Table 5 makes it apparent that the smallest group is Cluster 1, which contains only three accessions from two breeding centres in the USA and Cuba, while other clusters, which had accessions from several breeding centres, were grouped in the same cluster.

Constructing a Mini-Core Collection
Based on the PowerCore program output, conducted on the entire collection (253 accessions), a mini-core collection was generated that included 21 accessions using M strategy and heuristic search.Descriptions of the 21 selected accessions with important morphological and sugar-related traits have been shown in Table 6.
To compare the primary collection with the mini-core collection, the variance difference (VD = 53.5%),mean difference (MD = 2.31%), coincidence rate (CR = 93.96%),and variable rate (VR = 147.28%)were calculated.In traits with normal distribution, there were no significant differences between the means of two collections (Table 7).Similarly, in non-normal traits, there were no significant differences between the medians of two collections (Table 7).The F test showed significant differences between variances of two collections for normally distributed traits, while, for nonnormally distributed traits, except for juice weight, no significant differences were found between two collections (Table 7).Interestingly, for those traits with significant difference in variation, the mini-core collection was more diverse than the whole collection.
To compare the variability of two collections, mini-core collection and primary collection, the Shannon-Weaver diversity index was also used for each trait in each collection (Table 7).The mean of the Shannon-Weaver diversity index scores for primary and mini-core collections were 17.50% and 20.11%, respectively.Pairwise t-test for comparing this index between two collections revealed significant difference, i.e., two collections have different variability.Altogether, these results confirmed that the mini-core collection was a good representation of the primary collection and captured much of the diversity present in the primary collection.

Constructing a Mini-Core Collection
Based on the PowerCore program output, conducted on the entire collection (253 accessions), a mini-core collection was generated that included 21 accessions using M strategy and heuristic search.Descriptions of the 21 selected accessions with important morphological and sugar-related traits have been shown in Table 6.
To compare the primary collection with the mini-core collection, the variance difference (VD = 53.5%),mean difference (MD = 2.31%), coincidence rate (CR = 93.96%),and variable rate (VR = 147.28%)were calculated.In traits with normal distribution, there were no significant differences between the means of two collections (Table 7).Similarly, in non-normal traits, there were no significant differences between the medians of two collections (Table 7).The F test showed significant differences between variances of two collections for normally distributed traits, while, for non-normally distributed traits, except for juice weight, no significant differences were found between two collections (Table 7).Interestingly, for those traits with significant difference in variation, the mini-core collection was more diverse than the whole collection.To compare the variability of two collections, mini-core collection and primary collection, the Shannon-Weaver diversity index was also used for each trait in each collection (Table 7).The mean of the Shannon-Weaver diversity index scores for primary and mini-core collections were 17.50% and 20.11%, respectively.Pairwise t-test for comparing this index between two collections revealed significant difference, i.e., two collections have different variability.Altogether, these results confirmed that the mini-core collection was a good representation of the primary collection and captured much of the diversity present in the primary collection.

Discussion
The first question of this study sought to characterize the sugarcane germplasm with the assessment of phenotypic traits.The mean analysis showed that no differences were found among three harvest time points of the second season for stalk height, internode length, stalk diameter, and cane yield (Figure 1a-c,g).A possible explanation for this result might be that there is cold weather and short daylight in November, January, and February in the second season in Iran, which may have led to a lack of adequate photosynthesis and no differences between the time points were found.Another reason could probably be due to the short time intervals between three harvests; hence, plants did not have growth opportunities.However, for other traits there were significant differences between the three time points in the second season (Figure 1d-f,h).In fact, for morphological traits, the harvest time differences were not significant; while for sugar-related traits significant differences between the three time points of the second season were observed.A significant increase in height from one season to the next was observed (Figure 1a).These results are consistent with those reported by Todd et al. [2], who showed a significant increase in height in the Saccharum officinarum species at different measurement times.
The present experiment was conducted during four harvest times (in two seasons) to determine the effect of growth periods to select the best harvest time in order to achieve the highest cane and sugar yield.It is interesting to note that in all sugar-related traits, significant differences between two seasons and/or different time points were observed (Figure 1d-f,h).This finding showed that the best time point with the maximum sugar yield is February in the second season.The same trend was observed for cane yield in which second season values were significantly (p ≤ 0.05) higher than first season records.Due to the significant increase in cane yield in the second season, these findings suggest the best time for sugarcane harvest to be the second season.Also, it could conceivably be hypothesized that energy-cane cultivars/accessions are preferably harvested in the second year (ratoon stage) due to the importance of cane yield in the production of high biomass and lingo-cellulosic compounds (Figure 1g).
As mentioned earlier, the greatest magnitudes of correlations were observed between January 2013 and February 2013 for stalk height, internode length, stalk diameter and cane yield (Table 2).The observed correlation may be due to the short time interval between January and February harvest times and subsequently, a lack of plant growth.These results are in agreement with those reported by Todd et al., [2] in which there were high correlations for stalk height, stalk diameter, and internode length between two time points.In correlation analysis among traits, positive association was seen between cane yield with stalk height and internode length, at α = 0.01.Cane yield is one of the major parameters for assessment of sugarcane performance [24].Hence, stalk height and internode length are two effective traits to improve and to recognize the high cane yield genotype.These results are in agreement with recent studies indicating that cane yield was associated positively with stalk height, internode length and cane thickness [25].Another important significant positive correlation was found between sugar-related factors such as RS%, Pol%, Brix%, PTY%, with sugar yield.The present results are significant in at least two major respects.On the one hand, for sugar yield improvement in sugarcane accessions, it is suggested to focus and to manipulate the factors such as RS%, Pol%, Brix%, PTY%.On the other hand, for increase in cane yield, it is recommended to focus on stalk height and internode length.
In current research, Brix% and PTY% were identified as effective traits on RS%, according to multiple stepwise regression analysis results.Positive regression coefficients of these variables indicated that these variables can be mentioned as logical and reasonable indices for selection in order to improve recoverable sucrose in sugarcane.This result confirms the strong association between RS% with two variables of Brix% and PTY%, (Table 3).The question that arises here is that in the study of correlation analysis (Table 3), positive correlation between RS% and POL% was found while it is missing in the model.In fact, due to high multicollinearity among independent variables scaled by VIF, POL% was omitted from the analysis.
The PCA scatterplot of two principal components axes provided the distribution of primary collection accessions in a biplot.The PCA finding has important implications for introducing accessions with desirable traits, such as accessions with high content of RS% and Brix% or accessions with high/low sugar yield or cane yield (Figure 2).Arguably, depending on the approach of sugarcane breeding, these traits can be utilized by breeders.Some of the clear findings in the biplot are that CL61-620 and CL73-239 are accessions with the highest content of RS% and Brix% percent, and that CP45-3, Q68 and EH16-9 are accessions with the highest cane yield (Figure 2).Interestingly these accessions are present in the constructed mini-core collection in this research (Table 6).
The clustering analysis revealed the morphologic similarity of different accessions from different breeding centres.For example, grouping of the USA and Cuba accessions in the same clusters indicated that there are morphologic similarities between these accessions (Figure 3).Likewise, the grouping of the Iran-Cuba hybrid (Iran) with the USA, India, Cuba, and South Africa varieties in Cluster 3 demonstrate the morphologic similarities between them (Figure 3).
The range values of all traits in both collections are shown in Table 7.As can be seen from the table (above), in the core collection (primary collection), the range value of stalk height and stalk diameter obtained 129.17 (cm) and 15.46 (mm) respectively.Todd et al. showed that range value of stalk height and stalk diameter 304.13 (cm) and 37.79 (mm), respectively, in World Collection of Sugarcane (Saccharum spp.) and Related Grasses of Sugarcane in Miami, Florida.This is different from the findings presented here about the range value of traits, but the mean value of the mentioned traits in our collection are consistent with data obtained with Todd et al [2].
With respect to the second research goal, PowerCore software was used to construct a mini-core collection.PowerCore software had been used previously to create core/mini collections for rice (Oryza sativa L.) [26,27], sesame (Sesamum indicum L.) [28], and chickpea (Cicer arietinum L.) [29].Using this software, 21 accessions were selected to build a mini-core collection out of 253 accessions of sugarcane.These results confirm the appropriate number of subset collections (10% of the primary collection), as was mentioned in the literature [8].A core/mini-core collection with a coincidence rate (CR%) greater than 80% has been recommended as a proper collection for breeding purposes [30].In our analysis, the CR was found to be 93.96%,indicating that the mini-core collection selected from sugarcane germplasm is a good representative subset of the phenotypic diversity in the primary collection.Also, tand Wilcoxon test results showed no significant difference in means and medians between the mini-core and primary sugarcane collections (Table 7).Furthermore, high values for VR% and VD% [31], which are the evaluating factors of core collections, reconfirm that the mini-core collection in this study incorporates the main portion of diversity of the primary sugarcane collection.Additionally, the F and Ansari-Bradley test results also revealed no significant loss of diversity in mini-core collection compared with the primary collection.Interestingly, for normally distributed traits, the mini-core collection was more diverse than the whole collection.However, for non-normally distributed traits, Ansari-Bradley test results showed non-significant difference between two collections, except for juice weight.
For comparing the Shannon-Weaver diversity indices between the two collections pairwise t-test was carried out.Accordingly, significant differences were found between the two collections, so that Shannon-Weaver index scores in the mini-core were significantly higher than in the primary collection.Higher variance in the core/mini collection relative to the entire collection has been reported frequently in previous researches [2,31,32].

Figure 1 .
Figure 1.(a) Mean stalk height (cm); (b) Mean internode length (cm); (c) Mean diameter (mm); (d) Mean juice weight (kg); (e) Mean percentage Brix of juice extracted from stalks; (f) Mean recoverable sucrose; (g) Mean cane yield (ton/ha); and (h) Mean sugar yield (ton/ha) of primary collection of sugarcane accessions measured in two seasons (in four time points).In all Figures (a-h), means followed with the same letters are not significantly different at p ≤ 0.05.

Figure 1 .
Figure 1.(a) Mean stalk height (cm); (b) Mean internode length (cm); (c) Mean diameter (mm); (d) Mean juice weight (kg); (e) Mean percentage Brix of juice extracted from stalks; (f) Mean recoverable sucrose; (g) Mean cane yield (ton/ha); and (h) Mean sugar yield (ton/ha) of primary collection of sugarcane accessions measured in two seasons (in four time points).In all Figures (a-h), means followed with the same letters are not significantly different at p ≤ 0.05.

Figure 2 .
Figure 2. Two-dimensional scatter plot of principal component analysis for the first two principal components created from phenotypic traits from the sugarcane collection (Saccharum spp.) in Khuzestan province, Iran.

Figure 2 .
Figure 2. Two-dimensional scatter plot of principal component analysis for the first two principal components created from phenotypic traits from the sugarcane collection (Saccharum spp.) in Khuzestan province, Iran.

Figure 3 .
Figure 3.The mountain (left) and matrix (right) visualization of relationships among 10 clusters constructed in collection of sugarcane (Saccharum spp.) in Khuzestan.The clustering analysis and visualization were performed using gCLUTO.

Figure 3 .
Figure 3.The mountain (left) and matrix (right) visualization of relationships among 10 clusters constructed in collection of sugarcane (Saccharum spp.) in Khuzestan.The clustering analysis and visualization were performed using gCLUTO.

Table 2 .
Correlation coefficients among months for the different traits measured in two seasons of the collection of sugarcane.

Table 3 .
Correlation coefficients among thirteen traits measured in 253 sugarcane accessions for mean measurement of two seasons.

Table 4 .
Stepwise regression model for RS% as dependent variable and the remaining characters as independent variables.

Table 5 .
Clustering analysis of 253 sugarcane accessions with morphological traits based on repeated bisection (RB) method.

Table 5 .
Clustering analysis of 253 sugarcane accessions with morphological traits based on repeated bisection (RB) method.

Table 6 .
Description of 21 sugarcane accessions created by M-strategy and heuristic algorithm.

Table 7 .
Comparison of descriptive statistics of the accessions growing in the primary germplasm of the sugarcane collection (means of two season), and the accessions from this collection selected for the mini-core collection (breeding collection), table (b) is continue of table (a).Inter N = internode number, Inter L = internode length, 10 SW = 10-stalk weight, JW = juice weight, Pol = sucrose content in the cane, PTY = the purity of juice, RS% = recoverable sugar percent.* and **, significant at p < 0.05 and 0.01, respectively.CC: core collection: Primary collection.MCC: mini core collection.