Molecular Phylogeny of Selected Kenyan Eucalyptus Species Inferred from MatK , rbcL and TrnL-F Genes and Their Suitability for Power Transmission Poles

: Genus Eucalyptus belongs to the family Myrtaceae and consists of more than 900 species, various hybrids and varieties. The major species that are grown in Kenya are Eucalyptus grandis, E. globulus, E. saligna and E. camaldulensis . Most Eucalyptus species are highly dependent on rainfall and this is challenged by climatic changes owing to global warming making it difﬁcult to effectively match the availability of mature trees and the market demand especially for use as power transmission poles. With the widespread availability of other naturally occurring Eucalyptus species such as E. camaldulensis and E. globulus, it becomes important to determine the genetic diversity and to analyze the phenotypic traits of these species for suitability as power transmission poles in order to counter the overdependence on E. grandis . Phenotypic traits investigated included measuring total tree height and diameter at breast height (DBH), while molecular data were obtained from sequencing MatK , rbcL and TrnL-F genes from selected species and evolutionary analyses such as nucleotide substitution rates, base composition disparity indices, evolutionary divergence, nucleotide diversity indices and phylogeny construction were conducted in MEGA 11. Signiﬁcant differences in DBH and height among Eucalyptus species were observed when the phenotypic data were subjected to ANOVA. In this study, E. robusta , E. paniculata , E. maculata, E. dunnii , E. camaldulensis and E. citriodora are ﬁt to be used as power transmission poles but they are limited by their short height. However, E. tereticornis and E. glaucina have the desired DBH and height and hence can be used as substitutes for E.grandis. Generally, the molecular phylogeny study has shown that the studied Eucalyptus species are closely related and form various monophyletic clades which can be attributed to the short genetic distances, low substitution rates, low nucleotide bias disparity indices and low diversity scores. Further phylogenetic and gene expression studies involving more Eucalyptus species are needed to better understand Eucalyptus phylogeny, and diversity and identify species with similar genetic make-up to that of E. grandis which has been used extensively for the provision of electricity transmission poles.


Introduction
The genus Eucalyptus belongs to the family Myrtaceae and comprises more than 900 species with various hybrids and varieties globally [1]. They grow in different ecological zones; hardy species grow in semi-arid areas while others grow on marshy sites and under a variety of soils including infertile sands, fertile loam soils and heavy clays [2]. Eucalyptus are plantation trees that are widely cultivated in Kenya comprising mostly species of the subgenus Symphyomyrtus [3]. Plantations composed of Eucalyptus species have been of concrete poles stands at less than 10% of the current annual demand for power transmission poles [21]. In Kenya, there has been a deficit of 200,000 transmission poles that have necessitated importation from various countries since 2006 [10,22]. There is a need, therefore, to venture into low-cost production of E. grandis and other alternative Eucalyptus species to increase the local supply of poles and reduce their importation.
Chloroplast DNA genes such as Maturase K (MatK), Ribulose biphosphate carboxylase large subunit (rbcL), and TrnL-F intergenic spacer (TrnL-F) are important in studies of population genetics and phylogeography in Eucalyptus. This is because most chloroplast genes evolve at a slower rate compared to nucleus genes and are considered the best genes for systematic studies [23]. Barthet et al. demonstrate that the MatK gene exists in most plant species, and the expression of this gene is influenced by light and development stage hence the importance of this gene in plant systematics, plant molecular biology, and gene evolution studies [24]. RbcL gene has also been exploited to understand plant systematics beyond the family level, and the TrnL-F gene is expressed in the leaves of plants and function in the first step of photosynthesis and photorespiration in plants and has also been used widely in plant phylogenetic studies [25].
Studying the current status of morphological and genetic diversity of Eucalyptus species and obtaining genetic data will provide a guide for their management, assessment and identification of economically and environmentally beneficial species [26]. Molecular data have been widely applied to understand the role of historical processes in shaping standing genetic diversity within species [27], but little is known about the phylogenetic relationships of most of the Kenyan Eucalyptus species. Eucalyptus is naturally protandrous and hence promotes outcrossing [28]. Consequently, for both natural and manipulated crosses, the offspring are morphological intermediates between the two parent taxa and show a high level of variability in the second generation and identification of the intermediate offspring is a challenge that could be addressed by molecular phylogenetic studies [28].

Sample Collection and Determination of Phenotypic Traits
Leaves of ten species of Eucalyptus aged between 8 and 12 years were collected from various Kenya Forest Service plantations and farms in Kakuzi in Nairobi and Londiani in Rift Valley in February 2021. They were positively identified by Magrate Kaigongi, a botanist at Kenya Forestry Research Institute (KEFRI) and selected for phenotypic study and as source samples for molecular analysis. Samples were sealed in plastic bags containing silica and transported in a cooler box to maintain viability of the cells and stored in the laboratory at −20 • C until DNA extraction was performed. Voucher specimens were deposited in KEFRI herbarium. The following are voucher specimen numbers of the specimens collected: E. grandis (MK2021/01), E. paniculata (MK2021/02), E. maculata (MK2021/03), E. robusta (MK2021/04), E. camaldulensis (MK2021/05), E. citriodora (MK2021/06), E. urophylla (MK2021/07), E.dunnii (MK2021/08), E.glaucina (MK2021/09) and E.tereticornis (MK2021/10). From the sampled species, two phenotypic traits were measured: total tree height (m), and diameter at breast height (DBH) (cm). Stems were assessed for straightness, and then stem diameter was determined using diameter tape and the total height of the tree measured by Suunto hypsometer [29]. Five observations were randomly obtained for each species under the two phenotypic traits investigated.

DNA Extraction, PCR, and Sequencing
DNA was extracted from fresh leaves of the collected Eucalyptus species using the cetyltrimethylammonium bromide (CTAB) DNA extraction method. Three chloroplast genes which are universal molecular markers suitable for phylogenetic studies in plants were targeted. These genes were MatK gene, rbcL and TrnL-F. Amplification of DNA was then carried out using primers of the three genes selected from previous phylogenetic studies (Table 1). Sequencing was performed at Inqaba Biotec East Africa (IBEA), South Africa. The obtained sequences of the three genes from the ten Eucalyptus species were then Diversity 2022, 14, 563 4 of 13 exported to MEGA 11 software (Molecular Evolutionary Genetics Analysis Version 11) for phylogenetic analysis [30].

Phylogenetic Analysis
Obtained DNA sequences were subjected to basic evolutionary analyses such as evolutionary distances and phylogenetic reconstructions recommended by Tamura et al. [30] and Kumar et al. [34]. All the MatK, rbcL and TrnL-F sequences from Eucalyptus species were exported and assembled in MEGA 11 and their ends edited manually to remove wrong bases at the ends, which could compromise the quality of the sequences, then aligned using Muscle-codon alignment method [35]. Multiple sequence alignment is important in identification of gaps, matches and mismatched nucleotides in the three genes which is a prerequisite for the construction of phylogenetic trees. Evolutionary analyses included measures of nucleotide substitution rates, base composition bias disparity index, evolutionary divergence and nucleotide diversity indices. Codon positions included in the analyses were 1st + 2nd + 3rd + Noncoding. During phylogenetic analysis of the three genes, Kimura 3 parameter and Jukes-Cantor evolutionary models were followed [36,37]. The models assumed the rates and patterns of substitution were uniform among the four nucleotide sites. The method of tree construction was maximum likelihood (ML) where for the ML tree, the heuristic approach was the nearest neighbor interchange and the test for phylogeny was bootstrap resampling where the number of bootstrap replications were set at 5000 for each ML tree constructed. The trees with the highest log likelihood were selected as the best trees to depict the phylogenetic relationships of Eucalyptus species under study.

Phenotypic Traits of Eucalyptus Species
DBH and height means of Eucalyptus species under study aged between 8 and 12 years are shown in Figure 1. The mean DBH of E. camaldulensis (90.33 ± 2.31 cm) is higher than the mean DBH of other species which is followed by the mean DBH of E. grandis (71.0 ± 3.24 cm), while the others have a mean DBH of around 25 cm. The high mean DBH of E. camaldulensis could be because the height of E.camaldulensis is smaller compared to the other species, and it can be assumed that it increases in girth more than height. E. dunnii has the smallest mean DBH (18.0+1.2 cm). The preferred range of DBH according to Kenya Power and Lightning Company (KPLC), for wooden power transmission poles is 18.3-28.3 cm [22]. Hence E. glaucina (27.0 ± 1.81 cm), E. robusta (22.0 ± 1.73 cm), E. paniculata (30.67 ± 2.41 cm), E. tereticornis (26.0 ± 1.59 cm), E.urophylla (22.0 ± 2.81 cm), E. maculata (24.67 ± 1.57 cm), E. dunnii (18.0 + 1.2 cm) and E. citriodora (27.0 ± 1.22 cm) have prefered average DBH for power transmission poles. However, DBH mainly will depend on the age of the tree, and the climatic condition of the region where the trees are grown, and species such as E. grandis can attain high DBH. In wet areas, Eucalyptus will attain the desired size between 8 and 10 years, while in drier areas, the same species could reach the preferred size between 12 and15 years [10]. Data analysis was performed in SPSS version 23. Initially, data were assessed for normality using Shapiro-Wilk test, which is an appropriate method for small sample sizes. Shapiro-Wilk statistic (W) showed that data were normally distributed since all p values for the W statistic in all the 10 Eucalyptus samples were above 0.05. Similarly, Q-Q normal plots showed that the data points did not deviate significantly from the diagonal line suggesting that the samples were normally distributed. Levene's test results for DBH and height were (F = 3.44, p = 0.061) and (F = 6.921, p = 0.072) respectively which indicated homogeneity of variance among the 10 Eucalyptus samples. A one-way ANOVA revealed that there was statistically significant difference in DBH (F (9, 40) = 33.356, p = 0.004) and height (F (9, 40) = 7.364, p = 0.007) among the 10 Eucalyptus species. species such as E. grandis can attain high DBH. In wet areas, Eucalyptus will attain the desired size between 8 and 10 years, while in drier areas, the same species could reach the preferred size between 12 and15 years [10]. Data analysis was performed in SPSS version 23. Initially, data were assessed for normality using Shapiro-Wilk test, which is an appropriate method for small sample sizes. Shapiro-Wilk statistic (W) showed that data were normally distributed since all p values for the W statistic in all the 10 Eucalyptus samples were above 0.05. Similarly, Q-Q normal plots showed that the data points did not deviate significantly from the diagonal line suggesting that the samples were normally distributed. Levene's test results for DBH and height were (F = 3.44, p = 0.061) and (F = 6.921, p = 0.072) respectively which indicated homogeneity of variance among the 10 Eucalyptus samples. A one-way ANOVA revealed that there was statistically significant difference in DBH (F (9, 40) = 33.356, p = 0.004) and height (F (9, 40) = 7.364, p = 0.007) among the 10 Eucalyptus species. According to Figure 1, the mean height of E. grandis (34.0 ± 3.11 m) is slightly higher than the mean height of other species, followed by E. glaucina (  According to Figure 1, the mean height of E. grandis (34.0 ± 3.11 m) is slightly higher than the mean height of other species, followed by E. glaucina (

Molecular Phylogeny of Eucalyptus Species
Twenty-four sequences (10 rbcL, 7 MatK and 7 TrnL-F sequences) of the three genes from the ten Eucalyptus species were successfully amplified and sequenced then deposited in NCBI Genbank and their accession numbers are tabulated below (Table 2).
Evolutionary statistics were performed in MEGA 11 to find the best model for evolutionary distances and phylogeny based on the three genes under study. Models with the lowest Bayesian Information Criterion (BIC) scores are considered the best in describing nucleotide substitution pattern in the course of evolution according to Tamura et al. [30]. From the analyses, the best nucleotide evolutionary model for analysis of MatK sequences was Tamura 3 parameter [38] with the lowest BIC value of 3062.8906. For analysis of rbcL sequences, the best model with the lowest BIC score of 2703.96 was the Jukes-Cantor model [37], while for the TrnL-F, the model for phylogeny analysis and construction of trees with the lowest BIC score of 1749. The base pattern disparity index, which measures the evenness of substitution terns between sequences, was also determined. Figure 3, shows the values of the b composition bias disparity per site between sequence pairs in the species studied. Va greater than zero indicate larger differences in base composition biases than expec when evolutionary distances are measured between the sequence pairs [39]. Hence lo The base pattern disparity index, which measures the evenness of substitution patterns between sequences, was also determined. Figure 3, shows the values of the base composition bias disparity per site between sequence pairs in the species studied. Values greater than zero indicate larger differences in base composition biases than expected when evolutionary distances are measured between the sequence pairs [39]. Hence lower nucleotide disparity index is positively correlated to close relatedness. Based on the rbcL sequences, the highest value was observed between E. paniculata and E. grandis which suggested larger differences between them. Generally, E. paniculata, when compared to the other species, showed higher disparity scores. The base composition disparity index for the MatK sequences was higher between E. citriodora and E. glaucina, E. robusta, and E. urophylla, suggesting larger genetic differences between E. citriodora and the other three species. Based on the TrnL-F sequences, high disparity indices were observed between E. grandis and E. tereticornis (0.14) and also between E. grandis and E. dunnii (0.88).  Based on the rbcL sequences, transition/transversion rate ratios were; k1 = 1.806 for purines and k2 =1.36 for pyrimidines and the overall transition/transversion bias R was 0.777. Results from the selection test of the rbcL sequences based on Tajima's neutrality test reported that the number of segregation sites was 78, with low nucleotide diversity of 0.029. With regard to MatK sequences, transition/transversion rate ratios were: k1 = 0.861 for purines and k2 = 1.51 for pyrimidines, and the overall transition/transversion bias R was 0.562, and the results of the selection test based on Tajima's neutrality test reported that the number of segregation sites was 46 with low nucleotide diversity of 0.022. For the TrnL-F genes, the transition/transversion rate ratios were: k1 = 2.732 for purines and k2 = 1.334 for pyrimidines, and the overall transition/transversion bias R was 0.96 while Tajima's neutrality test reported that the number of segregation sites was 84 with low nucleotide diversity of 0.0256.
Determination of evolutionary divergence is essential in molecular evolution studies and phylogenetic reconstructions, where it helps in the estimation of coalescence and divergence times [40]. The best way to measure the distance between a pair of sequences is to determine the number of nucleotide substitutions occurring between them. Figure 4 shows an evolutionary divergence between Eucalyptus species based on the three genes. Just like in base composition disparity indices, higher divergence suggests larger differences between sequences. Determination of evolutionary distances was conducted using the Maximum Likelihood method [30], and the number of base substitutions per site between sequence pairs showed reliable variations between the Eucalyptus species under study, with more substitutions per site implying a larger evolutionary distance. Based on the rbcL sequences, transition/transversion rate ratios were; k 1 = 1.806 for purines and k 2 =1.36 for pyrimidines and the overall transition/transversion bias R was 0.777. Results from the selection test of the rbcL sequences based on Tajima's neutrality test reported that the number of segregation sites was 78, with low nucleotide diversity of 0.029. With regard to MatK sequences, transition/transversion rate ratios were: k 1 = 0.861 for purines and k 2 = 1.51 for pyrimidines, and the overall transition/transversion bias R was 0.562, and the results of the selection test based on Tajima's neutrality test reported that the number of segregation sites was 46 with low nucleotide diversity of 0.022. For the TrnL-F genes, the transition/transversion rate ratios were: k 1 = 2.732 for purines and k 2 = 1.334 for pyrimidines, and the overall transition/transversion bias R was 0.96 while Tajima's neutrality test reported that the number of segregation sites was 84 with low nucleotide diversity of 0.0256.
Determination of evolutionary divergence is essential in molecular evolution studies and phylogenetic reconstructions, where it helps in the estimation of coalescence and divergence times [40]. The best way to measure the distance between a pair of sequences is to determine the number of nucleotide substitutions occurring between them. Figure 4 shows an evolutionary divergence between Eucalyptus species based on the three genes. Just like in base composition disparity indices, higher divergence suggests larger differences between sequences. Determination of evolutionary distances was conducted using the Maximum Likelihood method [30], and the number of base substitutions per site between sequence pairs showed reliable variations between the Eucalyptus species under study, with more substitutions per site implying a larger evolutionary distance.  In the rbcL sequences, the longest genetic distance was between E. paniculata and other Eucalyptus species ranging from 0.12 to 0.13. This suggested that E. paniculata was distantly related to the other species, which had shorter divergence between any two of them, implying close genetic relations. On the other hand, a comparison between most species depicted lower divergence based on the MatK sequences ranging from 0.00 to 0.03, suggesting a close relationship. For the TrnL-F sequences, the largest genetic distance of 0.26 was seen in the comparison of E. dunnii and E. grandis implying that these two species may not be closely related. Phylogenetic tree constructed based on concatenated alignments from the three genes is shown in Figure 5, Partitioning of the concatenated dataset was done in PartitionFinder2 [41] to select the best-fit partitioning strategy and model for the concatenated sequences. The tree was rooted by including Callistemon viminalis to show the hypothetical last common ancestor of the Eucalyptus species under study. Calilistemon is a genus within the Myrtaceae family just like Eucalyptus genus.  In the rbcL sequences, the longest genetic distance was between E. paniculata and other Eucalyptus species ranging from 0.12 to 0.13. This suggested that E. paniculata was distantly related to the other species, which had shorter divergence between any two of them, implying close genetic relations. On the other hand, a comparison between most species depicted lower divergence based on the MatK sequences ranging from 0.00 to 0.03, suggesting a close relationship. For the TrnL-F sequences, the largest genetic distance of 0.26 was seen in the comparison of E. dunnii and E. grandis implying that these two species may not be closely related. Phylogenetic tree constructed based on concatenated alignments from the three genes is shown in Figure 5, Partitioning of the concatenated dataset was done in PartitionFinder2 [41] to select the best-fit partitioning strategy and model for the concatenated sequences. The tree was rooted by including Callistemon viminalis to show the hypothetical last common ancestor of the Eucalyptus species under study. Calilistemon is a genus within the Myrtaceae family just like Eucalyptus genus.  In the rbcL sequences, the longest genetic distance was between E. paniculata and other Eucalyptus species ranging from 0.12 to 0.13. This suggested that E. paniculata was distantly related to the other species, which had shorter divergence between any two of them, implying close genetic relations. On the other hand, a comparison between most species depicted lower divergence based on the MatK sequences ranging from 0.00 to 0.03, suggesting a close relationship. For the TrnL-F sequences, the largest genetic distance of 0.26 was seen in the comparison of E. dunnii and E. grandis implying that these two species may not be closely related. Phylogenetic tree constructed based on concatenated alignments from the three genes is shown in Figure 5, Partitioning of the concatenated dataset was done in PartitionFinder2 [41] to select the best-fit partitioning strategy and model for the concatenated sequences. The tree was rooted by including Callistemon viminalis to show the hypothetical last common ancestor of the Eucalyptus species under study. Calilistemon is a genus within the Myrtaceae family just like Eucalyptus genus.   Initial trees for the ML heuristic search were attained through the application of Neighbor-Join and BioNeighbor-Join algorithms to pairwise distances matrix generated from all the sequences from the three genes based on the two selected models of evolution. The final tree selected was the tree topology with a superior log likelihood value. Bootstrap resampling was selected as a test of phylogeny, and it measured the reliability of the tree generated [17]. Species close to each other based on the tree suggest a close genetic relationship, and those emanating from the same node show that they form a monophyletic clade. Bootstrap values at the node indicate how well the tree topology/monophyletic clade is supported and reliable. Bootstrap values of over 70 imply strong support, while values of around 50 indicate that the tree topology/monophyletic clade is moderately supported, and those below 50 indicate that the tree topology/monophyletic clade is poorly supported [17].

Phenotypic Traits of Eucalyptus Species
Two phenotypic traits were analyzed among ten different species of Eucalyptus for suitability for use as power transmission poles. The traits analyzed were diameter at breast height and total tree height. The assessment was performed on mature Eucalyptus trees aged between 8 and 12 years that could easily be measured, as suggested by Marco de Lima et al. [12]. The requirements for a tree species to be used as a power transmission pole are reasonable strength, DBH ranging from 18-28 cm and a total height ranging 30-55 m, straightness, ability to withstand splitting and twisting and should be amenable to treatment by use of preservatives [42]. From the phenotypic results, E. robusta, E. paniculata, E. maculata, E. dunnii, E. camaldulensis and E. citriodora might be used as power transmission poles because they have the preferred DBH but they are limited in height. Among the species listed above, only E. tereticornis and E. galucina had both the desired DBH and height and hence can be used as substitutes for E. grandis for transmission poles and reduce importation of transmission poles. To combat pole shortage, one should compromise the ecological effects on the environment since species that mature fast need more water [7]. In Europe and USA, the use of wooden poles for power transmission is common, and for a long time, they have given satisfactory services, as steel poles are used only to carry lines of high tension in urban areas [42]. Other Eucalyptus species, such as E. saligna, are weak but have been used extensively for all kinds of poles because of their straightness and ability to absorb preservatives readily [43]. E. resinifera and E. globulus can also be used to some extent, but they are not readily available. E. paniculata, on the other hand, is one of the strongest pole timbers, but it cannot be used as a pole since it does not absorb preservatives easily; hence, its poles may not be durable [7].

Phylogeny of Eucalyptus Species
Genetic variation is important for the survival of the plant population since it leads to adaptation and evolution [44,45]. Genetic drift, migration, and a combination of mutations lead to genetic variation among the population [46]. Studying genetic diversity among plant species is important for conservation and breeding programs [47]. In this study, a total of ten species were analyzed in which three genes: MatK, rbcL and TrnL-F were amplified and sequenced and aligned. A species tree was constructed based on concatenated data from the three genes as shown in Figure 5. From the species tree, E. grandis is closely related to E. camaldulensis and the relationship between the two is supported by a moderate bootstrap support value of 66. In addition, E. robusta, E. dunnii, E. camaldulensis and E. grandis form a monophyletic clade which supported by a moderate bootstrap value of 50. The close relationship between E. robusta, and E. dunnii seen is also supported by De Araújo et al [48]. E. glaucina has also been shown to be closely related to E. paniculata and E. tereticornis and the monophyletic clade is supported by a moderate bootstrap support value of 63. Based on the concatenated tree, nine species which include E. maculata, E.glaucina, E. tereticornis, E. paniculata, E. urophylla, E. robusta, E. dunnii, E. grandis and E. camaldulensis have been shown to be closely related and form a monophyletic clade which is supported by a moderate boot strap support value of 66. Only E. citriodora is distantly related from the other species. The close relationship between some of the species such as E. citriodora and E. maculata seen also agrees with the results reported by De Araújo et al. [48] who grouped the two species together. Most of the branches are supported by moderate bootstrap values, this is because in a concatenated tree, the genes used could have conflicting phylogenetic signals causing poor consistency hence moderate to poor bootstrap support values. In most cases, species tree may not show exactly how species split apart due to incomplete lineage sorting, introgression between species, gene duplication, subsequent gene losses, and mis-assignment of paralogs as orthologs which may lead to phylogenetic incongruence; where phylogenies from individual genes differ from species phylogenies [49]. In Figure 5, the monophyletic clade which includes nine of the ten studied Eucalyptus species has a moderate bootstrap support value of 66 indicating a close phylogenetic relationship among the study Eucalyptus species. Indeed, by rooting the three trees through inclusion of a distantly related species such as C. viminalis, it was possible to show the hypothetical last common ancestor of all the Eucalyptus species under study. The close relationship observed can be attributed to low nucleotide substitution rates, low base disparity indices and shorter genetic distances among the species.
The findings from the study have been supported by findings from earlier morphological, molecular, and biochemical studies, which have reported that Eucalyptus species are closely related [48,50,51]. This close relationship of Eucalyptus species supports the use of E. tereticornis and E.glaucina as a substitute for E. grandis in the provision of transmission poles since they have the desired DBH and height. Eucalyptus species occurring in different geographical areas have different mutation rates [52], and this could explain why there are some differences in morphological and genetic diversity among the ten Eucalyptus populations. Genetic diversity within a population can also be caused by drought, diseases, and rain patterns [53]. However, according to Dawson et al., genetic diversity is impacted directly by the difference in flowering times within and among the populations [54]. Extensive gene flow causes low genetic differentiation among the population [55]. This could explain why there is low genetic variation among the ten Eucalyptus species studied due to alleles that are shared among the Eucalyptus species [56]. Hence comparison between the species showed low substitution rates, low base disparity index, low genetic distances, and low diversity scores, and the resulting phylogeny suggested that most species were closely related implying close relatedness of the species under study. This explains why most of the Eucalyptus species grown in Kenya are considered useful for poles, although only a few are highly used, particularly due to fast growth, strength and treatability.

Conclusions and Recommendations
Based on the phenotypic traits, E. robusta, E. paniculata, E. maculata, E. dunnii, E. camaldulensis and E. citriodora can be used as power transmission poles but they are limited due to their short height. However, E. tereticornis and E. glaucina have the desired DBH and height and hence can be used as substitutes for E.grandis. Generally, the molecular phylogeny study, has shown that the studied Eucalyptus species are closely related and form various monophyletic clades which can be attributed to the short genetic distances, low nucleotide substitution rates, low nucleotide bias disparity indices and low diversity scores. Since the current study worked on only ten Eucalyptus species, further research involving more Eucalyptus species is needed to better understand Eucalyptus phylogenetic relationships and diversity. Roughly, over a hundred Eucalypts have been identified in East Africa, comprising indigenous species, exotic ones, clones and hybrids [57]. Some of these species have specific growth requirements and will only grow in specific ecological zones. Investigations on molecular phylogeny and related studies of all the species can help researchers in dealing with Eucalyptus systematics pitfalls and identify species with similar genetic make-up to that of E. grandis, which has been used extensively for the provision of electricity transmission poles. Such studies will not only involve the use of MatK, Rbcl, and TrnL-F genes but can also involve the use of other molecular markers such as ndh-F, atpB, trnT-trnL, and atpB-Rbcl and employing methods that have been used to successfully study phylogenetics and gene expression in angiosperms. A comparison of phylogenetic trees based on two or three gene sequences can prove valuable in molecular phylogeny and diversity of Eucalyptus species.