Diversification of Chemical Structures of Methoxylated Flavonoids and Genes Encoding Flavonoid-O-Methyltransferases

The O-methylation of specialized metabolites in plants is a unique decoration that provides structural and functional diversity of the metabolites with changes in chemical properties and intracellular localizations. The O-methylation of flavonoids, which is a class of plant specialized metabolites, promotes their antimicrobial activities and liposolubility. Flavonoid O-methyltransferases (FOMTs), which are responsible for the O-methylation process of the flavonoid aglycone, generally accept a broad range of substrates across flavones, flavonols and lignin precursors, with different substrate preferences. Therefore, the characterization of FOMTs with the physiology roles of methoxylated flavonoids is useful for crop improvement and metabolic engineering. In this review, we summarized the chemodiversity and physiology roles of methoxylated flavonoids, which were already reported, and we performed a cross-species comparison to illustrate an overview of diversification and conserved catalytic sites of the flavonoid O-methyltransferases.


Introduction
Flavonoids are a large group of phenolic compounds; this group encompasses flavonol, flavanone, flavone, isoflavone, flavanol and anthocyanin subgroups, which play important roles in the physiological response of plants to various biotic and abiotic stresses. The core structure of flavonoids, their "aglycone", is composed of A, B and C rings, whilst a series of modification reactions, such as hydroxylation, glycosylation, prenylation and methylation, can enhance multiple physiological functions corresponding to both their structural diversity and tissue specificities. The hydroxylation reaction binds to the backbone in different numbers and positions, being the basis of subsequent O-methylation reactions.
The O-methylation of flavonoid aglycones results in reduced molecular activity of a hydroxyl moiety and increased lipophilicity, and it modifies their intracellular compartmentation. Furthermore, O-methylation provides a branchpoint in the biosynthesis of diverse metabolic pathways, such as lignin and flavonoid phytoalexin biosyntheses, which promote antimicrobial activity [1,2]. In the case of anthocyanins, O-methylation stabilizes the B ring, increasing its water solubility and enhancing its redshift effect [3]. In terms of chemical properties, methoxylated flavonoids have been reported to be produced in secretion systems, such as trichomes and root hairs in plants, for example, pentamethylmyricetin and xanthohumol in tomato and hop glandular trichomes, respectively [4,5], and 3 -methoxypuerarin, 5-methoxygenistein and 2 -methoxychalcone in Pueraria lobata, yellow lupin and alfalfa roots, respectively [6][7][8]. However, methoxylated flavonoids also receive attention due to their pharmacological benefits. Flavonoids consumed by humans undergo intracellular metabolism, usually by O-methylation or glucuronidation, which reduces their ability to donate hydrogen atoms [9]. In the example of puerarin and its 3 -methoxy derivative, higher liposolubility and antioxidant activity [6], as well as advantages in the protection of cerebral ischemia-reperfusion injury, were reported [10]. Furthermore, the methoxylated isoflavones neovestitol and vestitol from Brazilian red propolis display anti-inflammatory and antimicrobial bioactivity, thereby being regarded as potentially suitable pharmacological ingredients [11]. The bioactivity of the major methoxylated flavonoids has been the focus of a couple of recent reviews [12,13].
S-adenosyl-methionine (SAM)-dependent methyltransferases are enzymes utilizing SAM as a universal methyl donor, which operates in both primary and secondary metabolism [14]. Based on its substrate specificity, SAM-dependent methyltransferases can be sub-classified as O-, Nor Cmethyltransferases (OMT, NMT and CMT, respectively). Plant OMTs act as hydroxyl group acceptors of small molecules, such as flavonoids, alkaloids, phytoalexins, lignin precursors, simple phenols and phytohormones [15]. Flavonoid Omethyltransferases (FOMTs) of plants belong to class I and class II subgroups according to their molecular weight and cation dependency. The class I caffeoyl-CoA OMT (CCoAOMT) type group is of a lower subunit molecular weight (26)(27)(28)(29)(30) and displays cationdependent activity. The enzymes of this subclass mostly participate in lignin, phytohormone and scent metabolism. Such class I CCoAOMT genes are, therefore, thought to have evolved under the pressure of terrestrial colonization [16]. The enzymes of the class II caffeic acid OMT (COMT) type group range in size between 37 and 43 KDa molecular weight and display cation-independent activity; they are mainly involved in phenylpropanoid and alkaloid biosyntheses [15]. For this reason, class II COMT genes are thought to have evolved later and in response to more diverse selective pressures [16]. In hybrid Populus (P. deltoides × P. nigra), phylogenetic analysis combined with comparative genomics of OMTs indicated that OMTs have evolved by gene duplication; COMT genes, COMT-like genes and several FOMTs are thought to have evolved different functions or tissue specificity via tandem gene duplication followed by subsequent diversification [17]. In Brachypodium distachyon, three duplicated genes encoding COMT are proposed as species-specific functional genes, since these COMTs do not locate in collinear genomic regions of rice and sorghum. Functional characterization verifies that they have lost the original catalytic activity toward caffeic acid [18]. In Vanilla planifolia, two COMT-like genes that evolved from a COMT gene have a novel function in catalyzing methyl gallate and flavone luteolin production [19]. Although, in most cases, FOMTs are cation-independent COMT-type enzymes, functional characterization of an anthocyanin OMT and an (iso) flavonol-6-OMT in grape, ice plant, sweet basil and soybean [20][21][22][23] revealed that CCoAOMT-like genes have diversified from CCoAOMT genes since they display high similarity in their catalytic mechanisms, as well as their activities all being cation dependent.
With the exception of a minority of OMTs, which exhibit strict enzymatic specificity for a single substrate, plant OMTs generally accept a broad range of substrates across flavonoids, lignin precursors and alkaloids, albeit with different substrate preferences [14,24]. Such functional diversification of FOMTs provides both the challenge and the curiosity of characterizing their multiple in planta physiological functions. For example, recent studies have provided evidence that a variation in the modification enzymes flavonoid phenylacyltransferase and glycosyltransferase affect UV tolerance in Arabidopsis and rice, respectively [25,26], providing strong evidence that downstream modification of flavonoids affects environmental adaption. Since many FOMTs have been reported to be stress-induced and/or tissue-specific enzymes [23,27,28], elucidation of the roles and application values for the stress defense of methoxylated flavonoids is worthwhile. In this review, we summarize the present works regarding chemodiversity, the role of physiology, characterized function and structure basis of FOMTs, providing aspects for characterization of OMT enzymatic genes. We additionally discuss the current difficulties in studying the function of the individual metabolites of these classes.

Chemical Diversity of Methoxylated Flavonoids in Land Plants
The structural diversity of known methoxylated flavonoids is determined by the combination of the position and number of methyl groups, which are largely limited by the distribution of hydroxyl groups. In principle, O-methylation can occur at any position; however, 7-and 4 -methoxylation are the most highly represented since their corresponding hydroxyl moieties are attached during the biosynthesis of the flavonoid aglycone ( Figure 1). By contrast, according to the slight differences among flavonoid skeletons, some distinct methoxylated forms occur specifically within the aglycones. For example, before the step of cyclization by chalcone isomerase (CHS), naringenin chalcone has a specific hydroxyl moiety at the 2 position differing from the other flavonoids; thus, it has a unique methoxylated form, namely, 4,4'-dihydroxy-2',6'-dimethoxychalcone [29]. The backbone of flavonol has a hydroxyl moiety at the 3 position of the C ring; therefore, these compounds also have a distinct 3-methoxylated product compared to flavanone, flavone and isoflavone. Except for the common examples, 6-/8-/5-/2 -methoxylated flavonoids are relatively rarely discovered. Berim and Gang [30] and Koirala et al. [13] both compiled summaries of methoxylated flavonoids, including the commonly occurring methoxylated forms of flavonols, such as quercetin, kaempferol and luteolin, alongside their distribution. Here, we summarize the structural information to provide an overview of the distribution of methoxylated flavonols, flavones, isoflavones and anthocyanidins based on database data mining. This review also focuses on the unique methoxylated forms and plant species containing highly methoxylated flavonoids, which are potential substrates for novel functional FOMTs.

Chemical Diversity of Methoxylated Flavonoids in Land Plants
The structural diversity of known methoxylated flavonoids is determined by the combination of the position and number of methyl groups, which are largely limited by the distribution of hydroxyl groups. In principle, O-methylation can occur at any position; however, 7-and 4′-methoxylation are the most highly represented since their corresponding hydroxyl moieties are attached during the biosynthesis of the flavonoid aglycone (Figure 1). By contrast, according to the slight differences among flavonoid skeletons, some distinct methoxylated forms occur specifically within the aglycones. For example, before the step of cyclization by chalcone isomerase (CHS), naringenin chalcone has a specific hydroxyl moiety at the 2′ position differing from the other flavonoids; thus, it has a unique methoxylated form, namely, 4,4'-dihydroxy-2',6'-dimethoxychalcone [29]. The backbone of flavonol has a hydroxyl moiety at the 3 position of the C ring; therefore, these compounds also have a distinct 3-methoxylated product compared to flavanone, flavone and isoflavone. Except for the common examples, 6-/8-/5-/2′-methoxylated flavonoids are relatively rarely discovered. Berim and Gang [30] and Koirala et al. [13] both compiled summaries of methoxylated flavonoids, including the commonly occurring methoxylated forms of flavonols, such as quercetin, kaempferol and luteolin, alongside their distribution. Here, we summarize the structural information to provide an overview of the distribution of methoxylated flavonols, flavones, isoflavones and anthocyanidins based on database data mining. This review also focuses on the unique methoxylated forms and plant species containing highly methoxylated flavonoids, which are potential substrates for novel functional FOMTs.  In our survey and data mining, Fabaceae, Asteraceae, Lamiaceae and Rutaceae were the top four families in terms of the diversity of methoxylated flavonoids. As for the distribution of each group of methoxylated flavonoids, Asteraceae (25%), Fabaceae (15%), Lamiaceae (4%), Rutaceae (3%), Brassicaceae (2%) and Ericaceae (2%) occupied almost 50% of the variety of methoxylated flavonols ( Figure 3). In the same manner, Asteraceae (29%), Lamiaceae (14%), Fabaceae (9%), Rutaceae (5%) and Plantaginaceae (2%) held over 58% of In our survey and data mining, Fabaceae, Asteraceae, Lamiaceae and Rutaceae were the top four families in terms of the diversity of methoxylated flavonoids. As for the distribution of each group of methoxylated flavonoids, Asteraceae (25%), Fabaceae (15%), Lamiaceae (4%), Rutaceae (3%), Brassicaceae (2%) and Ericaceae (2%) occupied almost 50% of the variety of methoxylated flavonols ( Figure 3). In the same manner, Asteraceae (29%), Lamiaceae (14%), Fabaceae (9%), Rutaceae (5%) and Plantaginaceae (2%) held over 58% of the variety of methoxylated flavones. For methoxylated anthocyanidin and isoflavones, the Fabaceae family possessed the most diverse compounds ( Figure 3). Notably, the Fabaceae family was found to be rich in methoxylated isoflavones, with 78% among investigated families.
The degree of isoflavone methylation and its distribution were evaluated for each plant species. A total of 272 out of the 347 Fabaceae species investigated in KNApSAcK were found in the NCBI taxonomy database. The ratio of mono-, di-and tri-, as well as polymethoxylated isoflavones (more than four methoxyl groups), in 272 Fabaceae species were found in KNApSAcK. By combining the taxonomy relationship and metabolite in- the major isoflavones in legume plants, being highly accumulated in soybean mature seeds, the M. sativa plant and Pueraria thunbergiana. Although polymethoxylated flavonoids have been discovered, their biosynthesis mechanism, especially for those with more than three methoxylated sites, still remain largely unknown. The functional diversification of FOMTs in the sequential reaction and bi-function is a key point for the elucidation of the biosynthesis of polymethoxylated flavonoids. By searching the keywords "quercetin", "kaempferol", "isorhamnetin", "myricetin", "flavonol", "luteolin", "tricetin", "apigenin", "flavone", "daidzein", "glycitein", "genistein", "isoflavone", "petunidin", "malvidin", "peonidin" and "anthocyanidin" in the KNApSAcK database (http://www.knapsackfamily.com/knap-sack_core/top.php, accessed date: 14 May 2021), 322 out of 1046 flavonols, 788 out of 1298 flavones, 349 out of 573 isoflavones and 120 out of 506 anthocyanidins were found to be methoxylated flavonoids.
The degree of isoflavone methylation and its distribution were evaluated for each plant species. A total of 272 out of the 347 Fabaceae species investigated in KNApSAcK were found in the NCBI taxonomy database. The ratio of mono-, di-and tri-, as well as polymethoxylated isoflavones (more than four methoxyl groups), in 272 Fabaceae species were found in KNApSAcK. By combining the taxonomy relationship and metabolite information, the isoflavones with a mono-methoxyl group were found to be the most widely distributed, especially in some common studied species, such as Medicago, Trifolium, the Cicer genus and Glycine max ( Figure 4). The 6-methoxylated isoflavone glycitein is one of the major isoflavones in legume plants, being highly accumulated in soybean mature seeds, the M. sativa plant and Pueraria thunbergiana. Although polymethoxylated flavonoids have been discovered, their biosynthesis mechanism, especially for those with more than three methoxylated sites, still remain largely unknown. The functional diversification of FOMTs in the sequential reaction and bi-function is a key point for the elucidation of the biosynthesis of polymethoxylated flavonoids.

Physiological Roles of Methoxylated Flavonoids in Planta
The flavonoid pathway is thought to be ancient and to have evolved during adaption from an aquatic to terrestrial habitat, thus being supposed to be involved in various physiology processes and complex stress defenses ranging from coloration to anti-pathogen activity and symbiosis [32]. Flower and fruit color has been focused on and investigated regarding pollination, UV defense and the ornamental industry. Variation in the pigment anthocyanin, which contributes greatly to coloration, can alter pollinator preference and provide ornamental values. The O-methylation of anthocyanin increases its water solubility, strengthens its color properties and shifts the color to being more red based on the methylation level [3]. Moreover, O-methylation produces the major pigment subgroups peonidin, petunidin and malvidin, which are responsible for a purple appearance. Malvidin-and petunidin-type anthocyanins have been found to accumulate in colored grape berries, cyclamen flowers, petunia flowers, purple tomato seedlings, Nemophila menziesii blue flowers and torenia blue petals [33][34][35][36][37][38], whilst peonidin-type anthocyanins have been found in peach flowers and peony flowers [39,40]. Given this, the corresponding anthocyanin 3 /3 5 OMT genes are useful tools for artificial germplasm innovation, such as that being utilized in transgenic purple rose creation [41].
Based on the biological functions of flavonoids, the effect of O-methylation on the enhancement of antimicrobial activity has consistently been reported [12]. To promote physical defense against pathogens, the cell walls of grasses additionally contain tricin (3',5'-dimethoxytricetin); this is incorporated into lignin polymers, which were discovered following the characterization of the bifunctional rice enzyme OsAldOMT1 [42]. The methylation of flavonols is also considered to be essential for phytoalexin biosynthesis. In legume plants, methoxylated isoflavonols formononetin (4 -methoxydaidzein) and biochain A (4 -methoxygenistein) are the key precursors of phytoalexins, such as vestitol, medicarpin, pisatin, maackiain from Lotus corniculatus, Medicago sativa, Pisum sativum and Trifolium pratense [43][44][45][46]. The accumulation of formononetin and medicarpin by the elicitation of an MsIOMT overexpression line increased the resistance of alfalfa to the leaf pathogen Phoma medicaginis [47]. Glycitin (6-methoxydaidzin) and its derivatives are similarly greatly induced by the pathogens Aspergillus oryzae and Rhizopus oligosporus in soybean seedlings as opposed to daidzin, which accumulated in the negative control [23]. Furthermore, sakuranetin, a flavonoid phytoalexin, produced by the 7-O-methxylation of naringenin, rapidly responded both to UV irradiation and phytopathogen infection [28,48].
Methoxylated flavonoids have additionally been proposed to affect the interaction with symbiotic bacteria and plants. From the summary of 'infection flavonoids', which induce the Nod factor in the nodulation process in legume plants, 7-or 4 -methoxygenistins, glycitin, apigenin in soybean root and methoxychalcone in M. sativa and Vicia sativa are indicated as members of potential inducer factors [49]. Further evidence provided by single-cell sequencing in M. truncatula supports the important role of the nodule infection zone expression gene chalcone 2 -OMT and its corresponding product 4,4' -dihydroxy-2'methoxychalcone in the symbiosis process [50]. Moreover, plant endogenous flavonoids are reported to participate in modulating phytohormone oxidation and transportation during nodulation [51,52]. According to their chemical structure, flavonoids can have a completely different regulatory effect. For example, the 7,4 -dihydroxyflavone (DHF) induced by rhizobia inhibits IAA breakdown resulting in IAA accumulation for 14-48 h post-inoculation, whilst formononetin (4 -methoxydaidzein) accelerates IAA breakdown by stimulating or relieving inhibition of IAA oxidase activity [53]. Current studies focusing on plant-plant interactions suggest that methoxylated flavonoids possess allelopathic effects. In the case of aggressive ruderal plants, root secretions from Dittrichia viscosa contain apigenin, 6methoxykaempferol, rhamnetin (7-methoxyquercetin), isorhamnetin (3 -methoxyquercetin) and dihydroxyquercetin; 7-methoxykaempferol and 6-methoxykaempferol reduce the root length and root biomass of lettuce seedlings, respectively [54]. The stability of these root-secreted methoxylated flavonoids means that they remain in the soil for a considerable period of time and can thus inhibit the growth of other species [55]. These reports suggest that methoxylated flavonoids often accumulate in secretion organs in order to subsequently fulfill their anti-pathogen and interaction signal functions. Although the function of flavonoids has been the subject of continuous attention, the impact of flavonoid O-methylation on symbiosis and pathogen interaction essentially comes from indirect evidence and needs to be subjected to systematic research. In addition, the effects of environmental conditions, including temperature, nutrition and water sufficiency, on flavonoid O-methylation remains to be explored.

Flavonoid 3-O-Methyltransferase and Flavonoid 5-O-Methyltransferase
Given the requirement for 3-hydroxyl group existence, substrates of F3OMT are limited to flavonol aglycones. The StF3OMT isolated from Serratula tinctorial displayed a preference for quercetin, followed by kaempferol and myricetin, with no requirement for Mg 2+ [61]. In wild and cultivated tomato glandular trichomes, SlMOMT3 and ShMOMT3 were characterized to catalyze the methylation of the aglycone, as well as the 7/3 /4 methoxylated form of quercetin, kaempferol and myricetin, which participate in a series of methylation reactions leading to the highly methoxylated myricetin present in trichomes [4]. However, the CrOMT1 isolated from Catharanthus roseus was found to display a border preference for phenylpropanoids, such as 5-hydroxyferulate, in addition to the 3-O-position methylation of flavonols [62]. With regard to F5OMT, few cases have been reported, including one genistein 5-OMT isolated from Lupinus luteus roots [7] and one multiple functional enzyme CdFOMT5, which catalyzes 3,3 ,5,7 methylation of flavones in citrus fruit peels [63].

Flavonoid 6-O-Methyltransferase and Flavonoid 8-O-Methyltransferase
F6OMT and F8OMT are rarely isolated because of the rare existence of 6/8 methoxylated flavonoids requiring additional hydroxylation on these positions. The ice plant PFOMT was the first isolated F6OMT, which could methylate diverse flavonols and caffeoyl-CoA derivatives but only those with a vicinal dihydroxyl moiety. Phylogenetic analysis of protein sequences suggested that the PFOMT gene diverged from CCoAOMT genes. The subunit molecular weight, which ranges from 26 to 30 KDa (estimated as 26.6 KDa), and its Mg 2+ -dependent reaction indicated PFOMT to be a class I OMT, providing the clue that F6OMT might differ from flavonol/flavone OMTs decorating other positions, which usually belong to class II OMTs [21]. Several subsequent studies provided additional evidence to support this hypothesis. In Plagiochasma appendiculatum and Glycine max, the PFOMT-like proteins PaF6OMT and GmIOMT1 were identified as F6OMT catalyzing the production of scutellarein, baicalein and 6-OH daidzein, respectively. The characteristics of the small subunit size being estimated as 27.4 KDa and 26.75 KDa and their activity being cation dependent are consistent with belonging to PFOMT [23,64]. According to these known examples, F6OMTs are considered to phylogenetically cluster as a separate branch from CCoAOMTs, reacting with a vicinal hydroxyl group and belonging to class I OMTs, whose activity is cation dependent. However, in O. basilicum, two ObF6OMTs actually shared high similarity with F4 OMTs; their protein mass and non-cation reaction indicated them to be class II OMTs [22]. As for F8OMT, one flavone, namely, MpOMT2, was isolated to catalyze the 8-hydroxy-7-methoxyflavone in M. × piperita [58]. Subsequent research on O. basilicum also isolated the homologous gene of MpOMT2 designated as ObF8OMT-1, both of which were cation-independent OMTs. Interestingly, the other class I OMT gene was designated as ObPFOMT, which displayed cation-dependent activity and required a vicinal hydroxyl group. It was additionally demonstrated to display an 8 methylation activity to flavone in O. basilicum [65].
In the case of anthocyanin, F3 OMTs and F3 ,5 OMTs have been much characterized in studies of flower color engineering. By contrast to the other FOMTs, which prefer aglycone substrates, anthocyanidin OMTs prefer glycosylated substrates. In grape, F3 ,5 OMT VvAOMT could only catalyze glycosylated anthocyanins and flavonols and not aglycones. The types of glycosides attached to the substrate also affected the relative specific activity of VvAOMT [20]. In peony, the F3 OMT PsAOMT mediated color spot formation by methylating cyanidin-3-O-glucoside to the darker peonidin-3-O-glucoside [40]. In purple tomato tissues, SlAnOMT could produce petunidin glucoside utilizing delphinidin 3-O-glucoside. Silencing SlAnOMT in fruits and hypocotyls resulted in a reduction in the content of petunidin and malvidin [36]. In research focused on soybean seed coat pigmentation, GmOMT5 was characterized as a pigment isogene methylating cyanidin glucoside to peonidin glucoside in black seed coated compared to brown seed coated soybean seeds [85]. Besides the 3 catalytic activity, some AOMTs, such as NmAMT3, NmAMT6 and MT2 lotus from Nemophila menziesii and Petunia x hybrida, also display slight 3 ,5 catalytic activity as well as 3 -OMT function [37].

Chalcone 2 -O-Methyltransferase
Given that chalcones are missing a C ring in their chemical structure, chalcone OMTs are a unique class methylated at the 2 -O-position. The isoliquiritigenin 2 -OMT from alfalfa has been postulated to play a role in the nodulation process due to it producing the Nod gene inducer 4,4'-dihydroxy-2'-methoxychalcone in the rhizobia infection area of root hairs [51,86]. Furthermore, in Humulus lupulus, which is used as an ingredient for beer brewing, a chalcone 6 -OMT (equal to 2 position) OMT1 was characterized to produce the flavor compound xanthohumol from desmethylxanthohumol in trichomes [5].
A set of FOMTs with different regioselectivity and substrate preferences have been characterized in same species, such as isoflavone OMTs MtIOMT1-7 from M. truncatula; flavonol OMTs ShMOMT1-4 from S. habrochaites; flavone OMTs MpOMT1A, 1B to MpOMT4 from M. × piperita; and flavone OMTs ObFOMT1-6, ObF8OMT-1 from O. basilicum [4,22,58,59,79], providing proof that FOMTs show different functions with shifted substrate selectivity in spite of sharing high sequence similarity. By phylogenetic analysis of characterized FOMTs, the regioselectivity of candidate FOMTs can be predicted, but only for genes classified in the clade of functionally characterized OMTs. In addition, FOMTs may have different substrate selectivity in vivo compared to in vitro because of the complex internal environment, such as substrate spatiotemporal existence or the rapid glycosylation of compounds. Given this, the determination of enzyme activity by in vitro experiments is required. In M. truncatula elicited leaves, over-expressing MsI7OMT did not produce isoformononetin (7-methoxydaidzein) but rather accumulated formononetin (4 -methoxydaidzein) [47]. In camptotheca, an alkaloid biosynthetic enzyme, 10-hydroxycamptothecin O-methyltransferase, could also methylate the 7-O-position of kaempferol and quercetin aglycone in vitro; however, no 7-methoxylated flavonoids were detected in vivo, indicating that the only in vivo substrate of this enzyme was 10-hydroxycamptothecin [24].

The Structural Basis of Flavonoid O-Methyltransferase Function
The FOMTs belonging to class I and class II OMTs harbor conserved catalytic sites and substrate binding regions but are separately discussed according to their differences in length, conserved amino acid sites and cation interaction ( Figure 5). In 1998, Joshi and Chiang [91] conducted a study to improve the mismatch of the conserved region in silico by utilizing more than 10 subgroups of OMT sequences from 56  date, there is a lack of experimental support for these findings. In a later study focusing on the class I caffeoyl-CoA OMT of alfalfa, alignment using available crystal structures showed a SAM-binding site, a caffeoyl-CoA-binding site and a dimerization region [92]. Further research on anthocyanin OMTs compared the sequence to caffeoyl-CoA OMTs, reaching the conclusion that the substrate binding sites 21Lys ( [35]. Additionally, a catalytic triad, namely, 166Lys(K)-190Asn(N)-238Asp(D), was newly characterized, and it was verified by mutagenesis experiments to be essential for efficient catalytic capacity of class I OMTs [93] (Figure 6A), as well as being conserved in isoflavone 6-OMT GmI6OMT from soybean [23].
Of the class II OMTs, the chalcone OMT and isoflavone OMT from alfalfa were the first two reported crystal structures of plant OMTs, revealing their substrate specificity ( Figure 6). The structure basis and sequence alignment of known class II OMTs indicate 194Asp(D)-196Gly(G)-219Asp/Glu(D/E)-220Arg/Leu/Gln(R/L/Q)-239Asp(D)-240Met(M)-253Lys(K)-259Trp(W) residues to be SAM binding sites, and 257His(H)-288Asp/Glu(D/E)-318Glu/Val(E/V) residues to be involved in catalysis. Further mutation of 257His causes a failure to generate a corresponding product. Unlike the relatively conserved catalytic sites, substrate binding residues at 117, 307, 310 and 314 positions present diversity in accordance with substrate discrimination, except 168Met(M) and 311Met(M), which are thought to help constrain the A ring [94]. Based on the findings of Zubieta, studies on sweet basil and wheat support the conservation of catalytic residues in ObaCVOMT1, ObaEOMT1 and TaCOMT-3D [95,96]. not conserved [35]. Additionally, a catalytic triad, namely, 166Lys(K)-190Asn(N)-238Asp(D), was newly characterized, and it was verified by mutagenesis experiments to be essential for efficient catalytic capacity of class I OMTs [93] ( Figure 6A), as well as being conserved in isoflavone 6-OMT GmI6OMT from soybean [23].
Of the class II OMTs, the chalcone OMT and isoflavone OMT from alfalfa were the first two reported crystal structures of plant OMTs, revealing their substrate specificity ( Figure 6). The structure basis and sequence alignment of known class II OMTs indicate 194Asp(D)-196Gly(G)-219Asp/Glu(D/E)-220Arg/Leu/Gln(R/L/Q)-239Asp(D)-240Met(M)-253Lys(K)-259Trp(W) residues to be SAM binding sites, and 257His(H)-288Asp/Glu(D/E)-318Glu/Val(E/V) residues to be involved in catalysis. Further mutation of 257His causes a failure to generate a corresponding product. Unlike the relatively conserved catalytic sites, substrate binding residues at 117, 307, 310 and 314 positions present diversity in accordance with substrate discrimination, except 168Met(M) and 311Met(M), which are thought to help constrain the A ring [94]. Based on the findings of Zubieta, studies on sweet basil and wheat support the conservation of catalytic residues in ObaCVOMT1, ObaEOMT1 and TaCOMT-3D [95,96].

MtIOMT1
MsD7OMT  Figure 5. Phylogenetic tree of representative OMT genes. Class I OMT genes, including caffeoyl CoA OMTs and anthocyanin OMTs; class II OMTs, including caffeic acid OMTs, flavonoid OMTs and isoflavone OMTs. The tree was built by MEGAX [97]. The neighbor-joining method was used for clustering. The percentages of replicate trees in the bootstrap test (1000 replicates) are shown next to the branches. The evolutionary distances were computed using the p-distance method and are displayed in the units of the number of amino acid differences per site. Accession numbers: CCoAOMT1, At4g34050; CCoAOMT7, At4g26220; GmSOMT9, Glyma.17g171100; GmSOMT10, Glyma.07G214700; GmIOMT1, Glyma.05g14700; GmOMT5, Glyma.05G223400; VvAOMT a part of the substrate binding pocket close to the catalytic site. The mutation of Gly46 to Tyr led to a reverse in the para-to meta-O-methylation of flavanones and dihydroflavonols in vitro [72] ( Figure 6A). The substitution of 328-His with Arg in the ROMT9 gene product changed the hydrophobic pocket, resulting in a regioselectivity shift from 3′,5′ to 3′ hydroxyl groups [98]. Moreover, a single amino acid mutation, Asp257Gly, in the flavonol 7-O-methyltransferase POMT7 protein allowed the methylation of both 3, 7-hydroxyl groups of quercetin instead of the 3-hydroxyl group alone [99]. The Val309 of TaOMT2 in wheat, which is next to the catalytic site His262 in homology modeling, decides the substrate preference for tricetin ( Figure 6B). The Val309Ile TaOMT2 mutant in wheat and Ile316Val MtCOMT mutant in M. trunculata alter the substrate preference from a higher tricetin and 5HFA (5-hydroxyferulic acid) affinity to a higher 5HFA and tricetin affinity, respectively [100]. Moreover, studies on sweet basil and M. truncatula characterized a set of FOMTs sharing high similarity; however, having a different substrate preference and regioselectivity can also provide clues for essential residue identification [22,59]. Besides the variance in substrate binding sites, the cations have also been proved to dramatically modulate the substrate specificity of class I OMTs [101]. Along with the increasing number of elucidated crystal structures of FOMTs, homology modeling optimized by known FOMTs with docked substrates provided a prediction of the possible residues that affect activity and selectivity. The Gly46 in AtCCoAOMT7, which is equivalent to Tyr51 in known PFOMTs, was indicated via docking studies to be a part of the substrate binding pocket close to the catalytic site. The mutation of Gly46 to Tyr led to a reverse in the parato meta-O-methylation of flavanones and dihydroflavonols in vitro [72] ( Figure 6A). The substitution of 328-His with Arg in the ROMT9 gene product changed the hydrophobic pocket, resulting in a regioselectivity shift from 3 ,5 to 3 hydroxyl groups [98]. Moreover, a single amino acid mutation, Asp257Gly, in the flavonol 7-Omethyltransferase POMT7 protein allowed the methylation of both 3, 7-hydroxyl groups of quercetin instead of the 3-hydroxyl group alone [99]. The Val309 of TaOMT2 in wheat, which is next to the catalytic site His262 in homology modeling, decides the substrate preference for tricetin ( Figure 6B). The Val309Ile TaOMT2 mutant in wheat and Ile316Val MtCOMT mutant in M. trunculata alter the substrate preference from a higher tricetin and 5HFA (5-hydroxyferulic acid) affinity to a higher 5HFA and tricetin affinity, respectively [100]. Moreover, studies on sweet basil and M. truncatula characterized a set of FOMTs sharing high similarity; however, having a different substrate preference and regioselectivity can also provide clues for essential residue identification [22,59]. Besides the variance in substrate binding sites, the cations have also been proved to dramatically modulate the substrate specificity of class I OMTs [101].
The protein crystal structures of PaMTH1, MsChOMT, MsIOMT and HI4 OMT suggest that the homodimer formation generated by the N-terminal swapping of FOMT protein forms the functional homodimers in solution or the cell [94,102,103]. A report on flavone O-methyltransferase in wheat compared the homodimer and dissociated monomers with the dissociated monomer and concluded that the monomers retain their catalytic capacity [104]. Nevertheless, several studies referring to alkaloid biosynthesis suggested that heterodimers may contribute to the catalysis of new substrate. In keeping with this, complex heterodimers of four OMT proteins from Thalictrum tuberosum showed selectivity to a variety of new substrates from catechols to hydroxycinnamates and alkaloids compared to its corresponding homodimers [105]. A recently discovered heterodimer consisting of PsSOMT2 and PsSOMT3 or Ps6OMT filled the missing step for noscapine biosynthesis in Papaver somniferum [106]. According to such evidence, elucidating the function of the heterodimers of FOMT in planta may lead to the elucidation of unknown biochemical processes.

Concluding Remarks and Future Prospects
Flavonoids are phytochemicals involved in pathogen defense and UV light protection, and they enable plants to interact with their environment. Evolutionary analysis of flavonoid biosynthesis suggests that this pathway originated very early in plant colonization of land. However, the lignin biosynthesis pathway, which is also derived from phenylpropanoid metabolism, as well as flavonoid biosynthesis, is considered to be an essential factor of land colonization of plants. Given that many FOMTs have multiple preferences toward both flavonoid and lignin biosyntheses metabolites, the diversification and convergence of substrate selectivity and physiological functions involving multiple pathways are a topic for future research. Due to the multiple substrate selectivities caused by protein dimerization and the difference in activity performance in planta and in vitro, a comprehensive elucidation covering all substrate specificities and their interaction is very complex. Therefore, a comprehensive approach combining profiling data of endogenous methoxylated/non-methoxylated compounds and a gene expression analysis considering their tissue specificity and stress deducibility of FOMTs is required in future. In this review, we summarize our current knowledge of the chemical diversity and physiological roles of methoxylated flavonoids. Additionally, we provide a cross-species comparison of methoxylated flavonoids and FOMT genes. Such plant-species-wide approaches will give an overview of the diversification of the O-methylation of specialized metabolism in the plant kingdom.