Isolation and Expression of Glucosinolate Synthesis Genes CYP83A1 and CYP83B1 in Pak Choi (Brassica rapa L. ssp. chinensis var. communis (N. Tsen & S.H. Lee) Hanelt)

CYP83A1 and CYP83B1 are two key synthesis genes in the glucosinolate biosynthesis pathway. CYP83A1 mainly metabolizes the aliphatic oximes to form aliphatic glucosinolate and CYP83B1 mostly catalyzes aromatic oximes to synthesis corresponding substrates for aromatic and indolic glucosinolates. In this study, two CYP83A1 genes named BcCYP83A1-1 (JQ289997), BcCYP83A1-2 (JQ289996) respectively and one CYP83B1 (BcCYP83B1, HM347235) gene were cloned from the leaves of pak choi (Brassica rapa L. ssp. chinensis var. communis (N. Tsen & S.H. Lee) Hanelt) “Hangzhou You Dong Er” cultivar. Their ORFs were 1506, 1509 and 1500 bp in length, encoding 501, 502 and 499 amino acids, respectively. The predicted amino acid sequences of CYP83A1-1, CYP83A1-2 and CYP83B1 shared high sequence identity of 87.65, 86.48 and 95.59% to the corresponding ones in Arabidopsis, and 98.80, 98.61 and 98.80% to the corresponding ones in Brassica pekinensis (Chinese cabbage), respectively. Quantitative real-time PCR analysis indicated that both CYP83A1 and CYP83B1 expressed in roots, leaves and petioles of pak choi, while the transcript abundances of CYP83A1 were higher in leaves than in petioles and roots, whereas CYP83B1 showed higher abundances in roots. The expression levels of glucosinolate biosynthetic genes were consistent with the glucosinolate profile accumulation in shoots of seven cultivars and three organs. The isolation and characterization of the glucosinolate synthesis genes in pak choi would promote the way for further development of agronomic traits via genetic engineering.


Introduction
Pak choi (Brassica rapa L. ssp. chinensis var. communis (N. Tsen & S.H. Lee) Hanelt) is a very important vegetable crop in China and Northeast Asia because of its high yield and nutritional value [1]. It contains a number of bioactive components such as folates, vitamin C, carotenoids, polyphenols and glucosinolates which have anti-carcinogenic effects [2][3][4]. Glucosinolates are a group of plant secondary metabolites derived from different kinds of amino acids and can be hydrolyzed by myrosinase to produce a variety of compounds which can have beneficial effects on human health including anti-carcinogenic, cholesterol-reducing and other pharmacological effects [5]. The glucosinolates-myrosinase-system is also considered to be an efficient defense system for plant against herbivorous generalists and microorganisms [6][7][8].
To date, more than 132 individual glucosinolates have been detected, and they are grouped into aliphatic, aromatic and indole glucosinolates depending on the structure of their side-chain [9]. In recent years, significant advances have been made in our understanding of the biosynthetic pathway of glucosinolates. The formation of glucosinolates can be divided into three separate phases including the chain elongation, formation of the core structure and secondary modifications of side chain and glucose moiety of the parent glucosinolates [10]. Since the completion of the Arabidopsis genome sequencing, a significant progress on glucosinolate research has been made especially in the identification of the genes and regulators for glucosinolate biosynthesis. MAM (Methylthioalkylmalate synthase) genes are responsible for the chain-elongation. CYP79 enzymes catalyze the first committed step of glucosinolates core structure formation as the conversion of amino acids to aldoximes. CYP83 enzymes catalyze the key step for the core structure formation as the conversion of aromatic oximes, and the conversion of thiohydroximic acids to glucosinolates [11][12][13][14][15][16][17][18].
Of the two oxime-metabolizing enzymes CYP83A1 and CYP83B1, major progress has been made by using Arabidopsis as model plant. Aliphatic oximes derived from chain-elongated homologs of methionine are efficiently metabolized by CYP83A1 enzyme, whereas CYP83B1 enzyme metabolizes these substrates with very low efficiency. Aromatic oximes derived from phenylalanine, tryptophan, and tyrosine are metabolized by both enzymes, although CYP83B1 has higher affinity for these substrates than CYP83A1 [19,20]. However, most of these researches are carried out in the model plant Arabidopsis. As glucosinolates have been proved to have many beneficial effects on human health and Arabidopsis is not edible, it is necessary to focus on glucosinolates-containing crops which are important for human consumption. In the present study, two glucosinolates biosynthesis genes BcCYP83A1 and BcCYP83B1 were isolated from pak choi and characterized and their expression pattern in different organs and cultivars were also characterized.

Phylogenetic Analysis of CYP83A1 and CYP83B1
We compared the isolated sequences with the corresponding ones in Arabidopsis thaliana (NM117451, NM119299) and Brassica pekinensis (FJ376049, FJ376050, FJ376051) by multiple alignments, and reconstructed the phylogenetic tree by ClustalW analysis program. The results showed that the homologs of CYP83A1 and CYP83B1 from the Brassicaceae family are highly conserved, with low variation in DNA sequence. As shown in the phylogenetic tree, the glucosinolate synthesis genes of Brassica chinensis and Brassica pekinensis clustered first with each other and then with the genes of Arabidopsis (Figure 3), which demonstrated the highly congruent relationship among different sources of glucosinolate synthesis genes.

Expression of CYP83A1 and CYP83B1 in Different Cultivars and Various Organs
Quantitative real-time PCR (qRT-PCR) was employed to confirm the expression patterns of the two genes in seven cultivars, as well as in different organs. Actin was used as an internal reference control for total RNA input. In our analysis, the gene-specific primers used for amplification of CYP83A1 recognized and amplified both CYP83A1-1 and CYP83A1-2 genes. Both CYP83A1 and CYP83B1 were detected in the leaves of all the seven cultivars and had the same expression pattern except for cultivar "ZYYDE". The expression patterns of the two genes in leaves of seven cultivars are illustrated in Figure 4. CYP83A1 was presented in higher abundance compared with CYP83B1 in most of the cultivars, whereas the CYP83A1 and CYP83B1 transcripts in "ZYYDE" cultivar were nearly same. Both the aliphatic oxime-metabolizing enzyme and aromatic oxime-metabolizing enzyme were identified in the three organs (roots, leaves and petioles) of pak choi (cv. HZYDE). Though CYP83A1 and CYP83B1 exist in all those organs, they displayed spatial expression patterns ( Figure 5). The expression level of CYP83A1 was higher in leaves than in roots or petioles, whereas the CYP83B1 showed higher mRNA abundance in roots compared with another two.

Glucosinolate Profile Concentrations in Different Cultivars and Various Organs
The concentrations of aliphatic, indole and aromatic glucosinolates in different pak choi cultivars and in different organs of "Hangzhou You Dong Er" cultivar were analyzed. As shown in Table 1, the aliphatic glucosinolate concentrations in leaves of most of the cultivars were higher than indole and aromatic glucosinolates. The glucosinolate profiles in different organs varied differently. As leaf and petiole had particularly higher aliphatic glucosinolates, however, root had much higher indole and aromatic glucosinolates.

Discussion
In this study, we isolated two CYP83A1 genes and one CYP83B1 gene from pak choi "Hangzhou You Dong Er" cultivar. Amino acid sequence analysis and sequence alignment with their homologs from Brassica pekinensis and Arabidopsis showed the two key glucosinolate synthetic genes CYP83A1 and CYP83B1 within Brassicaceae family are highly conserved. As shown in Figures 1 and 2, the genes isolated from pak choi have both high DNA and amino acid sequence conservation with their Brassica pekinensis counterparts. The identity of DNA sequence and deduced amino acid sequence of CYP83B1 between the two species was 99.47% and 98.80%. The two copies of CYP83A1 namely BcCYP83A1-1 and BcCYP83A1-2 from pak choi were also highly similar to the corresponding ones of Brassica pekinensis, whereas there was only one copy of CYP83A1 in Arabidopsis. This is consistent with the previous finding by Zang et al. that identities within Brassica rapa were usually higher than those with Arabidopsis [21]. They isolated and analysed the glucosinolate synthesis genes in Brassica pekinensis by genome-wide identification [21]. The result was also accordant to the notion that Brassica species evolved and were triplicated after divergence from Arabidopsis lineage [22][23][24]. The observed glucosinolate synthesis sequence similarities among the Brassicas (mostly >90%) were higher than those between Brassica and Arabidopsis (mostly 80-90%) in this study also supported the presumptive evolution order.
Consistent with the results of previous studies in Arabidopsis, both the CYP83B1 and CYP83A1 transcripts were expressed in all tested organs (roots, leaves and petioles) of pak choi [25,26]. Mizutani reported that CYP83A1 expression levels in Arabidopsis was highest in leaves and significantly higher in roots and stems than in flowers and siliques, while CYP83B1 was the most highly expressed in roots [25]. In our study, the expression level of CYP83A1 was also highest in leaves, and the CYP83B1 expressed higher in leaves and roots than in petioles. These results were consistent with the previous finding that CYP83A1 expressed highest in the leaves and CYP83B1 were preferentially expressed in the roots [25,26]. The gene expression pattern in the organs was consistent with the glucosinolate profile accumulations as aliphatic glucosinolates were higher in leaves and petioles while indole and aromatic glucosinolates were higher in roots. The results were similar to Brown's finding that aliphatic glucosinolates are typically higher in the leaves compared to the roots while indole glucosinolates are abundant in the roots [27]. When compared within an appointed organ, we found that the expression patterns of the two genes were different. CYP83A1 transcripts were almost two-folds of CYP83B1 transcripts in leaves and petioles, in contrast, the expression levels of the CYP83A1 were relatively lower than CYP83B1 in roots. Our finding was also accordant to the conclusion that the CYP83A1 catalyzed aliphatic glucosinolates biosynthesis and accumulation in leaves and the CYP83B1 catalyzed aromatic and indole glucosinolates mostly found in roots [15].
The expression patterns of the two genes in leaves of seven cultivars in our study were similar except "ZYYDE" cultivar and the gene expression levels were consistent with the glucosinolate profile accumulation. The expression levels of CYP83A1 in leaves of six of seven cultivars were higher than CYP83B1 and the aliphatic glucosinolates of the six CYP83A1 higher expression cultivars were higher than another two glucosinolate profiles, whereas CYP83A1 and CYP83B1 expressed nearly identical in "ZYYDE" cultivar. While focused on the specific gene expression and glucosinolate accumulation in individual cultivar, "SYM" cultivar had high CYP83A1/CYP83B1 transcript level ratio and high aliphatic glucosinolates/indole glucosinolates ratio. And "CGB" cultivar had the higher CYP83B1 transcript level and relative higher aromatic glucosinolates accumulation.

Plant Materials
The seeds of seven pak choi cultivars (Brassica rapa L. ssp. chinensis var. communis (N. Tsen & S.H. Lee) Hanelt) with different characters were sown in plastic pot with vermiculite in a growth chamber with 26/22 °C (12 h light/12 h dark) for 2 weeks. The seven commonly plant and consumed pak choi cultivars in southern China used are "Hangzhou You Dong Er" (HZYDE), "Zhou Ye You Dong Er" (ZYYDE), "Si Yue Man" (SYM), "Shanghai Qing" (SHQ), "Nanjing Zhong Gan Bai" (NJZGB), "Chang Geng Bai" (CGB) and "Ai Jiao Huang" (AJH). The young leaves from two-week old plants of seven pak choi cultivars were collected as experimental materials for quantitative real-time PCR (qRT-PCR) analysis. And three organs including roots, petioles and leaves of "Hangzhou You Dong Er" cultivar were collected for the expression analysis.

Isolation of Total RNA and Synthesis of Full-Length cDNA Sequence
Total RNA was extracted from samples using TRIzol reagent RNAisoTM Plus (Takara, D9108A) following the manufacturer's protocols. Approximately 0.1 g of tissue was extracted in 1 mL of extraction buffer. After melted in RNA-free water, RNA was quantified by UV spectroscopy and its integrity was visually assessed on ethidium bromide stained agarose gels. The cDNA was synthesized from 500 ng of DNA-free RNA with a PrimeScript RT reagent Kit (Takara, DRR037A) following the manufacturer's protocol.
Degenerate primers allowing amplification were designed based on sequences corresponding to highly conserved peptide regions of CYP83A1 and CYP83B1 of Arabidopsis and Brassica pekinensis. The cDNAs were amplified by PCR using the degenerate primers, then cloned into the pMD18-T Easy vector (Takara, Code D101A) and sequenced (Invitrogen). Rapid amplification of cDNA ends (RACE) was performed to obtain the 3′ and 5′ ends of the genes with the SMART RACE cDNA Amplification Kit (Clontech) according to the manufacturer's instructions. Sequence analysis of open reading fragment (ORF) and encoded amino acid sequence of genes were deduced by BioXM 2.6 (Msight, Guangzhou, China, 2007). Physicochemical properties of the deduced protein were predicted by Protparam [28].