Evolutionary Conservation and Expression Patterns of Neutral/Alkaline Invertases in Solanum

The invertase gene family in plants is composed of two subfamilies of enzymes, namely, acid- and neutral/alkaline invertases (cytosolic invertase, CIN). Both can irreversibly cleave sucrose into fructose and glucose, which are thought to play key roles in carbon metabolism and plant growth. CINs are widely found in plants, but little is reported about this family. In this paper, a comparative genomic approach was used to analyze the CIN gene family in Solanum, including Solanum tuberosum, Solanum lycopersicum, Solanum pennellii, Solanum pimpinellifolium, and Solanum melongena. A total of 40 CINs were identified in five Solanum plants, and sequence features, phylogenetic relationships, motif compositions, gene structure, collinear relationship, and expression profile were further analyzed. Sequence analysis revealed a remarkable conservation of CINs in sequence length, gene number, and molecular weight. The previously verified four amino acid residues (D188, E414, Arg430, and Ser547) were also observed in 39 out of 40 CINs in our study, showing to be deeply conserved. The CIN gene family could be distinguished into groups α and β, and α is further subdivided into subgroups α1 and α2 in our phylogenetic tree. More remarkably, each species has an average of four CINs in the α and β groups. Marked interspecies conservation and collinearity of CINs were also further revealed by chromosome mapping. Exon–intron configuration and conserved motifs were consistent in each of these α and β groups on the basis of in silico analysis. Expression analysis indicated that CINs were constitutively expressed and share similar expression profiles in all tested samples from S. tuberosum and S. lycopersicum. In addition, in CIN genes of the tomato and potato in response to abiotic and biotic stresses, phytohormones also performed. Overall, CINs in Solanum were encoded by a small and highly conserved gene family, possibly reflecting structural and functional conservation in Solanum. These results lay the foundation for further expounding the functional characterization of CIN genes and are also significant for understanding the evolutionary profiling of the CIN gene family in Solanum.

In addition, analysis of the crystal structures of CINs (InvA and InvB) in a cyanobacterium revealed overall structural similarity of InvA and InvB invertases [22,23]. They share an (α/α)6-barrel core structure composed of 12 α-helices and an insertion of three helices. Two putative catalytic residues, Asp188 and Glu414 in InvA, and InvA' stringent substrate specificity towards the α1, 2-glycosidic bond of sucrose were further proposed [22]. In InvB, the Arg430 residue possesses different conformation, likely facilitating the deprotonation of catalytic residue Glu415 [23]. CINs are evolutionarily and functionally more stable compared with CWINs and VINs according to the phylogenetic relationship and functional genomic analysis, which may reflect their roles in maintaining cytosolic sugar homeostasis for cellular function [24].
To date, CIN gene families have been investigated in several plant species, such as Oryza sativa, Arabidopsis, Capsicum annuum, Populus trichocarpa, Manihot esculenta, Vitis vinifera, and Lotus japonicus, as well as Hevea brasiliensis [20,[25][26][27][28]. These previous studies have expanded our knowledge on the CIN gene family and presented several common characteristics. For example, the CIN gene family can be distinguished into two distinct groups, namely, the α and β subgroups; CIN genes from the α group differ in gene structure from those of the β group; CIN genes of the β group are localized in the cytosol, and α CINs are localized in plastids or mitochondria.
Here, to investigate the functional and structural characterizations of CINs, systematically comparative genome analysis was performed on the CIN gene family in five Solanum plant species, namely, Solanum tuberosum, Solanum lycopersicum, Solanum pennellii, Solanum pimpinellifolium, and Solanum melongena. On the basis of the bioinformatics method, we characterized sequence features, phylogenetic relationship, chromosomal localization, exon-intron structures, and expression profiles. These analyses provide valuable information for elucidating the evolutionary relationship and function of the CIN gene family in the near future.

Identification of CIN Genes in Solanum
All protein sequences from five solanum species were downloaded from three public databases: S. tuberosum from the Potato Genome Sequencing Consortium (PGSC, http://solanaceae.plantbiology. msu.edu/pgsc_download.shtml); S. pennellii, S. lycopersicum, and S. pimpinellifolium from the Sol Genomics Network (SGN, Release 2.5, https://solgenomics.net/); and S. melongena from the Eggplant Genome DataBase (ftp://ftp.kazusa.or.jp/pub/eggplant/). Subsequently, a local database was constructed by employing Bioedit 7.0 software [29]. The Hidden Markov Model (HMM) profile of the Glyco_hydro_100 (GH100) conserved domain (PF12899) was downloaded from the Pfam protein family database (http://pfam.sanger.ac.uk/). The AT1G35580 gene in Arabidopsis was considered as a neutral/alkaline invertase [21]. We searched the amino acid sequence of the AT1G35580 gene using the NCBI website (https://www.ncbi.nlm.nih.gov/). A BlastP search was carried out to identify CIN genes across five plant species using AT1G35580 as a query, and the E-value was set at e −5 . All predicted CIN genes were initially identified, and these candidates were then sent to the Pfam database to examine the existence of the GH100 domain. The protein sequences with conserved domain were retained, and sequences with a partial domain were manually removed.
The protein length and molecular weight of the CIN gene family were calculated by using the ProtParam server (http://web.expasy.org/protparam/). Chromosome location, genomic sequences, coding sequences (CDS), and intron numbers were retrieved from the SGN online database (http: //solgenomics.net/search/locus). The divergence dates of Solanum plants were also calculated using a website (http://www.timetree.org/).

Multiple Sequence Alignment and Phylogenetic Analysis
Multiple sequence alignment of each amino acid sequence from five Solanum species was conducted using the ClustalX1.83 software with default settings [30]. Then, similar protein sequences were highlighted with different colors using the Bioedit 7.0 software. On the basis of the sequence-alignment results, a maximum-likelihood phylogenetic tree was constructed by using MEGA7.0 tool [31]. The identity level of the CIN gene family was calculated by using the DNAMAN software (Lynnon Biosoft, San Ramon, CA, USA).

Conserved Motifs and Gene Structure of CIN Genes
The MEME server (http://meme-suite.org/tools/meme) and Tbtools were employed to analyze the motif composition of CINs [32]. A total of 20 different motifs, namely, motifs 1 to 20, were acquired. Intron-exon distribution and intron-phase patterns were illustrated by aligning the cDNA sequences and genomic sequences of CIN genes by using the online Gene Structure Display Server 2.0 (http://gsds.cbi.pku.edu.cn/).

Mapping CIN Members on Chromosomes
CIN genes mapped on the chromosomes of three species (S. lycopersicum, S. pennellii, and S. tuberosum) were visualized in the MapDraw 2.1 software [33]. The tandem duplicated gene pairs were determined on the basis of previously reported methods [34,35], and confirmed with the following criteria: (1) an array of 2 or more CIN candidates within a range of 100 kb distance; (2) alignment had a coverage rate of more than 70% of that of the longer gene; and (3) the identity of the aligned region was no less than 70%. Segmental duplication was investigated on the online Plant Genome Duplication Database (PGDD, http://chibba.agtec.uga.edu/duplication/).

Tissue-Specific Expression of CINs in Tomato and Potato
Data concerning CIN gene transcription were extracted from the tomato functional genomics database (TFGD, http://ted.bti.cornell.edu/) and the PGSC (http://solanaceae.plantbiology.msu.edu/), respectively. The Fragments Per Kilobase Million mapped reads (FPKM) values of CIN genes were submitted to the MultiExperiment Viewer 4.9 (MeV4.9) software and then log 2 -transformed [36]. Two heat maps of the CIN gene family in different tissue types in the tomato and potato were obtained.

Expression Profiling of CINs in Response to Different Stresses
In order to investigate the expression profiles of the CIN gene family in the potato plant, all available data of the regulating factor were downloaded from the PGSC (http://solanaceae.plantbiology.msu.edu/). The expression values of CIN genes were processed as log 2 transformation fold change in control vs. treatments. Heat maps were created by using MeV4.9 [34]. The expression value of each CIN gene that showed more than twofold up-or downregulation (p < 0.05) was regarded as differently expressed.

Plant Materials and Stress Treatment
Plants of the tomato cultivar (zhefen702), produced by the Zhejiang Academy of Agricultural Sciences, were used in our experiment. Seeds were placed on water-saturated filter paper to be germinated. These seedlings, with a condition of 25/18 • C 12/12 h day/night cycle, were planted in a growth chamber. At the age of 4-6 of fully expanded young tomato leaves, the tomato plants were treated under chemical stresses with gibberellins (GA 3 ; 100 µM), 6-benzylaminopurine (6-BA) (100 µM), salicylsalicylic acid (SA) (100 µM), abscisic acid (ABA) (100 µM), naphthylacetic acid (NAA; 100 µM), and jasmonic acid (JA; 100 µM). Leaf samples were harvested at 0, 1.5, 3, 6, 12, and 24 h after treatments, respectively. These collected samples were immediately frozen with liquid nitrogen for total RNA extraction. We performed three independently biological replications on each sample type.

Total RNA Extraction and Quantitative Real-Time PCR (qRT-PCR)
Total RNA was isolated using a Total RNA kit (Tiangen Biotech, Beijing, China), and the quality and concentration of RNA were evaluated by ultra-micro-nucleic acid protein tester (NanoDrop One).
Reverse transcription was further conducted using a FastQuant RT Kit (Tiangen Biotech, Beijing, China) on the basis of the manufacturer's procedure. A total of 20 µL volume, containing 6.8 µL of distilled water, 10 µL of 2× SYBR Green PCR MasterMix, 1.2 µL of each primer, and 2 µL of template (10× diluted cDNA from samples), was included for qRT-PCR analysis. QRT-PCR reactions were carried out using an ABI StepOne real-time fluorescence quantitative PCR instrument (Applied Biosystems, Foster City, CA, USA). Melt curves were used to evaluate the quality and specificity of primers. Three technical replicates were conducted for each of the CIN genes. The expression values of CIN genes were calculated by the 2 −∆∆ct method [37].

Conservation in CIN Gene Family: Gene Number, Sequence Length, and Molecular Weight
Neutral/alkaline invertases were identified in Solanum, including S. tuberosum, S. lycopersicum, S. pennellii, S. pimpinellifolium, and S. melongena, which made it possible to perform comparative genomic analysis for the CIN gene family. A total of 58 putative CINs were preliminarily predicted in these five fully sequenced plant genomes (Table 1). Among the 58 sequences, however, each of the six CIN genes from the three species, namely, S. lycopersicum, S. pimpinellifolium, and S. melongena, was manually excluded because of the absence of a significant portion of their coding regions, which probably encode truncated proteins, or could be nonfunctional or pseudogenes. Twelve genes existing as alternative splicing in the potato genome were also eliminated. Thus, 18 redundant CIN members from four different species were filtered out. In the end, a total of 40 CIN sequences from Solanum were selected for further analysis. Interestingly, each of the five Solanum plants possesses eight CIN genes. Previous studies have suggested that seven to eleven CINs were identified through a systematic survey in several plant genomes (such as maize, Arabidopsis, rice, pepper, and cassava) [26,27], whereas these were eight in all the surveyed Solanum species, indicating stability in the number of this small gene family among various plant species.
The CIN gene family in plants can be classified into two major classes, the α and β subfamilies, on the basis of their sequence homology [20,[25][26][27][28]. Combined with currently phylogenetic analysis, we found that the protein length and molecular weight of CIN genes dramatically varied between the α and β groups, but they were relatively stable in the corresponding group ( Figure 1 and Table S1). In the α group, protein length ranged from 547 (SmCIN07) to 678 (StCIN08) amino acids (AAs), and molecular weight was between 69.1 (SmCIN07) and 76.6 (StCIN08) kD, with a mean of 643 AAs and 73kD (Table S1). In the β group, variations of protein length were from 412 (SmCIN04 and SpeCIN08) to 605 (SpiCIN06) AA, and molecular weight from 47.3 to 69.1 kD, with a mean of 543 protein sequences and 62 kD (Table S1). Of these finally selected 40 CIN genes, SmCIN08 belonging to the β group only encoded 249 amino acid residues, which is possibly due to a genomic sequencing error or annotation error (Table S1). Therefore, shorter sequences from eggplant genome were manually eliminated from this statistical analysis. On the basis of length and molecular weight, the CIN gene family was well-conserved in the corresponding group.

Potential Functional Conservation: Four Key Amino Acid Residues
It was recently reported that three amino acid residues (D188, E414, and Arg430) were found to be associated with catalytic activity, established in crystal-structure comparison and enzymatic assays of CINs (InvA and InvB), which is isolated from the cyanobacterium Anabaena [22,23]. In the current paper, these three conserved residues were also observed in all CIN genes within five Solanum plants on the basis of sequence alignment, except that SmCIN08 was missing D188, and SmCIN07 lacked the E414 and Arg430 residues ( Figure 2 and Table S1). Phosphorylation of an amino acid residue (Ser547) in the CIN gene family by calcium-dependent kinases enables binding to 14-3-3 proteins to raise CIN activity [21]. In our study, residue Ser547 was extensively detected at the extreme C-terminus of CIN protein sequence in all of the analyzed CIN genes except for SmCIN07. More interestingly, between these four completely conserved amino acid residues, three of them are localized in the C-terminal portion of CIN genes (Table S1). Thus, we speculated that the C-terminus of the CIN gene family tends to have higher sequence similarity than the N-terminal region, and C-terminal sequences might be involved in the catalytic activity or substrate binding of CINs. It was believed that shared sequence fragments harbor conserved functions and structural attributes that could also reveal previously undiscovered protein families, as well as new members to existing ones [38]. In this study, we further analyzed the sequence traits of CIN genes by multiple sequence alignment. These CIN genes shared a large number of the same fragments or a high degree of sequence homology, especially in the C-terminal region (Table S2). These results revealed the conservation of the structure and function of the CIN gene family. In addition, the similarity level of the CIN genes showed that the lowest and highest identity were 43.1% and 100%, respectively, and average pairwise identity was 70.86% (Table S3). We also found a higher average pairwise identity for CIN members in the β group than that of those in the α group. The former was 90.55%, ranging from 80.5% to 100%, whereas the latter was 72.89%, ranging from 42.6% to 100%. This result indicated that these CINs in the β group possessed higher sequence identity compared with the α group members. In other words, the β group is ascribed to have higher conservation, and it is possible to reflect functional conservation for β CINs in a common ancestor.

Evolutionary Conservation of CIN Genes Based on Phylogenetic Relationship
To clarify the evolutionary relationship of the CIN gene family, Solanum CINs were aligned with members of Arabidopsis CINs, and then maximum-likelihood (ML) and maximum-parsimony (MP) phylogenetic trees were constructed using MEGA7.0 (Figure 1). These two methods essentially gave the same trees except for some minor details ( Figure S1). According to the dendrogram, these genes could be separated into two primary groups, corresponding to the α and β groups, and the α group was further divided into the α1 and α2 subgroups. Interestingly, between the α and β group, each conformably included four CIN members that derived from each out of five species (Figures 1 and 3). These results revealed that the CIN gene family was also extremely conserved in terms of the number of intragroup genes. The phylogenetic separation of thee α and β groups is statistically supported by bootstrap analysis and an exon-intron structure, and classification is in exact accordance with earlier reports on this topic [20,[25][26][27][28]. Hence, our phylogenetic analysis for the CIN gene family has high confidence. Analyses also showed that the CIN gene family from the β group had higher sequence similarity compared with the α group within Solanum from the point of view of the bar scale in the phylogenetic tree ( Figure 2). The β group CINs were identified to be cytosol-targeted, while the latter were inferred to be organelle-targeted (such as plastids or mitochondria) [39]. Our results and previous reports further hint that the CIN genes of cytosol might be more stable than CIN genes in organelles in the evolution of the Solanum plants, possibly suggesting that cytosolic CINs are more conserved than organelle-localized CINs in the α group.
We further studied the evolutionary time between five wholly sequenced plants, and searched the genome size of each species on the basis of a previous report [40]. A species tree and the number of genes were determined between the α and β groups in each plant (Figure 3). The eggplant, with approximately 1.3 Gb genome size, appeared earliest between five species, and divergence date was about 14.5 million years ago (MYA). The potato and three tomato species were separated by 7.7 MYA, and the potato genome was estimated to be around 0.84 Gb. The three tomato species are S. pennellii, which emerged approximately 1.8 MYA with a 1.2 Gb genome; and S. lycopersicum and S. pimpinellifolium, which diverged 0.7 MYA, and have 0.95 and 0.8 Gb genomes, respectively. These results showed a large span of species-differentiation time in these five Solanum genomes, but their genomic size was relatively stable. More interestingly, the same number of CINs was found in each Solanum species, and an equal number of CIN genes were present in each species of the α and β group (Figures 1 and 3), suggesting that the CIN gene family was stable and did not expand during the evolution of Solanum plants. It was also shown that gene number is not directly related to the genome size of the CIN gene family that was conserved throughout Solanum evolution, despite a previous author making the claim that gene number and genome size are actually correlated [41].

Conservation in CIN Motif Compositions and Gene Structures
With the aim of determining motif types and locations, we carried out an investigation of conserved motifs in each of the CIN proteins out using MEME and TBtool search software. A total of 15 distinct motifs were identified in the CIN gene family in Solanum, and the presence of the motifs in representative CINs from different subgroups is provided in Figure 4. Most CIN genes comprise a mass of completely conserved motifs. For example, 32 out of 40 CIN genes (80%) share 12 identical motifs, namely, motifs 8, 6, 10, 7, 3, 9, 1, 5, 12, 2, 11, and 4. That is to say, CIN genes within the α and β groups appeared highly conserved on the whole motif level in Solanum. Seven CIN genes (SmCIN03/04/07/08, SpeCIN08 SlCIN08, SpiCIN08) are missing some conserved motifs in the N-or C-terminus due to containing shorter amino acids. Differently colored boxes represent different types of motifs. Specific motifs in β, α1, and α2 subgroups are labeled with red boxes.
Relatively poor conservation was also observed in two subgroups (α1 and α2) among the CIN genes, showing subgroup-specific intergroup features. Three diverse motifs (motifs 13-15) were unique to each subgroup (Figure 4). Motif 13 is unique to β CIN members in the phylogenetic tree, while motifs 14 and 15 are specific to α1 and α2 CINs, respectively. These motifs were used to distinguish between the β, α1, and α2 subgroups. Overall, our analysis suggested the CIN gene family was highly conserved, and intergroup specificity was also observed during the evolution process of the Solanum species. These CIN proteins contain completely conserved motifs within each of these two groups, which may potentially share similar functional characteristics [42]. On the contrary, functional differentiation also occurred between the α and β groups of the CIN gene family [39]. However, the function of these highly conserved or intergroup-specific motifs still needs further studies in the near future.
We also found that α CINs have longer protein sequences than those of β CINs in the N-terminal region (Figure 4), whereas N-terminal sequences experienced larger sequence divergence with a longer C-terminal than that of the β group (Table S1). This led us to speculate that CINs of the α group may retain more signal peptide sequences in the process of plant evolution, which is probably the reason for the multilocalization of α CINs.
Many genes in the CIN gene family are typically encoded by four or six exons in different classes [25][26][27]. In our paper, two groups of CIN genes in five different plants also differed in exon features ( Figure 5). In line with previous reports, CINs from β group usually possess four exons, and α CIN members are encoded by six exons. Within the β subfamily, 15 CIN genes consist of four exons, while the four remaining members (SpeCIN08, SpiCIN06, SmCIN03, and SmCIN04) encoded irregular exons ( Figure 5). The intron phases of these β CIN members showed a 0-0-1 phase pattern, but four CINs from the potato genome presented a 2-0-0 pattern. Within the α branch, a total of 15 CIN members were predicted to be encoded by six exons (excluding SlCIN07/08 and SmCIN05/06/07), and the intron phase showed 0-1-0-2-0. Interestingly, each of the 14 CIN genes contained one shorter exon, as if it was erroneously predicted by an intron or had originated within other exons. The integrated gene-structure models of CINs in Solanum were very similar to those of CIN members from pepper, grapevine, cassava, and rice, as reported previously [25][26][27]43]. On the basis of the exon-intron structure and intron-phase pattern, the CIN gene family in Solanum was highly conserved in each corresponding group. That led us to suspect that the CIN gene family from the α and β groups had functional differentiation occurring during evolution. Taken together, analysis of the conserved motifs and gene structures again shed light on high conservation among CIN genes in Solanum, which could be used to verify functional conservation. Highly conserved motifs and group-specific gene structures were also in accordance with the results of phylogenetic analysis and classification mentioned above.

Strong Observed Collinearity in CIN Gene Family
After the evaluation of the motifs and gene structure, we continued our analysis with a focus on the chromosomal distribution of CINs in three species, S. lycopersicum, S. pennellii, and S. tuberosum. As shown in Figure 6, a total of 24 CIN members (each species contained eight genes) were randomly distributed on their chromosomes. Chromosomes 01 and 11 from each of three species harbored three CIN genes, while chromosomes 04 and 06 of each species had only one orphan CIN. We also found that most CIN genes are placed on the distal ends of chromosomes. It was previously reported that collinearity, a more specific form of synteny, was originally defined as a set of genes from two species located on the same chromosome that contained a conserved gene order during evolution [44,45]. When comparing the gene distribution between S. lycopersicum, S. pennellii, and S. tuberosum, a high degree of collinearity conservation was detected in these three species (Figure 6). Collinearity of the gene order was further illustrated when chromosome-by-chromosome comparison was made between the three Solanum species (Figure 7). A total of eight groups of collinear gene pairs were found, as shown in Figure 7. From them, multiple groups of CIN genes, forming orthologous genes to each other, are intimately clustered at one node of phylogenetic tree. These orthologous genes usually share a common ancestor. For example, SlCIN01, SpeCIN01, and StCIN01 are orthologous genes of the β group; SlCIN02, SpeCIN02, and StCIN02 belonging to the α group are also orthologous genes. Strong collinearity and abundance of orthologous gene pairs again support the idea that the CIN gene family was highly conservative in the process of evolution. Collinearity can be regarded as an 'agent for the conservation or constraint of gene function', and it can provide insight into the evolution of genomes [45][46][47]. Such strong collinearity in the CIN gene family can not only provide a clue to reveal conservation CINs, but also contributes to understanding the evolutionary and functional characterization of CIN genes. Particularly, the presence of massive collinearity indicated that CIN genes in the Solanum genome arose before the divergence of these different species. Additionally, no evidence was found for gene-duplication events (such as segmental and tandem duplication) in these 24 CIN genes.

Cross-Species Transcript Profiling Reveals High Constitutive Expression of CIN Genes in S. lycopersicum and S. tuberosum
Digital expression profiling is a particularly powerful and efficient method for large-scale gene-expression analysis [48][49][50][51][52][53][54][55], and it contributes to understanding gene functions. We therefore analyzed the expression profiles of the CIN gene family at different types of tissue and developmental stages from S. lycopersicum and S. tuberosum on the basis of in silico analysis (Figure 8). In S. lycopersicum, six out of eight paralog CIN genes were constitutively or broadly expressed in all analyzed samples (leaf, root, fully opened flower, unopened flower bud, 1-3 cm fruit, mature green fruit, and breaker + 10 fruit; Figure 8A). Correspondingly, a similar scenario was also observed in the CIN gene family of the potato genome ( Figure 8B). It was previously reported that some genes that displayed broad expression generally exhibit higher conservation than that of those expressed in specific types of tissue for plant and mammal species [56,57]. Hence, the conservation of the CIN gene family was verified by constitutive expression profiling once again. Significantly, two CIN orthologs (SlCIN06 and StCIN05) were unexpressed or hardly detected, and were phylogetically grouped into the same β subclade (Figures 1 and 8). These findings hint that β CINs located on the cytosol might behave similarly in functional expression across species compared with the α group. By contrast, an expression level of SlCIN08 was not detected in the tomato genome, while the StCIN08 potato gene was broadly expressed in all examined samples compared with SlCIN08. Both of them were clustered in the same subclade belonging to the α group. These observations also support the view mentioned above that CIN genes of the β group have higher stability and sequence similarity than those of the α group.

Expression profiles of tomato CIN genes in response to phytohormones with qRT-PCR
qRT-PCR analysis was used to investigate expression levels of tomato CIN genes under different phytohormones (including GA 3 , 6-BA, SA, ABA, NAA and JA; Table S5). The SlCIN06 gene was not selected for analysis due to invalid primers. Of the seven remaining genes, after treatment with different phytohormones, the SlCIN01 gene was highly induced at 12 h, and transcription levels were steeply downregulated at 24 h ( Figure 10). On the other hand, the SlCIN08 gene was expressed at certain stages of GA 3 , 6-BA, and SA induction. The remaining genes (SlCIN02/03/04/05/07) were barely affected by phytohormones or were slightly induced at specific stages.

Conclusions
In the current study, the whole genomes of five Solanum species (S. tuberosum, S. lycopersicum, S. pennellii, S. pimpinellifolium, and S. melongena) were chosen to identify the CIN gene family. A total of 40 putative CIN genes were identified in the five plants, and each species contained a small CIN gene family with eight members. Subsequently, comparative analysis involving protein sequences, phylogenetic relationships, cis-acting elements, genome organization, and expression profile was performed at the CIN family level. On the basis of our results, an abundance of functional genomic evidence showed that the CIN gene family is evolutionarily and functionally conserved. In summary, our study not only provides a comprehensive understanding of the CIN gene family existing in Solanum, but also established a basis for further working out the function of CIN candidates in the whole Solanaceae family in the future.
Supplementary Materials: The following are available online at http://www.mdpi.com/2218-273X/9/12/763/s1. Figure S1: Phylogenetic relationship of Solanum CINs with Arabidopsis. Phylogenetic tree of CIN proteins was constructed with the maximum-parsimony method using MEGA 7.0 software. Support values shown in each line in the phylogenetic tree. Putative CIN genes divided into two subfamilies (α and β) on the basis of their in silico prediction of subcellular localization; α subfamily further divided into α1 and α2. Table S1: Characterization of CIN gene family in α and β groups. Table S2: Multiple sequence alignment of CIN genes in Solanum. Table S3: Pairwise identity of CIN genes in Solanum. Table S4: Expression data of potato CIN genes under various stress conditions. Table S5: Primers used to detect expression of tomato invertase genes in this study.

Conflicts of Interest:
The authors declare no conflict of interest.