- freely available
Int. J. Mol. Sci. 2013, 14(11), 23160-23187; doi:10.3390/ijms141123160
Published: 21 November 2013
Abstract: Genome-wide experiments are routinely used to increase the understanding of the biological processes involved in the development and maintenance of a variety of pathologies. Although the technical feasibility of this type of experiment has improved in recent years, data analysis remains challenging. In this context, gene set analysis has emerged as a fundamental tool for the interpretation of the results. Here, we review strategies used in the gene set approach, and using datasets for the pig cardiocirculatory system as a case study, we demonstrate how the use of a combination of these strategies can enhance the interpretation of results. Gene set analyses are able to distinguish vessels from the heart and arteries from veins in a manner that is consistent with the different cellular composition of smooth muscle cells. By integrating microRNA elements in the regulatory circuits identified, we find that vessel specificity is maintained through specific miRNAs, such as miR-133a and miR-143, which show anti-correlated expression with their mRNA targets.
Genome-wide experiments on RNA expression typically provide lists of differentially expressed genes (DEGs) [1,2] that represent the starting point of a highly challenging process of result interpretation in which the gene-by-gene approach is often used. The lists obtained are highly dependent on the statistical tests adopted and on the threshold used to declare a gene significant. This variability has raised substantial criticism concerning the reproducibility of array experiments. Several studies have demonstrated greater consistency of array results using gene set approaches, rather than single gene approaches , indicating that there is greater reproducibility of the main biological themes than of their single elements. A gene set is defined as a set of genes that are functionally related. Gene sets are usually identified based on a priori biological knowledge (see, for example, Gene Ontology “GO” ( http://www.geneontology.org/ (accessed on 13 November 2013)) and the Kyoto Encyclopedia of Genes and Genomes “KEGG” ( http://www.genome.jp/kegg/ (accessed on 13 November 2013))). In this regard, several new bioinformatics tools have been developed that allow the integration of information such as gene location [4–6], ontological annotations [7–10], or sequence features . These methods can be broadly divided into supervised and unsupervised approaches. Supervised methods use a priori information on the functional relationships among genes to identify the processes involved in an experimental condition, while unsupervised approaches attempt to reconstruct functional associations among genes without relying on external information. In the following, we will briefly review these strategies, focusing specifically on their pros and cons; in addition, we will apply these strategies to a case study.
1.1. Supervised Approaches: Pathway Analysis
The integration of gene expression profiles with additional information on pathway annotations is called pathway analysis. The pathway analysis approach evaluates gene expression profiles among related genes, looking for coordinated changes in their expression levels. Several implementations of pathway analysis are now available, from the widely used algorithm developed by Subramanian and colleagues (Gene Set Enrichment Analysis; GSEA) , with its improvements [10,12], to more sophisticated implementations that exploit the topology of the pathway [13,14] (for a comprehensive review of existing methods, see ). Pathway analysis methods can be divided into (i) methods based on enrichment analysis and performed on a list of genes selected through a gene-level test; and (ii) methods based on global and multivariate approaches that define a model based on the whole gene set. With the first class of methods, the primary concerns are the assumption that genes are independent and the use of a threshold value for the selection of differentially expressed genes. Due to the latter, many genes with moderate but meaningful expression changes are discarded based on the strict cut-off value, leading to a reduction in statistical power. On the other hand, global and multivariate approaches relax the assumption of independence among genes belonging to the same gene sets and identify moderate but coordinated expression changes that cannot be detected by the enrichment analysis approach .
From this perspective, we recently developed three novel algorithms that can be used to perform gene set and pathway analysis. Graphite, a Bioconductor package , is a computational framework that can be used to manage, interpret, and convert pathway annotations to gene-gene networks, while STEPath  integrates expression levels and chromosome positioning to identify regional gene activation and CliPPER [14,19] explores the topology of a pathway, highlighting the portions most involved in its deregulation. We have implemented most of these analyses in a new web tool called GraphiteWeb .
One of the major drawbacks associated with these approaches is the limitation of pathway annotation. Pathway annotation is a highly challenging procedure that exploits the efforts of many researchers, who manually curate each single pathway based on information available in the literature. Pathways are often thought of as the elementary functional and evolutionary building blocks of the complete metabolic network, with each pathway representing a “self-contained” elementary biochemical process. To partition the reaction network of an organism into a set of (possibly overlapping) metabolic pathways requires arbitrary decisions as to where such partitions should be made and how pathway variants should be described . For these reasons, only a portion (in humans, approximately one-third) of known genes are currently annotated in at least one pathway.
In KEGG , the metabolic pathways—called “maps”—are subparts of the overall reaction graph. Reactions within a map are connected by their constituent metabolites, which also provide links to reactions in other maps. KEGG metabolic maps are described without reference to a particular species, and each map includes the reactions belonging to all known variants of a particular pathway. MetaCyc is a database of non-redundant, experimentally elucidated metabolic pathways that are found in many species  while, in the smaller Reactome database , the human database is used as the reference for predicting reactions and pathways in other organisms.
1.2. Unsupervised Approaches: Reverse Engineering Approach
A different approach to dealing with biological networks is the ab initio strategy: using genome-wide expression values, these algorithms try to infer the best network of interactions satisfying specific conditions. Unlike the pathway analysis approach, here, all known genes can be taken into consideration. Several methods have been proposed for the reconstruction of gene regulatory networks (GRNs) from experimental data; these include Bayesian Networks (BN) , Relevance Networks (RN) , and Graphical Gaussian Models (GGM) [27,28]. While BN and GGM distinguish between direct and indirect edges, RN does not. It is worth noting that although BN and GGM are able to infer edge direction this does not necessarily imply an ability to identify biological causality.
BN and GGM function poorly in cases involving thousands of genes and a small number of replicates, while RN has the ability to address such cases. RN uses association measures between two expression profiles, such as correlation and mutual information, to rank gene-gene interactions according to their strengths; the higher the association measure, the greater the probability of a functional interaction between the two genes. All of these approaches produce a large number of false positives (false interactions). The seminal paper of Basso et al., 2005 , extends RN, introducing an algorithm based on Data Processing Inequality (DPI) for removing indirect edges. Their approach, called ARACNE (Algorithm for the Reconstruction of Accurate Cellular Networks) , has been successfully used to reconstruct the sub-network of the MYC gene in human B cells.
In this context, we developed a new R package, parmigene, that performs network inference by implementing an unbiased estimation of the mutual information between expression profiles, thus yielding more precise results than existing software at strikingly less computational cost .
Apart from their low specificity, a significant issue raised by the last network inference challenge (DREAM 5) is that no single network inference method performs optimally across all data sets. In contrast, integration of predictions from multiple inference methods through a consensus network shows robust and high performance across diverse data sets .
Apart from the algorithm used, once the whole network has been inferred, the classical approach to dealing with large amounts of interactions is identifying small-connected components as a means of testing their enrichment in specific biological processes.
1.3. The Missing Element: MicroRNAs (miRNAs)
Although highly innovative, the supervised and unsupervised approaches described so far do not take miRNAs into consideration. Many efforts have been made to predict miRNA/mRNA interactions, first by developing various target prediction algorithms and then by introducing new experimental techniques to isolate miRNA/mRNA complexes [33–36]. Computational target prediction is still widely used, although it is characterized by many false positives. For exhaustive reviews on miRNA discovery algorithms and in silico target prediction [37,38].
The integration of target predictions with miRNA and gene expression profiles has recently been proposed as a means of computationally improving and refining miRNA-target predictions. As miRNAs act predominantly through target degradation, the expression profiles of miRNAs and those of their target genes are expected to be inversely correlated [39,40].
Although the key role of miRNA in post-transcriptional regulation is universally recognized, few attempts have been made to use combinations of miRNA elements in developing gene set approaches. The only such attempt was described by Nam and colleagues , who performed GSEA on the mRNA targets of de-regulated miRNAs.
1.4. Case Study: The Pig as a Model Organism
Considering the advantages and disadvantages of the approaches described above, here we propose a consensus strategy based on the integration of pathway analysis, relevance networks and miRNA expression using as a model organism the pig and its cardiocirculatory system.
The size of organs, as well as various anatomical features, general physiology, and features of organ development, are very similar in pigs and humans. This permits the use of the pig as a model in the study of a number of pathologies, such as those affecting eyes , muscle , organ transplantation [44,45], and the gastrointestinal , nervous , and cardiovascular  systems. The coronary artery distribution in the pig is more similar to that of humans than is that of other animals. In addition, pigs present very similar cardiac output to humans; they possess a vaso vasorum in the aorta, and the left azygous vein empties into the coronary sinus instead of into the precava. Blood pressure (145–160/105 BP), heart rate (100–150 BPM) and pulmonary pressure are higher in pigs than in humans.
Despite the medical importance of the pig as a species for study, our knowledge of the genome organization, gene expression regulation, and the molecular mechanisms underlying the pathophysiological processes of the pig is far less than the knowledge we have acquired of the mouse and rat. More than 90% of the porcine genome has been sequenced by the Swine Genome Sequencing Consortium . The availability of detailed information on the porcine genome, together with emerging transgenic technologies, will enhance our ability to create specific and useful pig models. Recently, an atlas of DNA methylomes in porcine adipose and muscle tissues was published , and a great effort was made to combine genome sequence information with our knowledge of gene expression. Many of these studies focused on the swine immune system [51–54], while a genome-wide expression analysis in different tissues was described in Freeman’s paper . Recently, using sequencing approaches, a compendium of small non-coding RNAs was identified in various pig tissues (e.g., skeletal muscle [56–62], kidney , tooth , intestinal tract , brain , testis, ovary, sperm, and embryo [67–71] and pituitary gland ). Li and colleagues demonstrated that a complex regulatory network of porcine subcutaneous fat development is reflected in a great diversity of miRNA composition and expression between muscle and adipose tissue .
Here, we generate new custom mRNA and miRNA platforms that can be used to dissect the transcriptomic changes and regulatory circuits that are involved in the maintenance of veins and arteries in the pig. An integrative approach, combining pathway analysis and de novo network reconstruction, was used to expand our current knowledge of these regulatory circuits and to integrate miRNA activity into these circuits demonstrating their role in vessel specification. We show that vessel specificity can be maintained through different miRNAs (e.g., miR-133a and miR-143), the expression of which is inversely correlated with that of their mRNA targets.
2. Results and Discussion
The integration and analysis of gene and miRNA expression profiles across different tissues is fundamental to our understanding of tissue-specific processes. Here, we focus our analysis on differences in gene and miRNA expression among different tracts of the circulatory system: the two largest veins of the body (superior and inferior vena cava), the aorta (ascending and descending), the pulmonary artery, and the coronary artery. To achieve this goal, we created mRNA and miRNA  platforms, the latter based on the RAKE (RNA primed–array-based Klenow enzyme assay) method [75,76], to quantify coding and non-coding gene expression in pig tissues. After quantifying miRNA and mRNA expression, we used a combination of supervised and unsupervised approaches to detect transcriptional and post-transcriptional differences among different tracts of the circulatory system.
Ensembl transcripts (Ver. 56; EMBL-EBI, Wellcome Trust Genome Campus, Hinxton, Cambridgeshire, UK) and UniGene (Ver. 38; National Center for Biotechnology Information, U.S. National Library of Medicine, Bethesda, MD, USA) pig sequences were used to produce a dedicated microarray platform for monitoring mRNA expression. On the basis of sequence similarity, UniGene features that overlapped more than 40% with an Ensembl transcript were discarded. After this filter, we obtained 40,267 UniGene clusters and 19,603 Ensembl transcripts (protein coding + pseudogenes + retrotransposed elements). For this selected collection of sequences, we designed microarray probes with different specificities and located at different distances from the 3′ ends of specific transcripts using six different algorithms. The two best probes for each sequence, as determined by the reliability of the prediction algorithm and by the probe’s vicinity to the 3′-end, were experimentally tested in a hybridization trial performed with a pool of mRNA populations independently prepared from 20 pig tissues (GEO: GSE28636). For each transcript with a replicated probe, we selected the probe that was the most responsive and specific on the basis of the intensity of fluorescence in the hybridization test, as suggested by Kronick . The resulting pig whole-genome microarray, which was used in the gene expression analysis, is composed of: (i) 17,048 replicated probes and 963 single probes specific for the Ensembl transcripts; (ii) 11,363 replicated probes specific for the UniGene clusters of lengths between 778 nt and 1348 nt; and (iii) 28,790 single probes specific for the remaining UniGene clusters. Our analysis was not able to identify specific probes for 114 UniGene clusters and 1592 Ensembl transcripts. A limitation we faced in working with gene expression in pig was the poor gene annotation available. The number of annotated features on the array was increased by mining description and protein annotations to associate gene names with our probe symbols. Basically, for genes for which the HUGO (Human Genome Organisation) symbol was not present, we mined the description available from the Unigene database and retrieved additional gene or protein IDs, if present. All IDs were manually curated (ArrayExpress ID: A-MEXP-2351).
Recently, a new microarray platform based on 52,355 expressed sequences comprising miRNAs in miRBase Ver. 15 (Wellcome Trust Sanger Institute, Cambridge, UK) for pigs, cows, humans, and mice was described . Unlike this new platform, which was constructed by spanning 22 probes along the transcripts, the platform we developed detects the 3′-UTR of each transcript; therefore, we are able to distinguish mRNA isoforms. This feature is fundamental because the activity of miRNAs is predominantly based on their interactions with the 3′-UTR region of mRNAs.
The identification of miRNAs was described in . Briefly, bioinformatic analyses were performed on the pig genome for the identification of putative pre-miRNAs. These were experimentally tested using six independent RAKE experiments to identify 5′ and 3′ miRNA boundaries. After this experimental confirmation, all the pre-miRNAs identified as responsive (1235 hairpins) were tested for the presence of mature miRNA through RNA sequencing experiments. RNA sequencing experiments identified 343 hairpins coding for miRNAs. However, using PCR we were able to validate several miRNAs that were not confirmed by RNA sequencing. Therefore, we decided to produce an miRNA microarray platform (Array Express ID: A-MEXP-2348) containing all miRNAs detected by RAKE experiments. In the following analysis, we will discuss only miRNAs that were confirmed in sequencing experiments. Each specific probe is flanked by a background probe that was used to subtract the corresponding background fluorescence signal in the analysis (Figure 1).
The short length of miRNAs makes complementary probe selection and the identification of optimized PCR primers a challenging task. While miRNA microarrays permit massive parallel and accurate relative measurement of all known miRNAs, they have been less useful for absolute quantification. We developed a new method that integrates the hybridization of miRNAs with an enzymatic elongation reaction that can take place only following a perfect match between the miRNA and the probe. Moreover, we introduced oligonucleotide spikes into the hybridization-enzymatic reaction, permitting the quantification of miRNAs over the linear dynamic range of 10−18 moles to 10−14 moles and avoiding biases related to sequence, labeling, or hybridization .
2.1. Differences between Arteries and Veins
We compared different tracts of the circulatory system: the two largest veins (the superior and inferior vena cava), the aorta (ascending and descending tracts), the pulmonary artery, and the coronary artery. As expected, the ascending and descending aorta and the coronary artery display similar gene expression profiles that are distinct from those of the superior and inferior vena cava (Figure 2A), while the pulmonary artery has an intermediate expression profile (Figure 2A). Arteries and veins are structurally different in terms of their relationship to the heart. Arteries receive blood directly from the heart and are therefore characterized by high pressure; in contrast, veins receive blood from peripheral body regions, and low pressure characterizes them. For this reason, some of the blood in the veins may not return to the heart but instead may back up or collect in these vessels. Veins transport de-oxygenated blood, while arteries transport oxygenated blood (with the exception of the pulmonary artery, which transports de-oxygenated blood to the lungs for oxygenation). The difference in blood pressure in arteries and veins is reflected in the different structures of these vessels. Arteries and arterioles have thicker walls than veins and venules; specifically, they possess an increased amount of smooth muscle that provides extra strength and elasticity to withstand surges of blood from the heart. Moreover, the thinner the vessel, the lower its innervation.
In accordance with the increased number of smooth muscle cells in arteries, the aorta expresses more smooth muscle-specific transcripts than the vena cava (Figure 2B). Genes that are up-regulated in the aorta include genes related to biological structures such as adherence junctions and processes such as nerve function and blood circulation (Table S1). This is consistent with the significantly higher level of innervation of arteries than of veins. Up-regulated genes in the vena cava are enriched in genes coding for proteins involved in the formation of the extracellular matrix (Table S1). These findings may be associated with the differences in elasticity between veins and arteries (veins have less elastic tissue than arteries).
A major component of the vessel walls of large arteries and veins is the extracellular matrix (ECM), which consists of collagens, elastin, and proteoglycans. The smooth muscle cells of the aorta and vena cava synthezise different amounts of collagen. As expected, our data show that collagen synthesis is four-fold higher in venous than in arterial ; collagen type I (COL1A2) is the most highly expressed extracellular matrix component.
Procollagen C-endopeptidase enhancer 2 (PCOLCE2) and P4HA1 prolyl 4-hydroxylase, α polypeptide I (P4H4) genes were found to be up-regulated in the vena cava. PCOLCE2 binds to the C-terminal propeptide of type I and II procollagens and may enhance the cleavage of their propeptides, while P4H4 is a key enzyme in collagen synthesis. Moreover, we found type VIII collagen (COL8A1), which is typical of the endothelium lining vessels, and type VI collagen (COL6A3), a subendothelial constituent , to be highly expressed in the vena cava.
2.2. Pathway Analysis
Using multivariate pathway analysis methods such as GSEA, we overcame the major limitation of the classical enrichment approach, cut-off-based gene selection, focusing instead on coordinated changes in gene expression. Using this method, we were able to identify gene pathways that are specifically expressed in arteries and veins (Table 1). Among the activated pathways in arteries are those associated with smooth muscle contraction, calcium-calmodulin-dependent events, genome stability and regulation of intracellular signaling cascades. This finding is consistent with the presence of a thicker smooth muscle ring in arteries than in veins. Among the activated pathways in veins, we find the complement cascade, arachidonic acid metabolism, cell surface interactions at the vascular wall, and extracellular matrix metabolism (glycosaminoglycan metabolism and keratin/keratan sulphate metabolism). Arachidonic acid metabolism is involved in the control of various processes within the cardiocirculatory system, including vasoconstriction  and vasodilation [81,82]. The two most highly expressed genes related to arachidonic acid metabolism were prostaglandin-endoperoxide synthase 2 (PTGS2 or COX-2) and γ-glutamyltransferase 5 (GGT5). COX-2 and endothelial nitric oxide synthase (eNOS) are primarily expressed in endothelial cells and are considered important regulators of vascular function. Under normal conditions, laminar flow induces COX-2 expression and synthesis of PGI2, which in turn stimulates eNOS activity . GGT expression was also localized in the endothelium . As blood normally flows more slowly through veins than through arteries, thromboses are more common in veins than in arteries. This could be the reason for the control of vasodilation and vasoconstriction through metabolites of arachidonic acid.
In support of the up-regulation of elements of the complement cascade in veins, it is known that inflammation is more readily induced in venous than in arterial epithelium due to the conditions of the venous circulation. We checked for the presence of an inflammatory process by analyzing the expression of complement components in 19 tissues (Figure 3). We find that not all complement components are up-regulated in veins, while most are highly expressed in lymph nodes, spleen, and liver. This is in accordance with complement system synthesis and laundering. The complement system consists of a dozen circulating proteins, most of which are synthesized by the liver, that have the ability to bind to cellular membranes. The spleen and the liver are able to remove immune complexes composed of complement elements linked to erythrocyte membranes .
Finally, it is worth noting that pathways describing mucopolysaccharidosis syndromes such as Hurler, Sanfilippo, and Morquio syndromes were found to be significantly expressed in veins. Altered glycosaminoglycan metabolism is a key feature of these pathologies. Glycosaminoglycans are proteoglycans that bind to a varying degree water, electrolytes and macromolecules, such as collagen, within the connective tissue. The lining of veins and arteries comprises a substantial amount of the body’s connective tissue. The outer layer of vessels (tunica adventitia) consists chiefly of connective tissue and is the thickest layer of the vein.
2.3. De Novo Pathway Reconstruction: Topological Parameters
Pathway analysis fails to consider many known genes and miRNAs that are not annotated in any pathway. To fill these gaps, we used de novo network reconstruction using both mRNA and miRNA profiles. Using a correlation measure with a permutation-based threshold of 0.9 of mutual information (0.9 was the maximum value of mutual information of the network generated by the permuted expression matrix), we generated a network with 7762 nodes (7647 genes and 115 miRNAs) and 44,092 edges (Figure 4). The global architecture of the network is characterized by two large clusters, which are shown as the blue and violet nodes in Figure 4. As expected (Figure 2A), these two clusters are composed of genes prevalently expressed in heart (the most different tissue) and in blood vessels (Figure S1). Thus, we separated these two clusters to create a vessel-specific and a heart-specific network.
To gain insight into the structure of complex networks of this type, various topological parameters were calculated (Table 2). The heart network is sparser and less connected than the vessel network. This is reflected by a larger number of connected components, a higher diameter and a smaller number of neighborhood genes of the heart network.
The degree of a node, also referred to as its connectivity, is the number of edges connected to the node. Based on this definition, the nodes with the highest connectivities are called hubs. In general, hub genes are master regulators and play important roles in the biology of the cell. In our networks, we define as hubs the top 5% of genes in the connectivity distribution. We found 162 and 128 hubs in the vessel and heart networks, respectively. The hub genes of the vessel network encode proteins that participate in two main processes: RNA processing and the regulation of apoptotic events (Table S2). During normal development as well as in pathology, the formation of new vessels and the regression of pre-existing ones depend on the balance between endothelial cell proliferation and endothelial cell apoptosis. In mature vessels, endothelial cell turnover is also under the control of these tightly regulated phenomena. Among the hubs of the heart network, we identified genes involved in cell membrane structure and signal transduction through MAPK activity as well as genes encoding various ion transporters (e.g., Na2+, K+) (Table S2). The members of the MAPK family are involved in the regulation of many cellular processes, including cell growth, differentiation, development, the cell cycle, death, and survival. Activation of genes in the MAPK family plays a key role in the pathogenesis of various processes in the heart, including myocardial hypertrophy and its transition to heart failure, ischemic and reperfusion injury, and cardioprotection conferred by ischemia- or drug-induced preconditioning .
The de novo reconstructed network (Figure 4) is characterized by the presence of different miRNAs (Table S3) that are responsible for the regulation of vessel specificity. Figure 5 represents the sub-network of the neighboring genes of miRNAs. Interestingly, the central part of the network (the densely connected portion of the sub-network) is characterized by genes involved in smooth muscle contraction (Table S4) that show differential expression in arteries and veins (Figure 6). As discussed previously, a thicker ring of smooth muscle is present in arteries than in veins (see Section 2.2). Our results suggest that this difference may be regulated by specific miRNAs that display anti-correlated expression with their putative targets (Figure 6).
Specifically, the α 2-actin (ACTA2) smooth muscle gene in aorta (ENSSSCG00000010447) is regulated by a specific miRNA (prediction_15_14390446_14390503_-_3p) that is down-regulated in the aorta and up-regulated in venous tissue (Figure 6). Defects in ACTA2 are the cause of aortic aneurysm familial thoracic type 6 (AAT6) [MIM:611788]. AATs are characterized by permanent dilation of the thoracic aorta, usually due to degenerative changes in the aortic wall. RHOB (Ssc#S35170885), an important gene involved in vasoconstriction, is also regulated by miR-133a (Figure 6). RHO gene family is involved in vascular morphogenesis , and miR-133a contributes to the phenotypic state of smooth muscle cells both in vitro and in vivo, suggesting a potential for therapeutic application of this miRNA in vascular disease . In fact, miR-133a, in association with miR143/145, is fundamental for the maintenance of the contractile smooth muscle cell phenotype . The expression of miRNAs prediction_15_14390446_14390503_-_3p and miR-133 and their targets ACAT2 and RHOB was confirmed by qRT-PCR (Figure 6C).
2.4. Integration of Supervised and Unsupervised Approaches
Supervised and unsupervised approaches gave similar results in terms of biological processes involved in tissue specificity. However, their complementary behavior might be better exploited through the use of an integrative approach. Specifically, our aim is to combine the topology of the discovered pathways with that of the de novo reconstructed network. The advantage of combining the topologies obtained in sections 2.2 and 2.3 is two-fold: (i) it allows the expansion of pathway definitions to include genes currently without pathway annotation; and (ii) it permits the inclusion of miRNAs. Using the topological structure of the pathway as a backbone, we include new genes in the pathway, following two rules: (i) a gene/miRNA is added only if it presents an edge in the de novo network with at least one gene in the pathway; and (ii) additional miRNAs are included if they share an edge with previously added non-annotated genes. Here, we will use this strategy to discuss one of the most interesting pathways significantly activated in arteries: the smooth muscle contraction pathway (Figure 7A). The genes used to expand this pathway (the γ isoform of the catalytic subunit of protein phosphatase 1 (PPP1CC), transgelin (TAGLN), and smooth muscle and non-muscle myosin light chain 6 (Myl6), among others) are primarily involved in membrane and actin filament organization, actomyosin function and responses to specific stimuli (NF-κB binding and response to unfolded protein) (Table S5), reflecting their functional congruence with the smooth muscle contraction pathway. Indeed, the membrane organization category includes the organismation of the sarcoplasmic reticulum, which is involved in the regulation of intracellular Ca2+ concentration (Figure 7). All of these genes are prevalently expressed in smooth muscle; in particular, TAGLN was purified from bovine aorta . Moreover, we added 61 miRNAs that putatively regulate genes involved, directly or indirectly in the smooth muscle contraction pathway (Figure 7A). Interestingly, 23 miRNAs are involved in the regulation of the original genes of the pathway (core genes). Among these miRNAs, miR-542 (ENSSSCT00000021275), which was shown in a previous work to be involved in the epithelial-mesenchymal transition , was found to be associated with vimentin (VIM) regulation (Figure 7B). Finally, it is worth noting that many other miRNAs important for vascular remodeling and smooth muscle phenotypic control, such as miR-133 , miR-143 , miR-99b , miR-23a , miR-138 (ENSSSCT00000021566) , miR-29c , miR-125a (ENSSSCT00000020936) , and miR-24 ), are included in this network.
3. Experimental Section
3.1. Sample Preparation
RNA samples (total RNA and small RNAs) were extracted from the analyzed tissues of three non-inbred pigs and kept at −80 °C until use. Before the experiments were performed, the three samples from the same tissues were pooled, and miRNA was selected using a flashPAGE instrument (Ambion, Carlsbad, CA, USA). RNA extraction was performed using TRIzol (Invitrogen, Carlsbad, CA, USA) according to the manufacturer’s protocol. The PureLink Isolation Kit (Invitrogen, Carlsbad, CA, USA) was used to separate long RNA from short (<200 nt, after use in the flashPAGE instrument). All samples were quantitated using a NanoDrop ND-1000 spectrophotometer; RNA quality was then analyzed using the Agilent Bioanalyser 2100 (Agilent, Santa Clara, CA, USA) (Agilent RNA 6000 nano kit; RIN at least 7 accepted) and for the presence of miRNA using the Agilent small RNA kit.
3.2. Microarray Platforms
For this study, we synthesized two different types of microarray platforms: (a) 4 × 2 K Combimatrix microarrays for miRNA expression profiling (ArrayExpress ID: A-MEXP-2348); (b) 90 K Combimatrix microarrays (ArrayExpress ID: A-MEXP-2351) for mRNA expression profiling. All microarrays were synthesized using the Combimatrix oligonucleotide synthesizer station (Combimatrix, Mukilteo, WA, USA), which allows in situ synthesis of oligonucleotide probes through phosphoramidite chemistry. All synthesized microarray platforms were tested for uniformity of the probes as suggested by the manufacturer.
The 4 × 2 K microarrays contain specific probes for miRNAs. Each specific probe is flanked by a background probe that is used in the analysis to subtract the corresponding background fluorescence signal (Figure 1). The background probes were derived from a previous RAKE experiment aimed at the identification of specific ends of miRNAs in which a tiling microarray was used for the scope (Figure S2) .
3.3. Microarray mRNA and miRNA Gene Expression and qRT-PCR
Pooled RNA (1 μg; three samples from the same tissue) was linearly amplified and labeled by the addition of biotinylated nucleotides according to the procedure described in the Ambion MessageAmp™ II aRNA Amplification kit (Ambion, Carlsbad, CA, USA). The procedure includes reverse transcription with an oligo-dT primer carrying a T7 promoter to produce the first-strand cDNA. After second-strand synthesis and clean-up, the cDNA is used as template in an in vitro transcription reaction to generate a large quantity of antisense RNA (aRNA). Biotinylated UTPs were incorporated into the aRNA during the in vitro transcription reaction. Following purification, 18 μg of aRNA was fragmented using the Ambion Fragmentation Kit (Ambion, Carlsbad, CA, USA). Intact and fragmented aRNAs were tested on an Agilent Bioanalyzer 2100 (Agilent, Santa Clara, CA, USA) using the RNA 6000 Nano LabChip (Agilent, Santa Clara, CA, USA). The size of intact aRNAs ranged from 300 to 4000 nucleotides, while that of fragmented aRNAs ranged from 50 to 250 nucleotides. Fragmented aRNA was hybridized to pre-hybridized 90 K Combimatrix microarrays. The pre-hybridization step was performed for 2 h at 42 °C in a solution containing 5× Denhardt’s solution, 100 ng/μL salmon sperm DNA and 0.05% SDS in 1× Hyb solution prepared as suggested by Combimatrix. Hybridizations were carried out with 4.8 μg of fragmented aRNA in 25% DI formamide, 100 ng/μL salmon sperm DNA and 0.04% SDS in 1× Hybridization solution at 42 °C for 18 h with constant mixing. After hybridization, the microarray platforms were washed with the following:
6× SSPET (SSPE added with 0.05% of Tween-20) preheated at 42 °C for 5 min;
3× SSPET for 1 min at room temperature;
0.5× SSPET for 1 min at room temperature; and
PBST for 1 min at room temperature.
The microarray chamber was then filled with biotin blocking solution (0.1% Tween-20 and 10 mg/mL BSA in 2× PBS) and incubated at room temperature for 1 h. Labeling was performed by incubating the microarray with dye labeling solution (0.1% Tween-20, 10 mg/mL BSA and 1.6 ng of Cy3-streptavidin (Amersham, Little Chalfont, UK) in 2× PBS) for 1 h at room temperature. After the washing steps (PBST for 1 min at room temperature two times; PBS for 1 min at room temperature), microarrays were scanned at 3 μm resolution with the VersArray ChiprRaderTM (BioRad, Hercules, CA, USA) (ArrayExpress ID: E-MTAB-1941).
A sample of the miRNA pool (350 ng) was hybridized for 20 h at 37 °C in a static hybridization oven in hybridization buffer consisting of 6× SSPE, 8 mg/mL BSA, 700 ng of small RNAs and spike-in. After hybridization, the microarrays were washed with the following stringent procedure:
1 min at room temperature with 6× SSPET (SSPE containing 0.05% Tween-20);
1 min at room temperature with 3× SSPET;
1 min at room temperature with 2× PBS;
1 min at room temperature with 1× Buffer 2 (the buffer for the Klenow enzyme).
The RAKE reaction was performed at 36.5 °C by incubating the microarray for 1.5 h in 1× Buffer 2 containing 16 μM biotin-14-dATP (Invitrogen, Carlsbad, CA, USA) and 0.25 U/μL Klenow fragment (3′→5′ exo−) (NEB, Ipswich, MA, USA). The microarrays were washed two times in 1× Buffer 2 and incubated in biotin blocking solution for 1 h at room temperature. Extended miRNAs (primers) were labeled by incubating the microarray in the dye labeling solution for 1 h at room temperature. The microarrays were rinsed in PBST (0.1% Tween-20 in 2× PBS) for 1 min at room temperature and in 2× PBS for 1 min at room temperature and scanned (ArrayExpress ID: E-MTAB-1938).
qRT-PCR was used to validate the expression of miRNAs and mRNAs. For mRNA, the SYBR green approach was used in association with the Power SYBR® Green PCR Master Mix (Applied Biosystems, Carlsbad, CA, USA); for miRNA, the NCode™ SYBR® Green miRNA qRT-PCR Kit (Life Technologies, Carlsbad, CA, USA) was used according to the manufacturer’s specifications. The primers used were GCATGCAGAAGGAGATCACA (left) and GCTGGAAGGTGGACAGA GAG (right) for ACTA2, TATGTGCTTCTCGGTGGACA (left) and CGAGGTAGTCGTA GGCTTGG (right) for RHOB, and GGTTCCCAGGCTAGGGGTCG (specific) for prediction_15_14390446_14390503_-_3p and CAGCTGGTTGAAGGGGACCA for miR-133a. The reference genes used were GAPDH for mRNA and snU6 for miRNA. The results shown are normalized to the expression of histone H3.
3.4. Data Analysis
Images of hybridized mRNA microarrays were quantitated using the Combimatrix imaging software. The raw data were normalized using the quantile method. The goal of the quantile method is to normalize the distribution of probe intensities across a set of microarrays. After normalization, the fluorescence intensities of probe spots presenting values lower than the average of the medians of all negative control probes were set as missing values (NA). The negative control probes were used to calculate the background value (filter). Probe spots presenting NA in more than six experiments were excluded from data analysis. Before performing the analysis, the intensity values of the replicated probes were averaged. Differentially expressed genes were identified using the MeV suite  and applying PCA (Principal Components Analysis)  and SAM (Significance Analysis of Microarrays)  analysis. COA (Correspondence Analysis) analysis  was used to determine the specificity of the de novo reconstructed network. Gene enrichment was performed using the DAVID web application ; pathway analysis was performed using GraphiteWeb .
miRNA data were pre-processed as previously described except that cyclic lowess normalization was applied . After inter-array normalization, the fluorescence intensity of the specific miRNA probe was subtracted from the corresponding background fluorescence and used to extrapolate the miRNA concentration from the spike-in-derived curve. The spike-in curve was extrapolated using spline interpolation .
Pig gene symbols from Ensembl were converted to human gene symbols using the Ensembl orthologous database through the BioMart service. For UniGene clusters, we extracted the most similar protein or gene curated by NCBI ( http://www.ncbi.nlm.nih.gov/ (accessed on 13 November 2013)) based on sequence similarity and then used the NCBI HomoloGene database to translate the protein or gene to its human homolog. This method is commonly used to map genes to pathways in non-model organisms or to map genes that are poorly annotated in model organisms ; it is also common to use the well-curated human pathways to extrapolate pathways for non-model organisms. GSEA  was then performed using the GraphiteWeb web tool .
Mutual information (MI) between all pairs of genes and miRNAs was estimated using the parmigene Bioconductor package ; miRNA-miRNA interactions have been removed. To assess MI significance, we estimated the null distribution using a permutational approach. The expression profiles of miRNAs and mRNAs were randomly shuffled, and MI was then estimated on the shuffled matrices. To generate the global network, we included only interactions with MI that were greater than the maximum MI value obtained from the null distribution, which was 0.9 (corresponding to quantile 0.999 in the empirical distribution).
The topologies of the most interesting pathways derived from pathway annotation (graphite Bioconductor package) were integrated with the topology of the de novo reconstructed network. The combination was performed using the pathway topology as backbone; new genes/miRNAs were then added based on fulfilment of one of the following criteria: (i) if the new gene/miRNA shares an edge in the de novo network with at least one gene in the pathway; and (ii) if an miRNA shares an edge in the de novo network with at least one previously added gene.
Gene set analyses have been shown to provide better insights and more robust results in array experiments than classical gene-by-gene approaches. Here, we reviewed various strategies used in gene set analysis and showed how to address their integration. We combined genome and pathway information with expression data and applied this approach to a case study, the analysis of the pig cardiocirculatory system. Two new platforms for pig transcriptome analysis (mRNA and miRNA) were presented and applied to the study of tissue specificity. Different expression patterns were identified in heart and vessels; within these, arteries show distinct profiles from those of veins. These findings seem to be associated with the functional and structural composition of the vessels. In agreement with histochemical evidence, pathway analysis revealed the greater importance of smooth muscle in arteries than in veins. We showed that miRNAs participate in the definition of arterial and venous pathways; specifically, for smooth muscle, our data indicate the importance of miR-133a in regulating the RHOB gene. The use of a combination of supervised and unsupervised approaches allowed us to expand the compositions of known pathways to include new genes involved in membrane and actin filament organization, actomyosin function and response to stimuli and new miRNAs, most of which are known to be associated with vascular remodeling and control of the smooth muscle phenotype. These results demonstrate the feasibility and usefulness of combining these two approaches in identifying new candidate genes whose expression is associated with specific experimental conditions.
|Table 1. Summary of Gene Set Enrichment Analysis (GSEA) analysis based on the Reactome database ( http://www.reactome.org/ (accessed on 13 Novembre 2013)). Set size refers to the dimension of the pathway, and NTK (Normalized T-test of the kth gene set) is the observed value of the statistic as defined in the Graphite web tool . Negative NTK values indicate pathways activated in veins, while positive values indicate pathways activated in arteries. It is worth noting that GSEA is known to have low statistical power; the suggested Q-value cut-off for identification of significant pathways is 0.25.|
|Arachidonic acid metabolism||11||−3.09||0.044912281|
|MPS I—Hurler syndrome||54||−3.09||0.044912281|
|MPS II—Hunter syndrome||54||−3.09||0.044912281|
|MPS IIIA—Sanfilippo syndrome A||54||−3.09||0.044912281|
|MPS IIIB—Sanfilippo syndrome B||54||−3.09||0.044912281|
|MPS IIIC—Sanfilippo syndrome C||54||−3.09||0.044912281|
|MPS IIID—Sanfilippo syndrome D||54||−3.09||0.044912281|
|MPS IV—Morquio syndrome A||54||−3.09||0.044912281|
|MPS IV—Morquio syndrome B||54||−3.09||0.044912281|
|Cell surface interactions at the vascular wall||54||−2.75||0.106666667|
|Keratan sulfate/keratin metabolism||20||−2.46||0.205977011|
|G α (12/13) signaling events||35||−2.37||0.24|
|Antigen presentation: Folding, assembly and peptide loading of class I MHC||11||−2.33||0.250980392|
|Golgi associated vesicle biogenesis||29||−2.29||0.247017544|
|Phase II conjugation||23||−2.26||0.249756098|
|EGFR interacts with phospholipase C-γ||17||2.12||0.273710692|
|Calmodulin induced events||14||2.14||0.262564103|
|Cell-extracellular matrix interactions||15||2.2||0.254184397|
|PLCG1 events in ERBB2 signaling||18||2.23||0.252121212|
|DAG and IP3 signaling||15||2.29||0.247017544|
|RNA polymerase I promoter opening||18||2.65||0.131282051|
|Deposition of new CENPA-containing nucleosomes at the centromere||21||2.88||0.077575758|
|Packaging of telomere ends||12||3.09||0.044912281|
|Striated muscle contraction||21||4.76||0|
|Smooth muscle contraction||19||6.13||0|
|Table 2. Summary of the principal topological parameters estimated for the de novo reconstructed network.|
|Topological parameters||Heart network||Vessels network|
|Average clustering coefficient||0.195||0.234|
|Avg. number of neighbors||6.329||15.611|
The authors acknowledge the CARIPARO Foundation (Project for Excellence 2012: “Role of coding and non-coding RNA in chronic myeloproliferative neoplasms: from bioinformatics to translational research”) and the CRIBI Center for high-performance computing resources funded by the Regione Veneto (RISIB project SMUPR n. 4145). The authors wish to thank the University of Padova for support of this work (CPDR075919 and CPDA119031 to Chiara Romualdi; CPDR070805 to Gabriele Sales) and MicroCribi service for microarray synthesis support ( http://microcribi.cribi.unipd.it).
Conflicts of Interest
The authors declare no conflict of interest.
- Smyth, G.K. Linear models and empirical bayes methods for assessing differential expression in microarray experiments. Stat. Appl. Genet. Mol. Biol 2004, 3, 1–28. [Google Scholar]
- Tusher, V.G.; Tibshirani, R.; Chu, G. Significance analysis of microarrays applied to the ionizing radiation response. Proc. Natl. Acad. Sci. USA 2001, 98, 5116–5121. [Google Scholar]
- Shen, K.; Tseng, G.C. Meta-analysis for pathway enrichment analysis when combining multiple genomic studies. Bioinformatics 2010, 26, 1316–1323. [Google Scholar]
- Callegaro, A.; Basso, D.; Bicciato, S. A locally adaptive statistical procedure (LAP) to identify differentially expressed chromosomal regions. Bioinformatics 2006, 22, 2658–2666. [Google Scholar]
- Toedling, J.; Schmeier, S.; Heinig, M.; Georgi, B.; Roepcke, S. MACAT—Microarray chromosome analysis tool. Bioinformatics 2005, 21, 2112–2113. [Google Scholar]
- Turkheimer, F.E.; Roncaroli, F.; Hennuy, B.; Herens, C.; Nguyen, M.; Martin, D.; Evrard, A.; Bours, V.; Boniver, J.; Deprez, M. Chromosomal patterns of gene expression from microarray data: Methodology, validation and clinical relevance in gliomas. BMC Bioinform 2006, 7, 526. [Google Scholar]
- Barry, W.T.; Nobel, A.B.; Wright, F.A. Significance analysis of functional categories in gene expression studies: A structured permutation approach. Bioinformatics 2005, 21, 1943–1949. [Google Scholar]
- Goeman, J.J.; van de Geer, S.A.; de Kort, F.; van Houwelingen, H.C. A global test for groups of genes: Testing association with a clinical outcome. Bioinformatics 2004, 20, 93–99. [Google Scholar]
- Subramanian, A.; Tamayo, P.; Mootha, V.K.; Mukherjee, S.; Ebert, B.L.; Gillette, M.A.; Paulovich, A.; Pomeroy, S.L.; Golub, T.R.; Lander, E.S.; et al. Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles. Proc. Natl. Acad. Sci. USA 2005, 102, 15545–15550. [Google Scholar]
- Tian, L.; Greenberg, S.A.; Kong, S.W.; Altschuler, J.; Kohane, I.S.; Park, P.J. Discovering statistically significant pathways in expression profiling studies. Proc. Natl. Acad. Sci. USA 2005, 102, 13544–13549. [Google Scholar]
- Levin, A.M.; Ghosh, D.; Cho, K.R.; Kardia, S.L. A model-based scan statistic for identifying extreme chromosomal regions of gene expression in human tumors. Bioinformatics 2005, 21, 2867–2874. [Google Scholar]
- Efron, B.; Tibshirani, R. On testing the significance of sets of genes. Ann. Appl. Stat 2007, 1, 107–129. [Google Scholar]
- Tarca, A.L.; Draghici, S.; Khatri, P.; Hassan, S.S.; Mittal, P.; Kim, J.S.; Kim, C.J.; Kusanovic, J.P.; Romero, R. A novel signaling pathway impact analysis. Bioinformatics 2009, 25, 75–82. [Google Scholar]
- Martini, P.; Sales, G.; Massa, M.S.; Chiogna, M.; Romualdi, C. Along signal paths: An empirical gene set approach exploiting pathway topology. Nucleic Acids Res 2013, 41, e19. [Google Scholar]
- Ackermann, M.; Strimmer, K. A general modular framework for gene set enrichment analysis. BMC Bioinform 2009, 10, 47. [Google Scholar]
- Nam, D.; Kim, S.Y. Gene-set approach for expression pattern analysis. Brief. Bioinform 2008, 9, 189–197. [Google Scholar]
- Sales, G.; Calura, E.; Cavalieri, D.; Romualdi, C. Graphite—A Bioconductor package to convert pathway topology to gene network. BMC Bioinform 2012, 13, 20. [Google Scholar]
- Martini, P.; Risso, D.; Sales, G.; Romualdi, C.; Lanfranchi, G.; Cagnin, S. Statistical Test of Expression Pattern (STEPath): A new strategy to integrate gene expression data with genomic information in individual and meta-analysis studies. BMC Bioinform 2011, 12, 92. [Google Scholar]
- Massa, M.S.; Chiogna, M.; Romualdi, C. Gene set analysis exploiting the topology of a pathway. BMC Syst. Biol 2010, 4, 121. [Google Scholar]
- Sales, G.; Calura, E.; Martini, P.; Romualdi, C. Graphite Web: Web tool for gene set analysis exploiting pathway topology. Nucleic Acids Res 2013, 41, W89–W97. [Google Scholar]
- Morgat, A.; Coissac, E.; Coudert, E.; Axelsen, K.B.; Keller, G.; Bairoch, A.; Bridge, A.; Bougueleret, L.; Xenarios, I.; Viari, A. UniPathway: A resource for the exploration and annotation of metabolic pathways. Nucleic Acids Res 2012, 40, D761–D769. [Google Scholar]
- Kanehisa, M.; Goto, S.; Furumichi, M.; Tanabe, M.; Hirakawa, M. KEGG for representation and analysis of molecular networks involving diseases and drugs. Nucleic Acids Res 2010, 38, D355–D360. [Google Scholar]
- Caspi, R.; Foerster, H.; Fulcher, C.A.; Kaipa, P.; Krummenacker, M.; Latendresse, M.; Paley, S.; Rhee, S.Y.; Shearer, A.G.; Tissier, C.; et al. The MetaCyc Database of metabolic pathways and enzymes and the BioCyc collection of Pathway/Genome Databases. Nucleic Acids Res 2008, 36, D623–D631. [Google Scholar]
- Joshi-Tope, G.; Gillespie, M.; Vastrik, I.; D’Eustachio, P.; Schmidt, E.; de Bono, B.; Jassal, B.; Gopinath, G.R.; Wu, G.R.; Matthews, L.; et al. Reactome: A knowledgebase of biological pathways. Nucleic Acids Res 2005, 33, D428–D432. [Google Scholar]
- Friedman, N. Inferring cellular networks using probabilistic graphical models. Science 2004, 303, 799–805. [Google Scholar]
- Butte, A.J.; Tamayo, P.; Slonim, D.; Golub, T.R.; Kohane, I.S. Discovering functional relationships between RNA expression and chemotherapeutic susceptibility using relevance networks. Proc. Natl. Acad. Sci. USA 2000, 97, 12182–12186. [Google Scholar]
- Schafer, J.; Strimmer, K. An empirical Bayes approach to inferring large-scale gene association networks. Bioinformatics 2005, 21, 754–764. [Google Scholar]
- Markowetz, F.; Spang, R. Inferring cellular networks—A review. BMC Bioinform 2007, 8, S5. [Google Scholar]
- Basso, K.; Margolin, A.A.; Stolovitzky, G.; Klein, U.; Dalla-Favera, R.; Califano, A. Reverse engineering of regulatory networks in human B cells. Nat. Genet 2005, 37, 382–390. [Google Scholar]
- Margolin, A.A.; Nemenman, I.; Basso, K.; Wiggins, C.; Stolovitzky, G.; Dalla Favera, R.; Califano, A. ARACNE: An algorithm for the reconstruction of gene regulatory networks in a mammalian cellular context. BMC Bioinform 2006, 7, S7. [Google Scholar]
- Sales, G.; Romualdi, C. Parmigene—A parallel R package for mutual information estimation and gene network reconstruction. Bioinformatics 2011, 27, 1876–1877. [Google Scholar]
- Marbach, D.; Costello, J.C.; Kuffner, R.; Vega, N.M.; Prill, R.J.; Camacho, D.M.; Allison, K.R.; Consortium, D.; Kellis, M.; Collins, J.J.; et al. Wisdom of crowds for robust gene network inference. Nat. Methods 2012, 9, 796–804. [Google Scholar]
- Macias, S.; Plass, M.; Stajuda, A.; Michlewski, G.; Eyras, E.; Caceres, J.F. DGCR8 HITS-CLIP reveals novel functions for the Microprocessor. Nat. Struct. Mol. Biol 2012, 19, 760–766. [Google Scholar]
- Thomson, D.W.; Bracken, C.P.; Goodall, G.J. Experimental strategies for microRNA target identification. Nucleic Acids Res 2011, 39, 6845–6853. [Google Scholar]
- Hafner, M.; Landthaler, M.; Burger, L.; Khorshid, M.; Hausser, J.; Berninger, P.; Rothballer, A.; Ascano, M., Jr.; Jungkamp, A.C.; Munschauer, M.; et al. Transcriptome-wide identification of RNA-binding protein and microRNA target sites by PAR-CLIP. Cell 2010, 141, 129–141. [Google Scholar]
- Chi, S.W.; Zang, J.B.; Mele, A.; Darnell, R.B. Argonaute HITS-CLIP decodes microRNA-mRNA interaction maps. Nature 2009, 460, 479–486. [Google Scholar]
- Yousef, M.; Showe, L.; Showe, M. A study of microRNAs in silico and in vivo: Bioinformatics approaches to microRNA discovery and target identification. FEBS J 2009, 276, 2150–2156. [Google Scholar]
- Witkos, T.M.; Koscianska, E.; Krzyzosiak, W.J. Practical aspects of microRNA target prediction. Curr. Mol. Med 2011, 11, 93–109. [Google Scholar]
- Sales, G.; Coppe, A.; Bisognin, A.; Biasiolo, M.; Bortoluzzi, S.; Romualdi, C. MAGIA, a web-based tool for miRNA and Genes Integrated Analysis. Nucleic Acids Res 2010, 38, W352–W359. [Google Scholar]
- Bisognin, A.; Sales, G.; Coppe, A.; Bortoluzzi, S.; Romualdi, C. MAGIA2: From miRNA and genes expression data integrative analysis to microRNA-transcription factor mixed regulatory circuits (2012 update). Nucleic Acids Res 2012, 40, W13–W21. [Google Scholar]
- Nam, S.; Li, M.; Choi, K.; Balch, C.; Kim, S.; Nephew, K.P. MicroRNA and mRNA integrated analysis (MMIA): A web tool for examining biological functions of microRNA expression. Nucleic Acids Res 2009, 37, W356–W362. [Google Scholar]
- Ross, J.W.; Fernandez de Castro, J.P.; Zhao, J.; Samuel, M.; Walters, E.; Rios, C.; Bray-Ward, P.; Jones, B.W.; Marc, R.E.; Wang, W.; et al. Generation of an inbred miniature pig model of retinitis pigmentosa. Investig. Ophthalmol. Vis. Sci 2012, 53, 501–507. [Google Scholar]
- Maxmen, A. Model pigs face messy path. Nature 2012, 486, 453. [Google Scholar]
- Sandrin, M.S.; Loveland, B.E.; McKenzie, I.F. Genetic engineering for xenotransplantation. J. Card. Surg 2001, 16, 448–457. [Google Scholar]
- Ekser, B.; Rigotti, P.; Gridelli, B.; Cooper, D.K. Xenotransplantation of solid organs in the pig-to-primate model. Transpl. Immunol 2009, 21, 87–92. [Google Scholar]
- Zhang, Q.; Widmer, G.; Tzipori, S. A pig model of the human gastrointestinal tract. Gut Microbes 2013, 4, 193–200. [Google Scholar]
- Kragh, P.M.; Nielsen, A.L.; Li, J.; Du, Y.; Lin, L.; Schmidt, M.; Bogh, I.B.; Holm, I.E.; Jakobsen, J.E.; Johansen, M.G.; et al. Hemizygous minipigs produced by random gene insertion and handmade cloning express the Alzheimer’s disease-causing dominant mutation APPsw. Transgenic Res 2009, 18, 545–558. [Google Scholar]
- Granada, J.F.; Kaluza, G.L.; Wilensky, R.L.; Biedermann, B.C.; Schwartz, R.S.; Falk, E. Porcine models of coronary atherosclerosis and vulnerable plaque for imaging and interventional research. EuroIntervention 2009, 5, 140–148. [Google Scholar]
- Groenen, M.A.; Archibald, A.L.; Uenishi, H.; Tuggle, C.K.; Takeuchi, Y.; Rothschild, M.F.; Rogel-Gaillard, C.; Park, C.; Milan, D.; Megens, H.J.; et al. Analyses of pig genomes provide insight into porcine demography and evolution. Nature 2012, 491, 393–398. [Google Scholar]
- Li, M.; Wu, H.; Luo, Z.; Xia, Y.; Guan, J.; Wang, T.; Gu, Y.; Chen, L.; Zhang, K.; Ma, J.; et al. An atlas of DNA methylomes in porcine adipose and muscle tissues. Nat. Commun 2012, 3, 850. [Google Scholar]
- Fairbairn, L.; Kapetanovic, R.; Beraldi, D.; Sester, D.P.; Tuggle, C.K.; Archibald, A.L.; Hume, D.A. Comparative analysis of monocyte subsets in the pig. J. Immunol 2013, 190, 6389–6396. [Google Scholar]
- Martins, R.P.; Lorenzi, V.; Arce, C.; Lucena, C.; Carvajal, A.; Garrido, J.J. Innate and adaptive immune mechanisms are effectively induced in ileal Peyer’s patches of Salmonella typhimurium infected pigs. Dev. Comp. Immunol 2013, 41, 100–104. [Google Scholar]
- Hulst, M.; Smits, M.; Vastenhouw, S.; de Wit, A.; Niewold, T.; van der Meulen, J. Transcription networks responsible for early regulation of Salmonella-induced inflammation in the jejunum of pigs. J. Inflamm 2013, 10, 18. [Google Scholar]
- Adler, M.; Murani, E.; Brunner, R.; Ponsuksili, S.; Wimmers, K. Transcriptomic response of porcine PBMCs to vaccination with tetanus toxoid as a model antigen. PLoS One 2013, 8, e58306. [Google Scholar]
- Freeman, T.C.; Ivens, A.; Baillie, J.K.; Beraldi, D.; Barnett, M.W.; Dorward, D.; Downing, A.; Fairbairn, L.; Kapetanovic, R.; Raza, S.; et al. A gene expression atlas of the domestic pig. BMC Biol 2012, 10, 90. [Google Scholar]
- McDaneld, T.G.; Smith, T.P.; Harhay, G.P.; Wiedmann, R.T. Next-generation sequencing of the porcine skeletal muscle transcriptome for computational prediction of microRNA gene targets. PLoS One 2012, 7, e42039. [Google Scholar]
- Zhou, B.; Liu, H.L.; Shi, F.X.; Wang, J.Y. MicroRNA expression profiles of porcine skeletal muscle. Anim. Genet 2010, 41, 499–508. [Google Scholar]
- Liu, Y.; Li, M.; Ma, J.; Zhang, J.; Zhou, C.; Wang, T.; Gao, X.; Li, X. Identification of differences in microRNA transcriptomes between porcine oxidative and glycolytic skeletal muscles. BMC Mol. Biol 2013, 14, 7. [Google Scholar]
- Siengdee, P.; Trakooljul, N.; Murani, E.; Schwerin, M.; Wimmers, K.; Ponsuksili, S. Transcriptional profiling and miRNA-dependent regulatory network analysis of longissimus dorsi muscle during prenatal and adult stages in two distinct pig breeds. Anim. Genet 2013, 44, 398–407. [Google Scholar]
- McDaneld, T.G.; Smith, T.P.; Doumit, M.E.; Miles, J.R.; Coutinho, L.L.; Sonstegard, T.S.; Matukumalli, L.K.; Nonneman, D.J.; Wiedmann, R.T. MicroRNA transcriptome profiles during swine skeletal muscle development. BMC Genomics 2009, 10, 77. [Google Scholar]
- Huang, T.H.; Zhu, M.J.; Li, X.Y.; Zhao, S.H. Discovery of porcine microRNAs and profiling from skeletal muscle tissues during development. PLoS One 2008, 3, e3225. [Google Scholar]
- Shen, H.; Liu, T.; Fu, L.; Zhao, S.; Fan, B.; Cao, J.; Li, X. Identification of microRNAs involved in dexamethasone-induced muscle atrophy. Mol. Cell. Biochem 2013, 381, 105–113. [Google Scholar]
- Timoneda, O.; Balcells, I.; Nunez, J.I.; Egea, R.; Vera, G.; Castello, A.; Tomas, A.; Sanchez, A. miRNA expression profile analysis in kidney of different porcine breeds. PLoS One 2013, 8, e55402. [Google Scholar]
- Li, A.; Song, T.; Wang, F.; Liu, D.; Fan, Z.; Zhang, C.; He, J.; Wang, S. MicroRNAome and expression profile of developing tooth germ in miniature pigs. PLoS One 2012, 7, e52256. [Google Scholar]
- Sharbati, S.; Friedlander, M.R.; Sharbati, J.; Hoeke, L.; Chen, W.; Keller, A.; Stahler, P.F.; Rajewsky, N.; Einspanier, R. Deciphering the porcine intestinal microRNA transcriptome. BMC Genomics 2010, 11, 275. [Google Scholar]
- Podolska, A.; Kaczkowski, B.; Kamp Busk, P.; Sokilde, R.; Litman, T.; Fredholm, M.; Cirera, S. MicroRNA expression profiling of the porcine developing brain. PLoS One 2011, 6, e14494. [Google Scholar]
- Zhou, Y.; Tang, X.; Song, Q.; Ji, Y.; Wang, H.; Jiao, H.; Ouyang, H.; Pang, D. Identification and characterization of pig embryo microRNAs by Solexa sequencing. Reprod. Domest. Anim 2013, 48, 112–120. [Google Scholar]
- Lian, C.; Sun, B.; Niu, S.; Yang, R.; Liu, B.; Lu, C.; Meng, J.; Qiu, Z.; Zhang, L.; Zhao, Z. A comparative profile of the microRNA transcriptome in immature and mature porcine testes using Solexa deep sequencing. FEBS J 2012, 279, 964–975. [Google Scholar]
- Li, M.; Liu, Y.; Wang, T.; Guan, J.; Luo, Z.; Chen, H.; Wang, X.; Chen, L.; Ma, J.; Mu, Z.; et al. Repertoire of porcine microRNAs in adult ovary and testis by deep sequencing. Int. J. Biol. Sci 2011, 7, 1045–1055. [Google Scholar]
- Curry, E.; Safranski, T.J.; Pratt, S.L. Differential expression of porcine sperm microRNAs and their association with sperm morphology and motility. Theriogenology 2011, 76, 1532–1539. [Google Scholar]
- Luo, L.; Ye, L.; Liu, G.; Shao, G.; Zheng, R.; Ren, Z.; Zuo, B.; Xu, D.; Lei, M.; Jiang, S.; et al. Microarray-based approach identifies differentially expressed microRNAs in porcine sexually immature and mature testes. PLoS One 2010, 5, e11744. [Google Scholar]
- Li, H.; Xi, Q.; Xiong, Y.; Cheng, X.; Qi, Q.; Yang, L.; Shu, G.; Wang, S.; Wang, L.; Gao, P.; et al. A comprehensive expression profile of microRNAs in porcine pituitary. PLoS One 2011, 6, e24883. [Google Scholar]
- Li, H.Y.; Xi, Q.Y.; Xiong, Y.Y.; Liu, X.L.; Cheng, X.; Shu, G.; Wang, S.B.; Wang, L.N.; Gao, P.; Zhu, X.T.; et al. Identification and comparison of microRNAs from skeletal muscle and adipose tissues from two porcine breeds. Anim. Genet 2012, 43, 704–713. [Google Scholar]
- Martini, P.; Sales, G.; Brugiolo, M.; Gandaglia, A.; Naso, F.; De Pitta’, C.; Spina, M.; Gerosa, G.; Romualdi, C.; Cagnin, S.; et al. Tissue-specific expression and regulatory networks of pig microRNAome. PLoS One 2013. unpublished work. [Google Scholar]
- Nelson, P.T.; Baldwin, D.A.; Kloosterman, W.P.; Kauppinen, S.; Plasterk, R.H.; Mourelatos, Z. RAKE and LNA-ISH reveal microRNA expression and localization in archival human brain. RNA 2006, 12, 187–191. [Google Scholar]
- Nelson, P.T.; Baldwin, D.A.; Scearce, L.M.; Oberholtzer, J.C.; Tobias, J.W.; Mourelatos, Z. Microarray-based, high-throughput gene expression profiling of microRNAs. Nat. Methods 2004, 1, 155–161. [Google Scholar]
- Kronick, M.N. Creation of the whole human genome microarray. Expert Rev. Proteomics 2004, 1, 19–28. [Google Scholar]
- Wong, A.P.; Nili, N.; Strauss, B.H. In vitro differences between venous and arterial-derived smooth muscle cells: Potential modulatory role of decorin. Cardiovasc. Res 2005, 65, 702–710. [Google Scholar]
- Ross, J.M.; McIntire, L.V.; Moake, J.L.; Rand, J.H. Platelet adhesion and aggregation on human type VI collagen surfaces under physiological flow conditions. Blood 1995, 85, 1826–1835. [Google Scholar]
- Smedegard, G.; Hedqvist, P.; Dahlen, S.E.; Revenas, B.; Hammarstrom, S.; Samuelsson, B. Leukotriene C4 affects pulmonary and cardiovascular dynamics in monkey. Nature 1982, 295, 327–329. [Google Scholar]
- Pawloski, J.R.; Chapnick, B.M. Antagonism of LTD4-evoked relaxation in canine renal artery and vein. Am. J. Physiol 1993, 265, H980–H985. [Google Scholar]
- Brink, C.; Dahlen, S.E.; Drazen, J.; Evans, J.F.; Hay, D.W.; Nicosia, S.; Serhan, C.N.; Shimizu, T.; Yokomizo, T. International Union of Pharmacology XXXVII. Nomenclature for leukotriene and lipoxin receptors. Pharmacol. Rev 2003, 55, 195–227. [Google Scholar]
- Inoue, H.; Taba, Y.; Miwa, Y.; Yokota, C.; Miyagi, M.; Sasaguri, T. Transcriptional and posttranscriptional regulation of cyclooxygenase-2 expression by fluid shear stress in vascular endothelial cells. Arterioscler. Thromb. Vasc. Biol 2002, 22, 1415–1420. [Google Scholar]
- Dahboul, F.; Leroy, P.; Maguin Gate, K.; Boudier, A.; Gaucher, C.; Liminana, P.; Lartaud, I.; Pompella, A.; Perrin-Sarrado, C. Endothelial γ-glutamyltransferase contributes to the vasorelaxant effect of S-nitrosoglutathione in rat aorta. PLoS One 2012, 7, e43190. [Google Scholar]
- Yousaf, N.; Howard, J.C.; Williams, B.D. Studies in the rat of antibody-coated and N-ethylmaleimide-treated erythrocyte clearance by the spleen. I. Effects of in vivo complement activation. Immunology 1986, 59, 75–79. [Google Scholar]
- Ravingerova, T.; Barancik, M.; Strniskova, M. Mitogen-activated protein kinases: A new therapeutic target in cardiac pathology. Mol. Cell. Biochem 2003, 247, 127–138. [Google Scholar]
- Howe, G.A.; Addison, C.L. RhoB controls endothelial cell morphogenesis in part via negative regulation of RhoA. Vasc. Cell 2012, 4, 1. [Google Scholar]
- Torella, D.; Iaconetti, C.; Catalucci, D.; Ellison, G.M.; Leone, A.; Waring, C.D.; Bochicchio, A.; Vicinanza, C.; Aquila, I.; Curcio, A.; et al. MicroRNA-133 controls vascular smooth muscle cell phenotypic switch in vitro and vascular remodeling in vivo. Circ. Res. 2011, 109, 880–893. [Google Scholar]
- Kobayashi, R.; Kubota, T.; Hidaka, H. Purification, characterization, and partial sequence analysis of a new 25-kDa actin-binding protein from bovine aorta: A SM22 homolog. Biochem. Biophys. Res. Commun 1994, 198, 1275–1280. [Google Scholar]
- Rhodes, L.V.; Tilghman, S.L.; Boue, S.M.; Wang, S.; Khalili, H.; Muir, S.E.; Bratton, M.R.; Zhang, Q.; Wang, G.; Burow, M.E.; et al. Glyceollins as novel targeted therapeutic for the treatment of triple-negative breast cancer. Oncol. Lett 2012, 3, 163–171. [Google Scholar]
- Cordes, K.R.; Sheehy, N.T.; White, M.P.; Berry, E.C.; Morton, S.U.; Muth, A.N.; Lee, T.H.; Miano, J.M.; Ivey, K.N.; Srivastava, D. miR-145 and miR-143 regulate smooth muscle cell fate and plasticity. Nature 2009, 460, 705–710. [Google Scholar]
- Ikeda, S.; Pu, W.T. Expression and function of microRNAs in heart disease. Curr. Drug Targets 2010, 11, 913–925. [Google Scholar]
- Chhabra, R.; Dubey, R.; Saini, N. Cooperative and individualistic functions of the microRNAs in the miR-23a~27a~24-2 cluster and its implication in human diseases. Mol. Cancer 2010, 9, 232. [Google Scholar]
- Li, S.; Ran, Y.; Zhang, D.; Chen, J.; Zhu, D. MicroRNA-138 plays a role in hypoxic pulmonary vascular remodelling by targeting Mst1. Biochem. J 2013, 452, 281–291. [Google Scholar]
- Park, C.; Yan, W.; Ward, S.M.; Hwang, S.J.; Wu, Q.; Hatton, W.J.; Park, J.K.; Sanders, K.M.; Ro, S. MicroRNAs dynamically remodel gastrointestinal smooth muscle cells. PLoS One 2011, 6, e18628. [Google Scholar]
- Talasila, A.; Yu, H.; Ackers-Johnson, M.; Bot, M.; van Berkel, T.; Bennett, M.; Bot, I.; Sinha, S. Myocardin regulates vascular response to injury through miR-24/-29a and platelet-derived growth factor recepto-β. Arterioscler. Thromb. Vasc. Biol 2013, 33, 2355–2365. [Google Scholar]
- Saeed, A.I.; Sharov, V.; White, J.; Li, J.; Liang, W.; Bhagabati, N.; Braisted, J.; Klapa, M.; Currier, T.; Thiagarajan, M.; et al. TM4: A free, open-source system for microarray data management and analysis. Biotechniques 2003, 34, 374–378. [Google Scholar]
- Raychaudhuri, S.; Stuart, J.M.; Altman, R.B. Principal components analysis to summarize microarray experiments: Application to sporulation time series. Pac. Symp. Biocomput 2000, 5, 455–466. [Google Scholar]
- Fellenberg, K.; Hauser, N.C.; Brors, B.; Neutzner, A.; Hoheisel, J.D.; Vingron, M. Correspondence analysis applied to microarray data. Proc. Natl. Acad. Sci. USA 2001, 98, 10781–10786. [Google Scholar]
- Huang da, W.; Sherman, B.T.; Lempicki, R.A. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat. Protoc 2009, 4, 44–57. [Google Scholar]
- Risso, D.; Massa, M.S.; Chiogna, M.; Romualdi, C. A modified LOESS normalization applied to microRNA arrays: A comparative evaluation. Bioinformatics 2009, 25, 2685–2691. [Google Scholar]
- Spath, H. Two Dimensional Spline Interpolation Algorithms; A K Peters/CRC Press: Wellesley, MA, USA, 1995. [Google Scholar]
- Kanehisa, M.; Goto, S.; Kawashima, S.; Okuno, Y.; Hattori, M. The KEGG resource for deciphering the genome. Nucleic Acids Res 2004, 32, D277–D280. [Google Scholar]
- Saito, R.; Smoot, M.E.; Ono, K.; Ruscheinski, J.; Wang, P.L.; Lotia, S.; Pico, A.R.; Bader, G.D.; Ideker, T. A travel guide to Cytoscape plugins. Nat. Methods 2012, 9, 1069–1076. [Google Scholar]
- Doncheva, N.T.; Assenov, Y.; Domingues, F.S.; Albrecht, M. Topological analysis and interactive visualization of biological networks and protein structures. Nat. Protoc 2012, 7, 670–685. [Google Scholar]
© 2013 by the authors; licensee MDPI, Basel, Switzerland This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).