Characterization of Alternative Splicing Events in Porcine Skeletal Muscles with Different Intramuscular Fat Contents

Hao, Wanjun; Yang, Zewei; Sun, Yuanlu; Li, Jiaxin; Zhang, Dongjie; Liu, Di; Yang, Xiuqin

doi:10.3390/biom12020154

Open AccessArticle

Characterization of Alternative Splicing Events in Porcine Skeletal Muscles with Different Intramuscular Fat Contents

by

Wanjun Hao

¹,

Zewei Yang

¹,

Yuanlu Sun

¹,

Jiaxin Li

¹,

Dongjie Zhang

²

,

Di Liu

^2,* and

Xiuqin Yang

^1,*

¹

College of Animal Science and Technology, Northeast Agricultural University, Harbin 150030, China

²

Institute of Animal Husbandry, Heilongjiang Academy of Agricultural Sciences, Harbin 150086, China

^*

Authors to whom correspondence should be addressed.

Biomolecules 2022, 12(2), 154; https://doi.org/10.3390/biom12020154

Submission received: 3 December 2021 / Revised: 11 January 2022 / Accepted: 14 January 2022 / Published: 18 January 2022

(This article belongs to the Special Issue Omics Approaches to Understanding Skeletal Muscle Biology)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Meat quality is one of the most important economic traits in pig breeding and production. Intramuscular fat (IMF) is a major factor that improves meat quality. To better understand the alternative splicing (AS) events underlying meat quality, long-read isoform sequencing (Iso-seq) was used to identify differential (D)AS events between the longissimus thoracis (LT) and semitendinosus (ST), which differ in IMF content, together with short-read RNA-seq. Through Iso-seq analysis, we identified a total of 56,789 novel transcripts covering protein-coding genes, lncRNA, and fusion transcripts that were not previously annotated in pigs. We also identified 456,965 AS events, among which 3930 were DAS events, corresponding to 2364 unique genes. Through integrative analysis of Iso-seq and RNA-seq, we identified 1174 differentially expressed genes (DEGs), among which 122 were DAS genes, i.e., DE-DAS genes. There are 12 overlapped pathways between the top 20 DEGs and DE-DAS genes, as revealed by KEGG (Kyoto Encyclopedia of Genes and Genomes) analysis, indicating that DE-DAS genes play important roles in the differential phenotype of LT and ST. Further analysis showed that upregulated DE-DAS genes are more important than downregulated ones in IMF deposition. Fatty acid degradation and the PPAR (peroxisome proliferator-activated receptor) signaling pathway were found to be the most important pathways regulating the differential fat deposition of the two muscles. The results update the existing porcine genome annotations and provide data for the in-depth exploration of the mechanisms underlying meat quality and IMF deposition.

Keywords:

meat quality; intramuscular fat content; alternative splicing; differentially expressed genes

1. Introduction

Meat quality is emerging as one of the most important economic traits in pig breeding and production, owing to the demands of consumers. Meat quality is a comprehensive indicator composed of the intramuscular fat (IMF) content, pH, tenderness, meat color, water-holding capacity, muscle fiber composition, etc. Most traits of meat quality exhibit low to moderate heritability [1,2], and they are expensive to measure. Selecting a meat quality using traditional methods, such as best linear unbiased prediction, is difficult. As such, molecular biology techniques have been developed to improve meat quality through molecular breeding strategies, for example, molecular marker-assisted selection. Many studies have focused on revealing the mechanisms underlying meat quality, which are the prerequisite for the molecular selection of the trait.

Now, high-throughput characterization of genetic mechanisms underlying meat quality can be performed efficiently with the development of next-generation sequencing technology. Many efforts have been directed at mRNA, LncRNA, miRNA, quantitative trait loci, and single-nucleotide polymorphisms, as well at the genome-wide level, with the aim of elucidating genetic factors affecting meat quality [3,4,5,6], which have revealed key genes and pathways implicated in meat quality. However, no efforts have been devoted to determining the effect of alternative splicing (AS) on meat quality.

AS is a ubiquitous phenomenon in mammals that leads to production of multiple mRNA species from a single gene [7,8]. AS can be grouped into five basic patterns: exon skipping (ES), mutually exclusive exons (MEE), alternative acceptor site (AAS), alternative donor site (ADS), and intron retention (IR) [9,10,11]. It is now well-established that, regardless of whether novel functional protein isoforms are produced, AS is an important mechanism regulating gene expression. It not only increases transcriptome and proteome diversity, but modulates the major mRNA levels by producing multiple mRNAs from the same pre-mRNA. Additionally, AS can regulate the role of other RNAs by competing with them for the same regulators [12].

Through genome-wide analysis, it has been estimated that 90–95% of multi-exon genes undergo AS [10,13]. At present, the human coding genes are annotated with four unique transcripts, on average, in the reference set [14], whereas studies suggested that, on average, at least 10 AS transcripts exist for each human coding gene [15,16]. Many AS transcripts remain to be identified. It was shown that about two-thirds of AS events are tissue-specific in humans [13], and the major isoform varies according to tissue and cell lines [17], indicating the importance of AS in tissue differentiation. AS events have been widely implicated in physiological processes, including tissue identity acquisition, organ development, etc. [18]. Dysregulation of AS can be related to a vast repertoire of diseases [19].

Long-read isoform sequencing (Iso-seq) technology, developed by Pacific Biosciences (PacBio), is capable of precisely and conveniently characterizing AS events through directly sequencing full-length transcripts [20,21,22]. Iso-seq analyses have been performed in humans, chicken, and pigs [23,24,25], in which numerous AS events were newly identified, and the regulation of AS was shown to be rigorous. However, the physiological relevance of these AS events is largely unknown. Additionally, reports on the AS with Iso-seq analysis in pigs were mainly performed on mixed tissues [23,26]; no separate analysis was performed of the skeletal muscles. The role of AS in IMF content remains to be identified.

Longissimus thoracis (LT) and semitendinosus (ST), known to differ in biochemical and histological composition, including connective tissue content, proportions of muscle fiber types, etc., are widely used in studies of meat quality [27,28,29]. They differ in many indicators of meat quality. Compared to ST, LT is more tender, lower in moisture content, and higher in lightness value [30]. The IMF content is also higher in LT than in ST, and lipid deposition occurs earlier and is higher in the intramuscular preadipocytes of LT than those of ST in pigs [27,28,31]. The IMF content is positively correlated with tenderness, juiciness, and flavor [32], and, thus, is important for improving meat quality. Although differences in other factors such as muscle fiber composition can affect pork quality, it is interesting to focus on the different IMF contents between LT and ST and to reveal its underlying mechanisms.

Here, we profiled AS events and identified genome-wide novel transcripts in porcine skeletal muscles using the Iso-seq technique. We characterized the differential (D)AS between LT and ST by combined Iso-seq and short-read RNA sequencing (RNA-seq) analyses, and we highlighted the functional pathways involved in IMF deposition. Our results provide valuable data for further revealing the mechanisms underlying meat quality.

2. Materials and Methods

2.1. Animals, Tissues, and RNA

LT and ST muscles were sampled from three 210-day-old Min pigs, a Chinese indigenous pig breed. The pigs were provided by the Institute of Animal Husbandry, Heilongjiang Academy of Agricultural Sciences, Harbin, China. The pigs were raised under the same conditions including food, housing, etc. Our animal treatment protocol was approved by the Animal Care Committee of Northeast Agricultural University. LT and ST muscles were collected immediately after slaughter and frozen in liquid nitrogen. Total RNA was isolated with TRIzol reagent (Invitrogen, CA, USA) from each tissue and assessed with a NanoPhotometer^® spectrophotometer (IMPLEN, CA, USA) and an Agilent 2100 Bioanalyzer (Agilent Technologies, CA, USA). RNA concentration was measured with a Qubit 3.0 Fluorometer (Life Technologies, CA, USA).

2.2. PacBio Library Construction and Sequencing

The PacBio library was constructed with equally mixed RNA from LT and ST muscles (n = 3). The full-length cDNA was synthesized with a SMARTerTM PCR cDNA Synthesis Kit (Clonetech, CA, USA) and Oligo dT primer from 1 µg RNA. PCR amplification was carried out with a KAPA HiFi PCR Kit (Roche, Shanghai, China), and primers were added into the end of full-length cDNA in the reverse transcription. The resultant products were size-selected with AMPure PB magnetic beads (PacBio, CA, USA) to obtain fragments not less than 1 kb. End repair was performed for full-length cDNA, and then the single molecular real-time sequencing (SMRT) dumbbell connector was connected. After digesting the unconnected fragments with exonuclease, the cDNA was purified with PB magnetic beads (PacBio) to obtain the sequencing library. The quality of the Iso-seq library was controlled with Qubit 3.0 (Life Technologies) for accurate quantification and an Agilent 2100 (Agilent Technologies) for size detection. The library was sequenced on the sequel II platform (PacBio) by Frasergen Inc. (Wuhan, China).

2.3. Raw Read Processing

The SMRTlink program (http://www.pacb.com/, accessed on 1 December 2021) was used for processing the raw reads to obtain full-length nonconcatemer (FLNC) reads with 15,000 ≥ length ≥ 50, passes number ≥ 3, and predicted accuracy ≥ 99%. Briefly, the raw polymerase reads generated by PacBio sequencing were first filtered and trimmed to produce subreads, and then error correction was carried out to obtain circular consensus sequences (CCSs). The nonconcatemer reads containing 5′ and 3′ adapters were regarded as FLNC reads, amongst which those having a poly(A) tail, FLNC_poly(A), were used for further analysis. Iterative clustering for error correction was applied to obtain consensus sequences. Errors in FLNC reads were further corrected with the high-quality Illumina short reads using LoRDEC with the options of -k 21 -s 3 [33]. The FLNC reads before and after error correction were mapped to the pig reference genome assembly (Sus Scrofa 11.1) using GMAP [34] with parameters of -no-chimeras -n 10, and those reads with a high percent of identity (PID) were used for annotation of loci and isoforms.

2.4. Loci and Isoform Annotation

Two sequences that had an overlap of ≥20% at the origin of alignment, at least one exon with ≥20% overlap, and the same transcriptional direction were characterized as the same loci transcript. Isoforms were identified in the same loci through removal of redundant and false-positive gene structure with the following criteria: (i) isoform with inner splice sites identical to the longer one; (ii) those missing the 5′ end; (iii) structures aligned by only one FLNC. In this situation, if all the junctions were annotated in the reference genome or supported by RNA-seq data, the isoforms were retained.

2.5. Novel Gene Identification

The annotated results were analyzed to determine the novel isoforms of annotated or novel genes through comparison with data in the reference genomes. The isoforms of novel genes were regarded as those that had an overlap of less than 20% or an overlap of more than 20% but transcribed in a different direction than the annotated genes. The novel isoforms of annotated genes were those having novel splice sites compared to the annotated ones. Additionally, if one of the two aligned sequences was a single exon and the other was not, the FLNC was also regarded as a novel isoform of the annotated gene.

The protein-coding potential was analyzed through screening novel isoforms against the NR, GO, KO, KOG, and Swiss-Prot databases with the BLASTX program using an e-value ≤ 1 × 10⁻⁵; the remaining ones were analyzed with the CPAT program (version 1.2.2) [35] to identify lncRNA with default parameters.

Fusion transcripts were identified with the following criteria: (i) the FLNC mapped to different annotated gene loci; (ii) more than 20% overlap at each locus; (iii) the junction site was supported by Illumina reads.

2.6. Alternative Splicing Analysis

AS events in Iso-seq data were first classified using Astalavista with default parameters [36]. Then, Illumina reads were aligned with Iso-seq data to compare the AS events between LT and ST muscles with rMATS software, and DAS events were identified using junction counts and reads on target. Each of the basic AS events, ES, IR, MEE, AD and AE, includes two isoforms, of which one is longer; thus, the variable sequences could be identified as an included/excluded exon. The expression ratio of exon inclusion (EI) events to both events was applied to evaluate the AS level, and the difference (ΔEI) between the two muscles was calculated to determine the changes in the splicing of all five AS events. The DAS events were characterized based on an absolute ΔEI > 5% and FDR < 0.05.

2.7. RNA Sequencing and Data Processing

RNA-seq was performed on the Illumina HiSeq 4000 PE150 platform at Frasergen Inc. (Wuhan) to quantify gene/isoform expression in muscles. A total of 6 libraries were constructed with RNAs from LT and ST of three individuals. The samples were the same as those for Iso-seq. We used 3 µg RNA for each library construction. After removing ribosomal RNA with a Ribo-zeroTM rRNA Removal Kit (Epicentre, Wisconsin, USA) and purifying with ethanol precipitation, libraries were constructed with an NEBNext^® UltraTM Directional RNA Library Prep Kit for Illumina^® (NEB, Ipswich, MA, USA) according to the manufacturer’s recommendations.

Raw reads were cleaned using SOAPnuke [37]. The high-quality clean reads were mapped to the porcine reference genome (Sus scrofa 11.1) using HISAT2 [38]. Reads mapped perfectly or with one mismatch were employed to assemble transcripts with StringTie under the default parameters [16]. The resultant transcripts were annotated with the gffcompare program. The expression levels were measured using the fragments per kilobase per million bases (FPKM) method using the PacBio GTF annotation file. The gene transcription was compared between two muscles using DESeq2, and those having an absolute log2 fold-change > 1 and FDR < 0.05 were identified as differentially expressed genes (DEGs).

2.8. Reverse-Transcription PCR and Real-Time Quantitative PCR

Total RNA was isolated as described above and treated with DNase I to eliminate genomic DNA contamination. Reverse transcription (RT) was performed with the PrimeScriptTM RT Reagent Kit (TaKaRa, Dalian, China). PCR and real-time quantitative PCR (qPCR) was carried out as described previously [39]. RT-PCR products were visualized on 2% agarose gels or 6.5% polyacrylamide gel electrophoresis. Primer information was shown in Table S1.

3. Results

3.1. Overview of PacBio Iso-Seq Data

In total, we detected 555,753 polymerase reads representing more than 53 G bases, with an average length of >9 kb. After processing, 36,499,249 filtered subread reads with an average length of 1383 bp and 395,182 circular consensus sequences (CCSs) with an average length of 1769 bp were obtained. CCSs were further classified into non-FL (88,282), FL (306,900), FLNC (293,556), and FLNC_polyA (293,459) reads based on the presence of 5′ adaptor, 3′ adaptor, and poly(A) tails (Figure 1A). FLNC_polyA reads have an average length of 1496 bp (Figure 1B). The data were then corrected with the high-quality short reads of RNA-seq, and a total of 281,522 high-quality FLNC_polyA reads (95.93%) were obtained for further analysis.

After filtering redundancies and false positives, the high-quality FLNC_polyA reads covered 59,075 transcripts (Table S2-a) and were aligned to 31,243 loci, of which 9623 were annotated in a reference genome (Figure 1C). Based on the mapping results, the transcripts were classified into three groups: known (2286) and novel transcripts (34,605) of annotated loci and novel transcripts (22,184) that were mapped to unannotated loci (Figure 1A). Of 59,075 transcripts, 34,188 (57.87%) were multi-exon involved in 9518 loci and 24,887 (42.13%) were single exon.

Among the multi-exon genes, 5502 loci had more than one transcript, producing a total of 29,803 transcripts, accounting for 50.45% of the total transcripts obtained. The relationships between the numbers of transcripts and exon or length were analyzed, and we found that genes having more transcripts contain more exons and are longer amongst the multi-exon genes (Figure 1D,E). Full-length frequency analysis showed that 82.93% (170,454/205,544) of annotated multi-exon FLNC_polyA reads had the same initial splice donor sites as the mRNAs in the reference database, indicating the high integrity of our Iso-seq data in the structure. The degree of transcript integrity is similar to previous Iso-seq data in humans and pigs [23,40].

3.2. Identification of Novel Gene

A total of 56,789 novel transcripts (34,605 + 22,184) were discovered in this study by Iso-seq. Among the 22,184 novel isoforms of unannotated loci, 20,427 were single exon (Table S2-b), accounting for 94.46%, which is far higher than the whole level of Iso-seq data (42.13%). We found that 268 unannotated loci had more than one transcript covering 827 isoforms. Studies have shown that protein-coding genes are usually multi-exon and overwhelmingly alternatively spliced [10,20]. The results suggested that the proportion of protein-coding transcripts in the 22,184 novel isoforms was low.

The novel genes (22,184) were screened against public databases, and the results showed that 11,987 (54.03%), 1168 (5.27%), 1406 (6.34%), 478 (2.15%), and 4254 (19.18%) of them could be found in the NR, GO, KO, KOG, and Swiss-Prot databases, respectively. A total of 266 novel isoforms had significant hits in the five databases (Figure 2A). A total of 10,166 (45.83%) isoforms were not found in any of the protein databases. After filtering out potential lncRNAs amongst them, TransDecoder (http://transdecoder.sf.net, accessed on 1 December 2021) was used to predict coding sequences for the remaining isoforms, of which 80 were identified as novel protein-coding genes (Table S2-c). These 80 novel transcripts might be pig-specific or low-conserved amongst species.

A total of 11,371 lncRNAs were identified (Table S2-d). The distribution of lncRNAs in the genome is shown in Figure 2B. We also found 19 fusion transcripts that contain sequences from different annotated gene loci (Table S2-e). Of the fusion genes, 18 occur interchromosomally and the remaining one is produced by an intrachromosome event, indicating that the interchromosome event was the main source of the fusion gene (Figure 2B). RT-PCR was used to validate the novel genes identified. A total of five isoforms of unannotated loci, four novel isoforms of annotated loci, three lncRNAs, and three fusion transcripts were confirmed (Figure 2C), demonstrating that these are bona fide isoforms/loci.

3.3. Alternative Splicing Events

A total of 456,965 AS events were obtained by our Iso-seq analysis, and all of the five basic AS types were covered (Figure 3A). Among the basic AS types, IR and ES were predominant, with IR being the most abundant and MEE being the least, which is consistent with previous studies in animals [23,25]. Moreover, a large number of AS events, such as those containing multiple basic AS patterns, were grouped into other types of AS, indicating the complexity of AS. The AS events of six genes, including one lncRNA and five protein-coding genes, were validated with RT-PCR (Figure 3B), demonstrating the accuracy and potential of Iso-seq in detecting AS events. Among the multi-exon genes, the gene number gradually decreased with transcript number per increasing gene. Overall, the number of multi-exon genes detected with Iso-seq analysis was lower than that in the reference genome; the reason might be that only one differentiated tissue, muscle, was used in the Iso-seq. However, the number of genes with ≥10 transcripts was much higher than that provided by the reference genome (Figure 3C).

Many genes were found to have more transcripts than deposited in the genome database, indicating Iso-seq is an effective tool for the characterization of AS variants. For example, the creatine kinase M-type (CKM) gene (ENSSSCG00000036132) has three transcripts in reference annotation, but 743 isoforms were found in Iso-seq data. Additionally, a protein-coding gene annotated as novel without a name in the ensemble database (ENSSSCG00000010190) was found to have 1095 transcripts by Iso-seq, although only three transcripts were annotated. Li et al. [23] found the largest number of transcripts, ~337, existing in the Sus scrofa actin alpha 1, skeletal muscle (ACTA1) gene through Iso-seq of a pooled set of 38 tissues of Yorkshire pigs in which the skeletal muscle content was trace. Min pigs and the single tissue of skeletal muscle were used here. Thus, AS is breed- and tissue-specific; more isoforms remain to be identified in the pig genome.

3.4. Differential Alternative Splicing Events

Through analyzing the ΔEI, a total of 3930 DAS events were identified, among which 2016 have increased EI and 1914 have decreased EI in ST compared to LT (Table S3). ES and MEE are the top two abundant DAS types, accounting for over 80%, indicating their important role in the differentiation and development of the two muscle tissues, whereas IR events, which are the predominant AS type in the pooled tissues of LT and ST, have much less DAS between these two tissues. The distribution of increased and decreased EI events is balanced in each kind of AS event (Figure 4A). Two DAS events were selected and validated with RT-PCR (Figure 4B).

After integrating the DAS events belonging to the same gene, 3930 DAS events were found to correspond to 2364 unique genes (Table S4-a). In this study, transcripts with an expression level of FPKM ≥ 0.1 in at least one of the six samples were defined as expressed as reported in previous studies [41,42]. The expression level of DAS genes is much more abundant compared to that of the total genes identified (Figure 4C, Tables S4-a and 4-b). Most of the DAS genes have similar expression levels between the two muscles (Figure 4D).

The DAS genes were annotated into various GO terms of all three functional categories, including cellular component (CC), biological process (BP), and molecular function (MF). In the BP category, biological regulation and metabolic process are among the most highly enriched GO terms with over 1000 genes (Figure 4E, Table S4-c). KEGG classification showed that DAS genes are involved in various pathways, and the results were provided on a level 2 hierarchy (Figure 4F, Table S4-d). A total of 37 genes are involved in lipid metabolism. On the level 3 hierarchy (KEGG pathway), various pathways are related to fat formation. For example, in the signal transduction catalog, 15 pathways are involved in adipogenesis, including the Hedgehog, FoxO, Wnt, Apelin, Hippo, Jak-STAT, MAPK, PI3K-Akt, cAMP, cGMP-CKG, TGF-β, mTOR, Notch, and Phospholipase D signaling pathways. These pathways include 202 unique genes, accounting for 76.81% of the catalog. In the endocrine system catalog, there are the PPAR (peroxisome proliferator-activated receptor) signaling pathway, adipocytokine signaling pathway, and regulation of lipolysis in adipocyte. Additionally, in the carbohydrate metabolism catalog, some pathways are involved in short-chain fatty acid metabolism, such as butanoate and propanoate.

3.5. Integrated Analysis of Differential Alternative Splicing Events and Differentially Expressed Genes

To further evaluate the role of DAS genes in the differential phenotype of LT and ST, the genome-wide mRNA expression profile was compared between the two muscles through integrated analysis of Iso-seq and RNA-seq data. A total of 1174 DEGs (683 upregulated and 491 downregulated) were identified in ST compared to LT (Figure 5A, Table S5-a). Among the DEGs, 122 are DAS genes, i.e., DE-DAS genes, of which 70 are upregulated and 52 are downregulated in ST compared to LT (Figure 5B,C, Table S5-b). Seven DE-DAS genes were validated with real-time quantitative PCR (Figure 5D). Functional enrichment analysis of DE-DAS genes suggested that they are enriched in 23 BP terms, 14 MF terms, and 9 CC terms (Table S5-c). The most highly enriched BP terms, including biological regulation, signaling, and regulation of biological process, are shown in Figure 5E.

DEGs are involved in various KEGG pathways, with many being fat-related: among the top 20 most significant enrichments, 6 are related to fat formation (Figure 6A). DE-DAS genes are also mainly involved in fat-related pathways: 6 of the top 20 significantly enriched pathways are associated with fat deposition (Figure 6B). Although only 122 out of 1174 DEGs are differentially spliced, there are various KEGG pathways significantly enriched by both DEGs and DE-DAS genes; among the top 20 pathways, 12 are shared, with 5 being fat-related, indicating DAS of DEGs plays major roles in the differential phenotype of the two muscles.

Further analysis showed that upregulated DE-DAS genes are involved in more pathways related to fat formation. Among the top 20 significant enrichments, 7 are fat-related. Four out of the top five pathways are fat-related (Figure 6C). Only two fat-related pathways are significantly enriched by the downregulated DE-DAS genes (Figure 6D). Additionally, upregulated DE-DAS genes share five fat-related pathways with all the DE-DAS genes, whereas no fat-related pathways overlap between downregulated and all DE-DAS genes. These indicated that upregulated DE-DAS genes are the key factors leading to the difference in IMF content between LT and ST. The fatty acid degradation and PPAR signaling pathway are the most important pathways regulating the differential fat deposition of the two muscles.

3.6. Transcription Factors in DE-DAS Genes

Among the top 20 up- and downregulated DEGs, 18 are annotated in NR databases and 8 are transcription factors (TFs) (Table 1); additionally, a total of 111 DEGs were identified as TFs through searching the AnimalTFDB database (http://bioinfo.life.hust.edu.cn/AnimalTFDB/, accessed on 1 December 2021) with hmmscan program (Table S5-d). There are 71 upregulated and 40 downregulated TFs in ST compared to in LT. These TFs belong to various families, with the most being from the homeobox family (Figure 7A). Of these differentially expressed TFs (DE-TFs), 16 are DAS genes (DE-DAS-TFs) (Figure 7B,C). PPI analysis was performed to identify the interaction of the DE-DAS-TFs with DEGs, and the results showed that ACTN2 and RNF41 play a key role in regulating DEGs (Figure 7D).

4. Discussion

In this study, we used Iso-seq combined with RNA-seq techniques to analyze AS events in the ST and LT muscles of pig and found that 122 DE-DAS genes are key regulators in the differential phenotype of the two muscles. Additionally, a large number of AS events and novel transcripts that were not previously annotated in pigs covering protein-coding genes, lncRNA, and fusion transcripts were obtained using PacBio sequencing. The results contribute to further revealing the mechanisms underlying meat quality and are of value for the refinement of the porcine genome annotation as well.

This study was designed to characterize AS in muscles, which is different from previous studies aiming at maximizing transcript diversity with multiple tissues via the SMRT methodology in pigs [25]. Thus, the number of transcripts obtained here is relatively low, but, in agreement with previous studies, our results further emphasize the complexity of the porcine transcriptome and the universality of AS in animals. The percent of AS in multi-exon genes is 57.81% (5502/9518) in our Iso-seq data, much lower than the values reported in previous studies (>90%) [10,13,25]. This finding might also be explained by only one tissue, skeletal muscle, being used here.

A total of 34,605 novel isoforms, previously unannotated, were identified with Iso-seq in 9623 known loci, with an average number of 3.6 per locus, which is a notable increase in the identified transcripts in the porcine genome. Moreover, a huge number of unannotated loci was found. These sequences were validated with short-read data of RNA-seq, homologous sequences in other species, and RT-PCR. Tissue-specific splicing and the subsequent transcripts exist extensively in animals. In a study involving nine porcine tissues, 44% of all detected transcripts were tissue-specific [26]. At the protein level, evidence for tissue-specific splicing was also reported by Rodriguez et al. [43] in multiple tissue groups, including nervous, muscle, blood, digestive, urinary, liver, reproductive, respiratory, endocrine, and placenta tissues. Thus, the many novel transcripts obtained here might be muscle-specific and related to the development, differentiation, and function of muscles. Additionally, Min pigs are an indigenous Chinese pig breed with higher IMF content than that in western pig breeds and were first used here for Iso-seq analysis. There should be a large number of transcripts specific for Min pigs, which might be valuable for revealing the difference in meat quality between Min and other pig breeds. Moreover, the low level of expression and/or conservation among species (such as lncRNA) might be another reason why they were not identified before. Nevertheless, these newly identified sequences not only update the pig transcriptome, but offer clues for the identification of skeletal muscle-specific isoforms, which will contribute to revealing mechanisms underlying skeletal muscle development.

Iso-seq technology can produce full-length transcripts without the aid of assembly, thus providing superior evidence for differentiating AS events. IR events were identified as the most abundant in the mixed samples of LT and ST muscles with Iso-seq. IR was thought to be the most prevalent AS type in various studies [25,44,45]. However, there were different results. For example, AD was the most common form with 44% of the total AS events analyzed, including ES, AD, AA, and IR, while IR accounted for approximately 19% in 34 different tissues, including muscle from Duroc pigs [42]. Additionally, AA was identified as the most prevalent AS type in muscle from a single White cross-bred pig [26]. These results showed that AS is common, complicated, and highly regulated, suggesting its importance in physiological processes. Nevertheless, IR has been emerging as a mechanism underlying gene regulation. It has been found to be strongly regulated and involved in early chick embryo development [25], granulocyte differentiation [46], terminal erythropoiesis [47], and germ cell differentiation [48]. Recent genome-wide splicing analyses showed that increases in IR are age-associated and conserved across species [49,50,51]. The prevalence of IR in muscle of Min pigs implicates its involvement in skeletal muscle development.

However, ES and MEE were characterized as the top two events that differentially splice between two muscles using integrated Iso-seq and RNA-seq analysis, indicating ES and MEE are critical splicing regulatory mechanisms for the different LT and ST phenotypes. ES and MEE are likely to cause amino acid deletion/insertion (indel) in the polypeptide without frameshift or premature termination codon (PTC), which are often present in IR events and result in truncated protein [25]. PTC-containing transcripts might be degraded by the nonsense-mediated mRNA decay mechanism [52,53]; thereafter, IR events function mainly in regulating the levels of productive mRNA and the corresponding protein. The novel isoforms produced by amino acid indels might be a major reason for the differences between LT and ST.

AS plays critical roles in phenotype determination, tissue differentiation, and development [12]. DEG analyses have been performed in porcine skeletal muscle previously, and key genes and pathways related to skeletal muscle development and growth [54,55] and meat quality [56,57] were identified. However, RNA splicing in skeletal muscle was not deeply investigated. Here, mechanisms underlying the differential phenotype of LT and ST were analyzed on the AS level, and pathways involved in IMF deposition were highlighted. Through integrated analysis of Iso-seq and RNA-seq, DAS and DE-DAS genes were identified and were found to be involved in various fat-related pathways, indicating AS plays an important role in the different IMF contents of the two muscles. Although much lower than DEGs in number (122 DE-DAS genes vs. 1177 DEGs), DE-DAS genes share the most pathways (12 out of the top 20) with DEGs and play similar roles in fat deposition, as revealed by KEGG analysis, which suggested that the DAS of DEGs is the major factor affecting the IMF contents in LT and ST muscles. These results indicated that DE-DAS genes are critical for the differential phenotype of ST and LT muscles, especially in fat deposition. The AS of these genes, especially upregulated ones, should be considered first in future studies on meat quality.

5. Conclusions

In conclusion, we obtained a full-length pig transcriptome using PacBio Iso-seq. Numerous novel transcripts covering protein-coding genes, lncRNA, and fusion transcripts that were not previously annotated in pigs were identified. A total of 456,965 AS events, of which 3930 are DAS, corresponding to 2364 unique genes, were obtained in LT and ST muscles. The results update the existing genome annotations and revealed the breed and tissue specificity of AS and isoforms. Furthermore, DAS of DEGs were identified as the important factors influencing the IMF contents in LT and ST muscles, and fatty acid degradation and PPAR signaling pathway were found to be the most important pathways regulating the differential deposition of two muscles. The findings provide a valuable basis for the in-depth exploration of the mechanisms underlying meat quality and the IMF deposition.

Supplementary Materials

The following are available online at https://www.mdpi.com/article/10.3390/biom12020154/s1, Table S1. Primers used in this study. Table S2. Transcripts identified by Iso-seq. Table S2-a. All transcripts identified by Iso-seq. Table S2-b. Novel gene loci identified by Iso-seq. Table S2-c. Novel protein-coding genes identified by Iso-seq. Table S2-d. Novel lncRNA identified by Iso-seq. Table S2-e. Fusion transcripts identified by Iso-seq. Table S3. Differential alternative splicing events. Table S4. Characterization of DAS genes. Table S4-a. DAS genes in semitendinosus compared to longissimus thoracis muscle. Table S4-b. Total genes identified in semitendinosus and longissimus thoracis muscles. Table S4-c. GO classification of DAS genes. Table S4-d. KEGG classification of DAS genes. Table S5. Identification of DE-DAS genes. Table S5-a. DEGs in semitendinosus compared to longissimus thoracis muscle. Table S5-b. DE-DAS genes in semitendinosus compared to longissimus thoracis muscle. Table S5-c. GO classification of DE-DAS genes. Table S5-d. Differentially expressed TFs in semitendinosus compared to longissimus thoracis muscle.

Author Contributions

Conceptualization, X.Y. and D.L.; investigation, W.H., Z.Y., and Y.S.; resources, D.L.; supervision, X.Y. and D.L.; funding acquisition, D.Z. and X.Y.; validation, W.H., J.L., and Z.Y.; writing—original draft, D.Z. and X.Y.; writing—review and editing, X.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China (31741114) and the Heilongjiang Science Fund for Distinguished Youth Scholars (JQ2020C005).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

All the relevant data are provided along in the manuscript as Supplementary Files.

Conflicts of Interest

The authors declare no conflict of interest.

References

Van Wijk, H.J.; Arts, D.J.; Matthews, J.O.; Webster, M.; Ducro, B.J.; Knol, E.F. Genetic parameters for carcass composition and pork quality estimated in a commercial production chain. J. Anim. Sci. 2005, 83, 324–333. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Miar, Y.; Plastow, G.S.; Moore, S.S.; Manafiazar, G.; Charagu, P.; Kemp, R.A.; Van Haandel, B.; Huisman, A.E.; Zhang, C.Y.; McKay, R.M.; et al. Genetic and phenotypic parameters for carcass and meat quality traits in commercial crossbred pigs. J. Anim. Sci. 2014, 92, 2869–2884. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Markljung, E.; Braunschweig, M.H.; Karlskov-Mortensen, P.; Bruun, C.S.; Sawera, M.; Cho, I.C.; Hedebro-Velander, I.; Josell, A.; Lundström, K.; von Seth, G.; et al. Genome-wide identification of quantitative trait loci in a cross between Hampshire and Landrace II: Meat quality traits. BMC Genet. 2008, 9, 22. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Yu, K.; Shu, G.; Yuan, F.; Zhu, X.; Gao, P.; Wang, S.; Wang, L.; Xi, Q.; Zhang, S.; Zhang, Y.; et al. Fatty acid and transcriptome profiling of longissimus dorsi muscles between pig breeds differing in meat quality. Int. J. Biol. Sci. 2013, 9, 108–118. [Google Scholar] [CrossRef]
Cho, I.C.; Yoo, C.K.; Lee, J.B.; Jung, E.J.; Han, S.H.; Lee, S.S.; Ko, M.S.; Lim, H.T.; Park, H.B. Genome-wide QTL analysis of meat quality-related traits in a large F2 intercross between Landrace and Korean native pigs. Genet. Sel. Evol. 2015, 47, 7. [Google Scholar] [CrossRef] [Green Version]
Wang, Y.; Thakali, K.; Morse, P.; Shelby, S.; Chen, J.; Apple, J.; Huang, Y. Comparison of Growth Performance and Meat Quality Traits of Commercial Cross-Bred Pigs versus the Large Black Pig Breed. Animals 2021, 11, 200. [Google Scholar] [CrossRef]
Maniatis, T.; Tasic, B. Alternative pre-mRNA splicing and proteome expansion in metazoans. Nature 2002, 418, 236–243. [Google Scholar] [CrossRef]
Nilsen, T.W.; Graveley, B.R. Expansion of the eukaryotic proteome by alternative splicing. Nature 2010, 463, 457–463. [Google Scholar] [CrossRef] [Green Version]
Black, D.L. Mechanisms of alternative pre-messenger RNA splicing. Annu. Rev. Biochem. 2003, 72, 291–336. [Google Scholar] [CrossRef] [Green Version]
Pan, Q.; Shai, O.; Lee, L.J.; Frey, B.J.; Blencowe, B.J. Deep surveying of alternative splicing complexity in the human transcriptome by high-throughput sequencing. Nat. Genet. 2008, 40, 1413–1415. [Google Scholar] [CrossRef]
Sammeth, M.; Foissac, S.; Guigó, R.A. general definition and nomenclature for alternative splicing events. PLoS Comput. Biol. 2008, 4, e1000147. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Kalsotra, A.; Cooper, T.A. Functional consequences of developmentally regulated alternative splicing. Nat. Rev. Genet. 2011, 12, 715–729. [Google Scholar] [CrossRef]
Wang, E.T.; Sandberg, R.; Luo, S.; Khrebtukova, I.; Zhang, L.; Mayr, C.; Kingsmore, S.F.; Schroth, G.P.; Burge, C.B. Alternative isoform regulation in human tissue transcriptomes. Nature 2008, 456, 470–476. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Frankish, A.; Diekhans, M.; Ferreira, A.M.; Johnson, R.; Jungreis, I.; Loveland, J.; Mudge, J.M.; Sisu, C.; Wright, J.; Armstrong, J.; et al. GENCODE reference annotation for the human and mouse genomes. Nucleic Acids Res. 2019, 47, D766–D773. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Hu, Z.; Scott, H.S.; Qin, G.; Zheng, G.; Chu, X.; Xie, L.; Adelson, D.L.; Oftedal, B.E.; Venugopal, P.; Babic, M.; et al. Revealing Missing Human Protein Isoforms Based on Ab Initio Prediction, RNA-seq and Proteomics. Sci. Rep. 2015, 5, 10940. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Pertea, M.; Shumate, A.; Pertea, G.; Varabyou, A.; Breitwieser, F.P.; Chang, Y.C.; Madugundu, A.K.; Pandey, A.; Salzberg, S.L. CHESS: A new human gene catalog curated from thousands of large-scale RNA sequencing experiments reveals extensive transcriptional noise. Genome Biol. 2018, 19, 208. [Google Scholar] [CrossRef] [PubMed]
Gonzàlez-Porta, M.; Frankish, A.; Rung, J.; Harrow, J.; Brazma, A. Transcriptome analysis of human tissues and cell lines reveals one dominant transcript per gene. Genome Biol. 2013, 14, R70. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Baralle, F.E.; Giudice, J. Alternative splicing as a regulator of development and tissue identity. Nat. Rev. Mol. Cell Biol. 2017, 18, 437–451. [Google Scholar] [CrossRef]
Bhadra, M.; Howell, P.; Dutta, S.; Heintz, C.; Mair, W.B. Alternative splicing in aging and longevity. Hum. Genet. 2020, 139, 357–369. [Google Scholar] [CrossRef]
Abdel-Ghany, S.E.; Hamilton, M.; Jacobi, J.L.; Ngam, P.; Devitt, N.; Schilkey, F.; Ben-Hur, A.; Reddy, A.S. A survey of the sorghum transcriptome using single-molecule long reads. Nat. Commun. 2016, 7, 11706. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Wang, B.; Tseng, E.; Regulski, M.; Clark, T.A.; Hon, T.; Jiao, Y.; Lu, Z.; Olson, A.; Stein, J.C.; Ware, D. Unveiling the complexity of the maize transcriptome by single-molecule long-read sequencing. Nat. Commun. 2016, 7, 11708. [Google Scholar] [CrossRef] [Green Version]
Kuo, R.I.; Tseng, E.; Eory, L.; Paton, I.R.; Archibald, A.L.; Burt, D.W. Normalized long read RNA sequencing in chicken reveals transcriptome complexity similar to human. BMC Genom. 2017, 18, 323. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Li, Y.; Fang, C.; Fu, Y.; Hu, A.; Li, C.; Zou, C.; Li, X.; Zhao, S.; Zhang, C.; Li, C. A survey of transcriptome complexity in Sus scrofa using single-molecule long-read sequencing. DNA Res. 2018, 25, 421–437. [Google Scholar] [CrossRef] [PubMed]
Chen, H.; Gao, F.; He, M.; Ding, X.F.; Wong, A.M.; Sze, S.C.; Yu, A.C.; Sun, T.; Chan, A.W.; Wang, X.; et al. Long-Read RNA Sequencing Identifies Alternative Splice Variants in Hepatocellular Carcinoma and Tumor-Specific Isoforms. Hepatology 2019, 70, 1011–1025. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Ren, J.; Sun, C.; Clinton, M.; Yang, N. Dynamic Transcriptional Landscape of the Early Chick Embryo. Front. Cell Dev. Biol. 2019, 7, 196. [Google Scholar] [CrossRef] [PubMed]
Beiki, H.; Liu, H.; Huang, J.; Manchanda, N.; Nonneman, D.; Smith, T.; Reecy, J.M.; Tuggle, C.K. Improved annotation of the domestic pig genome through integration of Iso-Seq and RNA-seq data. BMC Genom. 2019, 20, 344. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Realini, C.E.; Vénien, A.; Gou, P.; Gatellier, P.; Pérez-Juan, M.; Danon, J.; Astruc, T. Characterization of Longissimus thoracis, Semitendinosus and Masseter muscles and relationships with technological quality in pigs. 1. Microscopic analysis of muscles. Meat Sci. 2013, 94, 408–416. [Google Scholar] [CrossRef] [PubMed]
Realini, C.E.; Pérez-Juan, M.; Gou, P.; Díaz, I.; Sárraga, C.; Gatellier, P.; García-Regueiro, J.A. Characterization of Longissimus thoracis, Semitendinosus and Masseter muscles and relationships with technological quality in pigs. 2. Composition of muscles. Meat Sci. 2013, 94, 417–423. [Google Scholar] [CrossRef]
Ortiz, A.; Tejerina, D.; García-Torres, S.; González, E.; Morcillo, J.F.; Mayoral, A.I. Effect of Animal Age at Slaughter on the Muscle Fibres of Longissimus thoracis and Meat Quality of Fresh Loin from Iberian × Duroc Crossbred Pig under Two Production Systems. Animals 2021, 11, 2143. [Google Scholar] [CrossRef] [PubMed]
Listrat, A.; Gagaoua, M.; Normand, J.; Gruffat, D.; Andueza, D.; Mairesse, G.; Mourot, B.P.; Chesneau, G.; Gobert, C.; Picard, B. Contribution of connective tissue components, muscle fibres and marbling to beef tenderness variability in longissimus thoracis, rectus abdominis, semimembranosus and semitendinosus muscles. J. Sci. Food Agric. 2020, 100, 2502–2511. [Google Scholar] [CrossRef] [PubMed]
Chen, F.F.; Wang, Y.Q.; Tang, G.R.; Liu, S.G.; Cai, R.; Gao, Y.; Sun, Y.M.; Yang, G.S.; Pang, W.J. Differences between porcine longissimus thoracis and semitendinosus intramuscular fat content and the regulation of their preadipocytes during adipogenic differentiation. Meat Sci. 2019, 147, 116–126. [Google Scholar] [CrossRef] [PubMed]
Fernandez, X.; Monin, G.; Talmant, A.; Mourot, J.; Lebret, B. Influence of intramuscular fat content on the quality of pig meat—2. Consumer acceptability of m. longissimus lumborum. Meat Sci. 1999, 53, 67–72. [Google Scholar] [CrossRef]
Salmela, L.; Rivals, E. The algorithm, the software, and its performances are described in LoRDEC: Accurate and efficient long read error correction. Bioinformatics 2014, 30, 3506–3514. [Google Scholar] [CrossRef]
Wu, T.D.; Watanabe, C.K. GMAP: A genomic mapping and alignment program for mRNA and EST sequences. Bioinformatics 2005, 21, 1859–1875. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Wang, L.; Park, H.J.; Dasari, S.; Wang, S.; Kocher, J.P.; Li, W. CPAT: Coding-Potential Assessment Tool using an alignment-free logistic regression model. Nucleic Acids Res. 2013, 41, e74. [Google Scholar] [CrossRef] [PubMed]
Foissac, S.; Sammeth, M. ASTALAVISTA: Dynamic and flexible analysis of alternative splicing events in custom gene datasets. Nucleic Acids Res. 2007, 35, W297–W299. [Google Scholar] [CrossRef] [Green Version]
Chen, Y.; Chen, Y.; Shi, C.; Huang, Z.; Zhang, Y.; Li, S.; Li, Y.; Ye, J.; Yu, C.; Li, Z.; et al. SOAPnuke: A MapReduce acceleration-supported software for integrated quality control and preprocessing of high-throughput sequencing data. Gigascience 2018, 7, 1–6. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Kim, D.; Langmead, B.; Salzberg, S.L. HISAT: A fast spliced aligner with low memory requirements. Nat. Methods 2015, 12, 357–360. [Google Scholar] [CrossRef] [Green Version]
Yang, X.Q.; Zhao, X.L.; Yu, H.; Zhang, J.; Han, L.X.; Liu, D. Speckled 100 kDa gene in pigs: Alternative splicing, subcellular localization, and response to interferon-α stimulation. Gene 2021, 791, 145710. [Google Scholar] [CrossRef] [PubMed]
Sharon, D.; Tilgner, H.; Grubert, F.; Snyder, M. A single-molecule long-read survey of the human transcriptome. Nat. Biotechnol. 2013, 31, 1009–1014. [Google Scholar] [CrossRef]
Melé, M.; Ferreira, P.G.; Reverter, F.; DeLuca, D.S.; Monlong, J.; Sammeth, M.; Young, T.R.; Goldmann, J.M.; Pervouchine, D.D.; Sullivan, T.J.; et al. Human genomics. The human transcriptome across tissues and individuals. Science 2015, 348, 660–665. [Google Scholar] [CrossRef] [Green Version]
Feng, W.; Zhao, P.; Zheng, X.; Hu, Z.; Liu, J. Profiling Novel Alternative Splicing within Multiple Tissues Provides Useful Insights into Porcine Genome Annotation. Genes 2020, 11, 1405. [Google Scholar] [CrossRef]
Rodriguez, J.M.; Pozo, F.; di Domenico, T.; Vazquez, J.; Tress, M.L. An analysis of tissue-specific alternative splicing at the protein level. PLoS Comput. Biol. 2020, 16, e1008287. [Google Scholar] [CrossRef]
Braunschweig, U.; Barbosa-Morais, N.L.; Pan, Q.; Nachman, E.N.; Alipanahi, B.; Gonatopoulos-Pournatzis, T.; Frey, B.; Irimia, M.; Blencowe, B.J. Widespread intron retention in mammals functionally tunes transcriptomes. Genome Res. 2014, 24, 1774–1786. [Google Scholar] [CrossRef] [PubMed]
Ni, T.; Yang, W.; Han, M.; Zhang, Y.; Shen, T.; Nie, H.; Zhou, Z.; Dai, Y.; Yang, Y.; Liu, P.; et al. Global intron retention mediated gene regulation during CD4+ T cell activation. Nucleic Acids Res. 2016, 44, 6817–6829. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Wong, J.J.; Ritchie, W.; Ebner, O.A.; Selbach, M.; Wong, J.W.; Huang, Y.; Gao, D.; Pinello, N.; Gonzalez, M.; Baidya, K.; et al. Orchestrated intron retention regulates normal granulocyte differentiation. Cell 2013, 154, 583–595. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Pimentel, H.; Parra, M.; Gee, S.L.; Mohandas, N.; Pachter, L.; Conboy, J.G. A dynamic intron retention program enriched in RNA processing genes regulates gene expression during terminal erythropoiesis. Nucleic Acids Res. 2016, 44, 838–851. [Google Scholar] [CrossRef] [Green Version]
Naro, C.; Jolly, A.; Di Persio, S.; Bielli, P.; Setterblad, N.; Alberdi, A.J.; Vicini, E.; Geremia, R.; De la Grange, P.; Sette, C. An Orchestrated Intron Retention Program in Meiosis Controls Timely Usage of Transcripts during Germ Cell Differentiation. Dev. Cell 2017, 41, 82–93. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Heintz, C.; Doktor, T.K.; Lanjuin, A.; Escoubas, C.; Zhang, Y.; Weir, H.J.; Dutta, S.; Silva-García, C.G.; Bruun, G.H.; Morantte, I.; et al. Splicing factor 1 modulates dietary restriction and TORC1 pathway longevity in C. elegans. Nature 2017, 541, 102–106. [Google Scholar] [CrossRef] [PubMed]
Tabrez, S.S.; Sharma, R.D.; Jain, V.; Siddiqui, A.A.; Mukhopadhyay, A. Differential alternative splicing coupled to nonsense-mediated decay of mRNA ensures dietary restriction-induced longevity. Nat. Commun. 2017, 8, 306. [Google Scholar] [CrossRef] [Green Version]
Adusumalli, S.; Ngian, Z.K.; Lin, W.Q.; Benoukraf, T.; Ong, C.T. Increased intron retention is a post-transcriptional signature associated with progressive aging and Alzheimer’s disease. Aging Cell 2019, 18, e12928. [Google Scholar] [CrossRef]
Nagy, E.; Maquat, L.E. A rule for termination-codon position within intron-containing genes: When nonsense affects RNA abundance. Trends Biochem. Sci. 1998, 23, 198–199. [Google Scholar] [CrossRef]
Jung, H.; Lee, D.; Lee, J.; Park, D.; Kim, Y.J.; Park, W.Y.; Hong, D.; Park, P.J.; Lee, E. Intron retention is a widespread mechanism of tumor-suppressor inactivation. Nat. Genet. 2015, 47, 1242–1248. [Google Scholar] [CrossRef] [PubMed]
Wang, J.; Ren, Q.; Hua, L.; Chen, J.; Zhang, J.; Bai, H.; Li, H.; Xu, B.; Shi, Z.; Cao, H.; et al. Comprehensive Analysis of Differentially Expressed mRNA, lncRNA and circRNA and Their ceRNA Networks in the Longissimus Dorsi Muscle of Two Different Pig Breeds. Int. J. Mol. Sci. 2019, 20, 1107. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Li, X.; Xie, S.; Qian, L.; Cai, C.; Bi, H.; Cui, W. Identification of genes related to skeletal muscle growth and development by integrated analysis of transcriptome and proteome in myostatin-edited Meishan pigs. J. Proteom. 2020, 213, 103628. [Google Scholar] [CrossRef]
Li, X.J.; Zhou, J.; Liu, L.Q.; Qian, K.; Wang, C.L. Identification of genes in longissimus dorsi muscle differentially expressed between Wannanhua and Yorkshire pigs using RNA-sequencing. Anim. Genet. 2016, 47, 324–333. [Google Scholar] [CrossRef]
Liu, Y.; Yang, X.; Jing, X.; He, X.; Wang, L.; Liu, Y.; Liu, D. Transcriptomics Analysis on Excellent Meat Quality Traits of Skeletal Muscles of the Chinese Indigenous Min Pig Compared with the Large White Breed. Int. J. Mol. Sci. 2017, 19, 21. [Google Scholar] [CrossRef] [PubMed] [Green Version]

Figure 1. Analysis of Iso-seq data. (A) Classification of raw Iso-seq reads; (B) length distribution of full-length nonconcatemer reads; (C) Venn diagram depicting overlapped genes between Iso-seq and reference genome data; (D) analysis of gene length amongst genes generating different numbers of transcripts; (E) analysis of exon numbers among genes generating different numbers of transcripts.

Figure 2. Identification of novel genes by Iso-seq analysis. (A) Venn diagram depicting novel protein-coding isoforms among NR, GO, KO, KOG, and Swiss-Prot databases; (B) CIRCOS visualization of data identified at the chromosomal level. ① Pig chromosomes; ② distribution of gene in reference genome; ③ distribution of gene identified by Iso-seq; ④ transcript distribution; ⑤ alternative splicing event distribution; ⑥ lncRNA distribution; ⑦ fusion transcripts: inter-chromosome (yellow), intro-chromosome (blue); (C) validation of novel isoforms with RT-PCR. Arrow, primer position; @, junction of two genes; NC, negative control.

Figure 3. Identification of alternative splicing (AS) events by Iso-seq analysis. (A) Distribution of five basic AS events detected, and a schematic illustration of the five AS models; (B) RT-PCR validation of alternative splicing events; (C) distribution of transcript number per gene among multi-exon genes. M, DL2000 marker; TB, target band; NC, negative control.

Figure 4. Characterization of differential alternative splicing (DAS) events through integrated analysis of Iso-seq and Illumina-seq. (A) Statistics of differential DAS events; (B) RT-PCR validation of DAS genes; (C) distribution of DAS genes with different expression levels compared to that of total genes identified; (D) comparison of expression level of DAS genes between longissimus thoracis and semitendinosus; (E) top 10 GO terms enriched by DAS genes in each functional category; (F) KEGG (Kyoto Encyclopedia of Genes and Genomes) pathways enriched with over 20 DAS genes.

Figure 5. Characterization of differentially expressed (DE) differential alternative splicing (DAS) genes in semitendinosus compared to longissimus thoracis muscle. (A) Volcano plot of DE genes; (B) Venn diagram depicting DE-DAS and DE genes; (C) heatmap of DE-DAS genes; (D) real-time PCR validation of DE-DAS genes; (E) top 10 biological process categories classified by DE-DAS genes. The gene number is given in the chart. 1, Regulation of biological process; 2, multicellular organismal process; 3, cellular component organization or biogenesis.

Figure 6. Functional characterization of differentially expressed (DE) differential alternative splicing (DAS) genes. (A) Top 20 KEGG pathways significantly enriched by DE genes (DEGs); (B) top 20 KEGG pathways significantly enriched by DE-DAS genes; (C) top 20 pathways significantly enriched by upregulated DE-DAS genes; (D) all pathways significantly enriched by downregulated DE-DAS genes. The pathways involved in fat formation are indicated in red, and those shared by the top 20 of DEGs and DE-DAS are underlined; all pathways overlapped between enrichments of all and upregulated or downregulated DE-DAS genes are indicated in italics.

Figure 7. Characterization of differentially expressed transcription factors (DETFs) among differential alternative splicing (DAS) genes. (A) family analysis of DETFs; (B) Venn diagram of DETFs and DAS genes; (C) heatmap of DAS-DETFs; (D) protein–protein analysis of DAS-DETFs with differentially expressed genes in semitendinosus compared to longissimus thoracis muscles. Pink, upregulated genes; blue, downregulated genes; yellow, DAS-DETFs.

Table 1. Top 20 up- and downregulated differentially expressed genes.

Gene ID	Log2FC	NR_Protein_Accession	NR_Defination
7.1204	22.98	*	*
5.156	10.99	XP_013853022.1	^△^# homeobox protein Hox-C9 (Sus scrofa)
ENSSSCG00000036741	10.19	XP_019277494.1	^△^# pituitary homeobox 1 (Panthera pardus)
ENSSSCG00000016698	8.79	XP_003134898.2	^△^# homeobox protein Hox-A11 (Sus scrofa)
ENSSSCG00000029666	8.70	XP_014964404.1	^△^# homeobox protein Hox-A13 isoform X2 (Ovis aries musimon)
18.79	7.16	ABR01162.1	endonuclease/reverse transcriptase (Sus scrofa)
3.205	6.87	*	*
ENSSSCG00000022980	6.62	XP_005669066.1	^△^# T-box transcription factor TBX4 isoform X1 (Sus scrofa)
10.351	6.27	XP_012920268.1	^△^# tigger transposable element-derived protein 1 (Mustela putorius furo)
ENSSSCG00000033532	6.16	XP_003356130.2	^△ serine/threonine-protein kinase SBK2 (Sus scrofa)
ENSSSCG00000015069	−7.06	NP_001002801.1	apolipoprotein C-III precursor (Sus scrofa)
ENSSSCG00000040910	−6.54	XP_003131314.1	^△ beta-2-glycoprotein 1 isoform X1 (Sus scrofa)
ENSSSCG00000023686	−6.47	NP_999377.1	transthyretin precursor (Sus scrofa)
ENSSSCG00000011799	−6.44	XP_005652426.1	^△ alpha-2-HS-glycoprotein (Sus scrofa)
ENSSSCG00000011692	−6.40	XP_003358647.1	^△^# zinc finger protein ZIC 1 (Sus scrofa)
ENSSSCG00000048779	−6.10	XP_013843157.1	^△^# zinc finger protein 646 isoform X1 (Sus scrofa)
ENSSSCG00000032321	−5.86	NP_001001859.1	alpha-1,3-mannosyl-glycoprotein 4-beta-N-acetylglucosaminyltransferase C (Sus scrofa)
ENSSSCG00000005488	−5.83	XP_005660428.1	^△ alpha-1-acid glycoprotein isoform X1 (Sus scrofa)
ENSSSCG00000042542	−5.78	ABR01162.1	endonuclease/reverse transcriptase (Sus scrofa)
ENSSSCG00000005485	−5.76	NP_001157478.1	protein AMBP precursor (Sus scrofa)

FC, fold change; *, sequence was not identified in the NR database; ^△, sequence was predicted by automated computational analysis in the NR database; ^#, sequence is a transcription factor.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Hao, W.; Yang, Z.; Sun, Y.; Li, J.; Zhang, D.; Liu, D.; Yang, X. Characterization of Alternative Splicing Events in Porcine Skeletal Muscles with Different Intramuscular Fat Contents. Biomolecules 2022, 12, 154. https://doi.org/10.3390/biom12020154

AMA Style

Hao W, Yang Z, Sun Y, Li J, Zhang D, Liu D, Yang X. Characterization of Alternative Splicing Events in Porcine Skeletal Muscles with Different Intramuscular Fat Contents. Biomolecules. 2022; 12(2):154. https://doi.org/10.3390/biom12020154

Chicago/Turabian Style

Hao, Wanjun, Zewei Yang, Yuanlu Sun, Jiaxin Li, Dongjie Zhang, Di Liu, and Xiuqin Yang. 2022. "Characterization of Alternative Splicing Events in Porcine Skeletal Muscles with Different Intramuscular Fat Contents" Biomolecules 12, no. 2: 154. https://doi.org/10.3390/biom12020154

APA Style

Hao, W., Yang, Z., Sun, Y., Li, J., Zhang, D., Liu, D., & Yang, X. (2022). Characterization of Alternative Splicing Events in Porcine Skeletal Muscles with Different Intramuscular Fat Contents. Biomolecules, 12(2), 154. https://doi.org/10.3390/biom12020154

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Characterization of Alternative Splicing Events in Porcine Skeletal Muscles with Different Intramuscular Fat Contents

Abstract

1. Introduction

2. Materials and Methods

2.1. Animals, Tissues, and RNA

2.2. PacBio Library Construction and Sequencing

2.3. Raw Read Processing

2.4. Loci and Isoform Annotation

2.5. Novel Gene Identification

2.6. Alternative Splicing Analysis

2.7. RNA Sequencing and Data Processing

2.8. Reverse-Transcription PCR and Real-Time Quantitative PCR

3. Results

3.1. Overview of PacBio Iso-Seq Data

3.2. Identification of Novel Gene

3.3. Alternative Splicing Events

3.4. Differential Alternative Splicing Events

3.5. Integrated Analysis of Differential Alternative Splicing Events and Differentially Expressed Genes

3.6. Transcription Factors in DE-DAS Genes

4. Discussion

5. Conclusions

Supplementary Materials

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI