Next Article in Journal
Maximum Entropy Modeling the Distribution Area of Morchella Dill. ex Pers. Species in China under Changing Climate
Next Article in Special Issue
Hypoxia Affects HIF-1/LDH-A Signaling Pathway by Methylation Modification and Transcriptional Regulation in Japanese Flounder (Paralichthys olivaceus)
Previous Article in Journal
Acute Effects of Static Stretching Combined with Vibration and Nonvibration Foam Rolling on the Cardiovascular Responses and Functional Fitness of Older Women with Prehypertension
Previous Article in Special Issue
Masculinization of Adult Gambusia holbrooki: A Case of Recapitulation of Protogyny in a Gonochorist?
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Full-Length Transcriptome Reconstruction Reveals the Genetic Mechanisms of Eyestalk Displacement and Its Potential Implications on the Interspecific Hybrid Crab (Scylla serrata ♀ × S. paramamosain ♂)

1
Guangdong Provincial Key Laboratory of Marine Biotechnology, Shantou University, Shantou 515063, China
2
STU-UMT Joint Shellfish Research Laboratory, Shantou University, Shantou 515063, China
3
Institute of Tropical Aquaculture and Fisheries, Universiti Malaysia Terengganu, Kuala Nerus, Terengganu 21030, Malaysia
*
Author to whom correspondence should be addressed.
Biology 2022, 11(7), 1026; https://doi.org/10.3390/biology11071026
Submission received: 12 June 2022 / Revised: 26 June 2022 / Accepted: 27 June 2022 / Published: 7 July 2022
(This article belongs to the Special Issue The Application of Genetic and Genomic Biotechnology in Aquaculture)

Abstract

:

Simple Summary

The eyestalk is a key organ in crustaceans that produces neurohormones and regulates a range of physiological functions. Eyestalk displacement was discovered in some first-generation (F1) offspring of the novel interspecific hybrid crab (Scylla serrata ♀ × S. paramamosain ♂). To uncover the genetic mechanism underlying eyestalk displacement and its potential implications, high-quality transcriptome was reconstructed using single-molecule real-time (SMRT) sequencing. A total of 37 significantly differential alternative splicing (DAS) events (17 up-regulated and 20 down-regulated) and 1475 significantly differential expressed transcripts (DETs) (492 up-regulated and 983 down-regulated) were detected in hybrid crabs with displaced eyestalks (DH). The most significant DAS events and DETs were annotated as being endoplasmic reticulum chaperone BiP and leucine-rich repeat protein lrrA-like isoform X2. In addition, the top ten significant gene ontology (GO) terms were related to the cuticle or chitin. Overall, this study highlights the underlying genetic mechanisms of eyestalk displacement and provide useful knowledge for mud crab (Scylla spp.) crossbreeding.

Abstract

The lack of high-quality juvenile crabs is the greatest impediment to the growth of the mud crab (Scylla paramamosain) industry. To obtain high-quality hybrid offspring, a novel hybrid mud crab (S. serrata ♀ × S. paramamosain ♂) was successfully produced in our previous study. Meanwhile, an interesting phenomenon was discovered, that some first-generation (F1) hybrid offspring’s eyestalks were displaced during the crablet stage I. To uncover the genetic mechanism underlying eyestalk displacement and its potential implications, both single-molecule real-time (SMRT) and Illumina RNA sequencing were implemented. Using a two-step collapsing strategy, three high-quality reconstructed transcriptomes were obtained from purebred mud crabs (S. paramamosain) with normal eyestalks (SPA), hybrid crabs with normal eyestalks (NH), and hybrid crabs with displaced eyestalks (DH). In total, 37 significantly differential alternative splicing (DAS) events (17 up-regulated and 20 down-regulated) and 1475 significantly differential expressed transcripts (DETs) (492 up-regulated and 983 down-regulated) were detected in DH. The most significant DAS events and DETs were annotated as being endoplasmic reticulum chaperone BiP and leucine-rich repeat protein lrrA-like isoform X2. In addition, the top ten significant GO terms were related to the cuticle or chitin. Overall, high-quality reconstructed transcriptomes were obtained for the novel interspecific hybrid crab and provided valuable insights into the genetic mechanisms of eyestalk displacement in mud crab (Scylla spp.) crossbreeding.

1. Introduction

The mud crab (Scylla spp.) is a commercially important aquaculture species in Southeast Asian countries, mainly distributed along the coasts of India and the Western Pacific. In the last decade, to meet the high-quantity protein demand of humans, the production of mud crab farming has increased rapidly, but the majority of mud crab farming still relies on wild-caught seed crabs [1]. However, both the quantity and quality of mud crab seeds in the wild have decreased dramatically due to over-exploitation and environmental deterioration. Currently, the lack of high-quality juvenile crabs is considered as the main challenge in the expansion of the mud crab (S. paramamosain) industry. Furthermore, the genetic improvement of mud crabs (S. paramamosain) is still in its infancy compared with other aquatic species. Most studies focus on intraspecies classification [2,3], sex determination [4,5,6], sex identification [7,8], and nutritional composition [9,10]. Therefore, it is critical to speed up the genetic improvement of the mud crab (S. paramamosain).
Interspecific hybridization is a common and effective method for genetic improvement and breeding of aquatic animals [11] because hybrid offspring usually have more biomass, faster growth, and higher fertility than both parents [12]. To obtain a high-quality hybrid offspring, a novel hybrid mud crab (S. serrata ♀ × S. paramamosain ♂) was successfully developed in our previous study [13,14]. However, interspecific hybridization in the mud crab (Scylla spp.) not only brings hybrid vigor but also hybrid inferiority, because some hybrid offspring had eyestalk displacement during the crablet stage I. Compared with normal eyestalks, displaced eyestalks were located in the center of the head and extended forward close to the antenna (detailed morphological features and photographs were shown in our previous study [15]). As we know, the eyestalk is an important neuroendocrine organ complex and controls a variety of physiological processes involving the central pacemakers of circadian rhythms [16], osmotic regulation [17], molting [18], and reproduction in crustaceans [19]. Most importantly, molting and reproduction are the essential processes to improve productivity in crustacean aquaculture. As the awareness of crustacean endocrinology has increased, eyestalk ablation has become an effective method for promoting molting and reproduction in crustacean based on the removal of the gonad- and molt-inhibiting hormones [20,21]. Eyestalks plays an important role in crustacean aquaculture. Therefore, it is essential to investigate and understand the genetic mechanisms behind eyestalk displacement and its potential implications on the novel hybrid mud crab.
Over the past decade, with the rapid development of high-throughput sequencing technology, transcriptome analysis based on next-generation sequencing (NGS) has been widely implemented to identify the genetic mechanisms underlying the complex biological processes in crustacean studies [22]. However, the read lengths of NGS approaches are relatively short and limit the genome assembly in complex regions, especially for species without a reference genome (such as hybrids) [23]. To address this issue, the single-molecule real-time (SMRT) sequencing technology has been developed to improve the read length of sequencing with high accuracy [24]. Moreover, the sequencing process of SMRT occurs in real-time and doesn’t need PCR amplification during sequencing and library preparation, reducing the PCR related bias. Nowadays, full-length transcriptome sequencing has been successfully implemented in crustaceans to investigate the gene expression patterns of specific tissues or life stages and conditions in a transcriptomic experiment, such as Litopenaeus vannamei [25] and S. paramamosain [26]. Therefore, using full-length transcriptome sequencing data to reconstruct the transcriptomes of the hybrid crabs would help uncover the underlying genetic mechanisms of eyestalk displacement.
In this study, the transcriptomes of hybrid crabs with displaced eyestalks (DH), hybrid crabs with normal eyestalks (NH), and purebred mud crabs (S. paramamosain) with normal eyestalks (SPA) were reconstructed using PacBio sequencing data. Following that, novel genes, transcript isoforms, and alternative splicing (AS) were identified and annotated. Furthermore, the differential expression transcripts (DETs), differential alternative splicing (DAS) events, and their enrichment analysis between DH and NH were performed using Illumina RNA sequencing data. Finally, several DETs and DASs were selected for further validation of RNA-seq data. These results will facilitate highlighting the underlying genetic mechanisms of eyestalk displacement and provide useful knowledge for mud crab (Scylla spp.) crossbreeding.

2. Material and Methods

2.1. Sample Collection, RNA Extraction and Sequencing

The F1 hybrid offspring were developed in our previous study using S. paramamosain as the male parent and S. serrata as the female parent [13,14]. In brief, the male S. paramamosain was obtained from the local shore (Guangdong province, China), and the female S. serrata was bought from a local market in Shantou, China. The artificial mating of Scylla was performed following the method described by our previous study [13]. After artificial mating, female crabs were transferred to a rearing tank until the egg hatching. The F1 hybrid larvae were hatched in circular fiberglass tanks (0.9 m diameter, 1.0 m height), and then transferred to concrete rearing tanks (5.8 m × 4.8 m × 1.8 m) for growing seedlings. Culture conditions were ambient temperature (almost 30 °C), a natural photoperiod, and salinity of approximately 30 ppt. Samples of DH, NH, and SPA were collected for SMRT and Illumina sequencing at the first stage of the crablet. Briefly, these juvenile crabs were anaesthetized in ice cold water for 5 min before being snap-frozen in liquid nitrogen and stored at −80 °C for RNA extraction. The total RNA was then extracted from the whole-body using RNA isoPlus (TaKaRa, Shiga, Japan) following the manufacture’s instruction. Furthermore, the RNA quality was assessed in terms of integrity, purity, and concentration using Agilent 2100 Bioanalyzer (Agilent Technologies, CA, USA) and Nanodrop 2000 (Thermo Fiser Scientific, CA, USA). Finally, only high quality RNA was used to construct libraries and sequencing. The library preparation and sequencing were carried out at the Beijing Novogene Bioinformatics Technology Co. Ltd., China. For SMRT sequencing, three libraries for DH, NH, and SPA were constructed and sequenced on the PacBio Sequel platform, respectively. In addition, each SMRT library included pooled RNA from three crab samples. For Illumina sequencing, three libraries of the DH group (one abnormal hybrid crab per library) and three libraries of the NH group (two normal hybrid crabs per library) were constructed and sequenced on the Illumi-na Hiseq 2500 platform to generate 150 bp paired-end reads.

2.2. SMRT Sequencing Data Processing

To obtain transcript isoforms, the subread sequences data were processed with the following five steps: (1) The subread sequence data were processed by the ccs v4.2.0 software with the parameters —minLength 50, —maxLength 15,000, —minPasses 1, to generate circular consensus sequences (CCS); (2) the full-length (FL) reads were obtained from CCS by primer removal and demultiplexing using the lima v1.11.0 software with the parameters —dump-clips and —peek-guess; (3) the noise of FL reads was removed using the refine module of the isoseq3 v3.3.0 software with the parameters —require-polya and —min-polya-length 20; and (4) the consensus sequences from the same transcript were clustered to generate unpolished transcripts using the refine module of the isoseq3 v3.3.0 software with default parameters; and finally (5), the unpolished transcripts were polished to yield high-quality and low-quality isoforms using the polish module of the isoseq3 v3.3.0 software with default parameters.

2.3. Collapsing Redundant Transcripts Isoforms

To eliminate the redundant transcript isoforms, a two-step collapsing strategy was used in this study. In short, the high-quality isoforms were firstly aligned and sorted to generate SAM files by minimap2 (version 2.18) [27] with default parameters using the mud crab (S. paramamosain) as the reference genome [28]. Based on the mapping results, redundant isoforms were collapsed by the cDNA cupcake software (https://github.com/Magdoll/cDNA_Cupcake, accessed on 21 June 2021) with the parameters —min_aln_coverage 0.95, —min_aln_identity 0.85, and —dun-merge-5-shorter. In addition, unmapped transcripts were also collapsed using the Cogent v8.0 software (https://github.com/Magdoll/Cogent, accessed on 21 June 2021) and cDNA cupcake software with default parameters. In this process, different gene families were discovered initially from these unmapped transcripts. Then, a “fake genome” was created by concatenating all cogent unassigned contigs. Using the “fake genome” as the reference genome, these unmapped transcripts were collapsed by the cDNA cupcake software according to the abovementioned steps. Finally, CD-HIT (version 4.8.1) [29] was used to eliminate highly identical sequences from both mapped and unmapped transcript isoforms for further analysis.

2.4. Completeness and Characteristics Analysis of Reconstructed Transcriptomes

To evaluate the quality and completeness of the full-length transcriptomes, benchmarking universal single-copy orthologs (BUSCO) analysis were performed using BUSCO v5.1.3 software [30] with transcriptome mode and Arthropoda OrthoDB (arthropoda_odb10) [31] for full-length transcripts from DH, NH, and SPA. After determining the completeness, full-length transcripts were classified by comparing them to the reference genome annotation using gffcompare v0.12.2 software [32]. In this step, full-length transcripts were classified into 15 classes, including annotated (coded as “=” or “c”), novel isoform (coded as “j” or “k”), retrained intron (coded as “m” or “n”), novel antisense (coded as “x”), or novel intronic/intergenic (coded as “i” or “u”). The detailed information can be found on the official website of the gffcompare software (http://ccb.jhu.edu/software/stringtie/gffcompare.shtml, accessed on 6 February 2021). In addition, the transcriptome data of DH, NH, and SPA were paired for comparison to identify the common and unique transcripts using blastn v2.11.0+ and the following parameters: −e value 1 × 10-10 and −perc_identity 0.95. In this process, the common and unique transcripts between transcriptomes were identified by turning one transcriptome as a blast database and the other transcriptome as a query sequence.

2.5. Gene Functional Annotation

To have a better understanding of the biological context of the full-length transcripts, gene functional annotation was conducted. In summary, a TransDecoder was used to extract open reading frames (ORFs) from full-length transcripts using default parameters. If multiple ORFs were found in a single transcript, the first appeared ORF was selected for further analysis. The resulting ORFs were identified using eggNOG-mapper (v 2.1.4) [33] using the default parameters for obtaining this functional annotation information, such as the clusters of orthologous groups (COG), gene ontology (GO), Kyoto encyclopedia of genes and genomes (KEGG), and protein families database (Pfam). Additionally, the resulting ORFs were also searched against four databases (Swiss-Prot, TrEMBL, Uniref90, and NR) using the blastp function of diamond (v2.0.4.142) with the following parameters: -outfmt 6, -max_target_seqs 1, and −e value 1 × 10−5. Finally, all ORF annotation results were integrated and reported as a tab-delimited summary file.

2.6. Alternative Splicing (AS) Events Analysis

The alternative splicing (AS) events were evaluated using SUPPA2 software [34] utilizing a GTF annotation file obtained from DH, NH, and SPA to investigate the differences in gene expression patterns in purebred and crossbreed mud crabs. Using the generateEvents function of the SUPPA2 software with default parameters, seven AS event types were generated from the GTF annotation file, including SE (skipped exon), MX (mutually exclusive exon), A5 (alternative 5′ splice site), A3 (alternative 3′ splice site), RI (retained intron), AF (alternative first exon), and AL (alternative last exon). The distribution of AS events and the common overlaps were identified and visualized to compare the differences among DH, NH, and SPA. In addition, the GO enrichment analysis of AS events was performed by clusterProfiler using the results of eggNOG-mapper annotation.

2.7. Quantification of Identified Transcripts

For analyzing the differential alternative splicing events between NH and DH, the subreads sequences of DH and NH were merged, and processed to generate the reconstruction transcriptomes for the interspecific hybrid crab (S. serrata ♀ × S. paramamosain ♂) following the abovementioned steps. Using the reconstructed transcriptomes, the quantification of the expression of different transcripts and genes was estimated using Salmon with the mapping-based mode [35]. In this step, the mapping-based index of reference transcript sequences was initially constructed using an auxiliary k-mer hash over k-mers of length 31. Then, all clean data from RNA-seq were aligned to the reconstructed transcriptomes quicky and accurately using the quant module of Salmon with parameters—l IU and—validateMappings. Finally, the quantmerge module of Salmon was used to obtain transcripts per million reads (TPM) of each sample, which was then used to calculate additional inclusion levels (PSIs).

2.8. Differential Alternative Splicing (DAS) Events, Differential Expressed Transcripts (DETs), and Their Enrichment Analysis

To discover the differential alternative splicing (DAS) events between NH and DH, the inclusion levels (PSIs) per AS events were determined by the psiPerEvent function of SUPPA2 software using the results of TPM and AS events. Furthermore, significant differential expression analysis between NH and DH was performed by the diffSplice function of SUPPA2 software using this criterion | PSI| ≥ 0.15 and p value < 0.05. In addition, differential expression analysis between NH and DH was performed using DESeq2 [36] with their transcripts’ expression levels (TPM values). Significant differentially expressed transcripts (DETs) were identified using the threshold |log2(Fold Change)| ≥ 1 and p value < 0.05. Finally, for determining the functions of significant DAS events and DETs, GO and KEGG pathway enrichment analysis were performed. During the GO and KEGG enrichment analysis, the annotations of related genes were initially extracted from the results using eggNOG-mapper. Later on, the GO and KEGG enrichment analysis of these genes were performed by clusterProfiler [37]. Finally, only the GO terms or pathways which had p < 0.05 were denoted as significant.

2.9. Validation of AS Events and Differential Expressed Genes

To validate the transcriptome sequencing results, three genes with AS events were selected for RT-PCR and four genes from differential expressed gene analysis were selected for qRT-PCR. To summarize, total RNAs were used to synthesize the cDNA firstly using the GoScript™ Reverse Transcription System (Madison, WI, USA, Promega). Primer pairs of these nine genes were designed using Primer Premier 6.0 Software (Table S1). For validation of AS events, the reverse transcription products of three genes were subjected to PCR analysis to obtain PCR products for agarose gel electrophoresis. To validate the differential expressed genes between NH and DH, the qRT-PCR of four genes were performed according to the manufacturer’s protocol in a LightCycler® 480 system (Indianapolis, IN, USA, Roche Applied Science) using a miRcute Plus miRNA qPCR Kit (SYBR Green) (TIANGEN Biotech, Beijing, China) and Talent qPCR Premix (SYBR Green) kit (TIANGEN Biotech, Beijing, China). In this process, 18S rRNA was used as the internal control (reference genes), each gene was amplified in three biological replicates and three technical replicates, relative fold-change was calculated using the 2∆∆CT method [38], and a student’s t-test was used to determine the statistical significance (p < 0.05) using R software.

3. Results

3.1. Summary of PacBio Iso-Seq Data

We used PacBio SMRT sequencing on RNA samples extracted from NH, DH, and SPA at the crablet stage I to examine the expression patterns of interspecific hybrid crabs and its potential contribution to eyestalk displacement. A total of 12,705,473 (20.8 GB), 15,425,110 (23.0 GB), and 15,459,279 (22.22 GB) subreads for SPA, NH, and DH were recovered. These subreads yielded 265,745, 420,743, and 367,638 circular consensus sequences (CCS) for SPA, NH, and DH, respectively. Following the IsoSeq3 refinement, clustering, and polishing steps, a total of 11,643 (14), 16,587 (12), and 10,336 (9) high-quality (low-quality) isoforms with average lengths of 1780.5 (1883.8), 1630.5 (857.1), and 1661.6 (604.1) bp were obtained for SPA, NH, and DH, respectively. The proportion of low-quality isoforms in comparison to high quality isoforms was minimal so we could exclude them from further analysis (Table 1).

3.2. Collapsing Redundant Isoforms

After SMRT sequencing data processing, the high-quality isoforms still included a considerable number of redundant isoforms. In this study, a two-step collapsing strategy was applied to collapse redundant isoforms. Based on the results of the reference genome mapping, redundant isoforms were collapsed, generating 9427, 11,639, and 7858 unique isoforms in SPA, NH, and DH, respectively (Table 2). The remaining unmapped sequences were utilized to construct the “fake genome” to collapse redundant isoforms, yielding 466, 752, and 611 unique isoforms in SPA, NH, and DH, respectively (Table 2). Only a fraction of transcripts remained unmapped after collapsing redundant isoforms with a two-step strategy (Table 2). Finally, all unique isoforms were merged using CD-HIT to further collapse redundant isoforms, generating 9872, 12,382, and 8508 isoforms with average lengths of 1798.9, 1663.6, and 1685.5 bp for SPA, NH, and DH, respectively (Table 2). In general, a two-step collapsing strategy generates more unique isoforms in comparison with collapsing redundant isoforms based only on the reference genome.

3.3. Evaluation of Reconstructed Transcriptomes

The completeness and quality of transcriptome for SPA, NH, and DH are important prerequisites for further analysis. In this study, both completeness and characteristics analysis of reconstructed transcriptomes for SPA, NH, and DH were performed, and the results are shown in Figure 1. BUSCO assessment results showed that the number of complete and single-copy transcripts were 328 (32.4%), 302 (30.0%), and 315 (31.1%), duplicated transcripts were 161 (15.9%), 124 (12.2%), and 76 (7.5%), fragmented transcripts were 14 (1.4%), 19 (1.9%), and 16 (1.6%), and missing transcripts were 510 (50.3%), 568 (56.1%), and 606 (59.8%) for SPA, NH, and DH, respectively (Figure 1A). When overlapping transcripts across these three reconstructed transcriptomes were examined, the number of overlapping transcript isoforms for DH compared with SPA was 6983 and 7087 when DH was compared with NH, and 8823 for SPA compared with NH (Figure 1B). The number of unique transcript isoforms was 1525 or 1421 when DH was compared to NH or SPA, 5399 or 3559 when NH was compared to DH or SPA, and 2785 or 1421 when SPA was compared to DH or NH (Figure 1B). In a word, the number and diversity of transcript isoforms in NH samples were greater than in other samples, and that most transcript isoforms of DH (82%) and SPA (89%) were similar to NH. Moreover, these reconstructed transcriptomes had numerous isoforms, with the number of transcripts with more than two isoforms being 1756, 2050, and 1393 in SPA, NH, and DH, respectively (Figure 1C). In comparison to the reference genome annotation, a total of 1992, 2781, and 2566 potentially novel isoforms (coded as j) and a total of 3032, 4171, and 3297 unknown isoforms (coded as u) were annotated in reconstructed transcriptomes of DH, NH, and SPA, showing that the reconstructed transcriptomes contain more novel isoforms or transcripts. In addition, the number of transcripts isoforms that were annotated as other same strand overlaps with reference exons (coded as o) was 469 (3.8%) in NH, which was higher than DH 235 (2.8%) and SPA 240 (2.5%).

3.4. Functional Annotation

To obtain the biological context of the reconstructed transcripts, functional annotation was carried out for SPA, NH, and DH, respectively. Before functional annotation, a total of 8207, 9126, and 6323 ORFs were retrieved and selected from full-length transcripts in SPA, NH, and DH, respectively. That is, approximately 83.1%, 73.7%, and 74.3% of the transcript isoforms were potential protein-encoding segments of SPA, NH, and DH, respectively. The functional annotation results showed that the number of annotated transcript isoforms acquired from various databases varied, ranging from 5334 to 7856 in SPA (Figure 2A), 5273 to 8177 in NH (Figure 2B), and 3980 to 6023 in DH (Figure 2C). The number of transcript isoforms annotated at least once by Swiss-Prot, TrEMBL, Uniref90 or NR were 7871, 8210, and 6040 in SPA, NH, and DH, respectively (Figure 2D–F). The number of overlapping annotated transcript isoforms among Swiss-Prot, TrEMBL, Uniref90, and NR were 6163, 5823, and 4562 in SPA, NH, and DH, respectively (Figure 2D–F). In addition, transcript isoforms were classified into several COG categories to acquire a better understanding of gene functions enriched in SPA, NH, and DH (Figure 2G). The proportion of signal transduction mechanisms (COG category T) and cytoskeleton (COG category Z) in DH was comparable to that of NH but greater than SPA. NH also had a smaller proportion of carbohydrate transport and metabolism (COG category G) than DH and SPA. However, a contrasting pattern was observed in extracellular structures (COG category W) (Table S2).

3.5. Alternative Splicing (AS) Events

AS events plays critical roles in the transcriptome diversity and complexity of eukaryotes. To explore the different AS events in purebred and crossbred mud crabs, AS analyses were performed in SPA, NH, and DH (Figure 3). The results revealed that the trend of the proportion of AS event types was similar between NH and SPA, with the most abundant RI (retained intron), followed by A3 (alternative 3′ splice site), A5 (alternative 5′ splice site), AF (alternative first exon), SE (skipped exon), MX (mutually exclusive exon), and AL (alternative last exon) (Figure 3A). The proportion of A5 in DH was lower than SE and AF (Figure 3A). Additionally, the total number of AS events in DH (1224) fell well below NH (1784) and SPA (1802). A total of 215, 232, and 233 overlapped AS events were found for DH compared with NH, DH compared with SPA, and SPA compared with NH, respectively, and 132 of these were common AS events among SPA, NH, and DH (Figure 3B). Moreover, there were 909, 1468, and 1486 unique AS events detected in SPA, NH, and DH, respectively (Figure 3B, Table S3). To gain an insight into the function of these AS events, gene ontology (GO) enrichment analyses of all genes with AS events were performed in SPA, NH, and DH. In SPA, six of the top seven significantly biological processes (BP) obtained from the GO enrichment analysis were related to metabolic processes such as the NAD metabolic process, NADH metabolic process, pyridine-containing compound metabolic process, regulation of the viral process, pyridine nucleotide metabolic process, and nicotinamide nucleotide metabolic process (Figure 3C). However, for NH or DH, the top ten significant BP categories mainly occurred in muscle related to cell differentiation, muscle contraction, and the muscle system process (Figure 3D,E). In addition, when the top ten significant BP categories in NH and DH were examined, we found that chitin-based cuticle development and cuticle development were only significantly enriched in NH, suggesting that it may potentially contribute to abnormal eyestalks (Figure 3D,E).

3.6. Differential Alternative Splicing (DAS) Events, Differential Expressed Transcripts (DETs), and Their Enrichment Analysis

To identify and understand the potential implications of eyestalk displacement on the novel hybrid mud crab, both DAS events and DETs analysis between NH and DH were performed in this study. In the DAS event analysis, a total of 37 significant DAS events were found between NH and DH, with 17 up-regulated and 20 down-regulated in DH (Figure 4A). All these significant AS events were expressed by 28 protein-coding genes and included 8 A3, 7 A5, 1 AF, and 20 RI events (Table S4). The top three significantly DAS genes experienced RI events, including the PB.2110 (endoplasmic reticulum chaperone BiP), PC.398 (troponin T), and PB.5116 (signal peptidase complex subunit 1-like). More detailed information is provided in Table S4. In DET analysis, a total of 1475 significant DETs were found between DH and DH, comprising 492 up-regulated and 983 down-regulated transcript isoforms in DH (Figure 4B). The top ten significant transcripts based on p-values were PB.4294.2 (leucine-rich repeat protein lrrA-like isoform X2), PB.2110.3 (endoplasmic reticulum chaperone BiP), PB.821.4 (mite allergen Der f 3), PB.1808.5 (xylose isomerase-like isoform X2), PB.1547.11 (myosin heavy chain, muscle), PB.4122.4 (lysosomal Pro-X carboxypeptidase), PB.68.20 (cryptocyanin 1, partial), PB.5911.24 (hypothetical protein FQN60_009376), PB.2165.1 (mitochondrial-processing peptidase subunit alpha), and PB.1741.15 (aconitate hydratase, mitochondrial-like). However, the top ten significant genes based on fold change were PB.768.16, PB.1546.30 (myosin heavy chain, muscle), PB.3351.46 (tropomyosin slow-tonic isoform), PB.5762.10 (epidermal growth factor receptor kinase substrate 8), PB.3351.61 (tropomyosin slow-tonic isoform), PB.5203.12 (hypothetical protein GWK47_043851), PB.4579.4 (pro-resilin), PC.514.1 (glutathione S-transferase D7), PB.4574.4 (pro-resilin), and PB.753.1. Among, PB.5762.10 (epidermal growth factor receptor kinase substrate 8), PB.4579.4 (pro-resilin), and PB.4574.4 (pro-resilin) were closely linked to the epidermal development. More detailed information can be found in Table S5. Furthermore, between significant DAS events and DETs, there were 21 overlapping and seven non-overlapping genes (Tables S4 and S5). To gain further insight into their underlying molecular mechanisms, GO and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analyses were performed using both DAS events and DETs. We found that majority of the top ten significantly enriched GO terms were related to the cuticle or chitin, including cuticle development, chitin-based cuticle development, structural constituent of cuticle, structural constituent of chitin-based larval cuticle, structural constituent of chitin-based cuticle, and chitin binding (Figure 4C and Table S6). Furthermore, we found that these DAS events and DETs were primarily involved in RNA polymerase, protein digestion and absorption, exosome [BR:ko04147] peptidases and inhibitors [BR:ko01002], DNA repair and recombination proteins [BR:ko03400], and transcription machinery [BR:ko03021] (Figure 4D and Table S7). When compared to the other pathways, the exosome pathway had the most genes (80) with significant levels (adjusted p-value = 0.0001).

3.7. Validation of Alternative Splicing (AS) Events and Significant Differential Expression Transcripts (DGEs)

To validate the predicted AS events by SMRT sequencing, two genes with SE events (PB.3871 and PB.5095) and one gene with RI events (PB.293) were selected randomly to perform reverse transcription PCR (RT-PCR). The primers were designed in overlapping exons for genes with SE events and retained intron for gene with RI event. The results of agarose gel electrophoresis showed that these AS events actually existed in DH, NH, and SPA, and the PB.5095 had a special transcript isoform in NH (Figure 5A). For validating the Illumina sequencing results, four genes (PB.3297, PB.3654, PB.2760 and PB.480) from differential transcript expression analysis between NH and DH were selected for quantitative real-time PCR (qRT-PCR) (Figure 5B). The results showed that PB.3297 had a similar expression level in DH and NH (p = 0.84), the expression level of PB.3654 in DH was significantly higher than in NH (p = 0.0001), and the expression level PB.2760 and PB.480 in DH had a significantly lower level than NH (p = 0.01 and 0.02) (Figure 5B and Table S1). In general, the pattern of differentially expressed expression levels from the qRT-PCR results was consistent with the Illumina sequencing results.

4. Discussions

Interspecific hybridization, which is well known for developing hybrid offspring with greater biomass, speed of development, and fertility, is considered one of the effective breeding technologies in aquatic animals [11]. In our earlier work, we successfully developed a novel hybrid mud crab (S. serrata ♀ × S. paramamosain ♂) for obtaining high-quality hybrid offspring for genetic improvement [13,14]. Meanwhile, we found that some F1 hybrid offspring’s eyestalks had displaced during the crablet stage I. However, the genetic mechanisms of eyestalk displacement and its potential impact on the physiological development of the novel interspecific hybrid crab (S. serrata ♀ × S. paramamosain ♂) remains unclear. In this study, we constructed PacBio and Illumina HiSeq libraries to reconstruct high-quality transcriptomes to detect and analyze the novel genes, transcripts isoforms, and AS events among SPA, NH, and DH. Moreover, DAS analysis and DET analysis between NH and DH, and their enrichment analyse were performed to examine the genetic mechanisms of eyestalk displacement and its potential implications on the novel interspecific hybrid crab.

4.1. Reconstructed Transcriptomes Based on the Non-Hybrid Correction Methods and Two-Step Collapsing Strategy

The use of SMRT sequencing data to reconstruct transcriptomes for improving genome annotation has been widely used in several species [26,39,40]. In comparison to NGS technology, SMRT sequencing could precisely capture each full-length transcript to eliminate assembly errors and identify novel transcript isoforms and AS events [41,42]. Although SMRT sequencing proved effective in capturing transcript structure, it has the following shortcomings: a high sequencing error rate (approximately 15%) and low sequencing throughput [43,44]. Previously, the hybrid correction method (combining the strengths of SMRT and Illumina RNA sequencing) was always used to correct the high SMRT sequencing error rate [45]. Nowadays, numerous non-hybrid correction methods that exclusively use long reads have been proposed for SMRT sequencing data processing [46]. In this study, the non-hybrid correction method was used to correct SMRT sequencing errors, and the results showed that only 14, 12, and 9 low-quality isoforms remained in SPA, NH, and DH, respectively (Table 1), indicating that the non-hybrid correction method also was suitable for SMRT sequencing error correction. Even after polishing, high-quality isoforms retained a large number of redundant transcript isoforms. In this study, in order to obtain high-quality transcriptomes of SPA, NH, and DH, a two-step collapsing strategy was deployed to collapse redundant high-quality isoforms. In comparison to the collapsed isoforms with the reference genome only, this strategy might boost the coverage and diversity of reconstructed transcriptomes (Table 2). Moreover, a total of 9872, 12,382, and 8508 isoforms with an N50 of 2079, 2041, and 2161 bp were obtained for SPA, NH, and DH, respectively (Table 2), which was longer than previous studies on S. paramamosain that reconstructed transcriptomes using NGS technologies [47,48]. However, the mean length and N50 of transcript isoforms from this study were much shorter than previous SMRT sequencing studies in mud crabs (S. paramamosain) [26,49,50]. The potential reason may be the use of a two-step strategy for collapsing redundant transcript isoforms, resulting in the retention of many short transcript isoforms (Table 2). Samples with different periods and tissues among these studies may be the other reason, because gene expression usually has tissue specificity and spatiotemporal specificity [51,52]. In summary, both the non-hybrid correction method and the two-step collapsing strategy were suitable for SMRT sequencing data processing and would provide more distinct isoforms.

4.2. The Genetic Mechanisms of Eyestalk Displacement and Its Potential Implications on the Novel Interspecific Hybrid Crab

The eyestalk is a key organ in crustaceans that produces neurohormones and regulates a range of physiological functions. In this study, both DAS events and DET analysis were deployed to explore the genetic mechanisms of eyestalk displacement and its potential implications on the novel interspecific hybrid crab. The most substantially annotated DET was leucine-rich repeat protein lrrA-like isoform X2. To the best of our knowledge, the leucine-rich repeats are protein interaction motifs with 20 to 29 residues that include a high proportion of leucine residues [53]. Most of the studies on leucine-rich repeat, motif-containing proteins were focused on the immunological response, in crustaceans, such as S. serrata [54], Penaeus monodon [55], and Litopenaeus vannamei [56]. In other species, such as Drosophila, leucine-rich repeat, motif-containing proteins are also involved in cytoskeleton remodeling [57], cell morphogenesis [58], and segment morphogenesis [59]. We conclude that the leucine-rich repeat protein lrrA-like isoform X2 may play a critical role in eyestalk displacement of the interspecific hybrid crab.
The most significant DAS was annotated as endoplasmic reticulum chaperone binding immunoglobin protein (BiP), also known as glucose regulatory protein 78 (GRP78) or HSPA5, and it is the major family member of heat shock protein 70 (Hsp70) that is required for protein folding and quality control in the endoplasmic reticulum [60]. BiP may stimulate the folding of freshly synthesized polypeptides and repair misfolded proteins to avoid their aggregation in the endoplasmic reticulum [61]. These findings suggested that the down-regulated expression of endoplasmic reticulum chaperone BiP may result in the aggregation of misfolded proteins and hence may lead to eyestalk displacement. In addition, previous studies showed that BiP plays an important role in the response to environmental stress [62,63] and immune function [64,65], suggesting that the down-regulated expression of endoplasmic reticulum chaperone BiP may impact the adaptability and disease resistance in DH. In the enrichment analysis, we found that most of the top ten significantly gene ontology terms were enriched with genes which are linked to cuticle-related or chitin-related functions. Similar results were also found in our previous study using whole-transcriptome RNA sequencing [15]. Cuticle protein is considered as the major component of the exoskeleton in crustaceans. Similarly, chitin is a key component of the cuticle that protects from external threats. A previous study has shown that cuticle-related genes play an essential role in normal wing morphogenesis in the migratory locust [66]. Therefore, the expression of cuticle-related or chitin-related genes such as chitinase 7, chitin synthase, cuticle protein 21, cuticle protein 7-like, and early cuticle protein 2 (Table S5), may play an essential role in eyestalk displacement. In addition, the downregulation of cuticle-related or chitin-related genes in DH would affect molting and consequently, growth and mating. One possible reason is that cuticular chitin synthase and chitinase are involved in the degradation of old cuticle and the synthesized of new cuticle during molting [67,68]. Moreover, in the KEGG enrichment analysis, the phototransduction pathway was also significantly enriched (p = 0.006) suggesting that eyestalk displacement would affect phototransduction function in DH, except for the top significantly enriched pathways involved in transcription, protein synthesis, protein transportation, and protein degradation (Figure 4D and Table S7). Furthermore, the eyestalk is an essential phototransduction organ to receive light signals in crustaceans [69]. Overall, our findings provide valuable insights into the genetic mechanisms of eyestalk displacement and its potential implications on the novel interspecific hybrid crab, which would help enhance the genetic makeup of interspecific hybridization in the mud crab (S. paramamosain).

4.3. The Transcriptomes Difference between Purebred (SPA) and Crossbred (NH and DH) Mud Crabs

The completeness and quality of the reconstructed transcriptome is an important prerequisite aspect to reveal the genetic changes in novel interspecific hybrid crabs. In this study, the complete transcripts in BUSCO ranged from 38.7% to 48.4% for SPA, NH, and DH (Figure 1A), whereas the reference transcriptome was ~72.3% [28]. Moreover, the proportion of complete transcripts in BUSCO was 82.2% higher than in previous SMRT sequencing-based transcriptome in mud crabs (S. paramamosain) [49]. The high percentage of missing transcripts in this study might be attributed to the tissue specificity and spatiotemporal specificity [51,52], because all the samples originate from the same life stages (the first stage of the crablet). In addition, the proportion of encoded fragmented proteins in SPA, NH, and DH ranged from 1.4% to 1.9% (Figure 1A), which was lower than the reference transcriptome (3.0%) [28] and the previous SMRT sequencing-based transcriptome (3.2%) in mud crabs (S. paramamosain) [49], indicating the high quality of SMRT sequencing and data processing in this study. The characteristics analysis showed a high number of isoforms in these three reconstructed transcriptomes, especially in hybrid crabs (Figure 1C). The proportion of transcripts with more than two isoforms in purebred mud crabs (57.7% in SPA) was lower than in the hybrid crabs (67.9% in DH and 75.5% in NH), highlighting a high degree of transcriptome complexity (Figure 1C). Another study reported that the high degree of transcriptome complexity was the genetic basis of yield heterosis [70]. However, more abundant isoforms in hybrid crabs did not result in more alternative splicing events (Figure 3B), which contradicted a recent study that more alternative splicing events were detected in hybrids under heat stress and may contribute to heterosis in abalone [71]. These findings suggested that, in our study, hybridization might increase the isoforms of certain genes, while decreasing the isoforms of others. For example, the number of genes with more than six isoforms was 351 in NH, much more than SPA (191) (Figure 1C). Furthermore, the comparison between reconstructed transcriptomes and the reference genome annotation indicated that reconstructed transcriptomes contain more potentially novel or unknown transcript isoforms (Figure 1D). These results suggested that genes with increasing or decreasing isoforms may play an important role in the heterosis of hybrid crabs.

5. Conclusions

The use of SMRT and Illumina RNA sequencing to reconstruct transcriptomes based on the two-step collapsing strategy is an efficient way to identify candidate genes in the novel interspecific hybrid crab (S. serrata ♀ × S. paramamosain ♂). With the aim to disentangle the mechanisms related to eyestalk displacement, a total of 37 significant DAS events (17 up-regulated and 20 down-regulated) and 1475 significant DETs (492 up-regulated and 983 down-regulated) in DH were identified by differential expressed analysis based on the reconstructed transcriptome. The most significant DAS and DETs were annotated as being endoplasmic reticulum chaperone BiP and leucine-rich repeat protein lrrA-like isoform X2. Furthermore, the majority of the top ten significant GO terms were associated with cuticle or chitin development, suggesting that the expression of cuticle-related or chitin-related genes plays an essential role in eyestalk displacement. Overall, our findings provide valuable insights into the genetic mechanisms of eyestalk displacement and its potential impacts on the novel interspecific hybrid crab, which would help to improve the genetic improvement of interspecific hybridization in mud crabs (Scylla spp.).

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/biology11071026/s1, Table S1: Primer information for RT-PCR and qRT-PCR; Table S2: The detailed information of gene function annotation in different samples; Table S3: The detailed information of alternative splicing (AS) events in different samples; Table S4: The detailed information of differential alternative splicing (DAS) events between DH and NH; Table S5: The detailed information of differential expressed transcripts (DETs) between DH and NH; Table S6: Detailed results of GO analysis; Table S7: Detailed results of KEGG analysis.

Author Contributions

S.Y. participated in the experimental design, investigations, data analyses, interpretation, and drafted the manuscript. X.Y., H.C., Y.Z., Q.W., H.T. and J.S. participated in the experimental design, investigations and samples collected. A.F. and H.S.A.S. participated in language editing. M.I. participated in the manuscript modification. H.M. participated in the experimental design, manuscript modification, and provided all materials and reagents. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Leading Talent Project of Special Support Plan of Guangdong Province (2019TX05N067), the STU Scientific Research Foundation for Talents (No. NTF21023), the National Natural Science Foundation of China (42076133), and the Special Projects in Key Fields of Colleges and Universities in Guangdong Province (2020ZDZX1001).

Institutional Review Board Statement

Ethical review and approval were waived for this study, due to the animal subjects used in the present study being crabs, which are invertebrates and are exempt from this requirement. S. paramamosain is not an endangered or protected species. No specific permissions are required to work with invertebrates in China. All animal work has been conducted according to the relevant national and international guidelines.

Informed Consent Statement

Not applicable.

Data Availability Statement

The datasets presented in this study can be found in online repositories. The SMRT sequencing data of SPA, NH, and DH could be found in NCBI with the accession number SRX9982023, SRX10000714, and SRX10000713, respectively. The RNA-seq data also stored in NCBI with the accession number (PRJNA805205). In addition, the reconstructed transcriptomes of SPA, NH, DH, and the novel interspecific hybrid crab have been uploaded to the figshare repository (https://doi.org/10.6084/m9.figshare.19144988.v1, accessed on 9 February 2022).

Acknowledgments

We very much appreciate the feedback from the editor and two anonymous reviewers, whose useful suggestions and thoughtful comments helped us to improve the manuscript considerably.

Conflicts of Interest

The authors declare that they have no conflict of interest with the contents of this article. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

References

  1. Waiho, K.; Fazhan, H.; Quinitio, E.T.; Baylon, J.C.; Fujaya, Y.; Azmie, G.; Wu, Q.; Shi, X.; Ikhwanuddin, M.; Ma, H. Larval rearing of mud crab (Scylla): What lies ahead. Aquaculture 2018, 493, 37–50. [Google Scholar] [CrossRef]
  2. Ma, H.; Ma, L.; Ma, C.; Cui, H. Novel Polymorphic Microsatellite Markers in Scylla paramamosain and Cross-Species Amplification in Related Crab Species. J. Crustacean Biol. 2010, 30, 441–444. [Google Scholar] [CrossRef] [Green Version]
  3. Fazhan, H.; Waiho, K.; Quinitio, E.; Baylon, J.C.; Fujaya, Y.; Rukminasari, N.; Azri, M.F.D.; Shahreza, M.S.; Ma, H.; Ikhwanuddin, M. Morphological descriptions and morphometric discriminant function analysis reveal an additional four groups of Scylla spp. PeerJ 2020, 8, e8066. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  4. Farhadi, A.; Fang, S.; Zhang, Y.; Cui, W.; Fang, H.; Ikhwanuddin, M.; Ma, H. The significant sex-biased expression pattern of Sp-Wnt4 provides novel insights into the ovarian development of mud crab (Scylla Paramamosain). Int. J. Biol. Macromol. 2021, 183, 490–501. [Google Scholar] [CrossRef] [PubMed]
  5. Waiho, K.; Shi, X.; Fazhan, H.; Li, S.; Zhang, Y.; Zheng, H.; Liu, W.; Fang, S.; Ikhwanuddin, M.; Ma, H. High-density genetic linkage maps provide novel insights into ZW/ZZ sex determination system and growth performance in mud crab (Scylla paramamosain). Front. Genet. 2019, 10, 298. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  6. Shi, X.; Waiho, K.; Li, X.; Ikhwanuddin, M.; Miao, G.; Lin, F.; Zhang, Y.; Li, S.; Zheng, H.; Liu, W.; et al. Female-specific SNP markers provide insights into a WZ/ZZ sex determination system for mud crabs Scylla paramamosain, S. tranquebarica and S. serrata with a rapid method for genetic sex identification. BMC Genom. 2018, 19, 981. [Google Scholar] [CrossRef] [PubMed]
  7. Shi, X.; Lu, J.; Wu, Q.; Waiho, K.; Aweya, J.J.; Fazhan, H.; Zhang, Y.; Li, S.; Zheng, H.; Lin, F.; et al. Comparative analysis of growth performance between female and male mud crab Scylla paramamosain crablets: Evidences from a four-month successive growth experiment. Aquaculture 2019, 505, 351–362. [Google Scholar] [CrossRef]
  8. Cui, W.; Fang, S.; Lv, L.; Huang, Z.; Lin, F.; Wu, Q.; Zheng, H.; Li, S.; Zhang, Y.; Ikhwanuddin, M.; et al. Evidence of Sex Differentiation Based on Morphological Traits During the Early Development Stage of Mud Crab Scylla paramamosain. Front. Vet. Sci. 2021, 8, 712942. [Google Scholar] [CrossRef]
  9. Wu, Q.; Shi, X.; Fang, S.; Xie, Z.; Guan, M.; Li, S.; Zheng, H.; Zhang, Y.; Ikhwanuddin, M.; Ma, H. Different biochemical composition and nutritional value attribute to salinity and rearing period in male and female mud crab Scylla paramamosain. Aquaculture 2019, 513, 734417. [Google Scholar] [CrossRef]
  10. Wu, Q.; Waiho, K.; Huang, Z.; Li, S.; Zheng, H.; Zhang, Y.; Ikhwanuddin, M.; Lin, F.; Ma, H. Growth performance and biochemical composition dynamics of ovary, hepatopancreas and muscle tissues at different ovarian maturation stages of female mud crab, Scylla paramamosain. Aquaculture 2020, 515, 734560. [Google Scholar] [CrossRef]
  11. Wang, S.; Tang, C.; Tao, M.; Qin, Q.; Zhang, C.; Luo, K.; Zhao, R.; Wang, J.; Ren, L.; Xiao, J.; et al. Establishment and application of distant hybridization technology in fish. Sci. China Life Sci. 2019, 62, 22–45. [Google Scholar] [CrossRef] [PubMed]
  12. Chen, Z.J. Molecular mechanisms of polyploidy and hybrid vigor. Trends Plant Sci. 2010, 15, 57–71. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  13. Ma, H.; Wu, Q.; Tan, H.; Lin, F. Establishment of Inter-Specific Hybridization Technique and Identification of Phenotypic and Genotypic Characters of Hybrids in Mud Crab(Scylla paramamosain and S. serrata). J. Shantou Univ. 2021, 36, 59–66. [Google Scholar] [CrossRef]
  14. Cui, W.; Guan, M.; Sadek, M.A.; Wu, F.; Wu, Q.; Tan, H.; Shi, X.; Ikhwanuddin, M.; Ma, H. Construction of a genetic linkage map and QTL mapping for sex indicate the putative genetic pattern of the F1 hybrid Scylla (Scylla serrata ♀ × S. paramamosain ♂). Aquaculture 2021, 545, 737222. [Google Scholar] [CrossRef]
  15. Farhadi, A.; Lv, L.; Song, J.; Zhang, Y.; Ye, S.; Zhang, N.; Zheng, H.; Li, S.; Zhang, Y.; Ikhwanuddin, M.; et al. Whole-transcriptome RNA sequencing revealed the roles of chitin-related genes in the eyestalk abnormality of a novel mud crab hybrid (Scylla serrata ♀ × S. paramamosain ♂). Int. J. Biol. Macromol. 2022, 208, 611–626. [Google Scholar] [CrossRef]
  16. Han, Z.; Li, X.; Li, X.; Xu, W.; Li, Y. Circadian rhythms of melatonin in haemolymph and optic lobes of Chinese mitten crab (Eriocheir sinensis) and Chinese grass shrimp (Palaemonetes sinensis). Biol. Rhythm Res. 2018, 50, 400–407. [Google Scholar] [CrossRef]
  17. Li, Y.; Han, Z.; She, Q.; Zhao, Y.; Wei, H.; Dong, J.; Xu, W.; Li, X.; Liang, S. Comparative transcriptome analysis provides insights into the molecular basis of circadian cycle regulation in Eriocheir sinensis. Gene 2019, 694, 42–49. [Google Scholar] [CrossRef]
  18. Mykles, D.L.; Chang, E.S. Hormonal control of the crustacean molting gland: Insights from transcriptomics and proteomics. Gen. Comp. Endocrinol. 2020, 294, 113493. [Google Scholar] [CrossRef]
  19. Magana-Gallegos, E.; Arevalo, M.; Cuzon, G.; Gaxiola, G. Effects of using the biofloc system and eyestalk ablation on reproductive performance and egg quality of Litopenaeus vannamei (Boone, 1931) (Decapoda: Dendrobranchiata: Penaeidae). Anim. Reprod. Sci. 2021, 228, 106749. [Google Scholar] [CrossRef]
  20. Chang, E.S. Comparative Endocrinology of Molting and Reproduction: Insects and Crustaceans. Annu. Rev. Entomol. 1993, 38, 161–180. [Google Scholar] [CrossRef]
  21. Amankwah, B.K.; Wang, C.; Zhou, T.; Liu, J.; Shi, L.; Wang, W.; Chan, S. Eyestalk Ablation, a Prerequisite for Crustacean Reproduction: A review. Isr. J. Aquac. Bamidgeh 2019, 71, 14. [Google Scholar]
  22. Nguyen, T.V.; Jung, H.; Rotllant, G.; Hurwood, D.; Mather, P.; Ventura, T. Guidelines for RNA-seq projects: Applications and opportunities in non-model decapod crustacean species. Hydrobiologia 2018, 825, 5–27. [Google Scholar] [CrossRef] [Green Version]
  23. Pop, M.; Salzberg, S.L. Bioinformatics challenges of new sequencing technology. Trends Genet. 2008, 24, 142–149. [Google Scholar] [CrossRef] [PubMed]
  24. Wenger, A.M.; Peluso, P.; Rowell, W.J.; Chang, P.-C.; Hall, R.J.; Concepcion, G.T.; Ebler, J.; Fungtammasan, A.; Kolesnikov, A.; Olson, N.D.; et al. Accurate circular consensus long-read sequencing improves variant detection and assembly of a human genome. Nat. Biotechnol. 2019, 37, 1155–1162. [Google Scholar] [CrossRef]
  25. Zhang, X.; Li, G.; Jiang, H.; Li, L.; Ma, J.; Li, H.; Chen, J. Full-length transcriptome analysis of Litopenaeus vannamei reveals transcript variants involved in the innate immune system. Fish. Shellfish Immunol. 2019, 87, 346–359. [Google Scholar] [CrossRef]
  26. Cui, W.; Yang, Q.; Zhang, Y.; Farhadi, A.; Fang, H.; Zheng, H.; Li, S.; Zhang, Y.; Ikhwanuddin, M.; Ma, H. Integrative Transcriptome Sequencing Reveals the Molecular Difference of Maturation Process of Ovary and Testis in Mud Crab Scylla paramamosain. Front. Mar. Sci. 2021, 8, 658091. [Google Scholar] [CrossRef]
  27. Li, H. Minimap2: Pairwise alignment for nucleotide sequences. Bioinformatics 2018, 34, 3094–3100. [Google Scholar] [CrossRef]
  28. Zhao, M.; Wang, W.; Zhang, F.; Ma, C.; Liu, Z.; Yang, M.H.; Chen, W.; Li, Q.; Cui, M.; Jiang, K.; et al. A chromosome-level genome of the mud crab (Scylla paramamosain Estampador) provides insights into the evolution of chemical and light perception in this crustacean. Mol. Ecol. Resour. 2021, 21, 1299–1317. [Google Scholar] [CrossRef]
  29. Huang, Y.; Niu, B.; Gao, Y.; Fu, L.; Li, W. CD-HIT Suite: A web server for clustering and comparing biological sequences. Bioinformatics 2010, 26, 680–682. [Google Scholar] [CrossRef]
  30. Manni, M.; Berkeley, M.R.; Seppey, M.; Simao, F.A.; Zdobnov, E.M. BUSCO Update: Novel and Streamlined Workflows along with Broader and Deeper Phylogenetic Coverage for Scoring of Eukaryotic, Prokaryotic, and Viral Genomes. Mol. Biol. Evol. 2021, 38, 4647–4654. [Google Scholar] [CrossRef]
  31. Kriventseva, E.V.; Kuznetsov, D.; Tegenfeldt, F.; Manni, M.; Dias, R.; Simão, F.A.; Zdobnov, E.M. OrthoDB v10: Sampling the diversity of animal, plant, fungal, protist, bacterial and viral genomes for evolutionary and functional annotations of orthologs. Nucleic Acids Res. 2018, 47, 807–811. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  32. Pertea, G.; Pertea, M. GFF Utilities: GffRead and GffCompare. F1000Research 2020, 9, 304. [Google Scholar] [CrossRef]
  33. Huerta-Cepas, J.; Forslund, K.; Coelho, L.P.; Szklarczyk, D.; Jensen, L.J.; von Mering, C.; Bork, P. Fast Genome-Wide Functional Annotation through Orthology Assignment by eggNOG-Mapper. Mol. Biol. Evol. 2017, 34, 2115–2122. [Google Scholar] [CrossRef] [Green Version]
  34. Trincado, J.L.; Entizne, J.C.; Hysenaj, G.; Singh, B.; Skalic, M.; Elliott, D.J.; Eyras, E. SUPPA2: Fast, accurate, and uncertainty-aware differential splicing analysis across multiple conditions. Genome Biol. 2018, 19, 40. [Google Scholar] [CrossRef] [Green Version]
  35. Patro, R.; Duggal, G.; Love, M.I.; Irizarry, R.A.; Kingsford, C. Salmon provides fast and bias-aware quantification of transcript expression. Nat. Methods 2017, 14, 417–419. [Google Scholar] [CrossRef] [Green Version]
  36. Love, M.I.; Huber, W.; Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014, 15, 550. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  37. Yu, G.; Wang, L.G.; Han, Y.; He, Q.Y. clusterProfiler: An R package for comparing biological themes among gene clusters. Omics-A J. Integr. Biol. 2012, 16, 284–287. [Google Scholar] [CrossRef]
  38. Livak, K.J.; Schmittgen, T.D. Analysis of relative gene expression data using real-time quantitative PCR and the 2(-Delta Delta C(T)) Method. Methods 2001, 25, 402–408. [Google Scholar] [CrossRef]
  39. Feng, S.; Xu, M.; Liu, F.; Cui, C.; Zhou, B. Reconstruction of the full-length transcriptome atlas using PacBio Iso-Seq provides insight into the alternative splicing in Gossypium australe. BMC Plant Biol. 2019, 19, 365. [Google Scholar] [CrossRef] [Green Version]
  40. Ali, A.; Thorgaard, G.H.; Salem, M. PacBio Iso-Seq Improves the Rainbow Trout Genome Annotation and Identifies Alternative Splicing Associated with Economically Important Phenotypes. Front. Genet. 2021, 12, 683408. [Google Scholar] [CrossRef]
  41. Wang, B.; Kumar, V.; Olson, A.; Ware, D. Reviving the Transcriptome Studies: An Insight into the Emergence of Single-Molecule Transcriptome Sequencing. Front. Genet. 2019, 10, 384. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  42. Sedlazeck, F.J.; Rescheneder, P.; Smolka, M.; Fang, H.; Nattestad, M.; von Haeseler, A.; Schatz, M.C. Accurate detection of complex structural variations using single-molecule sequencing. Nat. Methods 2018, 15, 461–468. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  43. Koren, S.; Schatz, M.C.; Walenz, B.P.; Martin, J.; Howard, J.T.; Ganapathy, G.; Wang, Z.; Rasko, D.A.; McCombie, W.R.; Jarvis, E.D.; et al. Hybrid error correction and de novo assembly of single-molecule sequencing reads. Nat. Biotechnol. 2012, 30, 693–700. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  44. Eid, J.; Fehr, A.; Gray, J.; Luong, K.; Lyle, J.; Otto, G.; Peluso, P.; Rank, D.; Baybayan, P.; Bettman, B.; et al. Real-time DNA sequencing from single polymerase molecules. Science 2009, 323, 133–138. [Google Scholar] [CrossRef]
  45. Fu, S.; Wang, A.; Au, K.F. A comparative evaluation of hybrid error correction methods for error-prone long reads. Genome Biol. 2019, 20, 26. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  46. Zhang, H.; Jain, C.; Aluru, S. A comprehensive evaluation of long read error correction methods. BMC Genom. 2020, 21, 889. [Google Scholar] [CrossRef]
  47. Yang, X.; Ikhwanuddin, M.; Li, X.; Lin, F.; Wu, Q.; Zhang, Y.; You, C.; Liu, W.; Cheng, Y.; Shi, X.; et al. Comparative Transcriptome Analysis Provides Insights into Differentially Expressed Genes and Long Non-Coding RNAs between Ovary and Testis of the Mud Crab (Scylla paramamosain). Mar. Biotechnol. 2018, 20, 20–34. [Google Scholar] [CrossRef]
  48. Liu, S.; Chen, G.; Xu, H.; Zou, W.; Yan, W.; Wang, Q.; Deng, H.; Zhang, H.; Yu, G.; He, J.; et al. Transcriptome analysis of mud crab (Scylla paramamosain) gills in response to Mud crab reovirus (MCRV). Fish. Shellfish Immunol. 2017, 60, 545–553. [Google Scholar] [CrossRef]
  49. Wan, H.; Jia, X.; Zou, P.; Zhang, Z.; Wang, Y. The Single-molecule long-read sequencing of Scylla paramamosain. Sci. Rep. 2019, 9, 12401. [Google Scholar] [CrossRef] [Green Version]
  50. Lin, J.L.; Shi, X.; Fang, S.B.; Zhang, Y.; You, C.H.; Ma, H.Y.; Lin, F. Comparative transcriptome analysis combining SMRT and NGS sequencing provides novel insights into sex differentiation and development in mud crab (Scylla paramamosain). Aquaculture 2019, 513, 734447. [Google Scholar] [CrossRef]
  51. Johnson, B.R.; Atallah, J.; Plachetzki, D.C. The importance of tissue specificity for RNA-seq: Highlighting the errors of composite structure extractions. BMC Genom. 2013, 14, 586. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  52. Naumova, O.Y.; Lee, M.; Rychkov, S.Y.; Vlasova, N.V.; Grigorenko, E.L. Gene expression in the human brain: The current state of the study of specificity and spatiotemporal dynamics. Child. Dev. 2013, 84, 76–88. [Google Scholar] [CrossRef] [PubMed]
  53. Ikegami, A.; Honma, K.; Sharma, A.; Kuramitsu, H.K. Multiple functions of the leucine-rich repeat protein LrrA of Treponema denticola. Infect. Immun. 2004, 72, 4619–4627. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  54. Vidya, R.; Makesh, M.; Purushothaman, C.S.; Chaudhari, A.; Gireesh-Babu, P.; Rajendran, K.V. Report of leucine-rich repeats (LRRs) from Scylla serrata: Ontogeny, molecular cloning, characterization and expression analysis following ligand stimulation, and upon bacterial and viral infections. Gene 2016, 590, 159–168. [Google Scholar] [CrossRef]
  55. Sriphaijit, T.; Senapin, S. High expression of a novel leucine-rich repeat protein in hemocytes and the lymphoid organ of the black tiger shrimp Penaeus mouodon. Fish. Shellfish Immunol. 2007, 22, 264–271. [Google Scholar] [CrossRef]
  56. Zhang, H.; Li, S.; Wang, F.; Xiang, J.; Li, F. Identification and functional study of an LRR domain containing membrane protein in Litopenaeus vannamei. Dev. Comp. Immunol. 2020, 109, 103713. [Google Scholar] [CrossRef]
  57. Liu, C.I.; Cheng, T.L.; Chen, S.Z.; Huang, Y.C.; Chang, W.T. LrrA, a novel leucine-rich repeat protein involved in cytoskeleton remodeling, is required for multicellular morphogenesis in Dictyostelium discoideum. Dev. Biol. 2005, 285, 238–251. [Google Scholar] [CrossRef] [Green Version]
  58. Kume, K.; Kubota, S.; Koyano, T.; Kanai, M.; Mizunuma, M.; Toda, T.; Hirata, D. Fission yeast leucine-rich repeat protein Lrp1 is essential for cell morphogenesis as a component of the morphogenesis Orb6 network (MOR). Biosci. Biotechnol. Biochem. 2013, 77, 1086–1091. [Google Scholar] [CrossRef]
  59. Graham, P.L.; Anderson, W.R.; Brandt, E.A.; Xiang, J.; Pick, L. Dynamic expression of Drosophila segmental cell surface-encoding genes and their pair-rule regulators. Dev. Biol. 2019, 447, 147–156. [Google Scholar] [CrossRef]
  60. Wang, J.; Lee, J.; Liem, D.; Ping, P. HSPA5 Gene encoding Hsp70 chaperone BiP in the endoplasmic reticulum. Gene 2017, 618, 14–23. [Google Scholar] [CrossRef]
  61. Gething, M.J. Role and regulation of the ER chaperone BiP. Semin. Cell Dev. Biol. 1999, 10, 465–472. [Google Scholar] [CrossRef] [PubMed]
  62. Li, L.; Wang, P.; Zhao, C.; Qiu, L. The anti-stresses capability of GRP78 in Penaeus monodon: Evidence from in vitro and in vivo studies. Fish. Shellfish Immunol. 2018, 72, 132–142. [Google Scholar] [CrossRef] [PubMed]
  63. Fan, L.; Wang, A.; Miao, Y.; Liao, S.; Ye, C.; Lin, Q. Comparative proteomic identification of the hepatopancreas response to cold stress in white shrimp, Litopenaeus vannamei. Aquaculture 2016, 454, 27–34. [Google Scholar] [CrossRef]
  64. Luan, W.; Li, F.; Zhang, J.; Wang, B.; Xiang, J. Cloning and expression of glucose regulated protein 78 (GRP78) in Fenneropenaeus chinensis. Mol. Biol. Rep. 2009, 36, 289–298. [Google Scholar] [CrossRef] [PubMed]
  65. Xi, X.-Z.; Ma, K.-S. Molecular cloning and expression analysis of glucose-regulated protein 78 (GRP78) gene in silkworm Bombyx mori. Biologia 2013, 68, 559–564. [Google Scholar] [CrossRef]
  66. Zhao, X.; Gou, X.; Liu, W.; Ma, E.; Moussian, B.; Li, S.; Zhu, K.; Zhang, J. The wing-specific cuticular protein LmACP7 is essential for normal wing morphogenesis in the migratory locust. Insect Biochem Mol. Biol 2019, 112, 103206. [Google Scholar] [CrossRef]
  67. Rocha, J.; Garcia-Carreño, F.L.; Muhlia-Almazán, A.; Peregrino-Uriarte, A.B.; Yépiz-Plascencia, G.; Córdova-Murueta, J.H. Cuticular chitin synthase and chitinase mRNA of whiteleg shrimp Litopenaeus vannamei during the molting cycle. Aquaculture 2012, 330–333, 111–115. [Google Scholar] [CrossRef]
  68. Hardardottir, H.M.; Male, R.; Nilsen, F.; Eichner, C.; Dondrup, M.; Dalvin, S. Chitin synthesis and degradation in Lepeophtheirus salmonis: Molecular characterization and gene expression profile during synthesis of a new exoskeleton. Comp. Biochem. Physiol. A-Mol. Integr. Physiol. 2019, 227, 123–133. [Google Scholar] [CrossRef]
  69. Barriga-Montoya, C.; de la O-Martínez, A.; Fuentes-Pardo, B.; Gomez-Lagunas, F. Desensitization and recovery of crayfish photoreceptors. Dependency on circadian time, and pigment-dispersing hormone. Comp. Biochem. Physiol. Part A Mol. Integr. Physiol. 2017, 203, 297–303. [Google Scholar] [CrossRef]
  70. Paschold, A.; Jia, Y.; Marcon, C.; Lund, S.; Larson, N.B.; Yeh, C.T.; Ossowski, S.; Lanz, C.; Nettleton, D.; Schnable, P.S.; et al. Complementation contributes to transcriptome complexity in maize (Zea mays L.) hybrids relative to their inbred parents. Genome Reserch 2012, 22, 2445–2454. [Google Scholar] [CrossRef] [Green Version]
  71. Xiao, Q.; Huang, Z.; Shen, Y.; Gan, Y.; Wang, Y.; Gong, S.; Lu, Y.; Luo, X.; You, W.; Ke, C. Transcriptome analysis reveals the molecular mechanisms of heterosis on thermal resistance in hybrid abalone. BMC Genom. 2021, 22, 650. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Completeness and characteristics analysis of reconstruction transcriptomes. (A) BUSCO assessment results of collapsed redundant transcripts. The Y-axis represents reconstructed transcriptome of different samples. Both the X-axis and the different colors of the box represents the proportion of different categories, including complete and single-copy, complete and duplicated, fragmented, or missing. (B) Common and unique transcripts among different transcriptomes. These diagonal, lower triangle, and upper triangle values are the number of transcript isoforms in the database, the number of unique transcript isoforms in the database (query sequences), and the number of common transcript isoforms between query sequences and the database. (C) The distribution of gene locus with different transcript isoforms number in SPA, NH, and DH. The Y-axis represents the number of genes. The X-axis represents gene locus with different isoforms. (D) Compared reconstructed transcriptomes with reference genome annotation using gffcompare v0.12.2 software. The Y-axis represents the number of genes. The X-axis and the different colors represent different categories.
Figure 1. Completeness and characteristics analysis of reconstruction transcriptomes. (A) BUSCO assessment results of collapsed redundant transcripts. The Y-axis represents reconstructed transcriptome of different samples. Both the X-axis and the different colors of the box represents the proportion of different categories, including complete and single-copy, complete and duplicated, fragmented, or missing. (B) Common and unique transcripts among different transcriptomes. These diagonal, lower triangle, and upper triangle values are the number of transcript isoforms in the database, the number of unique transcript isoforms in the database (query sequences), and the number of common transcript isoforms between query sequences and the database. (C) The distribution of gene locus with different transcript isoforms number in SPA, NH, and DH. The Y-axis represents the number of genes. The X-axis represents gene locus with different isoforms. (D) Compared reconstructed transcriptomes with reference genome annotation using gffcompare v0.12.2 software. The Y-axis represents the number of genes. The X-axis and the different colors represent different categories.
Biology 11 01026 g001
Figure 2. The summary of gene functional annotation using different databases. (AC) Statistics of isoforms annotation results for SPA, NH, and DH using different databases including NR, Uniref90, Swiss-Prot, TrEMBL, COG, GO, KEGG, and PFAMs. The Y-axis represents the number of annotated isoforms. The X-axis represents different databases. (DF) Venn diagrams showing the overlapping isoforms annotation results obtained using a different database for SPA, NH, and DH, respectively. (G) COG profiles of transcripts isoforms in SPA, NH, and DH.
Figure 2. The summary of gene functional annotation using different databases. (AC) Statistics of isoforms annotation results for SPA, NH, and DH using different databases including NR, Uniref90, Swiss-Prot, TrEMBL, COG, GO, KEGG, and PFAMs. The Y-axis represents the number of annotated isoforms. The X-axis represents different databases. (DF) Venn diagrams showing the overlapping isoforms annotation results obtained using a different database for SPA, NH, and DH, respectively. (G) COG profiles of transcripts isoforms in SPA, NH, and DH.
Biology 11 01026 g002
Figure 3. Summary of alternative splicing (AS) events profiling in SPA, NH, and DH. SPA, NH, and DH are indicated by different colors (red, yellow, and green, respectively). (A) The proportion of each AS type in SPA, NH, and DH. The Y-axis represents the proportion of different AS events. The X-axis represents different AS event types, including SE (skipped exon), MX (mutually exclusive exon), A5 (alternative 5′ splice site), A3 (alternative 3′ splice site), RI (retained intron), AF (alternative first exon), and AL (alternative last exon). (B) Identified common and specific AS events among SPA, NH, and DH. The barplots on the left represent the size of the datasets of SPA, NH, and DH. Dots and vertical lines indicate the overlapping AS events in the respective comparison. Barplots in the top panels represent the number of AS events. (CE) The top ten significantly biological processes (BP) obtained from Gene Ontology (GO) enrichment analysis using genes with AS events in SPA, NH, and DH, respectively. The Y-axis represents different BP categories. The X-axis represents the corresponding −log10 transformed p-value.
Figure 3. Summary of alternative splicing (AS) events profiling in SPA, NH, and DH. SPA, NH, and DH are indicated by different colors (red, yellow, and green, respectively). (A) The proportion of each AS type in SPA, NH, and DH. The Y-axis represents the proportion of different AS events. The X-axis represents different AS event types, including SE (skipped exon), MX (mutually exclusive exon), A5 (alternative 5′ splice site), A3 (alternative 3′ splice site), RI (retained intron), AF (alternative first exon), and AL (alternative last exon). (B) Identified common and specific AS events among SPA, NH, and DH. The barplots on the left represent the size of the datasets of SPA, NH, and DH. Dots and vertical lines indicate the overlapping AS events in the respective comparison. Barplots in the top panels represent the number of AS events. (CE) The top ten significantly biological processes (BP) obtained from Gene Ontology (GO) enrichment analysis using genes with AS events in SPA, NH, and DH, respectively. The Y-axis represents different BP categories. The X-axis represents the corresponding −log10 transformed p-value.
Biology 11 01026 g003
Figure 4. Differential expression analysis and enrichment analysis between DH and NH. (A) The volcano plot indicates p-values with minus log10-transformed for AS events (Y-axis) against their corresponding difference in inclusion levels (∆PSI) of each AS event (X-axis). The horizontal gray dotted line represents the significant threshold (0.05). The red, blue, and gray points represent up-regulated, down-regulated, and non-regulated AS events in DH groups, respectively. (B) The volcano plot indicates with minus log10-transformed for genes (Y-axis) against their corresponding log2(|fold change|) of echo gene (X-axis). (C) The top ten significant gene ontology (GO) terms obtained from GO enrichment analysis using genes with DAS events or DETs in SPA, NH, and DH, respectively. The Y-axis represents different GO term categories. The X-axis represents the proportion of significant expressed genes in the list of corresponding GO terms (GeneRatio). Different sizes and colors of circle represent the number of significantly expressed genes and corresponding adjusted p-value of GO terms. (D) The top ten significant pathways obtained from KEGG enrichment analysis using genes with DAS events or DEGs in SPA, NH, and DH, respectively. The Y axis represents different pathways categories. The X-axis represents the number of significantly expressed genes in the corresponding pathway. Different colors represent the different adjusted p-value of pathway.
Figure 4. Differential expression analysis and enrichment analysis between DH and NH. (A) The volcano plot indicates p-values with minus log10-transformed for AS events (Y-axis) against their corresponding difference in inclusion levels (∆PSI) of each AS event (X-axis). The horizontal gray dotted line represents the significant threshold (0.05). The red, blue, and gray points represent up-regulated, down-regulated, and non-regulated AS events in DH groups, respectively. (B) The volcano plot indicates with minus log10-transformed for genes (Y-axis) against their corresponding log2(|fold change|) of echo gene (X-axis). (C) The top ten significant gene ontology (GO) terms obtained from GO enrichment analysis using genes with DAS events or DETs in SPA, NH, and DH, respectively. The Y-axis represents different GO term categories. The X-axis represents the proportion of significant expressed genes in the list of corresponding GO terms (GeneRatio). Different sizes and colors of circle represent the number of significantly expressed genes and corresponding adjusted p-value of GO terms. (D) The top ten significant pathways obtained from KEGG enrichment analysis using genes with DAS events or DEGs in SPA, NH, and DH, respectively. The Y axis represents different pathways categories. The X-axis represents the number of significantly expressed genes in the corresponding pathway. Different colors represent the different adjusted p-value of pathway.
Biology 11 01026 g004
Figure 5. Validation of alternative splicing (AS) events and significant differential expression genes. (A) Validation of AS events by RT-PCR and agarose gel electrophoresis. (B) The relative expression level of PB.3297, PB.3654, PB.2760, and PB.480 in NH and DH.
Figure 5. Validation of alternative splicing (AS) events and significant differential expression genes. (A) Validation of AS events by RT-PCR and agarose gel electrophoresis. (B) The relative expression level of PB.3297, PB.3654, PB.2760, and PB.480 in NH and DH.
Biology 11 01026 g005
Table 1. Summary of PacBio sequencing data in SPA, NH, and DH.
Table 1. Summary of PacBio sequencing data in SPA, NH, and DH.
SampleTypesNumbers of SequencesLength of IsoformsN50 9
MinMeanMax
SPA 1Subreads12,705,473511636.8211,0521962
CCS 4265,745104190414,1282177
FL 5226,748991814.210,4922109
FLCN 6221,222681754.596082051
HQ 711,643721780.564422053
LQ 8143211883.839221839
NH 2Subreads15,425,110501491.4115,2201656
CCS420,743671755.311,5561939
FL345,9771021613.796051735
FLCN308,364511495.295731534
HQ16,587701630.567782012
LQ12452857.11617951
DH 3Subreads15,459,279501437.4118,3011590
CCS367,638701782.214,4971994
FL253,5451041683.411,3441926
FLCN221,371501568.198041769
HQ10,336691661.665262132
LQ9115604.11880654
1 SP = Scylla paramamosain with normal eyestalk; 2 NH = hybrid crabs with normal eyestalk; 3 DH = hybrid crabs with displaced eyestalk; 4 CCS = circular consensus sequence; 5 FL = full-length; 6 FLNC = full-length-non-chimeric; 7 HQ = high-quality isoforms; 8 LQ = Low-quality isoforms; 9 N50 = 50% of reads are longer than this value.
Table 2. Summary of features of transcript isoforms after collapsing redundant isoforms with cDNA cupcake, cogent, and CD-HIT.
Table 2. Summary of features of transcript isoforms after collapsing redundant isoforms with cDNA cupcake, cogent, and CD-HIT.
SamplesNumbers of Transcript Isoforms after Collapsing Redundant IsoformsLength of Collapsing Redundant IsoformsN50 4
Reference GenomeFake GenomeUnmap-PedMergeMinMaxMean
SPA 19427466298729264421798.92079
NH 211,639752312,38211867781663.62041
DH 37858661385088065261685.52161
1 SPA = Scylla paramamosain with normal eyestalk; 2 NH = hybrid crabs with normal eyestalk; 3 DH = hybrid crabs with deformed eyestalk; 4 N50 = 50% of reads are longer than this value.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Ye, S.; Yu, X.; Chen, H.; Zhang, Y.; Wu, Q.; Tan, H.; Song, J.; Saqib, H.S.A.; Farhadi, A.; Ikhwanuddin, M.; et al. Full-Length Transcriptome Reconstruction Reveals the Genetic Mechanisms of Eyestalk Displacement and Its Potential Implications on the Interspecific Hybrid Crab (Scylla serrata ♀ × S. paramamosain ♂). Biology 2022, 11, 1026. https://doi.org/10.3390/biology11071026

AMA Style

Ye S, Yu X, Chen H, Zhang Y, Wu Q, Tan H, Song J, Saqib HSA, Farhadi A, Ikhwanuddin M, et al. Full-Length Transcriptome Reconstruction Reveals the Genetic Mechanisms of Eyestalk Displacement and Its Potential Implications on the Interspecific Hybrid Crab (Scylla serrata ♀ × S. paramamosain ♂). Biology. 2022; 11(7):1026. https://doi.org/10.3390/biology11071026

Chicago/Turabian Style

Ye, Shaopan, Xiaoyan Yu, Huiying Chen, Yin Zhang, Qingyang Wu, Huaqiang Tan, Jun Song, Hafiz Sohaib Ahmed Saqib, Ardavan Farhadi, Mhd Ikhwanuddin, and et al. 2022. "Full-Length Transcriptome Reconstruction Reveals the Genetic Mechanisms of Eyestalk Displacement and Its Potential Implications on the Interspecific Hybrid Crab (Scylla serrata ♀ × S. paramamosain ♂)" Biology 11, no. 7: 1026. https://doi.org/10.3390/biology11071026

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop