Next Article in Journal
Continuum Models of Membrane Fusion: Evolution of the Theory
Next Article in Special Issue
Characterization of miRNAs in Cultured Atlantic Salmon Head Kidney Monocyte-Like and Macrophage-Like Cells
Previous Article in Journal
Time-course Transcriptome of Parageobacillus thermoglucosidasius DSM 6285 Grown in the Presence of Carbon Monoxide and Air
Previous Article in Special Issue
Identification and Regulation of Interleukin-17 (IL-17) Family Ligands in the Teleost Fish European Sea Bass
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Transcriptome Analysis of Maternal Gene Transcripts in Unfertilized Eggs of Misgurnus anguillicaudatus and Identification of Immune-Related Maternal Genes

1
Department of Marine Bio-Materials and Aquaculture, College of Fisheries Sciences, Pukyong National University, 45 Yongso-ro, Nam-gu, Busan 48513, Korea
2
Bioinformatics Team, DNA Link Inc., 150 Bugahyeon-ro, Seodaemun-gu, Seoul 03759, Korea
*
Author to whom correspondence should be addressed.
Int. J. Mol. Sci. 2020, 21(11), 3872; https://doi.org/10.3390/ijms21113872
Submission received: 30 April 2020 / Revised: 28 May 2020 / Accepted: 28 May 2020 / Published: 29 May 2020
(This article belongs to the Special Issue Fish Immunology)

Abstract

:
Maternal genes are important in directing early development and determining egg quality in fish. We here report the de novo transcriptome from four tissue libraries of the cyprinid loach, Misgurnus anguillicaudatus, and for the first time identified maternal gene transcripts in unfertilized eggs and suggest their immune system involvement. Expression profiles and functional enrichment revealed a total 24,116 transcripts were expressed as maternal transcripts in unfertilized eggs, which were involved in a wide range of biological functions and pathways. Comparison expression profiles and analysis of tissue specificity revealed that the large numbers of maternal transcripts were stored in unfertilized eggs near the late phase of ovarian maturation and before ovulation. Functional classification showed a total of 279 maternal immune-related transcripts classified with immune system process GO term and immune system KEGG pathway. qPCR analysis showed that transcript levels of identified maternal immune-related candidate genes were dynamically modulated during development and early ontogeny of M. anguillicaudatus. Taken together, this study could not only provide knowledge on the protective roles of maternal immune-related genes during early life stage of M. anguillicaudatus but could also be a valuable transcriptomic/genomic resource for further analysis of maternally provisioned genes in M. anguillicaudatus and other related teleost fishes.

1. Introduction

Before embryos begin to utilize newly synthesized products derived from the zygotic genome, maternally inherited components such as RNA transcripts and proteins direct virtually all aspects of early development in animal embryos [1,2,3,4]. Maternal components that are stored into an egg during oogenesis are particularly important in most oviparous teleost fish, as they undergo external fertilization and development, without close interconnection with their parents [5,6,7]. Besides, it was proposed that maternal components are a critical determinant of fish egg quality, which serves as a key challenging issue in the reproductive control of aquacultured fish [8,9,10,11,12].
Fish embryos and early larvae are often exposed to various potential pathogens in the environment. Accordingly, a need to protect themselves from pathogenic invaders is crucial during this early life interval. The immune system in vertebrates including teleost fishes utilizes complex and highly organized gene sets of molecular signaling pathways that protect against environmental invasions and regulate dynamic changes in an organism [13,14]. This complex and organized immune system would make to intertwine and share certain common features in developmental mechanisms and operational modes. It is not surprising that many immune-related genes co-opt onto the developmental mechanisms of animals. For an instance, the membrane receptor Toll and the related Toll-like receptors (TLR) are well known for their function in innate immunity, however, these molecules were originally found in studies investigating embryonic development [15,16]. Maternal immunity is believed to be the principal means of fish embryo and larval protection until they have developed a complete immune system [5]. It is of interest, therefore, to identify maternal immune-related genes/transcripts to extend insights into dynamic changes during the early embryonic development process. To date, most studies on the maternal immunity of fish have mainly focused on immune effector molecules such as peptides/proteins: antimicrobial peptides, immunoglobulins/antibodies, complement components, lectins, and lysozyme [5,17,18]. However, in contrast to the rather rich studies on immune factors, a study that builds a comprehensive understanding of maternal immune gene transcripts in fish species remains limited. Recently, next-generation sequencing (NGS) technologies have revolutionized our ability to analyze the dynamic gene transcripts in a specific cell, tissue, or organism at a given developmental stage [11,19,20,21]. As the analysis of massive quantities of genes has allowed gene discovery and expression profiling, recent transcriptome data for several fishes have provided notable insights into the distinctive features in oocyte development and early embryonic development [19,22,23,24,25].
The cyprinid loach, Misgurnus anguillicaudatus (Actinopterygii; Cypriniformes; and Cobitidae), also known as oriental weather loach, is widely distributed in freshwater areas. M. anguillicaudatus prefers a habitat with muddy bottoms as a spawning ground, which is known to contain abundant and diverse microbial populations. The loach has been considered a valuable source of food owing to its high nutritional values in East Asia including Korea and, has become an important freshwater aquaculture species (https://www.fishbase.de/summary/Misgurnus-anguillicaudatus.html). Moreover, this species has many advantageous merits as an experimental model organism, particularly with respect to developmental and reproductive researches. They include short generation time, year-round multiple spawning under controlled laboratory conditions, high fecundity, and fast and transparent embryonic development [26,27,28,29]. These attractive characteristics make this fish a suitable model organism to study the contribution of maternally provisioned immune-related gene transcripts to development and host defense function in the early life stages of teleosts.
Here, we report the M. anguillicaudatus transcriptome newly constructed from RNA-Seq libraries derived from unfertilized eggs, ovary, testis, and muscle tissues. We employed a paired-end Illumina HiSeq 2500 sequencing technology and the subsequent de novo assembly, to generate a comprehensive set of reference transcripts. The transcriptome assembly was used to analyze expression profiles and identify maternal gene transcripts related to immune functions. Further, we analyzed transcriptional levels of identified maternal immune-related candidate genes during the development of M. anguillicaudatus. This study is the first transcriptome analysis of maternal gene transcripts associated with immune function in the loach, M. anguillicaudatus.

2. Results and Discussion

2.1. Sequencing, De Novo Assembly, and Completeness

The RNA-Seq libraries from muscle, testis, ovary, and unfertilized eggs were constructed to carry out paired-end sequencing using the Illumina HiSeq 2500 platform. The statistics for the sequencing and de novo assembly are shown in Table 1. Illumina sequencing produced a total of 243,267,764 raw reads from the four different cDNA libraries. The maximum number of raw reads generated from the ovary library and the minimum number of raw reads generated from the muscle library were 63,490,556 and 56,211,235, respectively. After trimming adaptor sequences and ambiguous and low-quality reads via quality filtering, a total of 237,869,744 clean reads (approximately 118 million paired-end clean reads) were obtained using the Trimmomatic software [30]. Trinity assembler was then used to pool and assemble the clean reads into 281,866 contigs, with a total length of 218,148,233 bp [31]. Redundant sequences were removed from the assembled contigs using the CD-HITest platform [32]. This yielded a total of 267,111 non-redundant transcripts with a total length of 197,693,642 bp, a mean length of 740.12 bp, weighted median length (N50) of 1533 bp, and a GC (guanine+cytosine) ratio of 42.90% in a size range of 201–23,427 bp. Although the N50 statistic is not entirely appropriate for de novo transcriptome assemblies, it is still a traditionally valuable measurement for the assessment of continuity in transcriptome assembly [33,34].
Thus, the N50 value (1533 bp) of our transcriptome assembly was comparable to those obtained for published de novo transcriptomes of M. anguillicaudatus [21,35,36,37]. The detailed length distribution of de novo assembled transcriptome is shown in Figure S1. Of these non-redundant transcripts, the majority of sequences (157,612, 60.0%) ranged from 200 to 400 bp, 55,464 (20.8%) from 400 to 800 bp, and 49,802 (18.6 %) were longer than 1 kb. Completeness of the assembly was evaluated using BUSCO against the eukaryote (eukaryote_odb9) lineages [38,39], which revealed that 100% of conserved genes across all eukaryotes were present; 70.6% complete and single-copy BUSCOs (214 out of 303) and 29.4% complete and duplicated BUSCOs (89 out of 303). Taken together with the sufficient number of sequencing clean reads mentioned above, the transcriptome assembly from the four cDNA libraries of the loach M. anguillicaudatus achieved a reasonable degree of completion and an acceptable coverage and quality of protein-coding transcripts for subsequent analysis.

2.2. Functional Annotation of the Reference Transcriptome

To find homologous proteins and the collection of biological information from various sources, all of 267,111 transcripts were used to search homology against several public databases based on the BLAST algorithm. Overall, 161,091 (60.3%) transcripts were successfully annotated to a least one database and 1394 (0.5%) transcripts shared annotation in all databases used (Figure 1a). Of the annotated transcripts, 159,361 (98.9%), 76,332 (47.4%), 62,946 (39.1%), 46,897 (29.1%), 18,465 (11.5%), and 15,288 (9.5%) of transcripts had BLAST hit to NR, Swiss-Prot/UniproKB, EggNOG, HMMER/Pfam, GO, and KEGG databases, respectively (Figure 1a and Table S1). Moreover, a total of 70,203 ORFs were predicted from 64,767 (24.2%) transcripts using TransDecoder, of which 4,481 (6.4%) and 12,597 (17.9%) ORFs contained a signal peptide and transmembrane region by SignalP and tmHMM, respectively. Meanwhile, remaining sequences (106,020, 39.7%) resulted in non-significant hits. This can be explained by a lack of sequences conservation across species associated with incomplete gene information on non-model species in public databases, or because it contains non-coding RNA in sequences of the transcripts. Alternatively, short length transcripts obtained through de novo assembly would reduce the BLAST annotation efficiency. The e-value distribution of the transcripts in the NR BLAST results revealed that 48,916 (30.69%) annotated transcripts showed significant homology (less than 1e-60) and 70,848 (44.46%) transcripts were annotated with e-value of greater than 1e-15 (Figure 1b). The ratio of the similarity distributions revealed 67.43% of the NR annotated transcripts had a similarity over 61% (Figure 1c). The species distribution analysis showed that 159,361 transcripts were distributed in 7,871 species. The top-hit species belonged to the genus Sinocyclocheilus, which include S. rhinocerous (19,676, 12.3%), S. anshuiensis (17,053, 10.7%), and S. grahami (14,974, 9.4%). Then, these species were followed by Danio rerio (13,709, 8.6%), and Cyprinus carpio (13,206, 8.3%; Figure 1d). These species belong to the Cyprinidae family along with M. anguillicaudatus, and their annotated genomes are available in a database (https://www.ncbi.nlm.nih.gov/genome/annotation_euk/). In contrast, a previous study reported that transcripts of M. anguillicaudatus skin transcriptome showed relatively high homology to D. rerio [37]. This may be ascribed to several reasons including the relatively small number of transcripts obtained from one specific tissue and a restricted use of the database (i.e., NR protein and D. rerio transcriptome database) [37].
The GO annotation is a standardized system for functional classification and inference of the biological significance of genomic/transcriptomic datasets [40]. Therefore, the NR BLAST results were submitted to the Blast2GO program [41], yielding 18,465 (11.6%) transcripts that were categorized into a total 92,581 GO terms because of the large numbers of transcripts annotated with more than one GO term (Table S2). The GO annotation result was represented by three main categories at level 2 including biological process (BP, 50 subcategories), molecular function (MF, 23 subcategories), and cellular component (CC, 21 subcategories) using the WEGO tool (Figure 2 and Table S2) [42]. In the BP category, the most represented GO terms were the metabolic process (10,837, GO:0008152), cellular process (9646, GO:0009987), biological regulation (4461, GO:0065007), regulation of biological process (4461, GO:0050789), and response to stimulus (3127, GO:0050896). The immune system process (GO:0002376) in BP contained 418 transcripts. For the MF category, transcripts were mainly associated to binding (9836, GO:0005488), and catalytic activity (8819, GO:0003824). In the CC category, cell (6253, GO:0005623), cell part (6253, GO:0044464), membrane (4113, GO:0016020), organelles (3752, GO:0043226), and membrane part (3038, GO:0044425) were the most represented categories. Direct comparisons between the number of transcripts in each GO category of our transcriptome assembly and previously published transcriptomic data for M. anguillicaudatus are somewhat limited. Nevertheless, the representation of GO distribution was consistent between our transcriptome and previous transcriptome assemblies for all three main categories [35,36,37,43].
The KEGG pathway provides an important insight into different biological processes and interactions across different tissues or conditions within an animal, enabling further understanding of the biological function and interaction of the transcripts in M. anguillicaudatus [44,45]. Consequently, 15,288 transcripts (8.4%) were assigned to 4413 unique KEGG orthology (KO) identifiers and grouped into 402 KEGG signaling pathways (Table S2). The annotated pathways were represented in five main categories: metabolism (3412 KOs), genetic information processing (1175 KOs), environmental information processing (3283 KOs), cellular processes (1962 KOs), and organismal systems (4888 KOs; Figure 3 and Table S2). In these main categories, the signal transduction pathways had the largest number of KOs (2666), followed by global and overview maps (1561), endocrine system (1370), and immune system (1275). These functional classifications could provide comprehensive information for the understanding of the biological significance and function of transcripts in the M. anguillicaudatus transcriptome.

2.3. Expression Analysis and Characterization of Maternal Transcripts

To gain insight into the composition and complexity of transcripts that were particularly expressed in the transcriptome of M. anguillicaudatus unfertilized eggs, we evaluated the expression levels and classified the transcripts from each RNA-seq library. Overall, more than 89% of transcript sequences for each library were mapped to the reference transcriptome assembly, of which 95% of transcript sequences in unfertilized eggs were mapped (see Table 1 mapping ratio). Expression levels of transcripts in each sample were calculated and shown in Table S3. With a cut-off value of 1 trimmed mean normalization of M-values (TMM), a total of 98,222 (36.8%) transcripts were detected in at least one tissue, with 45,603 (46.4%) in muscle, 73,683 (75.0%) in testis, 22,233 (22.6%) in ovary, and 24,116 (24.6%) in unfertilized eggs (Figure 4a). All detected transcripts in each library were considered as an expressed transcript and, in particular, we defined the transcripts expressed in the unfertilized eggs library as “maternal transcripts”. Functional classification and enrichment analysis of the maternal transcripts revealed a wide distribution of GO terms and KEGG pathways and 27 GO terms and 39 KEGG pathways being enriched (p < 0.05; Figure 2 and Figure 3 and Table S2). In detail, most enriched GO terms included the cellular process (1441 transcripts) in the BP category, catalytic activity (989 transcripts) in the MF category, and cell and cell part (1261 transcripts) in the CC category (Figure 2). While, signal transduction (377 KOs), global and overview maps (236 KOs), endocrine system (187 KOs), and immune system (171 KOs) were the most enriched KEGG pathways (Figure 3). These results indicate that various transcripts involved in a wide range of biological functions and pathways were stored during oogenesis, and these transcripts will be contributed in early embryonic developmental competence until the onset of zygotic transcription [46]. The expression profile with expression abundance distribution and pattern revealed that the transcripts expressed in unfertilized eggs were highly correlated with those in the ovary (yolk-laden mature ovary) (Figure 4b,c), reflecting the maternal RNA transcripts are either produced by the oocyte or stored by surrounding cells into the oocyte during oocyte development [2,47,48]. Nevertheless, there were 3248 (3.3%) transcripts that were only detected in the unfertilized eggs library (Figure 4a). Recent transcriptome studies of zebrafish follicles using serial analysis of the gene expression (SAGE) method provided an extensive data set on maternal mRNA stored in follicles near the end of oogenesis [49,50]. Consistently, in the context of the understanding of ovary development and oogenesis, the transcripts detected only in unfertilized eggs suggests that a number of transcripts were stored in unfertilized eggs near the end of ovarian development and before ovulation of M. anguillicaudatus. Meanwhile, there were 38,163 (38.9%) transcripts were coexpressed in more than two tissues, of which 20,868 (86.5%) transcripts were coexpressed between unfertilized eggs and other tissues (Figure 4a).
Pleiotropy is a common phenomenon caused by a single molecular function involved in multiple biological processes, which could affect two or more distinct and seemingly unrelated traits [51,52,53]. A previous study showed that a number of maternally loaded RNA transcripts in unfertilized eggs were linked to male function in Tribolium castaneum, suggesting a pleiotropic function of maternal transcripts [53]. Moreover, high-through RNA interference data from Caenorhabditis elegans has shown that pleiotropic genes were involved in an important role in early embryological life and functioned as a potential bridge between different protein complexes and pathways [54]. In this respect, we believed that function of maternal transcripts can be deduced from their tissue-specific coexpressed transcripts of known function [55,56], and therefore, we further classified and characterized transcripts that were coexpressed in the unfertilized eggs with other tissues using the TissueEnrich tool [57]. Among maternal transcripts coexpressed in other tissues, 2806, 4384, 5349, and 6281 were grouped into specific transcripts (tissue enriched and group enriched transcripts) with muscle, testis, ovary, and unfertilized eggs, respectively, while 6050 and 5358 transcripts were classified into expressed in all and mixed group (Figure 4d and Table S4). The GO annotation by WEGO revealed that a number of the maternal transcripts linked with tissue specificity was significantly different in several GO terms (p < 0.05; Figure 4e). As such, it had been found that not only large numbers of the maternal transcript were linked to functions with ovarian specificity, but large numbers of the maternal transcripts were also linked to functions with testicular and muscle specificity, implying that these transcripts would have a pleiotropic function linked with tissue-specificity and affect multiple phenotypic traits. Surprisingly, we found that, in each tissue-specific coexpressed transcript, highly expressed transcripts (TMM value > 50) with high percentages (99.1% muscle-, 86.2% testis-, and 94.8% ovary-specific) appeared in maternal transcripts (Figure 4d). To date, many studies have made an effort to identify genes/transcripts associated with egg quality as the ability of the egg to be fertilized and subsequently develop into a normal embryo [8]. Nevertheless, the information of genes and biological mechanisms as a determinant of egg quality is still poorly understood and there is no study of maternal transcripts associated with the proportion and abundance of tissue-specific transcripts. Although there are many biological questions and debates about the proportion of maternal transcripts linking with tissue-specificity, it could be valuable to examine potential interrelationship between contents of tissue-specific material transcripts and indices of egg quality.

2.4. Identification of Maternal Gene Transcripts Related to Immune Functions

To further investigate maternal transcripts associated with immune function during early embryo development in M. anguillicaudatus, which may also have a pleiotropic function as mentioned above or share the function to the developmental mechanism in early embryonic development, we identified candidate transcripts related to immune functions. We focused on the GO term of immune system process (GO:0002367) and the KEGG pathway of immune system (5.1), which was closely associated with immune functions, processes, and interactions and often used to identify genes/transcripts associated with the immune functions. A total of 61 maternal transcripts were classified into 16 different GO terms (level 3) in the GO immune system process and 218 maternal transcripts with 171 KOs were assigned into 19 signaling pathways in the KEGG immune system (Figure 5). The top three GO terms that most frequently represent maternal gene transcripts in the immune system process are the immune system development (GO:0002520, 38 transcripts from 33 genes), the immune response (GO:0006955, 24 transcripts from 15 genes), and the regulation of the immune system process (GO:0002682, 20 transcripts from 14 genes; Figure 5a). Moreover, in the immune system KEGG pathway, the three pathways with the largest number of maternal gene transcripts were the chemokine signaling pathway (21 maternal transcripts with 16 KOs), the Fc gamma R-mediated phagocytosis (23 transcripts with 12 KOs), and the leukocyte transendothelial migration (15 transcripts with 12 KOs; Figure 5b). Among them, ten maternal gene transcripts were involved in multiple immunological systems with high expression (TMM value > 50), which seem to correlate with diverse cellular functions during early embryonic development of M. anguillicaudatus (Table 2). Detailed information on the maternal transcripts involved in immunological functions is shown in Table S5. Overall, our results indicate that a variety of maternal immune-related gene transcripts were stored during oocyte development, which can facilitate effective defense mechanisms against invading pathogens during embryonic development before the maturation of immunocompetence in M. anguillicaudatus embryos.

2.5. Transcriptional Levels of Maternal Immune-Related Candidate Genes during Development and Early Ontogeny of M. anguillicaudatus

Seven maternal genes involved in multiple immunological systems with high expression (TMM value > 50) were further studied with qPCR analysis. They included genes, RAC1, MAPK3, MAPK14, IL1B, CTNB1, FADD, and ITB1. Preferentially, we validated the expression profile of the selected candidate genes from RNA-Seq analysis using qPCR. The qPCR results revealed that expression of the candidate genes was consistent with expression profiles from RNA-Seq analysis (Figure S2), indicating that our RNA-Seq data was accurate and effective and that expression profiles of maternal gene transcripts can be used for further studies, such as ontogenetic expression where the adaptive immune system is still far from being completely developed.
A study on zebrafish miR430 has suggested that a correctly timed degradation of maternal transcripts is critical for embryogenesis to transfer developmental control from the maternal transcripts to the zygotic genome [58]. In this context, evaluation of transcriptional levels of maternal candidate genes can provide an understanding of the immune system and embryogenesis involvement of maternal immune-related genes during embryogenesis and larval ontogeny. Consequently, we evaluated the transcription levels of identified maternal immune-related candidate genes during the development of M. anguillicaudatus (Figure 6a). Expression of all maternal immune-related candidate genes, except for FADD, exhibited a drastic increase after first hatching out, which has an agreement that development of the immune function starts after hatching in M. anguillicaudatus (Figure 6) [59]. The expression patterns of RAC1, CTNB1, and ITB1 were similar, which the transcripts retained until the blastula stage, then began to increase at the beginning of gastrulation, and peaked at the end of gastrulation (Figure 6b,f,h). Consistent with this, expression of RAC1 mRNA in zebrafish was found in 30% epiboly that occurred during the blastula period after the yolk syncytial layer forms (YSL) [60] and the zygotic expression of RAC1 mRNA clearly showed the involvement in the host immune response by microbial infection in fish [60,61,62]. CTNB1 is well-known as a maternal gene, which mediates the canonical Wnt signaling pathway and plays a crucial role in the generation of the body plan in zebrafish embryo [63,64]. Thus, the result indicated that RAC1, CTNB1, and ITB1 seem to be “typical” for maternal transcripts that could be degraded during midblastula transition (MBT). These transcripts are responsible for defense against pathogen invasion and/or embryonic development of M. anguillicaudatus. MAPKs generally are known as a protein kinase group and are involved in directing a diverse array of cellular functions including proliferation, gene expression, differentiation, mitosis, and apoptosis. A knock-down experiment in zebrafish showed MAPK3 and MAPK14 mRNA expression was required at an early embryonic stage (from the 2,4 cell stage to 50% epiboly) in the gilthead sea bream (Sparus aurata) [65,66]. Zygotic expression of MAPK3 and MAPK14 were upregulated by the immune challenge in several fish species [65,67]. Expression of maternal MAPK14 in M. anguillicaudatus declined around the gastrulation stage. Meanwhile, the expression pattern of MAPK3 was steady, so we were unable to determine maternal to zygotic transition, suggesting the expression of MAPK3, whether maternal or zygotic expression, is required above a certain transcriptional level for immune function and embryonic development (Figure 6c,d). Studies on IL1B functions involved in the control of inflammatory responses have been extensively conducted in several fish species [68,69,70,71] and maternal expression of IL1B was evident in rohu (Labeo rohita) and brown trout (Salmo trutta) [72,73]. Expression of maternal IL1B in M. anguillicaudatus gradually declined until the end of gastrulation. Furthermore, there was no change during the myotome stages (Figure 6e). This indicates the expression of maternal IL1B is required for early embryonic development rather than the myotomes stage. FADD is an adaptor protein to form the death-inducing signaling complex (DISC) during apoptosis [74]. The function of the FADD gene as a maternal gene is unknown, but the FADD knockout mouse model demonstrated that the absence of FADD is lethal [75]. We did not observe changes in maternal FADD during our study (Figure 6g). Many maternal immune-related genes seem to correlate with diverse cellular functions during early embryonic development. However, how these genes function in early immune development is unclear. Further studies on the role of these maternal gene transcripts in early embryonic and immune system development in M. anguillicaudatus and other fish need to be conducted.

2.6. Data Availability

The cleaned short read sequences from the RNA-Seq data were deposited in the NCBI Sequence Read Archive (SRA, http://www.ncbi.nlm.nih.gov/sra) under the accession numbers SRX4452604, SRX4452605, SRX4452606, and SRX4452607 for ovary, testis, muscle, and unfertilized eggs, respectively. The Transcriptome Shotgun Assembly (TSA) project has been deposited in the DDBJ/EMBL/GenBank under the accession GGUH00000000. The version described in this paper is the first version, GGUH01000000.

3. Materials and Methods

3.1. Fish Rearing and Sample Collection

The cyprinid loach, Misgurnus anguillicaudatus, used in this study was a laboratory stock maintained at the Institute of Marine Living Modified Organism, Pukyong National University, Busan, Korea. Fishes were reared in water recirculating aquarium tanks filled with sand-filtered groundwater at 25 ± 1 °C and continuous aeration (dissolved oxygen level = 5 ± 1 mg/L). Fishes were fed with a commercial diet (50% crude protein) on an ad libitum basis. For tissue sampling, 2-year-old mature adult fishes were euthanized with tricaine methanesulfonate (MS-222; 500 mg/L). Gonads (yolk-laden mature ovary or milky white-colored testis) were surgically dissected from 3 females (average body weight (BW) = 24.3 ± 3.5 g) and 3 males (average BW = 12.3 ± 1.4 g). In addition, muscles (dorsal skeletal muscles) were obtained from the same individuals above in order to increase transcriptome coverage of the gene expressed in fertilized eggs during de novo transcriptome assembly. Collected samples were immediately frozen on dry ice and stored at −80 °C until RNA extraction. Unfertilized eggs were obtained by hormone-induced spawning, as described previously [27]. Mature females (n = 3; same-aged and similar-weighed as above) were given an intraperitoneal injection of carp pituitary extract (Argent Aquaculture Inc., Redmond, WA, USA) at a dose level of 10 μg/g body weight. Thirteen hours after injection at 25 °C, eggs were hand stripped and directly subjected to RNA extraction. This study was approved by the Animal Ethics Committee of Pukyong National University (Approval #2018-16; approval date: 30 Aug. 2018) and performed according to the guidelines for the care and use of laboratory animals.

3.2. RNA Isolation, Library Construction, and Sequencing

Total RNAs were extracted from individual tissue and egg batch samples using the Trizol reagent (Invitrogen, CA, USA) and purified using the RNeasy Plus Mini Kit (Qiagen, Hilden, Germany) according to the manufacturer’s instruction, including a genomic DNA elimination step. RNA purity, quality, and quantity were determined by NanoDrop-2000 spectrophotometer (Thermo Scientific, Waltham, MA, USA), ethidium bromide staining of 28S and 18S ribosomal bands on a 1% MOPS-formaldehyde agarose gel, and Bioanalyzer 2100 system (Agilent Technologies Inc., Stanta Clara, CA, USA). Only high-quality RNAs (A260/A280 ratio = 1.8–2.0 and A260/A230 ratio ≥ 2.0, and RIN > 8) were pooled within a given sample type (1 μg of total RNA each from individual sample) to construct cDNA libraries. Briefly, 1 μg of total RNA was purified, fragmented, and primed to synthesize the first- and second-strand cDNA according to the TruSeq Stranded Total RNA Sample Pre protocol (Illumina, San Diego, CA, USA). Synthesized second-strand cDNA was purified, end-repaired, and ligated to index adapters. The ligated products were separated to select suitable sequencing templates that were amplified by PCR to generate cDNA libraries. Finally, the libraries were sequenced using the Illumina HiSeq 2500 (Illumina, Inc., San Diego, CA, USA) platform as paired-end reads to 100 bp in length.

3.3. De Novo Assembly and Completeness of Transcriptome

After sequencing was completed, the Trimmomatic v0.32 software was used to process Illumina paired-end raw reads to obtain clean reads through trimming the adaptor sequences, plus trimming of sequences with an ambiguous base (N) ratio >5% and low-quality [30]. Subsequently, the integrity of the remaining raw reads was verified with FastQC v0.10.1 by running Perl program [76] with the clean reads stored in the FASTQ format [77]. In the absence of a reference genome for the loach M. anguillicaudatus in this study, the clean reads were pooled together to maximize the chance of obtaining as many transcripts as was possible, and de novo assembly used to create a reference transcriptome using Trinity Assembler v2.2.0 with default parameters [31]. The assembly was clustered by removing the redundant sequences using a cluster software, CD-HITest v4.6.1, with a defined parameter (-c 0.99 -T 8 -G 0 -aL 0.90 -AL 100 -aS 0.99 -AS 30) [32]. Completeness of the de novo transcriptome assembly, as a reference for downstream analysis, was assessed using Benchmarking Universal Single-Copy Orthologs v3.0.2 software (BUSCO, http://busco.ezlab.org/) against the eukaryote (eukaryote_odb9) lineages with default parameters in the CyVerse Discovery Environment (https://de.cyverse.org/) [38,39,78].

3.4. Functional Annotation

Initially, transcripts were annotated by homology search using BLASTx algorithm against NCBI NR protein database. In addition, Trinotate v3.0.1 (https://trinotate.github.io/) pipeline was used to search for any additional homologies of the transcripts: the open reading frames (ORF) for each transcript were predicted by TransDecoder v3.0.1 (https://github.com/TransDecoder) [79]. Nucleotide sequences and the corresponding ORF proteins were queried against the Swiss-Prot/UniprotKB database. Protein domains were identified using HMMER v.3.1 against the HMMER/Pfam protein database [80]. Orthologous groups were searched against the EggNOG database [81]. Furthermore, prediction for the presence of the signal peptide and transmembrane regions were performed using SignalP v4.1 [82] and tmHMM v2.0c [83]. The NR BLAST results were imported to the Blast2GO v3.0.11 program for mapping and retrieving gene ontology (GO) [41]. After retrieving the GO annotation result, the WEGO 2.0 tool was used for classification and graphical representation of GO terms at the macro level [42]. GO terms with a corrected p-value < 0.05 were defined as significantly enriched GO terms. Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway assignments were performed on the KEGG Automatic Annotation Server (KAAS) [44]. KEGG annotations were filtered to keep KEGG orthology (KO) and those were categorized using the KEGG Mapper (https://www.kegg.jp/kegg/tool/map_pathway.html). Enrichment analysis was conducted by comparison of a transcript set against the reference transcriptome background using the Pearson Chi-Square test in R package. GO terms and KEGG pathways with a p-value less than 0.05 were considered to be enriched significantly.

3.5. Expression Analysis and Specificity Classification

Expression levels of transcripts from four tissue libraries (muscle, ovary, testis, and unfertilized eggs) were calculated by mapping to the reference transcriptome that was created by the de novo assembly mentioned above. To obtain the expression levels of each transcript in each tissue, the transcript abundances were quantified for each library using Kallisto v0.43.0 [84]. Low abundance transcripts with less than two count across four tissue libraries were removed. To normalize the expression of transcripts across tissues, the trimmed mean normalization of M-values (TMM) was calculated using the Bioconductor package edgeR v3.8.5 [85] and used to classify tissue specificity of transcripts. A cut-off value of 1 TMM was used as the detection limit [86]. To determine the distribution of tissue specificity, transcripts were further classified into six categories using the TissueEnrich tool [57]: (1) “Not expressed”—< 1 TMM value across all the tissues; (2) “Tissue enriched”—five-fold higher TMM value in one tissue compared to all other tissues; (3) “Group enriched”—five-fold higher TMM value in a group of 2-3 tissues compared to all other tissues; (4) five-fold higher TMM value in one tissue compared to the average TMM value in all other tissues; (5) “Expressed all”—1 ≥ TMM value across all of the tissues; and (6) “Mixed”—transcripts that were not assigned to any of the five categories. Transcripts from “Tissue enriched”, “Group enriched”, and “Tissue enhanced” groups were classified as tissue-specific transcripts [57]. For the classification of transcripts where a log2-scale of the data was used, pseudo-counts of +1 were added to the data set.

3.6. qPCR Analysis and Measurement of Transcriptional Levels of Identified Candidate Genes during Ontogeny of M. anguillicaudatus

Seven genes from the RNA-Seq analysis and 18s ribosomal RNA (18S rRNA, accession no. EU120032) as an endogenous control [87] were analyzed by qPCR to validate RNA-Seq data and to evaluate ontogenetic expression profile. For RNA-Seq data validation, the total RNAs from each tissue were prepared from three replicate pools of tissues (three individuals per pool within a given tissue type) independently of the RNA-Seq analysis. To evaluate the transcriptional levels of identified candidate genes during ontogeny, developmental samples at 25 °C were collected from 12 developmental stages (i.e., unfertilized eggs, immediately after fertilization, 8-cells, 64-cells, morula, blastula, beginning of gastrulation, end of gastrulation, 3-4 myotomes, 30 myotomes, first hatch out, and yolk sac absorption) according to the study on ontogeny of M. anguillicaudatus that has been previously described [28]. Three independent spawning trials were performed to prepare biological replications and 60 embryos were sampled at each developmental stage from each spawning batch. Total RNAs were extracted and purified according to the RNA isolation described in Section 3.2. Subsequently, cDNA was synthesized from 2 μg total RNA using the Omniscript reverse transcription kit (Qiagen) with a 1–9 mixing ratio of oligo dT primer and random nonamer primers (Bioneer, Daejeon, Korea), according to the manufacturer’s instruction. RT-qPCR was performed in triplicate using the LightCycler 480 Real-Time PCR system and LightCycler 480 SYBR Green I Master (Roche Applied Science, Penzberg, Germany). In brief, the amplification was carried out in a 20 μL reaction mixture containing 1 × SYBR green I master, 5 μM forward and reverse primers, and 2 μL of 20 times diluted cDNA template and nuclease-free water. The thermal profile conditions were: 90 °C for 5 min, 40 cycles consisting of 95 °C for 10 s, 60 °C for 20 s, and 72 °C for 20 s, with fluorescence recording at the end of each cycle. Melt curve analysis was performed to ensure product specificity over the temperature range of 65–90 °C. The sequences of oligonucleotide primers used in qPCR are listed in Table S7. Amplicons were analyzed on agarose gels to confirm the product size. The levels of gene expression were calculated from the threshold cycle according to the 2-ΔΔCT method [88]. The results were analyzed one-way analysis of variance (ANOVA) with a Bonferroni’s multiple comparison test using GraphPad Prism (v7.0 for Windows; GraphPad Software). A p < 0.05 was considered statistically significant.

4. Conclusions

Our study presented the transcriptome expression profiles of the loach, M. anguillicaudatus, and for the first time, maternal gene transcripts and their immune system involvement were identified from unfertilized eggs of M. anguillicaudatus. These findings are needed to understand maternal gene transcripts in early embryonic development and the immunological understanding of maternal immune-related gene transcripts before zygotic gene expression. This information can be utilized to assess developmental competence concerning cellular mechanism and/or immune system in embryonic development, which can be associated with the egg quality of teleost fish. The data generated could be a valuable transcriptomic/genomic resource for further analysis of maternal genes involved in the cellular mechanisms and/or immune system in early embryonic development of M. anguillicaudatus and other teleost fishes.

Supplementary Materials

Supplementary materials can be found at https://www.mdpi.com/1422-0067/21/11/3872/s1.

Author Contributions

C.-H.K. and Y.K.N. designed the study, data analysis, and wrote the paper. E.J.K. performed samples preparation and collection and qPCR. C.S. performed RNA-Seq data collection and analysis. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the grant from the Korea Institute of Marine Science & Technology (KIMST) funded by the Ministry of Oceans and Fisheries (Project #20170327).

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

RINRNA integrity number
BUSCOBenchmarking Universal Single-Copy Orthologs
KEGGKyoto Encyclopedia of Genes and Genomes
GOGene ontology
TMMTrimmed mean normalization of M-values

References

  1. Bowerman, B. Maternal control of pattern formation in early Caenorhabditis elegans embryos. Curr. Top. Dev. Biol. 1998, 39, 73–117. [Google Scholar] [PubMed]
  2. Pelegri, F. Maternal factors in zebrafish development. Dev. Dynamics Off Pub. Am. Assoc. Anatomists 2003, 228, 535–554. [Google Scholar] [CrossRef] [PubMed]
  3. Minami, N.; Suzuki, T.; Tsukamoto, S. Zygotic gene activation and maternal factors in mammals. J. Reprod. Dev. 2007, 53, 707–715. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  4. White, J.A.; Heasman, J. Maternal control of pattern formation in Xenopus laevis. J. Exp. Zool. Part. B Mol. Dev. Evol. 2008, 310, 73–84. [Google Scholar] [CrossRef] [PubMed]
  5. Swain, P.; Nayak, S.K. Role of maternally derived immunity in fish. Fish. Shellfish Immunol. 2009, 27, 89–99. [Google Scholar] [CrossRef]
  6. Lyman-Gingerich, J.; Pelegri, F. Maternal factors in fish oogenesis and embryonic development. In The Fish Oocyte; Springer: Berlin/Heidelberg, Germany, 2007. [Google Scholar]
  7. Abrams, E.W.; Mullins, M.C. Early zebrafish development: It’s in the maternal genes. Curr. Opin. Genet. Dev. 2009, 19, 396–403. [Google Scholar] [CrossRef] [Green Version]
  8. Brooks, S.; Tyler, C.R.; Sumpter, J.P. Egg quality in fish: What makes a good egg? Rev. Fish. Biol. Fish. 1997, 7, 387–416. [Google Scholar] [CrossRef]
  9. Bobe, J.; Labbé, C. Egg and sperm quality in fish. Gen. Comp. Endocrinol. 2010, 165, 535–548. [Google Scholar] [CrossRef]
  10. Mommens, M.; Fernandes, J.M.O.; Bizuayehu, T.T.; Bolla, S.L.; Johnston, I.A.; Babiak, I. Maternal gene expression in Atlantic halibut (Hippoglossus hippoglossus L.) and its relation to egg quality. BMC Res. Notes 2010, 3, 138. [Google Scholar] [CrossRef] [Green Version]
  11. Sullivan, C.V.; Chapman, R.W.; Reading, B.J.; Anderson, P.E. Transcriptomics of mRNA and egg quality in farmed fish: Some recent developments and future directions. Gen. Comp. Endocrinol. 2015, 221, 23–30. [Google Scholar] [CrossRef] [Green Version]
  12. Wargelius, A.; Furmanek, T.; Montfort, J.; Le Cam, A.; Kleppe, L.; Juanchich, A.; Edvardsen, R.B.; Taranger, G.L.; Bobe, J. A comparison between egg trancriptomes of cod and salmon reveals species-specific traits in eggs for each species. Mol. Reprod. Dev. 2015, 82, 397–404. [Google Scholar] [CrossRef]
  13. Subramanian, N.; Torabi-Parizi, P.; Gottschalk, R.A.; Germain, R.N.; Dutta, B. Network representations of immune system complexity. Wiley Interdiscip. Rev. Syst. Biol. Med. 2015, 7, 13–38. [Google Scholar] [CrossRef] [PubMed]
  14. Rauta, P.R.; Nayak, B.; Das, S. Immune system and immune responses in fish and their role in comparative immunity study: A model for higher organisms. Immunol. Lett. 2012, 148, 23–33. [Google Scholar] [PubMed]
  15. Akira, S.; Takeda, K. Toll-like receptor signalling. Nat. Rev. Immunol. 2004, 4, 499–511. [Google Scholar] [CrossRef] [PubMed]
  16. Nüsslein-Volhard, C.; Wieschaus, E. Mutations affecting segment number and polarity in Drosophila. Nature 1980, 287, 795–801. [Google Scholar] [CrossRef] [PubMed]
  17. Mulero, I.; García-Ayala, A.; Meseguer, J.; Mulero, V. Maternal transfer of immunity and ontogeny of autologous immunocompetence of fish: A minireview. Aquaculture 2007, 268, 244–250. [Google Scholar]
  18. Zhang, S.; Wang, Z.; Wang, H. Maternal immunity in fish. Dev. Comp. Immunol. 2013, 39, 72–78. [Google Scholar]
  19. Harvey, S.A.; Sealy, I.; Kettleborough, R.; Fenyes, F.; White, R.; Stemple, D.; Smith, J.C. Identification of the zebrafish maternal and paternal transcriptomes. Development 2013, 140, 2703–2710. [Google Scholar] [CrossRef] [Green Version]
  20. Rauwerda, H.; Wackers, P.; Pagano, J.F.B.; de Jong, M.; Ensink, W.; Dekker, R.; Nehrdich, U.; Spaink, H.P.; Jonker, M.; Breit, T.M. Mother-specific signature in the maternal transcriptome composition of mature, unfertilized zebrafish eggs. PLoS ONE 2016, 11, e0147151. [Google Scholar] [CrossRef]
  21. Zhou, H.; Xu, Q.-Z.; Zhang, R.; Zhuang, Z.-X.; Ma, Y.-Q.; Wang, W.; Ma, T.-Y.; Sui, Y.; Liu, Y.; Cao, X. Gonadal transcriptome analysis of hybrid triploid loaches (Misgurnus anguillicaudatus) and their diploid and tetraploid parents. PLoS ONE 2018, 13, e0198179. [Google Scholar]
  22. Lanes, C.F.C.; Bizuayehu, T.T.; de Oliveira Fernandes, J.M.; Kiron, V.; Babiak, I. Transcriptome of Atlantic cod (Gadus morhua L.) early embryos from farmed and wild broodstocks. Mar. Biotechnol. 2013, 15, 677–694. [Google Scholar] [PubMed] [Green Version]
  23. Kleppe, L.; Edvardsen, R.B.; Furmanek, T.; Taranger, G.L.; Wargelius, A. Global transcriptome analysis identifies regulated transcripts and pathways activated during oogenesis and early embryogenesis in Atlantic cod. Mol. Reprod. Dev. 2014, 81, 619–635. [Google Scholar]
  24. Rauwerda, H.; Pagano, J.F.B.; de Leeuw, W.C.; Ensink, W.; Nehrdich, U.; de Jong, M.; Jonker, M.; Spaink, H.P.; Breit, T.M. Transcriptome dynamics in early zebrafish embryogenesis determined by high-resolution time course analysis of 180 successive, individual zebrafish embryos. BMC Genom. 2017, 18, 287. [Google Scholar]
  25. Romney, A.L.; Podrabsky, J.E. Transcriptomic analysis of maternally provisioned cues for phenotypic plasticity in the annual killifish, Austrofundulus limnaeus. EvoDevo 2017, 8, 6. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  26. Kim, D.S.; Nam, Y.K.; Park, I.-S. Survival and karyological analysis of reciprocal diploid and triploid hybrids between mud loach (Misgurnus mizolepis) and cyprinid loach (Misgurnus anguillicaudatus). Aquaculture 1995, 135, 257–265. [Google Scholar]
  27. Nam, Y.K.; Park, I.-S.; Kim, D.S. Triploid hybridization of fast-growing transgenic mud loach Misgurnus mizolepis male to cyprinid loach Misgurnus anguillicaudatus female: The first performance study on growth and reproduction of transgenic polyploid hybrid fish. Aquaculture 2004, 231, 559–572. [Google Scholar] [CrossRef]
  28. Fujimoto, T.; Kataoka, T.; Sakao, S.; Saito, T.; Yamaha, E.; Arai, K. Developmental stages and germ cell lineage of the loach (Misgurnus anguillicaudatus). Zool. Sci. 2006, 23, 977–989. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  29. Arai, K.; Fujimoto, T. Genomic constitution and atypical reproduction in polyploid and unisexual lineages of the Misgurnus loach, a teleost fish. Cytogenet. Genome Res. 2013, 140, 226–240. [Google Scholar] [CrossRef] [Green Version]
  30. Bolger, A.M.; Lohse, M.; Usadel, B. Trimmomatic: A flexible trimmer for Illumina sequence data. Bioinformatics 2014, 30, 2114–2120. [Google Scholar]
  31. Grabherr, M.G.; Haas, B.J.; Yassour, M.; Levin, J.Z.; Thompson, D.A.; Amit, I.; Adiconis, X.; Fan, L.; Raychowdhury, R.; Zeng, Q. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat. Biotechnol. 2011, 29, 644. [Google Scholar] [CrossRef] [Green Version]
  32. Li, W.; Godzik, A. Cd-hit: A fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics 2006, 22, 1658–1659. [Google Scholar] [PubMed] [Green Version]
  33. Schliesky, S.; Gowik, U.; Weber, A.P.M.; Bräutigam, A. RNA-Seq assembly—Are we there yet? Front. Plant. Sci. 2012, 3, 220. [Google Scholar] [PubMed] [Green Version]
  34. Kumar, S.; Blaxter, M.L. Comparing de novo assemblers for 454 transcriptome data. BMC Genom. 2010, 11, 571. [Google Scholar]
  35. Luo, W.; Cao, X.; Xu, X.; Huang, S.; Liu, C.; Tomljanovic, T. Developmental transcriptome analysis and identification of genes involved in formation of intestinal air-breathing function of Dojo loach, Misgurnus anguillicaudatus. Sci. Rep. 2016, 6, 31845. [Google Scholar] [PubMed]
  36. Luo, W.; Liu, C.; Cao, X.; Huang, S.; Wang, W.; Wang, Y. Transcriptome profile analysis of ovarian tissues from diploid and tetraploid loaches Misgurnus anguillicaudatus. Int. J. Mol. Sci. 2015, 16, 16017–16033. [Google Scholar] [PubMed] [Green Version]
  37. Long, Y.; Li, Q.; Zhou, B.; Song, G.; Li, T.; Cui, Z. De novo assembly of mud loach (Misgurnus anguillicaudatus) skin transcriptome to identify putative genes involved in immunity and epidermal mucus secretion. PLoS ONE 2013, 8, e56998. [Google Scholar] [CrossRef] [Green Version]
  38. Waterhouse, R.M.; Seppey, M.; Simão, F.A.; Manni, M.; Ioannidis, P.; Klioutchnikov, G.; Kriventseva, E.V.; Zdobnov, E.M. BUSCO applications from quality assessments to gene prediction and phylogenomics. Mol. Biol. Evol. 2018, 35, 543–548. [Google Scholar] [CrossRef] [Green Version]
  39. Simão, F.A.; Waterhouse, R.M.; Ioannidis, P.; Kriventseva, E.V.; Zdobnov, E.M. BUSCO: Assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 2015, 31, 3210–3212. [Google Scholar]
  40. Ashburner, M.; Ball, C.A.; Blake, J.A.; Botstein, D.; Butler, H.; Cherry, J.M.; Davis, A.P.; Dolinski, K.; Dwight, S.S.; Eppig, J.T. Gene Ontology: Tool for the unification of biology. Nat. Genet. 2000, 25, 25. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  41. Conesa, A.; Götz, S.; García-Gómez, J.M.; Terol, J.; Talón, M.; Robles, M. Blast2GO: A universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics 2005, 21, 3674–3676. [Google Scholar]
  42. Ye, J.; Fang, L.; Zheng, H.; Zhang, Y.; Chen, J.; Zhang, Z.; Wang, J.; Li, S.; Li, R.; Bolund, L. WEGO: A web tool for plotting GO annotations. Nucleic Acids Res. 2006, 34 (Suppl. 2), W293–W297. [Google Scholar] [CrossRef] [PubMed]
  43. Zhang, Y.; Li, Y.; Liang, X.; Cao, X.; Huang, L.; Yan, J.; Wei, Y.; Gao, J. Hepatic transcriptome analysis and identification of differentially expressed genes response to dietary oxidized fish oil in loach Misgurnus anguillicaudatus. PLoS ONE 2017, 12, e0172386. [Google Scholar] [CrossRef]
  44. Moriya, Y.; Itoh, M.; Okuda, S.; Yoshizawa, A.C.; Kanehisa, M. KAAS: An automatic genome annotation and pathway reconstruction server. Nucleic Acids Res. 2007, 35 (Suppl. 2), W182–W185. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  45. Kanehisa, M.; Goto, S.; Sato, Y.; Kawashima, M.; Furumichi, M.; Tanabe, M. Data, information, knowledge and principle: Back to metabolism in KEGG. Nucleic Acids Res. 2014, 42, D199–D205. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  46. Schier, A.F. The maternal-zygotic transition: Death and birth of RNAs. Science 2007, 316, 406–407. [Google Scholar] [CrossRef] [PubMed]
  47. Lubzens, E.; Bobe, J.; Young, G.; Sullivan, C.V. Maternal investment in fish oocytes and eggs: The molecular cargo and its contributions to fertility and early development. Aquaculture 2017, 472, 107–143. [Google Scholar] [CrossRef] [Green Version]
  48. Picchietti, S.; Taddei, A.R.; Scapigliati, G.; Buonocore, F.; Fausto, A.M.; Romano, N.; Mazzini, M.; Mastrolia, L.; Abelli, L. Immunoglobulin protein and gene transcripts in ovarian follicles throughout oogenesis in the teleost Dicentrarchus labrax. Cell Tissue Res. 2004, 315, 259–270. [Google Scholar] [CrossRef]
  49. Knoll-Gellida, A.; André, M.; Gattegno, T.; Forgue, J.; Admon, A.; Babin, P.J. Molecular phenotype of zebrafish ovarian follicle by serial analysis of gene expression and proteomic profiling, and comparison with the transcriptomes of other animals. BMC Genom. 2006, 7, 46. [Google Scholar] [CrossRef]
  50. Knoll-Gellida, A.; Babin, P.J. Zebrafish ovarian follicle transcriptome. In The Fish Oocyte; Springer: Berlin/Heidelberg, Germany, 2007. [Google Scholar]
  51. He, X.; Zhang, J. Toward a molecular understanding of pleiotropy. Genetics 2006, 173, 1885–1891. [Google Scholar] [CrossRef] [Green Version]
  52. Dudley, A.M.; Janse, D.M.; Tanay, A.; Shamir, R.; Church, G.M. A global view of pleiotropy and phenotypically derived gene function in yeast. Mol. Syst. Biol. 2005, 1. [Google Scholar] [CrossRef] [Green Version]
  53. Preuss, K.M.; Lopez, J.A.; Colbourne, J.K.; Wade, M.J. Identification of maternally-loaded RNA transcripts in unfertilized eggs of Tribolium castaneum. BMC Genom. 2012, 13, 671. [Google Scholar] [CrossRef]
  54. Zou, L.; Sriswasdi, S.; Ross, B.; Missiuro, P.V.; Liu, J.; Ge, H. Systematic analysis of pleiotropy in C. elegans early embryogenesis. PLoS Comput. Biol. 2008, 4, e1000003. [Google Scholar]
  55. Sonawane, A.R.; Platig, J.; Fagny, M.; Chen, C.-Y.; Paulson, J.N.; Lopes-Ramos, C.M.; DeMeo, D.L.; Quackenbush, J.; Glass, K.; Kuijjer, M.L. Understanding tissue-specific gene regulation. Cell Rep. 2017, 21, 1077–1088. [Google Scholar] [CrossRef] [Green Version]
  56. Watanabe, K.; Stringer, S.; Frei, O.; Mirkov, M.U.; de Leeuw, C.; Polderman, T.J.C.; van der Sluis, S.; Andreassen, O.A.; Neale, B.M.; Posthuma, D. A global overview of pleiotropy and genetic architecture in complex traits. Nat. Genet. 2019, 51, 1339–1348. [Google Scholar] [CrossRef] [PubMed]
  57. Jain, A.; Tuteja, G. TissueEnrich: Tissue-specific gene enrichment analysis. Bioinformatics 2018, 35, 1966–1967. [Google Scholar]
  58. Giraldez, A.J.; Mishima, Y.; Rihel, J.; Grocock, R.J.; Van Dongen, S.; Inoue, K.; Enright, A.J.; Schier, A.F. Zebrafish MiR-430 promotes deadenylation and clearance of maternal mRNAs. Science 2006, 312, 75–79. [Google Scholar] [PubMed] [Green Version]
  59. Lee, J.W.; Kim, J.E.; Goo, I.B.; Hwang, J.-A.; Im, J.H.; Choi, H.-S.; Lee, J.-H. Expression of immune-related genes during loach (Misgurnus anguillicaudatus) embryonic and early larval development. Dev. Reprod. 2015, 19, 181. [Google Scholar] [CrossRef]
  60. Salas-Vidal, E.; Meijer, A.H.; Cheng, X.; Spaink, H.P. Genomic annotation and expression analysis of the zebrafish Rho small GTPase family during development and bacterial infection. Genomics 2005, 86, 25–37. [Google Scholar]
  61. Han, F.; Wang, X.; Yang, Q.; Cai, M.; Wang, Z.Y. Characterization of a RacGTPase up-regulated in the large yellow croaker Pseudosciaena crocea immunity. Fish. Shellfish Immunol. 2011, 30, 501–508. [Google Scholar] [CrossRef]
  62. Jia, A.; Zhang, X.-H. cDNA cloning, characterization, and expression analysis of the Rac1 gene from Scophthalmus maximus. Comp. Biochem. Physiol. Part. B Biochem. Mol. Biol. 2009, 154, 80–84. [Google Scholar]
  63. Lu, F.-I.; Thisse, C.; Thisse, B. Identification and mechanism of regulation of the zebrafish dorsal determinant. Proc. Natl. Acad. Sci. USA 2011, 108, 15876–15880. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  64. Zhang, M.; Zhang, J.; Lin, S.-C.; Meng, A. β-Catenin 1 and β-catenin 2 play similar and distinct roles in left-right asymmetric development of zebrafish embryos. Development 2012, 139, 2009–2019. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  65. Chiou, M.-J.; Wang, Y.-D.; Kuo, C.-M.; Chen, J.-C.; Chen, J.-Y. Functional analysis of mitogen-activated protein kinase-3 (MAPK3) and its regulation of the promoter region in zebrafish. Dna Cell Biol. 2007, 26, 781–790. [Google Scholar] [CrossRef] [PubMed]
  66. Roufidou, C.; Feidantsis, K.; Mente, E.; Sarropoulou, E.; Antonopoulou, E. Heat shock protein (HSP) expression and mitogen-activated protein kinase (MAPK) phosphorylation during early embryonic developmental stages of the Gilthead sea bream (Sparus aurata). Mediterr. Mar. Sci. 2018, 19, 240–247. [Google Scholar]
  67. Umasuthan, N.; Bathige, S.D.N.K.; Noh, J.K.; Lee, J. Gene structure, molecular characterization and transcriptional expression of two p38 isoforms (MAPK11 and MAPK14) from rock bream (Oplegnathus fasciatus). Fish. Shellfish Immunol. 2015, 47, 331–343. [Google Scholar]
  68. Hong, S.; Zou, J.; Collet, B.; Bols, N.C.; Secombes, C.J. Analysis and characterisation of IL-1β processing in rainbow trout, Oncorhynchus mykiss. Fish. Shellfish Immunol. 2004, 16, 453–459. [Google Scholar] [CrossRef]
  69. Jiang, S.; Zhang, D.; Li, J.; Liu, Z. Molecular characterization, recombinant expression and bioactivity analysis of the interleukin-1β from the yellowfin sea bream, Acanthopagrus latus (Houttuyn). Fish. Shellfish Immunol. 2008, 24, 323–336. [Google Scholar]
  70. Mathew, J.A.; Guo, Y.X.; Goh, K.P.; Chan, J.; Verburg-van Kemenade, B.M.L.; Kwang, J. Characterisation of a monoclonal antibody to carp IL-1β and the development of a sensitive capture ELISA. Fish. Shellfish Immunol. 2002, 13, 85–95. [Google Scholar]
  71. Ogryzko, N.V.; Hoggett, E.E.; Solaymani-Kohal, S.; Tazzyman, S.; Chico, T.J.A.; Renshaw, S.A.; Wilson, H.L. Zebrafish tissue injury causes upregulation of interleukin-1 and caspase-dependent amplification of the inflammatory response. Dis. Models Mech. 2014, 7, 259–264. [Google Scholar] [CrossRef] [Green Version]
  72. Cecchini, S.; Paciolla, M.; Biffali, E.; Borra, M.; Ursini, M.V.; Lioi, M.B. Ontogenetic profile of innate immune related genes and their tissue-specific expression in brown trout, Salmo trutta (Linnaeus, 1758). Fish. Shellfish Immunol. 2013, 35, 988–992. [Google Scholar] [CrossRef]
  73. Dash, P.; Patel, S.; Dixit, A.; Garg, L.C.; Sahoo, P.K. Four pro-inflammatory cytokines of rohu (Labeo rohita) during early developmental stages, their tissue distribution and expression by leucocytes upon in-vitro stimulation. Fish. Shellfish Immunol. 2015, 47, 913–922. [Google Scholar] [CrossRef] [PubMed]
  74. Sakamaki, K.; Nozaki, M.; Kominami, K.; Satou, Y. The evolutionary conservation of the core components necessary for the extrinsic apoptotic signaling pathway, in Medaka fish. BMC Genom. 2007, 8, 141. [Google Scholar] [CrossRef] [Green Version]
  75. Yeh, W.-C.; de La Pompa, J.L.; McCurrach, M.E.; Shu, H.-B.; Elia, A.J.; Shahinian, A.; Ng, M.; Wakeham, A.; Khoo, W.; Mitchell, K. FADD: Essential for embryo development and signaling from some, but not all, inducers of apoptosis. Science 1998, 279, 1954–1958. [Google Scholar] [CrossRef] [PubMed]
  76. Patel, R.K.; Jain, M. NGS QC Toolkit: A Toolkit for quality control of next generation sequencing data. PLoS ONE 2012, 7, e30619. [Google Scholar] [CrossRef] [PubMed]
  77. Cock, P.J.A.; Fields, C.J.; Goto, N.; Heuer, M.L.; Rice, P.M. The Sanger FASTQ file format for sequences with quality scores, and the Solexa/Illumina FASTQ variants. Nucleic Acids Res. 2010, 38, 1767–1771. [Google Scholar] [CrossRef] [Green Version]
  78. Devisetty, U.K.; Kennedy, K.; Sarando, P.; Merchant, N.; Lyons, E. Bringing your tools to cyverse discovery environment using docker. F1000Research 2016, 5. [Google Scholar]
  79. Haas, B.J.; Papanicolaou, A.; Yassour, M.; Grabherr, M.; Blood, P.D.; Bowden, J.; Couger, M.B.; Eccles, D.; Li, B.; Lieber, M.; et al. De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis. Nat. Protoc. 2013, 8, 1494. [Google Scholar] [CrossRef]
  80. Finn, R.D.; Clements, J.; Eddy, S.R. HMMER web server: Interactive sequence similarity searching. Nucleic Acids Res. 2011, 39 (Suppl. 2), W29–W37. [Google Scholar] [CrossRef] [Green Version]
  81. Powell, S.; Szklarczyk, D.; Trachana, K.; Roth, A.; Kuhn, M.; Muller, J.; Arnold, R.; Rattei, T.; Letunic, I.; Doerks, T.; et al. eggNOG v3. 0: Orthologous groups covering 1133 organisms at 41 different taxonomic ranges. Nucleic Acids Res. 2011, 40, D284–D289. [Google Scholar] [CrossRef]
  82. Petersen, T.N.; Brunak, S.; von Heijne, G.; Nielsen, H. SignalP 4.0: Discriminating signal peptides from transmembrane regions. Nat. Methods 2011, 8, 785–786. [Google Scholar] [CrossRef]
  83. Krogh, A.; Larsson, B.; Von Heijne, G.; Sonnhammer, E.L. Predicting transmembrane protein topology with a hidden markov model: Application to complete genomes1. J. Mol. Biol. 2001, 305, 567–580. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  84. Bray, N.L.; Pimentel, H.; Melsted, P.; Pachter, L. Near-optimal probabilistic RNA-seq quantification. Nat. Biotechnol. 2016, 34, 525–527. [Google Scholar] [CrossRef] [PubMed]
  85. Robinson, M.D.; Oshlack, A. A scaling normalization method for differential expression analysis of RNA-seq data. Genome Biol. 2010, 11, R25. [Google Scholar] [PubMed] [Green Version]
  86. MacManes, M.D. Establishing evidenced-based best practice for the de novo assembly and evaluation of transcriptomes from non-model organisms. BioRxiv 2016, 035642. [Google Scholar]
  87. Xia, X.; Huo, W.; Wan, R.; Xia, X.; Du, Q.; Chang, Z. Identification of housekeeping genes as references for quantitative real-time RT-PCR analysis in Misgurnus anguillicaudatus. J. Genet. 2017, 96, 895–904. [Google Scholar] [CrossRef] [PubMed]
  88. Livak, K.J.; Schmittgen, T.D. Analysis of relative gene expression data using real-time quantitative PCR and the 2− ΔΔCT method. Methods 2001, 25, 402–408. [Google Scholar] [CrossRef]
Figure 1. Functional annotation of transcripts from the de novo assembled M. anguillicaudatus transcriptome: (a) Venn diagram of all annotated transcripts from the reference transcriptome against non-redundant (NR), Swiss-Prot, Pfam, non-supervised orthologous groups (eggNOG), gene ontology (GO), and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases; (b) E-value distribution; (c) similarity distribution; and (d) species distribution.
Figure 1. Functional annotation of transcripts from the de novo assembled M. anguillicaudatus transcriptome: (a) Venn diagram of all annotated transcripts from the reference transcriptome against non-redundant (NR), Swiss-Prot, Pfam, non-supervised orthologous groups (eggNOG), gene ontology (GO), and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases; (b) E-value distribution; (c) similarity distribution; and (d) species distribution.
Ijms 21 03872 g001
Figure 2. GO classification for the reference transcriptome and enrichment analysis of maternal gene transcripts using the Pearson Chi-Square test. Horizontal axis displays the percentage of significant transcripts in each column of which the number at the end indicates the number of transcripts. Vertical axis displays the detailed GO term corresponding to each functional type. Star marks in front of terms indicate significantly enriched GO terms (p < 0.05).
Figure 2. GO classification for the reference transcriptome and enrichment analysis of maternal gene transcripts using the Pearson Chi-Square test. Horizontal axis displays the percentage of significant transcripts in each column of which the number at the end indicates the number of transcripts. Vertical axis displays the detailed GO term corresponding to each functional type. Star marks in front of terms indicate significantly enriched GO terms (p < 0.05).
Ijms 21 03872 g002
Figure 3. KEGG pathway annotation for the reference transcriptome and enrichment analysis of maternal gene transcripts using the Pearson Chi-Square test. Star marks in front of pathways indicate significantly enriched KEGG pathway (p < 0.05).
Figure 3. KEGG pathway annotation for the reference transcriptome and enrichment analysis of maternal gene transcripts using the Pearson Chi-Square test. Star marks in front of pathways indicate significantly enriched KEGG pathway (p < 0.05).
Ijms 21 03872 g003
Figure 4. Expression analysis of transcripts in different tissues of M. anguillicaudatus and characterization of maternal transcripts: (a) overlapping comparison of expressed transcripts with a cut-off trimmed mean normalization of M-values (TMM) value < 1 in each tissue; (b) the number of transcripts with different expression abundances in each tissue; (c) heatmap of maternal transcripts with substantial expression differences across tissues; and (d) the number and abundance of transcripts with tissue-specificity by the TissueEnrich tool, which were coexpressed in more than two tissues. “Maternal” in the horizontal axis represented transcripts that were coexpressed in the unfertilized eggs with other tissues; (e) GO terms (p < 0.05) that contained significantly different numbers of maternal transcript with tissue-specificity.
Figure 4. Expression analysis of transcripts in different tissues of M. anguillicaudatus and characterization of maternal transcripts: (a) overlapping comparison of expressed transcripts with a cut-off trimmed mean normalization of M-values (TMM) value < 1 in each tissue; (b) the number of transcripts with different expression abundances in each tissue; (c) heatmap of maternal transcripts with substantial expression differences across tissues; and (d) the number and abundance of transcripts with tissue-specificity by the TissueEnrich tool, which were coexpressed in more than two tissues. “Maternal” in the horizontal axis represented transcripts that were coexpressed in the unfertilized eggs with other tissues; (e) GO terms (p < 0.05) that contained significantly different numbers of maternal transcript with tissue-specificity.
Ijms 21 03872 g004
Figure 5. Identification of maternal immune-related gene transcripts based on GO and KEGG pathway annotations: (a) Distribution of maternal gene transcripts in the immune system process (GO:0002367) at GO level 3 and (b) distribution of maternal gene transcripts in the immune system pathway subcategories.
Figure 5. Identification of maternal immune-related gene transcripts based on GO and KEGG pathway annotations: (a) Distribution of maternal gene transcripts in the immune system process (GO:0002367) at GO level 3 and (b) distribution of maternal gene transcripts in the immune system pathway subcategories.
Ijms 21 03872 g005
Figure 6. Transcriptional levels of identified maternal immune-related candidate genes during ontogeny of M. anguillicaudatus: (a) the ontogeny of M. anguillicaudatus, time for each developmental stage was represented by HPF (hours post-fertilization) and DPH (days post-hatching) and transcriptional levels of (b) RAC1; (c) MAPK3; (d) MAPK14; (e) IL1B; (f) CTNB1; (g) FADD; and (h) ITB1. The results analyzed in statistics are shown in Table S6.
Figure 6. Transcriptional levels of identified maternal immune-related candidate genes during ontogeny of M. anguillicaudatus: (a) the ontogeny of M. anguillicaudatus, time for each developmental stage was represented by HPF (hours post-fertilization) and DPH (days post-hatching) and transcriptional levels of (b) RAC1; (c) MAPK3; (d) MAPK14; (e) IL1B; (f) CTNB1; (g) FADD; and (h) ITB1. The results analyzed in statistics are shown in Table S6.
Ijms 21 03872 g006
Table 1. Statistics for sequencing, de novo assembly, and completeness assessment of the M. anguillicaudatus transcriptome.
Table 1. Statistics for sequencing, de novo assembly, and completeness assessment of the M. anguillicaudatus transcriptome.
Sequencing Stats (Illumina HiSeq 2500; paired-end, 2 × 101 bp)
SequencingOvaryTestisMuscleUnfertilized eggsTotal
No. of raw reads63,490,55660,095,44956,211,23563,470,524243,267,764
No. of clean reads62,078,59758,749,28055,006,88962,034,978237,869,744
Q20 of clean read100%100%100%100%100%
De novoAssembly Stats (Trinity v2.2.0)
No. of total genesNo. of total transcriptsTotal size (bp)Mean size (bp)N50GC (%)
202,709281,866218,148,238773.94169242.96
After Removing Non-Redundant sequzences (CD-HITest v4.6.1)
No. of total genesNo. of total transcriptsTotal size (bp)Mean size (bp)N50GC (%)
193,289267,111197,693,642740.12153342.90
Expression Profiling (Kallisto v0.43.0 and edgeR v3.8.5)
DescriptionOvaryTestisMuscleUnfertilized eggsAverage
Mapping ratio (%)9592899592.8
No. of TMM ≥ 122,23373,68345,60324,116
Table 2. Maternal genes involved in multiple immune systems with high expression levels (TMM value > 50).
Table 2. Maternal genes involved in multiple immune systems with high expression levels (TMM value > 50).
GeneDescriptionNo. of Involved Immune SystemTMM Value
RAC1Ras-related C3 botulinum toxin substrate 11587.41
MAPK3mitogen-activated protein kinase 31360.25
MAPK14mitogen-activated protein kinase 141261.73
IL1Binterleukin-1 beta7216.87
CTNB1catenin beta-15138.05
FADDFAS-associated death domain protein565.82
ITB1Integrin beta 12166.79
LEF1lymphoid enhancer-binding factor 1261.20
MELKmaternal embryonic leucine zipper kinase276.41
PDPK13-phosphoinositide-dependent protein kinase 12127.54

Share and Cite

MDPI and ACS Style

Kim, C.-H.; Kim, E.J.; Seo, C.; Nam, Y.K. Transcriptome Analysis of Maternal Gene Transcripts in Unfertilized Eggs of Misgurnus anguillicaudatus and Identification of Immune-Related Maternal Genes. Int. J. Mol. Sci. 2020, 21, 3872. https://doi.org/10.3390/ijms21113872

AMA Style

Kim C-H, Kim EJ, Seo C, Nam YK. Transcriptome Analysis of Maternal Gene Transcripts in Unfertilized Eggs of Misgurnus anguillicaudatus and Identification of Immune-Related Maternal Genes. International Journal of Molecular Sciences. 2020; 21(11):3872. https://doi.org/10.3390/ijms21113872

Chicago/Turabian Style

Kim, Chan-Hee, Eun Jeong Kim, Chaehwa Seo, and Yoon Kwon Nam. 2020. "Transcriptome Analysis of Maternal Gene Transcripts in Unfertilized Eggs of Misgurnus anguillicaudatus and Identification of Immune-Related Maternal Genes" International Journal of Molecular Sciences 21, no. 11: 3872. https://doi.org/10.3390/ijms21113872

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop