Next Article in Journal
Screening out microRNAs and Their Molecular Pathways with a Potential Role in the Regulation of Parvovirus B19 Infection Through In Silico Analysis
Previous Article in Journal
Characterization of the 172 SNPs Included in the ForenSeq™ DNA Signature Prep Kit in a Population from Northeast Italy
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Chromosome-Level Genome Assembly and Genomic Analysis of the Hybrid Grouper ShanHu (Epinephelus fuscoguttatus ♀ × Epinephelus polyphekadion ♂)

by
Yiqun Liu
1 and
Yunxiang Mao
1,2,*
1
Key Laboratory of Tropical Aquatic Germplasm of Hainan Province, Sanya Oceanographic Institute, Ocean University of China, Sanya 572000, China
2
Yazhou Bay Innovation Institute, Hainan Tropical Ocean University, Sanya 572022, China
*
Author to whom correspondence should be addressed.
Int. J. Mol. Sci. 2025, 26(11), 5036; https://doi.org/10.3390/ijms26115036
Submission received: 28 March 2025 / Revised: 7 May 2025 / Accepted: 9 May 2025 / Published: 23 May 2025
(This article belongs to the Section Molecular Biology)

Abstract

Groupers are important aquaculture species, and hybridization is an effective breeding method for genetic improvement and to enhance production efficiency in groupers. The ShanHu grouper (Epinephelus fuscoguttatus ♀ × Epinephelus polyphekadion ♂) is a hybrid grouper with potential for aquaculture development and research value. Using Illumina and PacBio sequencing platforms, as well as PacBio SMRT technology and Hi-C auxiliary mounting technology, the whole genome sequencing and assembly of the ShanHu grouper were completed, resulting in a chromosome-level genome information for this hybrid grouper. The genome assembly has a total length of 1.17 Gb with a scaffold N50 of 46.12 Mb, and 171 contigs were anchored into 24 chromosomes. Additionally, its repeat sequences and non-coding RNAs were annotated and 26,102 genes were predicted. Through comparative genomic analysis of the hybrid species ShanHu grouper and its parents, we found that comparative genomic analyses revealed centric inversion structural variations on the chromosomes of the hybrid ShanHu grouper in relation to the brown-marbled grouper and the camouflage grouper. Furthermore, the gene families of the hybrid species have expanded in pathways related to immunity and growth development. This study is the first to provide complete genomic information for a hybrid grouper, offering its full genetic information, exploring the genetic variations in the genomes of hybrid offspring, and providing data references for foundational theoretical research and grouper production practices.

1. Introduction

Groupers (Epinephelidae, Perciformes) are important marine aquaculture fish consisting of 16 genera and approximately 160 species [1], among which approximately 47 species are currently farmed [2]. Currently, China’s annual marine aquaculture production of groupers has exceeded 200,000 tons [3]. Groupers also serve as good models for the study of marine ecological systems, social relations, and sex inversion [4].
Currently, the genomic research of groupers is progressing rapidly and the complete genome information of various economic groupers has been decoded. Examples are: Epinephelus lanceolatus, E. akaara, Plectropomus leopardus, E. tukula, Cromileptes altivelis, E. cyanopodus, E. moara, and E. polyphekadion [5,6,7,8,9,10,11,12]. The brown-marbled grouper (E. fuscoguttatus) and the camouflage grouper (E. polyphekadion) are two species with relatively mature aquaculture techniques. The brown-marbled grouper is known for its high disease resistance and stress tolerance [13]. Additionally, it has an extended spawning season of 8–9 months per year, with a stable spawning period and high-quality, high-volume egg production. Thus, it is considered an excellent maternal parent for hybridization with other grouper species, such as E. fuscoguttatus ♀ × E. lanceolatus ♂ and E. fuscoguttatus ♀ × potato grouper (E. tukula) ♂ [14,15]. The camouflage grouper, on the other hand, is characterized by its docile nature, high protein content, and delicious taste [16]. Hybridization is an effective breeding method used in fish farming to improve genetic traits and production efficiency. It is also one of the most widely used breeding technologies in fish breeding, producing offspring with traits superior to their parents through genetic recombination [17]. The ShanHu grouper (E. fuscoguttatus ♀ × E. polyphekadion ♂), created by using the brown-marbled grouper as the maternal parent and the camouflage grouper as the paternal parent, is an example of a well-developed hybrid grouper species with great commercial development potential and value for foundational theoretical research [18]. As a hybrid grouper, the ShanHu grouper integrates the advantageous traits of its parental generations. It demonstrates the characteristics of strong stress resistance and rapid growth rate from its paternal parent, as well as the features of high protein content and delicious flavor from its maternal parent. The ShanHu grouper represents enormous economic value for the industry. In the aspect of scientific research, the ShanHu is also an excellent model for studying the hybridization mechanism of groupers [19].
Whole genome sequencing represents a complete set of genetic information for an organism. A high-quality genome sequence is a crucial foundation and valuable resource for genomics, functional genomics, comparative genomics, species evolution, and gene function research [20]. Using third-generation PacBio SMRT sequencing technology and Hi-C high-throughput chromatin conformation we completed the chromosome-level whole genome assembly and annotation of the hybrid ShanHu grouper, acquiring a high-quality and complete set of genetic information for the hybrid species. On the whole-genome level we explored the differences between the brown-marbled grouper, the camouflage grouper, and their hybrid the ShanHu grouper. The whole genome sequencing results of the hybrid grouper provide a comprehensive genetic reference for future research on groupers.

2. Results

2.1. Genome Assembly and Quality Assessment

Using Illumina libraries and paired-end (PE) sequencing based on the Illumina Hiseq platform, we assessed the genome size, GC content, heterozygosity, repetitive sequences, and potential foreign contamination of the hybrid ShanHu groupers. First, a total of 59.30 Gb of raw data were obtained from the hybrid ShanHu groupers second-generation sequencing, with a sequence efficacy of 85.16%, an error rate of 0.04%, and Q20 and Q30 values of 96.72% and 91.52%, respectively. The GC content was found to be 41.38%. K-mer analysis was performed to evaluate the genome size and heterozygosity of the ShanHu groupers, setting the K-mer length to 17 bp (Figure A1). The predicted genome size of the hybrid ShanHu groupers was 1125.98 Mbp, with a heterozygosity of 1.7% and a repeat rate of 41.12%. Additionally, after assembling the data of the ShanHu groupers, the correlation results of the GC content and sequencing depth for the contigs are shown in Figure A2. The GC content of the contigs did not exhibit separation, indicating that the ShanHu groupers genome is free from foreign contamination.
The sequencing amount for ShanHu groupers based on PacBio technology was 100 Gb. Based on the estimated genome size of 1125 Mb from the second-generation sequencing data, the sequencing coverage depth was calculated to be 88.89×. After quality control of the sequencing data, we obtained the genome sequence of ShanHu groupers and conducted a preliminary assembly of its genome. The assembled genome sequence contained 153 contigs, with a contig N50 of 46.12 Mb and a total genome size of 1.17 Gb where the longest contig was 70.4 Mb.
Once the genome assembly of ShanHu groupers was completed, we assessed the quality of the assembly using BUSCO analysis to evaluate the genome integrity. The BUSCO evaluation results indicated that, using a single-copy ortholog gene set of 3354 genes, 98.5% of complete BUSCO genes were assembled, with 3135 complete single-copy genes successfully assembled from the ortholog gene set (Figure A3).

2.2. Pseudochromosome Construction

HI-C assisted assembly was performed on the ShanHu groupers genome to achieve chromosome-level resolution (Figure 1). The chromosome-level genome assembly results are as follows (Table 1): the total length of contigs is 1,176,882,344 bp, with a contig N50 length of 45,365,500 bp; the total length of scaffolds is 1,176,892,644 bp, with a scaffold N50 length of 46,120,958 bp; and the total length of sequences anchored to the chromosomes is 1,083,187,597 bp, with 92.04% of the sequences anchored to the 24 chromosomes of the ShanHu groupers (Table A1).

2.3. Genome Annotation

Based on the assembled chromosome-level genome of the hybrid ShanHu groupers, we completed the annotation of repetitive sequences, non-coding RNAs, and protein-coding genes, as well as predicted related functions (Figure 2). For protein-coding genes, the ShanHu groupers genome predicted a total of 26,102 genes (Table 2) (Figure A4 and Figure A5), and based on comparisons with known protein databases 98.5% of the genes were able to have their functions predicted (Figure A6). Repeat sequences were identified using homologous and de novo methods, revealing a total of 552.11 Mbp of repetitive sequences in the ShanHu groupers genome, which accounts for 46.91% of the genome (Table 3). For the non-coding RNAs in the ShanHu groupers, a total of 3438 miRNAs, 2415 tRNAs, 8374 rRNAs, and 1671 snRNAs were identified (Table 4).

2.4. Phylogenetic Analysis of Economically Epinephelus

Using E. fuscoguttatus and E. polyphekadion as target species, and E. lanceolatus, E. akaara, E. moara, E. awoara, E. tukula, P. leopardus, and C. altivelis as comparative species, along with the model organism Danio rerio as an outgroup, orthologs of the 10 fish species were predicted. A total of 253,963 genes were clustered into 21,710 gene families, of which 5457 were common single-copy gene families (Figure A7 and Figure A8). The shared single-copy genes were used to construct a phylogenetic tree with divergence times (Figure 3). Based on the 5457 shared single-copy gene families between the target species and comparative species, we analyzed the phylogenetic relationships and evolutionary history at the genomic level among the brown-marbled groupers, the camouflage groupers, and several other economically important groupers. The estimated divergence times indicate that the ancestor of groupers diverged from Danio rerio approximately 221.3 million years ago (204.3–252.1 million years). The divergence time among economically important groupers is more recent, with their ancestor beginning to diverge around 49 million years ago (40.8–59.0 million years) and leading to the formation of P. leopardus and other groupers. After the first divergence among groupers, approximately 23.5 million years ago (19.5–27.9 million years), the ancestors of the E. awoara and the E. akaara began to diverge, grouping them together. The genetic evolutionary relationship between the brown-marbled grouper and the camouflage grouper is the most recent among economically important groupers, with their ancestor diverging about 2.5 million years ago (2.0–3.2 million years).

2.5. Comparative Analyses of the Genome Structure

A collinearity analysis was conducted on the ShanHu groupers genome in comparison with the brown-marbled groupers and camouflage groupers genomes to understand the structural differences in chromosomes between the hybrid ShanHu groupers and their maternal and paternal parents. (Figure 4). Between the genomes of hybrid ShanHu grouper and the maternal brown-marbled grouper, a total of 342 homologous gene collinear blocks were identified, with 73.87% of the genes located within these collinear blocks. While between the hybrid grouper and the paternal camouflage grouper genomes, 340 homologous gene collinear blocks were identified, with 76.43% of the genes falling within these blocks. The hybrid ShanHu grouper displayed a high proportion of homologous genes compared to both parental genomes (Figure A9 and Figure A10).
Both the ShanHu grouper vs. the maternal brown-marbled grouper and the ShanHu grouper vs. the paternal camouflage grouper exhibited vertical collinear regions, as well as collinear regions showing central inversion phenomena. The number of collinear genes between the ShanHu grouper and the camouflage grouper is higher than that between the ShanHu grouper and the brown-marbled grouper. Both the ShanHu grouper and the brown-marbled grouper, as well as the ShanHu grouper and the camouflage grouper, have vertical collinear regions and collinear regions with central inversion phenomena. There is a phenomenon of vertical collinearity in the structures of chromosomes No. 1, 3, 4, 5, 6, 7, 12, 13, 14, 16, 17, 20, 21, 22, and 23 of the ShanHu grouper and chromosomes No. 1, 2, 3, 5, 6, 7, 8, 9, 11, 16, 17, 19, 20, 22, and 24 of the brown-marbled grouper. Additionally, there is a phenomenon of vertical collinearity in the structures of chromosomes No. 1, 9, 12, 19, and 21 of the ShanHu grouper and chromosomes No. 5, 11, 13, 14, and 15 of the camouflage grouper. When comparing the hybrids, the structural similarity of the ShanHu groupers’ chromosomes to the maternal brown-marbled grouper was higher, while the proportion of homologous genes was greater for the ShanHu grouper compared to the paternal camouflage groupers despite exhibiting larger structural differences.
Comparative genomic analysis was conducted between the ShanHu grouper and the brown-marbled grouper and the camouflaged grouper. After hybridization, it was found that 1545 gene families had undergone expansion. GO functional enrichment was performed and multiple enriched pathways related to immunity and growth development were identified (Figure 5). The expanded genes were analyzed using protein–protein interaction (PPI) and maximal clique centrality (MCC) methods. Through the constructed PPI network it was found that several genes related to growth and immunity, such as acta1b, hsp70.3, hspa8, cdc42, and cxcr4b, are closely connected to other expanded genes. Furthermore, several hub genes, such as cxc4, ptprc, and ccr2, were identified as playing important roles among the expanded genes (Figure 6).

3. Discussion

The grouper is an important aquaculture fish species. Hybridization is an effective breeding method for improving the genetics and increasing production efficiency in fish farming practices, and it is also one of the most widely used breeding technologies in fish breeding. Currently, regarding grouper hybridization experiments, numerous research teams have conducted extensive studies and screened many promising grouper hybrid combinations, with the ShanHu grouper considered a hybrid grouper with significant development potential [18,21]. Whole genome sequencing represents the complete genetic material of an organism and is the most important means of understanding that organism’s information [22]. Deciphering the genome sequence of the ShanHu grouper can provide comprehensive reference data for studying the genetic mechanisms of grouper hybridization. Additionally, it can accelerate the foundational research on this species, which is substantially meaningful and practically valuable with respect to genetic trait analysis, genetic breeding, germplasm resource conservation, and aquaculture.
Firstly, based on the PacBio and Illumina sequencing platforms, as well as the PacBio SMRT technology and Hi-C assisted scaffolding technology, the whole-genome sequencing and assembly of the ShanHu grouper were carried out. The whole-genome size of the ShanHu grouper is 1.17 Gb, which is slightly larger than the genome size of its male parent, E. fuscoguttatus, at 1.08 Gb, and that of its female parent, E. polyphekadion, at 1.09 Gb. The scaffold N50 is 46.12 Mb and 92.04% of the sequences can be successfully anchored to the 24 chromosomes of the ShanHu grouper. On the basis of the completed genome assembly of the ShanHu grouper, we annotated its repetitive sequences, non-coding RNAs, and predicted 26,102 genes. Through the processes of sequencing, assembly, and annotation of the ShanHu grouper, complete whole-genome information of this hybrid grouper has been obtained, enabling a deeper understanding of the genetic basis of the ShanHu grouper which is a hybrid grouper with high economic value.
At the same time, a phylogenetic tree of economical groupers was constructed at the whole-genome level to understand the evolutionary relationships among groupers. It was found that the genetic evolutionary relationship between the brown-marbled grouper and the camouflage grouper is the closest among economical groupers, providing theoretical evidence for the high hybridization potential between them. Hybridization is recognized as a potentially creative force contributing to adaptation and species diversification [23]. The ShanHu grouper has a recombinant genome with the same ploidy level as its parental species. The ShanHu grouper is formed through homoploid hybrid speciation between the E. fuscoguttatus and the E. polyphekadion. It may take many forms with respect to its genomic makeup, ranging from the introgression of a single or few genes into a foreign genomic background to balanced genomic contributions from both parent lineages [24]. Synteny analysis between the ShanHu grouper and its maternal brown-marbled grouper and paternal camouflage grouper revealed a high proportion of homologous genes and structural variations, with centric inversions in the chromosomes of both the ShanHu grouper and the brown-marbled grouper as well as the camouflage grouper. There are 24,031 homologous genes between the ShanHu and the brown-marbled grouper, accounting for 92.0%, and 24,077 homologous genes between the ShanHu and the camouflage grouper, accounting for 92.2%. Between the genomes of the hybrid ShanHu grouper and the maternal brown-marbled grouper a total of 342 homologous gene collinear blocks were identified, with 73.87% of the genes located within these collinear blocks. While between the hybrid grouper and the paternal camouflage grouper genomes a total of 340 homologous gene collinear blocks were identified, with 76.43% of the genes falling within these blocks. More vertical synteny chromosomes were observed between the ShanHu grouper and the brown-marbled grouper compared to the camouflage grouper, whereas more centric inversion structural variations occurred between the ShanHu grouper and the camouflage grouper than between the ShanHu grouper and the brown-marbled grouper. The 8th, 9th, 10th, 11th, 15th, 18th, and 19th chromosomes of the ShanHu grouper showed structural central inversions with the 4th, 10th, 12th, 13th, 14th, 15th, and 18th chromosomes of the brown-marbled grouper. Additionally, the 3rd, 4th, 5th, 6th, 7th, 8th, 10th, 11th, 13th, 15th, 16th, 17th, 18th, 20th, 22nd, and 23rd chromosomes of the ShanHu grouper displayed central inversions with the 1st, 2nd, 3rd, 4th, 6th, 7th, 8th, 9th, 10th, 12th, 16th, 17th, 18th, 19th, 20th, and 21st chromosomes of the camouflage grouper. The chromosomal structure of the ShanHu grouper exhibits a greater degree of similarity to that of its maternal parent. Intriguingly, with respect to crucial aquaculture traits that are of primary concern for commercially valuable fish species, such as growth patterns, developmental characteristics, and morphological features, the ShanHu grouper also demonstrates a closer resemblance to its maternal parent. During the hybridization process of the ShanHu grouper’s parents, gene recombination occurs. The drastic changes in the genome will prompt the replication of some genes, thereby increasing the gene copy number. Based on the whole genome information, it was found that multiple genes related to immunity and growth development had expanded in the hybrid ShanHu grouper compared to in the parents. We speculate that during the hybridization process, gene recombination has led to the expansion of genes related to immunity and growth in the ShanHu grouper. This is an important reason why the ShanHu grouper demonstrates stronger resistance and growth advantages during the farming process.
The ShanHu grouper, as a new variety of groupers in the field of grouper aquaculture, has a very broad aquaculture prospect and is of far-reaching significance for our study of the genetic mechanism of grouper hybridization. Third-generation sequencing technology has been used to conduct whole-genome sequencing of the ShanHu grouper so as to obtain a whole-genome sequence of the hybrid grouper at the chromosomal level. Currently, many model organisms have completed whole genomes at the telomere-to-telomere level through the use of ultra-long sequencing technology, which enables more accurate and comprehensive coverage of the entire chromosomal region to obtain more complete assembled genome sequences and decipher complex genetic structures. With the improvement of sequencing technology and the popularization of ultra-long sequencing methods, in the future more complete whole genomes at the telomere-to-telomere level will emerge in the field of aquaculture to serve the development of the industry.

4. Materials and Methods

4.1. Sample Collection, Library Construction, and Sequencing

A ShanHu grouper was obtained by hybridizing the brown-marbled grouper as the maternal parent and the camouflage grouper as the paternal parent. The hybrid parents were sourced from Hainan Chenhai Aquatic Co., Ltd. (Lingshui), China. The hybrid groupers were raised to one year of age, and their heart, liver, spleen, kidney, skin, fins, gills, and muscle tissues were collected and immediately frozen in liquid nitrogen. Genomic DNA of the ShanHu grouper was extracted from dorsal muscle tissue using a QIAGEN DNeasy Kit (QIAGEN, Shanghai, China). For the Illumina platform (NEB, Ipswich, MA, USA), the short fragment library with an insertion size of 350 bp was generated using a NEB Next® Ultra™ DNA Library Prep Kit. Subsequently, HiFi SMRTbell Libraries with an insert size of 20 kb were generated using the SMRTbell Template PrepKit 2.0 (PacBio, Menlo Park, CA, USA), and the library was sequenced in circular consensus sequencing (CCS) on the PacBio Sequel II platform. The Hi-C library was constructed following the standard protocol described previously with certain modifications and was sequenced using the Illumina NovaSeq 6000 platform [25]. In total, 59 Gb of Illumina data and 100 Gb of PacBio data were generated, and low-quality reads and sequencing-adaptor-contaminated reads were removed. RNA isolation was performed using RNAsimple Total RNA Kit (TIANGEN, Beijing, China). RNA-sequencing libraries for 8 tissues (spleen, gill, skin, muscle, liver, heart, kidney, and fin) were constructed using the NEBNext Ultra RNA Library Prep Kit following the manufacturer’s protocol and also sequenced on the Illumina NovaSeq 6000 platform.

4.2. Genome Size Estimation, Assembly, and Quality Assessment

First, the constructed library was sequenced using Illumina Novaseq for paired-end (PE) sequencing. To ensure the quality of the data analysis, raw reads were filtered to obtain clean reads. K-mer frequency distribution analysis was then used to estimate the genome characteristics using Jellyfish (v2.2.7), based on a 17-mer [26]. The estimated genome size was 1125.98 Mb, the heterozygosity rate was approximately 1.7%, and the repeat content was 41.12%. The genome was then assembled using SOAPdenovo2 (r242) software with a 41-mer to obtain the preliminary assembly results for the ShanHu grouper genome [27]. Based on the pre-assembly results, sequencing was performed on the PacBio platform with a coverage depth of 88.89×. Quality control was conducted on the raw sequencing data to obtain HiFi reads, and the genome was assembled using Hifiasm (v0.19.5) software [28]. Finally, the completeness of the genome assembly was assessed using Benchmarking Universal Single-Copy Orthologs (BUSCO) (v4.1.2) [29].

4.3. Pseudochromosome Construction

Following the standard protocol described previously with certain modifications, we constructed Hi-C libraries using the original sample as input [30]. Hi-C sequencing was performed using muscle tissue from the ShanHu grouper. Based on the principle that inter-chromosomal interactions are weaker than intra-chromosomal interactions and that long-range interactions are weaker than short-range ones, we used high-throughput chromosome conformation capture (Hi-C) technology to cluster contigs into chromosome groups. The order and orientation of the contigs within each chromosome group were then determined to construct the chromosome-level whole genome of the ShanHu grouper. Based on the sequencing results and the obtained Hi-C reads, the contig sequences assembled were mapped to the chromosome level using Allhic (v0.9.8) software [31].

4.4. Genome Prediction and Annotation

We conducted repeat annotation, non-coding RNA annotation, protein-coding gene prediction, and functional annotation on the ShanHu grouper genome. For repeat annotation, repetitive sequences were detected using a combined approach of homology alignment and de novo searches. Homologous sequences comparison was conducted using RepeatMasker (v4.1.0) and RepeatProteinMask based on Repbase database [32]. And a de novo candidate database of repetitive elements was constructed by LTR (v1.0.6), RepeatScout (v1.0.5), and RepeatModeler (v2.0.1) software [33,34,35]. Additionally, tandem repeat sequences were predicted using the software TRF (v4.09). Non-coding RNA annotations, including rRNA, snRNA, and microRNA, were identified using Infernal (v1.1) and tRNAs were predicted using tRNAscan- SE (v2.0) [36,37]. A combination of homology-based prediction, de novo prediction, and transcriptome-based prediction was used to predict the protein-coding genes within the ShanHu grouper genome. First, protein sequences from other fish, including E. fuscoguttatus, E. lanceolatus, E. moara, Danio rerio, Gasterosteus aculeatus, and Takifugu rubripes, were aligned against the ShanHu grouper genome using Augustus (v3.2.3), Glimmer (v3.0.4), and Genscan (v1.0) software [38,39,40]. Next, solar software (v0.9.6) was used to conjoin with BLAST (v2.2.26) to generate a hit. GeneWise (v2.4.1) was then used to predict the exact gene structure of the corresponding genomic region on each BLAST hit [41,42]. Additionally, high-quality RNA-seq data were assembled by Trinity (v2.11.0), and the sequences were aligned against the ShanHu grouper genome to assemble spliced alignment [43,44]. Furthermore, RNA-seq reads were mapped to the ShanHu grouper genome using Tophat (v2.0.13) and then assembled into gene models by Cufflinks (v2.1.1) [45]. Finally, a non-redundant and more complete gene set was integrated via EvidenceModeler (EVM) (v1.1.1) using various gene models. Furthermore, the predicted protein-coding genes in the ShanHu grouper genome were annotated based on public databases including SwissProt, NR database, InterPro, and KEGG pathway [46,47,48].

4.5. Phylogenetic Analysis

Using the brown-marbled grouper and the camouflage grouper as target species, a phylogenetic tree for groupers was constructed to understand the evolutionary relationships between the brown-marbled grouper and the camouflage grouper in relation to other economically important grouper species. First, orthologous relationships between the genes of E. lanceolatus, E. akaara, E. moara, E. awoara, E. tukula, P. leopardus, C. altivelis, and D. rerio were inferred through all-against-all protein sequence similarity searches with OthoMCL (v1.4), retaining only the longest predicted transcript per locus, with an E-value threshold set to 1 × 10−5 [49,50]. Then, for each gene family, an alignment was produced using Muscle (v3.8.31), ambiguously aligned positions were trimmed using Gblocks (v0.91b), and the tree was inferred using RAxML (v8.2.12) [51,52]. Divergence times between species were calculated using the MCMCtree (v1.1) program implemented in PAML [53].

4.6. Comparative Genomic Analyses

Based on the whole-genome information of the brown-marbled grouper, the camouflage grouper, and the hybrid ShanHu grouper, synteny analysis was conducted using MCscanX software (E-value ≤ 1 × 10−5, match size ≥ 5) with default parameters [54]. We identified the orthologous groups among these three species using all-to-all blast (E-value ≤ 1 × 10−5, identity ≥ 80%) and identified the expanded families using café [55]. Enrichment analyses based on GO annotations were performed to identify functional implications of the expanded family (Fisher’s exact test, adjusted p-value < 0.05). A PPI network was conducted using STRING v11.0 with default parameters [56]. Additionally, the maximal clique centrality (MCC) method of the cytoHubba module in Cytoscape (v3.10.3) software was performed to determine the hub genes in the networks [57].

Author Contributions

Y.L. carried out the hybridization work of ShanHu and completed tasks such as its cultivation and sampling. Y.M. conceived this paper. Y.L. and Y.M. drafted and revised the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Project of Sanya Yazhou Bay Science and Technology City, grant number SKJC-2020-02-009 and the Open Project Program of Yazhou Bay Innovation Institute of Hainan Tropical Ocean University, grant number No. 2022CXYKFKT06.

Institutional Review Board Statement

We declare here that this study was approved by the College of Marine Life Sciences, Ocean University of China Institutional Animal Care and Use Committee on 10 October 2018 (Project Identification Code: 20181010).

Data Availability Statement

Whole Genome Shotgun project for ShanHu has been deposited at GenBank under the accession GCA_047716675.1 under BioProject number PRJNA1210092.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

Table A1. Scaffold Cluster Number and Length of Each Chromosome in ShanHu.
Table A1. Scaffold Cluster Number and Length of Each Chromosome in ShanHu.
Sequeues IDCluster NumberSequeues Length (bp)
Chr1144,551,120
Chr2237,351,789
Chr3153,015,742
Chr4147,211,449
Chr5148,719,712
Chr6153,326,279
Chr7148,243,466
Chr8146,120,958
Chr9145,365,500
Chr10148,494,318
Chr11143,829,164
Chr12143,297,142
Chr13147,782,323
Chr14124,826,122
Chr15148,046,878
Chr16152,374,887
Chr17141,935,179
Chr18158,196,232
Chr19145,048,601
Chr20145,100,459
Chr21246,795,679
Chr22239,857,385
Chr232541,020,352
Chr2410032,676,861

Appendix B

Figure A1. Distribution graph of K-mer for ShanHu. The horizontal axis represents the depth when K-mer = 17 and the vertical axis represents the frequency at different depths.
Figure A1. Distribution graph of K-mer for ShanHu. The horizontal axis represents the depth when K-mer = 17 and the vertical axis represents the frequency at different depths.
Ijms 26 05036 g0a1
Figure A2. GC-depth distribution graph for ShanHu.
Figure A2. GC-depth distribution graph for ShanHu.
Ijms 26 05036 g0a2
Figure A3. BUSCO assessment results for ShanHu.
Figure A3. BUSCO assessment results for ShanHu.
Ijms 26 05036 g0a3
Figure A4. Comparison of gene structure components between ShanHu and annotated species.
Figure A4. Comparison of gene structure components between ShanHu and annotated species.
Ijms 26 05036 g0a4
Figure A5. Venn diagram of gene structure annotation for ShanHu based on de novo prediction methods, homology prediction methods, and transcriptome data.
Figure A5. Venn diagram of gene structure annotation for ShanHu based on de novo prediction methods, homology prediction methods, and transcriptome data.
Ijms 26 05036 g0a5
Figure A6. Venn diagram of gene function annotation statistics for ShanHu based on known protein databases.
Figure A6. Venn diagram of gene function annotation statistics for ShanHu based on known protein databases.
Ijms 26 05036 g0a6
Figure A7. Statistical results for Epinephelus gene family.
Figure A7. Statistical results for Epinephelus gene family.
Ijms 26 05036 g0a7
Figure A8. Venn diagram of Epinephelus gene family clustering.
Figure A8. Venn diagram of Epinephelus gene family clustering.
Ijms 26 05036 g0a8
Figure A9. Whole genome alignment between ShanHu and E. fuscoguttatus. The vertical axis represents the E. fuscoguttatus and the horizontal axis represents the ShanHu.
Figure A9. Whole genome alignment between ShanHu and E. fuscoguttatus. The vertical axis represents the E. fuscoguttatus and the horizontal axis represents the ShanHu.
Ijms 26 05036 g0a9
Figure A10. Whole genome alignment between ShanHu and E. polyphekadion. The vertical axis represents the E. polyphekadion and the horizontal axis represents the ShanHu.
Figure A10. Whole genome alignment between ShanHu and E. polyphekadion. The vertical axis represents the E. polyphekadion and the horizontal axis represents the ShanHu.
Ijms 26 05036 g0a10

References

  1. Zhuang, X.; Qu, M.; Zhang, X.; Ding, S. A Comprehensive Description and Evolutionary Analysis of 22 Grouper (Perciformes, Epinephelidae) Mitochondrial Genomes with Emphasis on Two Novel Genome Organizations. PLoS ONE 2013, 8, e73561. [Google Scholar] [CrossRef] [PubMed]
  2. Rimmer, M.A.; Glamuzina, B. A Review of Grouper (Family Serranidae: Subfamily Epinephelinae) Aquaculture from a Sustainability Science Perspective. Rev. Aquac. 2019, 11, 58–87. [Google Scholar] [CrossRef]
  3. Agriculture and Rural Affairs Department. China Fishery Statistical Yearbook. In 2–2 Aquaculture; China Agriculture Press: Beijing, China, 2023; p. 22. [Google Scholar]
  4. Hennersdorf, P.; Mrotzek, G.; Abdul-Aziz, M.A.; Saluz, H.P. Metagenomic Analysis between Free-Living and Cultured Epinephelus fuscoguttatus under Different Environmental Conditions in Indonesian Waters. Mar. Pollut. Bull. 2016, 110, 726–734. [Google Scholar] [CrossRef]
  5. Zhou, Q.; Gao, H.; Zhang, Y.; Fan, G.; Xu, H.; Zhai, J.; Xu, W.; Chen, Z.; Zhang, H.; Liu, S.; et al. A Chromosome-level Genome Assembly of the Giant Grouper (Epinephelus lanceolatus) Provides Insights into Its Innate Immunity and Rapid Growth. Mol. Ecol. Resour. 2019, 19, 1322–1332. [Google Scholar] [CrossRef]
  6. Ge, H.; Lin, K.; Shen, M.; Wu, S.; Wang, Y.; Zhang, Z.; Wang, Z.; Zhang, Y.; Huang, Z.; Zhou, C.; et al. De Novo Assembly of a Chromosome-level Reference Genome of Red-spotted Grouper (Epinephelus akaara) Using Nanopore Sequencing and Hi-C. Mol. Ecol. Resour. 2019, 19, 1461–1469. [Google Scholar] [CrossRef]
  7. Zhou, Q.; Guo, X.; Huang, Y.; Gao, H.; Xu, H.; Liu, S.; Zheng, W.; Zhang, T.; Tian, C.; Zhu, C.; et al. De Novo Sequencing and Chromosomal-scale Genome Assembly of Leopard Coral Grouper, Plectropomus leopardus. Mol. Ecol. Resour. 2020, 20, 1403–1413. [Google Scholar] [CrossRef]
  8. Wang, L.; Li, Z.; Liu, Y.; Chen, S.; Li, L.; Duan, P.; Wang, X.; Li, W.; Wang, Q.; Zhai, J.; et al. A Chromosome-Level Genome Assembly of the Potato Grouper (Epinephelus tukula). Genomics 2022, 114, 110473. [Google Scholar] [CrossRef]
  9. Yang, Y.; Wu, L.; Weng, Z.; Wu, X.; Wang, X.; Xia, J.; Meng, Z.; Liu, X. Chromosome Genome Assembly of Cromileptes altivelis Reveals Loss of Genome Fragment in Cromileptes Compared with Epinephelus Species. Genes 2021, 12, 1873. [Google Scholar] [CrossRef]
  10. Cao, X.; Zhang, J.; Deng, S.; Ding, S. Chromosome-Level Genome Assembly of the Speckled Blue Grouper (Epinephelus cyanopodus) Provides Insight into Its Adaptive Evolution. Biology 2022, 11, 1810. [Google Scholar] [CrossRef]
  11. Zhou, Q.; Gao, H.; Xu, H.; Lin, H.; Chen, S. A Chromosomal-Scale Reference Genome of the Kelp Grouper Epinephelus moara. Mar. Biotechnol. 2021, 23, 12–16. [Google Scholar] [CrossRef]
  12. Li, H.; Liu, Y.; Mao, M.; Mao, Y. Chromosome-Level Genome Assembly and Annotation of the Camouflage Grouper (Epinephelus polyphekadion). Sci. Data 2024, 11, 1353. [Google Scholar] [CrossRef]
  13. Sun, Y.; Guo, C.-Y.; Wang, D.-D.; Li, X.F.; Xiao, L.; Zhang, X.; You, X.; Shi, Q.; Hu, G.-J.; Fang, C.; et al. Transcriptome Analysis Reveals the Molecular Mechanisms Underlying Growth Superiority in a Novel Grouper Hybrid (Epinephelus fuscogutatus♀ × E. lanceolatus♂). BMC Genet. 2016, 17, 24. [Google Scholar] [CrossRef] [PubMed]
  14. Rimmer, M.A.; Thampisamraj, Y.C.; Jayagopal, P.; Thineshsanthar, D.; Damodar, P.N.; Toledo, J.D. Spawning of Tiger Grouper Epinephelus fuscoguttatus and Squaretail Coralgrouper Plectropomus areolatus in Sea Cages and Onshore Tanks in Andaman and Nicobar Islands, India. Aquaculture 2013, 410–411, 197–202. [Google Scholar] [CrossRef]
  15. Cheng, M.; Tian, Y.; Li, Z.; Wang, L.; Wu, Y.; Zhang, J.; Pang, Z.; Ma, W.; Zhai, J. The Complete Mitochondrial Genome of the Hybrid Offspring Epinephelus fuscoguttatus ♀ × Epinephelus tukula ♂. Mitochondrial DNA Part B 2019, 4, 2717–2718. [Google Scholar] [CrossRef]
  16. Cao, L.; Chen, P.; Hou, X.; Ma, J.; Yang, N.; Xu, Y.; Zhang, Y.; Zhao, A.; Zhang, J.; Li, X.; et al. Genetic Characteristics and Growth Patterns of the Hybrid Grouper Derived from the Hybridization of Epinephelus fuscoguttatus (Female) × Epinephelus polyphekadion (Male). J. Fish Biol. 2023, 102, 328–339. [Google Scholar] [CrossRef]
  17. Huang, Y.S.; Li, Z.B.; Ning, Y.F.; Shangguan, J.B.; Yuan, Y.; Mao, X.Q.; Li, B.B. Isolation and Characterization of Microsatellite Loci in Hybrid Giant Tiger Grouper. Genet. Mol. Res. 2015, 14, 14706–14710. [Google Scholar] [CrossRef]
  18. Ou, G.; Xie, R.; Huang, J.; Huang, J.; Wen, Z.; Li, Y.; Jiang, X.; Ma, Q.; Chen, G. Effects of Dietary Alpha-Lipoic Acid on Growth Performance, Serum Biochemical Indexes, Liver Antioxidant Capacity and Transcriptome of Juvenile Hybrid Grouper (Epinephelus fuscoguttatus♀ × Epinephelus polyphekadion♂). Animals 2023, 13, 887. [Google Scholar] [CrossRef]
  19. Amenyogbe, E.; Chen, G.; Wang, Z. Identification, Characterization, and Expressions Profile Analysis of Growth Hormone Receptors (GHR1 and GHR2) in Hybrid Grouper (Epinephelus fuscoguttatus ♀ × Epinephelus polyphekadion ♂). Genomics 2020, 112, 1–9. [Google Scholar] [CrossRef]
  20. Quail, M.; Smith, M.E.; Coupland, P.; Otto, T.D.; Harris, S.R.; Connor, T.R.; Bertoni, A.; Swerdlow, H.P.; Gu, Y. A Tale of Three next Generation Sequencing Platforms: Comparison of Ion Torrent, Pacific Biosciences and Illumina MiSeq Sequencers. BMC Genom. 2012, 13, 341. [Google Scholar] [CrossRef]
  21. Cao, L.; Ma, J.; Chen, P.; Hou, X.; Yang, N.; Lu, Y.; Huang, H. Exploring the Influence of DNA Methylation and Single Nucleotide Polymorphisms of the Myostatin Gene on Growth Traits in the Hybrid Grouper (Epinephelus fuscoguttatus (Female) × Epinephelus polyphekadion (Male)). Front. Genet. 2024, 14, 1277647. [Google Scholar] [CrossRef]
  22. Nugent, C.M.; Kess, T.; Brachmann, M.K.; Langille, B.L.; Duffy, S.J.; Lehnert, S.J.; Wringe, B.F.; Bentzen, P.; Bradbury, I.R. Whole-genome Sequencing Reveals Fine-scale Environment-associated Divergence near the Range Limits of a Temperate Reef Fish. Mol. Ecol. 2023, 32, 4742–4762. [Google Scholar] [CrossRef]
  23. Elgvin, T.O.; Trier, C.N.; Tørresen, O.K.; Hagen, I.J.; Lien, S.; Nederbragt, A.J.; Ravinet, M.; Jensen, H.; Sætre, G.-P. The Genomic Mosaicism of Hybrid Speciation. Sci. Adv. 2017, 3, e1602996. [Google Scholar] [CrossRef]
  24. The Heliconius Genome Consortium. Butterfly Genome Reveals Promiscuous Exchange of Mimicry Adaptations among Species. Nature 2012, 487, 94–98. [Google Scholar] [CrossRef]
  25. Rao, S.S.P.; Huntley, M.H.; Durand, N.C.; Stamenova, E.K.; Bochkov, I.D.; Robinson, J.T.; Sanborn, A.L.; Machol, I.; Omer, A.D.; Lander, E.S.; et al. A 3D Map of the Human Genome at Kilobase Resolution Reveals Principles of Chromatin Looping. Cell 2014, 159, 1665–1680. [Google Scholar] [CrossRef]
  26. Marçais, G.; Kingsford, C. A Fast, Lock-Free Approach for Efficient Parallel Counting of Occurrences of k -Mers. Bioinformatics 2011, 27, 764–770. [Google Scholar] [CrossRef]
  27. Luo, R.; Liu, B.; Xie, Y.; Li, Z.; Huang, W.; Yuan, J.; He, G.; Chen, Y.; Pan, Q.; Liu, Y.; et al. SOAPdenovo2: An Empirically Improved Memory-Efficient Short-Read De novo Assembler. Gigascience 2012, 1, 18. [Google Scholar] [CrossRef]
  28. Burton, J.N.; Adey, A.; Patwardhan, R.P.; Qiu, R.; Kitzman, J.O.; Shendure, J. Chromosome-Scale Scaffolding of de Novo Genome Assemblies Based on Chromatin Interactions. Nat. Biotechnol. 2013, 31, 1119–1125. [Google Scholar] [CrossRef] [PubMed]
  29. Waterhouse, R.M.; Seppey, M.; Simão, F.A.; Manni, M.; Ioannidis, P.; Klioutchnikov, G.; Kriventseva, E.V.; Zdobnov, E.M. BUSCO Applications from Quality Assessments to Gene Prediction and Phylogenomics. Mol. Biol. Evol. 2018, 35, 543–548. [Google Scholar] [CrossRef]
  30. Belton, J.-M.; McCord, R.P.; Gibcus, J.H.; Naumova, N.; Zhan, Y.; Dekker, J. Hi–C: A Comprehensive Technique to Capture the Conformation of Genomes. Methods 2012, 58, 268–276. [Google Scholar] [CrossRef]
  31. Zhang, X.; Zhang, S.; Zhao, Q.; Ming, R.; Tang, H. Assembly of Allele-Aware, Chromosomal-Scale Autopolyploid Genomes Based on Hi-C Data. Nat. Plants 2019, 5, 833–845. [Google Scholar] [CrossRef]
  32. Bao, W.; Kojima, K.K.; Kohany, O. Repbase Update, a Database of Repetitive Elements in Eukaryotic Genomes. Mob. DNA 2015, 6, 11. [Google Scholar] [CrossRef] [PubMed]
  33. Xu, Z.; Wang, H. LTR_FINDER: An Efficient Tool for the Prediction of Full-Length LTR Retrotransposons. Nucleic Acids Res. 2007, 35, W265–W268. [Google Scholar] [CrossRef]
  34. Abrusán, G.; Grundmann, N.; DeMester, L.; Makalowski, W. TEclass—A Tool for Automated Classification of Unknown Eukaryotic Transposable Elements. Bioinformatics 2009, 25, 1329–1330. [Google Scholar] [CrossRef]
  35. Price, A.L.; Jones, N.C.; Pevzner, P.A. De Novo Identification of Repeat Families in Large Genomes. Bioinformatics 2005, 21 (Suppl. 1), i351–i358. [Google Scholar] [CrossRef]
  36. Nawrocki, E.P.; Eddy, S.R. Infernal 1.1: 100-Fold Faster RNA Homology Searches. Bioinformatics 2013, 29, 2933–2935. [Google Scholar] [CrossRef]
  37. Chan, P.P.; Lin, B.Y.; Mak, A.J.; Lowe, T.M. tRNAscan-SE 2.0: Improved Detection and Functional Classification of Transfer RNA Genes. Nucleic Acids Res. 2021, 49, 9077–9096. [Google Scholar] [CrossRef]
  38. Stanke, M.; Keller, O.; Gunduz, I.; Hayes, A.; Waack, S.; Morgenstern, B. AUGUSTUS: Ab Initio Prediction of Alternative Transcripts. Nucleic Acids Res. 2006, 34, W435–W439. [Google Scholar] [CrossRef]
  39. Majoros, W.H.; Pertea, M.; Salzberg, S.L. TigrScan and GlimmerHMM: Two Open Source Ab initio Eukaryotic Gene-Finders. Bioinformatics 2004, 20, 2878–2879. [Google Scholar] [CrossRef]
  40. Burge, C.; Karlin, S. Prediction of Complete Gene Structures in Human Genomic DNA. J. Mol. Biol. 1997, 268, 78–94. [Google Scholar] [CrossRef]
  41. Yu, X.-J.; Zheng, H.-K.; Wang, J.; Wang, W.; Su, B. Detecting Lineage-Specific Adaptive Evolution of Brain-Expressed Genes in Human Using Rhesus Macaque as Outgroup. Genomics 2006, 88, 745–751. [Google Scholar] [CrossRef]
  42. Birney, E.; Clamp, M.; Durbin, R. GeneWise and Genomewise. Genome Res. 2004, 14, 988–995. [Google Scholar] [CrossRef] [PubMed]
  43. Grabherr, M.G.; Haas, B.J.; Yassour, M.; Levin, J.Z.; Thompson, D.A.; Amit, I.; Adiconis, X.; Fan, L.; Raychowdhury, R.; Zeng, Q.; et al. Full-Length Transcriptome Assembly from RNA-Seq Data without a Reference Genome. Nat. Biotechnol. 2011, 29, 644–652. [Google Scholar] [CrossRef] [PubMed]
  44. Haas, B.J.; Salzberg, S.L.; Zhu, W.; Pertea, M.; Allen, J.E.; Orvis, J.; White, O.; Buell, C.R.; Wortman, J.R. Automated Eukaryotic Gene Structure Annotation Using EVidenceModeler and the Program to Assemble Spliced Alignments. Genome Biol. 2008, 9, R7. [Google Scholar] [CrossRef] [PubMed]
  45. Trapnell, C.; Roberts, A.; Goff, L.; Pertea, G.; Kim, D.; Kelley, D.R.; Pimentel, H.; Salzberg, S.L.; Rinn, J.L.; Pachter, L. Differential Gene and Transcript Expression Analysis of RNA-Seq Experiments with TopHat and Cufflinks. Nat. Protoc. 2012, 7, 562–578. [Google Scholar] [CrossRef]
  46. Apweiler, R. UniProt: The Universal Protein Knowledgebase. Nucleic Acids Res. 2004, 32, D115–D119. [Google Scholar] [CrossRef]
  47. Finn, R.D.; Attwood, T.K.; Babbitt, P.C.; Bateman, A.; Bork, P.; Bridge, A.J.; Chang, H.-Y.; Dosztányi, Z.; El-Gebali, S.; Fraser, M.; et al. InterPro in 2017—Beyond Protein Family and Domain Annotations. Nucleic Acids Res. 2017, 45, D190–D199. [Google Scholar] [CrossRef]
  48. Kanehisa, M.; Sato, Y.; Kawashima, M.; Furumichi, M.; Tanabe, M. KEGG as a Reference Resource for Gene and Protein Annotation. Nucleic Acids Res. 2016, 44, D457–D462. [Google Scholar] [CrossRef]
  49. Li, L.; Stoeckert, C.J.; Roos, D.S. OrthoMCL: Identification of Ortholog Groups for Eukaryotic Genomes. Genome Res. 2003, 13, 2178–2189. [Google Scholar] [CrossRef]
  50. Li, H. TreeFam: A Curated Database of Phylogenetic Trees of Animal Gene Families. Nucleic Acids Res. 2006, 34, D572–D580. [Google Scholar] [CrossRef]
  51. Edgar, R.C. MUSCLE: Multiple Sequence Alignment with High Accuracy and High Throughput. Nucleic Acids Res. 2004, 32, 1792–1797. [Google Scholar] [CrossRef]
  52. Darriba, D.; Taboada, G.L.; Doallo, R.; Posada, D. ProtTest 3: Fast Selection of Best-Fit Models of Protein Evolution. Bioinformatics 2011, 27, 1164–1165. [Google Scholar] [CrossRef] [PubMed]
  53. Benton, M.J.; Donoghue, P.C.J. Paleontological Evidence to Date the Tree of Life. Mol. Biol. Evol. 2006, 24, 26–53. [Google Scholar] [CrossRef] [PubMed]
  54. Wang, Y.; Tang, H.; DeBarry, J.D.; Tan, X.; Li, J.; Wang, X.; Lee, T.-h.; Jin, H.; Marler, B.; Guo, H.; et al. MCScanX: A Toolkit for Detection and Evolutionary Analysis of Gene Synteny and Collinearity. Nucleic Acids Res. 2012, 40, e49. [Google Scholar] [CrossRef]
  55. De Bie, T.; Cristianini, N.; Demuth, J.P.; Hahn, M.W. CAFE: A Computational Tool for the Study of Gene Family Evolution. Bioinformatics 2006, 22, 1269–1271. [Google Scholar] [CrossRef]
  56. Szklarczyk, D.; Gable, A.L.; Lyon, D.; Junge, A.; Wyder, S.; Huerta-Cepas, J.; Simonovic, M.; Doncheva, N.T.; Morris, J.H.; Bork, P.; et al. STRING V11: Protein–Protein Association Networks with Increased Coverage, Supporting Functional Discovery in Genome-Wide Experimental Datasets. Nucleic Acids Res. 2019, 47, D607–D613. [Google Scholar] [CrossRef]
  57. Chin, C.-H.; Chen, S.-H.; Wu, H.-H.; Ho, C.-W.; Ko, M.-T.; Lin, C.-Y. cytoHubba: Identifying Hub Objects and Sub-Networks from Complex Interactome. BMC Syst. Biol. 2014, 8, S11. [Google Scholar] [CrossRef]
Figure 1. Hi-C contact map of the ShanHu genome. The color bar indicates contact density from yellow (low) to red (high).
Figure 1. Hi-C contact map of the ShanHu genome. The color bar indicates contact density from yellow (low) to red (high).
Ijms 26 05036 g001
Figure 2. Genetic information characteristic diagram of the ShanHu genome. From the outer circle to the inner circle: 1 represents chromosomes of the ShanHu genome; 2 GC content; 3 gene density (yellow); 4 genes on the forward strand (red); 5 genes on the reverse strand (green); 6 non-coding RNA on the forward strand (blue); 7 non-coding RNA on the reverse strand (purple); 8 repeat sequence content (orange).
Figure 2. Genetic information characteristic diagram of the ShanHu genome. From the outer circle to the inner circle: 1 represents chromosomes of the ShanHu genome; 2 GC content; 3 gene density (yellow); 4 genes on the forward strand (red); 5 genes on the reverse strand (green); 6 non-coding RNA on the forward strand (blue); 7 non-coding RNA on the reverse strand (purple); 8 repeat sequence content (orange).
Ijms 26 05036 g002
Figure 3. Phylogenetic tree of economically important Epinephelus. The numbers at the nodes represent the divergence time of the grouper ancestors in millions of years, with the numbers in parentheses indicating the confidence range of that divergence time.
Figure 3. Phylogenetic tree of economically important Epinephelus. The numbers at the nodes represent the divergence time of the grouper ancestors in millions of years, with the numbers in parentheses indicating the confidence range of that divergence time.
Ijms 26 05036 g003
Figure 4. Collinearity comparison between the chromosomes of ShanHu, E. fuscoguttatus, and E. polyphekadion.
Figure 4. Collinearity comparison between the chromosomes of ShanHu, E. fuscoguttatus, and E. polyphekadion.
Ijms 26 05036 g004
Figure 5. Enrichment of GO annotations with the expanded gene families in ShanHu (p < 0.05).
Figure 5. Enrichment of GO annotations with the expanded gene families in ShanHu (p < 0.05).
Ijms 26 05036 g005
Figure 6. (a) Protein–protein interaction network analysis of expanded genes. (b) Identification of hub genes in terms of maximal clique centrality (MCC) score.
Figure 6. (a) Protein–protein interaction network analysis of expanded genes. (b) Identification of hub genes in terms of maximal clique centrality (MCC) score.
Ijms 26 05036 g006
Table 1. Genome assembly statistics for the hybrid ShanHu grouper.
Table 1. Genome assembly statistics for the hybrid ShanHu grouper.
TypeE fuscoguttatusE. polyphekadionShanHu
Total length of Contigs (bases)1,082,718,1031,099,728,8021,176,882,344
Number of Contigs396164171
Max length Contig (bases)52,327,45653,203,86758,196,232
Contig N50 (bases)37,367,96440,804,24045,365,500
Contig N90 (bases)9,232,28512,950,42815,571,449
Total length of Scaffolds (bases)1,082,726,7031,099,733,6021,176,892,644
Number of Scaffolds31011668
Scaffold N50 (bases)44,785,04847,393,75246,120,958
Scaffold N90 (bases)39,728,48039,964,76024,826,122
Table 2. Statistics of gene predictions in the hybrid ShanHu genome.
Table 2. Statistics of gene predictions in the hybrid ShanHu genome.
TypeGene SetNumberAverage
Transcript Length (bp)
Average
CDS Length (bp)
Average Exons Per GeneAverage Exon
Length (bp)
Average
Intron Length (bp)
De novoAugustus39,87810,427.621197.886.60181.451647.61
SNAP41,33639,332.491185.358.12145.915355.06
HomologE.fuscoguttatus25,12317,952.511712.519.69176.791869.54
E.lanceolatus24,33617,919.251698.199.68175.491869.42
E.moara24,92717,676.761694.959.63176.051852.45
T. rubripes21,74017,560.211688.499.49177.941869.64
G. aculeatus22,29617,691.721702.109.56178.121868.87
D.rerio22,0915,746.661541.848.50181.391893.95
RNA-seqTranscripts106,294100.74898.002.50151.121749.27
PASA21,7237242.56715.034.73177.731876.96
EVMEVM35,88713,394.271320.967.43177.731876.96
Pasaupdatepasaupdate35,73913,629.091326.577.47177.641902.11
Final setFinal26,10217,514.951626.809.43172.601885.81
Table 3. Annotation results of repetitive sequences in ShanHu genome.
Table 3. Annotation results of repetitive sequences in ShanHu genome.
TypeNumberLength (bp)Percentage (%)
SINEs11,410905,3880.08
LINEs328,95791,778,6267.80
LTRs563,312103,517,6348.80
DNA1,733,660316,822,57826.92
Unknown106,73916,787,6221.43
Total 552,112,95146.91
Table 4. Annotation results for non-coding RNA in ShanHu genome.
Table 4. Annotation results for non-coding RNA in ShanHu genome.
Type Copy (w)Total Length (bp)% in Genome
miRNA 3438471,9170.040%
tRNA 2415180,8260.015%
rRNArRNA41871,069,9670.091%
18s445279,0490.024%
28s1058474,9920.040%
5.8s14522,4580.002%
5s2539293,4680.025%
snRNASnRNA836121,1440.010%
CD-box17119,6700.002%
HACA-box9514,8420.001%
splicing50474,7080.006%
scaRNA6511,8670.001%
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Liu, Y.; Mao, Y. Chromosome-Level Genome Assembly and Genomic Analysis of the Hybrid Grouper ShanHu (Epinephelus fuscoguttatus ♀ × Epinephelus polyphekadion ♂). Int. J. Mol. Sci. 2025, 26, 5036. https://doi.org/10.3390/ijms26115036

AMA Style

Liu Y, Mao Y. Chromosome-Level Genome Assembly and Genomic Analysis of the Hybrid Grouper ShanHu (Epinephelus fuscoguttatus ♀ × Epinephelus polyphekadion ♂). International Journal of Molecular Sciences. 2025; 26(11):5036. https://doi.org/10.3390/ijms26115036

Chicago/Turabian Style

Liu, Yiqun, and Yunxiang Mao. 2025. "Chromosome-Level Genome Assembly and Genomic Analysis of the Hybrid Grouper ShanHu (Epinephelus fuscoguttatus ♀ × Epinephelus polyphekadion ♂)" International Journal of Molecular Sciences 26, no. 11: 5036. https://doi.org/10.3390/ijms26115036

APA Style

Liu, Y., & Mao, Y. (2025). Chromosome-Level Genome Assembly and Genomic Analysis of the Hybrid Grouper ShanHu (Epinephelus fuscoguttatus ♀ × Epinephelus polyphekadion ♂). International Journal of Molecular Sciences, 26(11), 5036. https://doi.org/10.3390/ijms26115036

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop