Next Article in Journal
Morinda citrifolia Essential Oil: A Plant Resistance Biostimulant and a Sustainable Alternative for Controlling Phytopathogens and Insect Pests
Previous Article in Journal
Are Hyperglycemia-Induced Changes in the Retina Associated with Diabetes-Correlated Changes in the Brain? A Review from Zebrafish and Rodent Type 2 Diabetes Models
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

The Chromosome-Scale Genome of Chitala ornata Illuminates the Evolution of Early Teleosts

1
College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
2
BGI-Qingdao, BGI-Shenzhen, Qingdao 266555, China
3
BGI-Shenzhen, Shenzhen 518083, China
*
Authors to whom correspondence should be addressed.
Biology 2024, 13(7), 478; https://doi.org/10.3390/biology13070478
Submission received: 29 May 2024 / Revised: 20 June 2024 / Accepted: 25 June 2024 / Published: 27 June 2024
(This article belongs to the Section Genetics and Genomics)

Abstract

:

Simple Summary

As the most diverse vertebrate group, the unique adaptive expansion of euryhaline fishes is critical to understanding vertebrate evolution. In particular, the high degree of consistency of unique paired appendage structures across the extremely morphologically diverse group of teleost fishes has become a fascinating scientific question. Early teleost fishes provide a critical window into the study of this large taxon. Therefore, this study constructs high-quality chromosome-level genomes of Osteoglossiformes (Chitala ornata). It also explores the genomic features of early teleost fishes and traces the unique genetic basis of pectoral fin evolution in teleost fishes at the molecular level, which provides an important basis for understanding the evolution of the origin of early teleosts.

Abstract

Teleosts are the most prolific vertebrates, occupying the vast majority of aquatic environments, and their pectoral fins have undergone remarkable physiological transformations throughout their evolution. Studying early teleost fishes, such as those belonging to the Osteoglossiformes order, could offer crucial insights into the adaptive evolution of pectoral fins within this group. In this study, we have assembled a chromosomal-level genome for the Clown featherback (Chitala ornata), achieving the highest quality genome assembly for Osteoglossiformes to date, with a contig N50 of 32.78 Mb and a scaffold N50 of 40.73 Mb. By combining phylogenetic analysis, we determined that the Clown featherback diverged approximately 202 to 203 million years ago (Ma), aligning with continental separation events. Our analysis revealed the intriguing discovery that a unique deletion of regulatory elements is adjacent to the Gli3 gene, specifically in teleosts. This deletion might be tied to the specialized adaptation of their pectoral fins. Furthermore, our findings indicate that specific contractions and expansions of transposable elements (TEs) in teleosts, including the Clown featherback, could be connected to their adaptive evolution. In essence, this study not only provides a high-quality genomic resource for Osteoglossiformes but also sheds light on the evolutionary trajectory of early teleosts.

Graphical Abstract

1. Introduction

Teleost fishes are currently the most diverse group of vertebrates on Earth, with a total of more than 30,000 species. Osteoglossiformes, an early teleost group encompassing Arapaimidae, Gymnarchidae, Osteoglossidae, Pantodontidae, Notopteridae, and Mormyridae, is of particular interest. This group dates to the Jurassic Period, making it a valuable link to the past. Recent research indicates that Elopomorpha and Osteoglossomorpha form a sister clade to all other teleosts [1]. Consequently, studying the genetic basis of physiological traits in Osteoglossiformes is important for exploring the macroevolutionary history of teleosts.
The pectoral fin, a crucial locomotive organ in fishes, traces its origins back to the chondrichthyans [2]. However, following the divergence of Sarcopterygii and Actinopterygii, Sarcopterygii’s plesiomorphy [3,4]. In contrast, the early ray-finned fishes (Actinopterygii) retained numerous skeletal features, including the humerus [5,6]. With the emergence of teleost fishes, the pectoral fin underwent further specialization, abandoning its original accessory structures and evolving into a distinct pectoral fin form [7].
The Clown featherback, a species belonging to the Notopteridae family [8], is predominantly found in Asia, particularly Southeast Asia. Due to the lack of a high-quality genome for the Clown featherback, studying the genetic evolution of this species has been challenging. In this study, we de novo assembled the first chromosome-level genome of the Clown featherback. By combining this genome with published genomic data from other teleosts, we aim to comprehensively detail the genomic features of the early teleosts. Furthermore, we seek to unravel clues pertaining to the evolution of teleost pectoral fins.

2. Materials and Methods

2.1. Library Construction and Sequencing

The adult specimens of C. ornata were bought from an ornamental fish market in Qingdao, China, and their muscle tissues were immediately stored in liquid nitrogen. Then, the genomic DNA was extracted from the muscles of the fishes using NucleoBond HMW DNA KIT (MACHEREYNAGEL, Dueren, Germany). The Agilent 4200 Bioanalyzer (Agilent Technologies, Palo Alto, CA, USA) was used to determine the integrity of the DNA. To construct the PacBio high-fidelity circular consensus sequencing (HiFi-CCS) library, eight micrograms of genomic DNA were sheared and concentrated with AMPure PB magnetic beads (Pacific Biosciences, Menlo Park, CA, USA). Using Pacific Biosciences SMRTbell Template Prep Kit v1.0, each SMRT bell library was constructed. The constructed library was selected by Sage ELF for molecules 11–15 kb in size, followed by primer annealing and binding of SMRT bell templates to polymerases with the DNA Polymerase Binding Kit (Pacific Biosciences, Menlo Park, CA, USA). For HiFi-CCS data assembly, we assembled them using hifiasm version 0.9 and converted them to fasta genome files using gfatools. Sequencing was performed on the Pacific Bioscience Sequel II for 30 h by Annoroad Gene Technology, Beijing, China. We sequenced 43.44 Gb long reads (~50× coverage, N50 read length 18.13 Kb) using HiFi. For Hi-C library sequencing, approximately 1 g living muscle tissue was utilized for DNA extraction and library contraction, according to Wang’s method [9]. Sequencing was performed on a BGISEQ-500 sequencer (BGI, Shenzhen, China), generating 102 Gb of clean Hi-C data.

2.2. Genome Assessment

For HiFi-CCS data, we assembled them using hifiasm [10] software and converted them to fasta genome files using gfatools. The final genome size was 837,258,286 bp, and Contig N50 was 32.78 Mb. To generate a chromosomal-level genome assembly of C. ornata, high-quality Hi-C data were used for further assembly. Firstly, we used HiC-Pro software (version 2.8.0_devel) [11] with default parameters to obtain valid sequencing data. Then, Juicer (version 1.5), an open-source tool for analyzing Hi-C datasets [12], and the 3D de novo assembly pipeline were used to connect the contigs to chromosomes. To evaluate the genome assembly of C. ornata, we aligned sequencing data filtered previously using SOAPaligner (version 2.2) [13]. We also calculated its GC depth to rule out possible biases during sequencing or possible contaminations. Then, the genome completeness was estimated with Benchmarking Universal Single-copy Orthologs (BUSCO, version 3.0.1).

2.3. Genome Annotation

For the newly assembled genome, genome annotation was carried out to seek protein-coding genes in two ways: (a) the ab initio gene prediction and (b) the homology-based annotation. For ab initio gene prediction approaches, Augustus [14] and GlimmerHMM [15] were used with Danio rerio as the species of HMM model to predict gene models. For homology-based annotation, six species, including Scleropages formosus, Baufortia kweichowensis, Danio rerio, Oryzias latipes, Takifugu rubripes, and Gasterosteus aculeatus, were aligned against the genome assembly using BLAT software (version 0.36) [16] and GeneWise software (version 2.4.1) [17]. Protein-coding genes were obtained by combining the different evidences using Glean software (version 1.0) [18]. In the final gene models, the average length of 28,404 genes was 9244.86 bp. The average length of coding sequences, exons, and introns was 1347.22 bp, 209.59 bp, and 1455.04 bp. To better understand the evolutionary dynamics of genes, gene family expansion and contraction analysis was performed using Cafe (v3.1) software [19].

2.4. Identification of Repetitive Sequences

To ensure the comparability of repeat element annotation between different species, we used two approaches to identify repeat elements in the genome: homolog-based prediction and de novo prediction. For homology-based approaches, different types of transposable element sequences in Repbase (version 16.02) [20] were aligned against the assembly using RepeatMasker (version 3.3.0) [21] with parameters “-q -nolow -no_is -norna -engine ncbi”. We used Tandem Repeats Finder (v4.07) to find tandem repeats. RepeatModeler [22] and LTR-Finder (version 1.0.6) with parameters “-C -w 2” and danRer7-tRNAs.fa as tRNA database [23] were used to perform de novo prediction of repeat sequences, and the results were combined as the library for RepeatMasker to identify and classify repeat elements.

2.5. Construct Whole Genome Alignments (WGAs)

We used the C. ornata genome as a reference, and aligned it with the genomes of 11 other species (Erpetoichthys calabaricus, Amia calva, Lepisosteus oculatus, Scleropages formosus, Esox lucius, Arapaima gigas, Anguilla anguilla, Oryzias latipes, Danio rerio, Megalops atlanticus, and Gasterosteus aculeatuss) using LastZ version 1.1 [24] with the following parameters: H = 2000, Y = 9400, L = 3000, and K = 3000. All genomes were masked softly before alignment.

2.6. Phylogeny Reconstruction

The resulting LastZ result files were combined into a single multiple genome alignment by MultiZ [25], from which we then obtained a total of 1,909,881 bp with no gap sites across all species. We used alignment blocks including all 12 species to construct phylogenetic trees by Raxml [26] using GTRGAMMA mode with 100 bootstrap replicates.

2.7. Analysis of Conserved Elements

We intercepted all sequences of the Gli3 gene in the upstream and downstream 1000 bp range of the human genome (GRCh38.p14), totaling 278,260 bp sequences (Chr7: 41,959,949–42,238,209). We used two different LastZ comparison models. Close alignment was used in lobe-finned fishes (Latimeria chalumnae, Xenopus laevis, and Mus musculus), and distance alignment was used in cartilaginous and ray-finned fishes (Callorhinchus milii, Erpetoichthys calabaricus, Polypterus senegalus, Acipenser ruthenus, Polyodon spathula, Atractosteus spatula, Lepisosteus oculatus, Amia calva, C. ornata, Megalops atlanticus, Danio rerio, Esox lucius, Oryzias latipes, and Takifugu rubripe). The distant alignment parameters are H = 2000, Y = 3400, L = 6000, and K = 2200, and the close alignments parameters are H = 2000, Y = 9400, L = 3000, and K = 3000 [7]. The alignments of CNEs referred to in the manuscript were manually checked and plotted with VISTA (v1.4.26) [27]. The functional analysis of CNE comes from the website (https://www.encodeproject.org/, accessed on 1 April 2023.).

3. Results and Discussion

3.1. Genome Assembly and Annotation of a Chromosome-Level C. ornata

To sequence and assemble the C. ornata genome, a total of 43.44 Gb (∼50×) PacBio HiFi CCS data were used to assemble 837.26 Mb genomic sequences, containing 126 contigs with a contig N50 of 32.78 Mb and a GC content of 41.92% (Table 1). To anchor the contig sequences to chromosomes, we constructed a Hi-C library and sequenced ∼102 Gb of Hi-C data. About 794 Mb sequences (94.78% of contig-level assembly) were anchored to 21 chromosomes (Figure 1a and Table 1), which was consistent with the previous report on the C. ornata karyotype [28]. Finally, by using BUSCO (Benchmarking Universal Single-Copy Orthologs), we found that ∼96.4% of the complete vertebrate BUSCO genes were covered by our assembly (Table 1), providing further evidence for the fine quality of the assembled genome. Compared to previously published fish genomes of Osteoglossiformes, we found that the presently published C. ornata genome is the highest quality genome to date (Figure 1b and Table S1).
We predicted protein-coding genes with combinational annotation methods (de novo prediction and homology-based prediction) in this genome. In the final gene models, the average length was 9244.86 bp, with an average of six exons. The average length of coding sequences, exons, and introns was 1347.22 bp, 209.59 bp, and 1455.04 bp. We identified 129 contracted gene families and 110 expanded gene families (p < 0.05) in C. ornata (Figure S1a). We further enriched the functions of these expanded genes, which were related to immune, redox responses (Figure S1b and Table S3), suggesting that evolutionary changes in these gene families were related to adaptations in the early survival environment of C. ornata.

3.2. Phylogenetic and Divergence Time Analysis of Early Teleosts

There are many different hypotheses about the phylogenetic relationships of teleost fishes, especially between Osteoglossomorpha, Elopomorpha, and Clupeocephala. The relationship between Osteoglossomorpha, Elopomorpha, and Clupeocephala has been supported by large-scale transcriptomic data [29], but recent studies have shown that Osteoglossomorpha + Elopomorpha acted as a sister group to Clupeocephala to form the Eloposteoglossocephala, which is consistent with the findings of the Asian arowana [30]. Whole-genome comparison data provides invaluable insights and serves as solid evidence for determining accurate phylogenetic relationships among species [31,32,33,34]. In this study, we performed phylogenetic analyses using genome-wide, specific age data. Our results support the latest hypothesis, the Eloposteoglossocephala hypothesis.
We constructed phylogenetic relationships using comprehensive genome-wide comparison data from 12 species, including C. ornata (as well as Erpetoichthys calabaricus, Amia calva, Lepisosteus oculatus, Scleropages formosus, Esox lucius, Arapaima gigas, Anguilla anguilla, Oryzias latipes, Danio rerio, Megalops atlanticus, and Gasterosteus aculeatus). Furthermore, our analysis confirms that Osteoglossomorpha and Elopomorpha constitute sister groups to Clupeocephala (Figure 2a), consistent with previous research [1]. By integrating differentiation time analysis, we determined that teleost fishes originated around the Permian period (~269 million years ago), while the divergence of Osteoglossiformes took place during the Jurassic (~203 million years ago). Previous studies have shown that plate drift affects speciation to some extent. For example, continental drift has led to the speciation of mammals [33,35]. The combination of our data and paleogeological data suggests that plate movement may be associated with the occurrence of teleost fishes (Figure 2b). Therefore, by using genome-wide data analysis, we determined the evolutionary history of Osteoglossiformes, which provides more references for understanding the evolutionary history of early teleost fishes.

3.3. Genomic Repetitive Sequences and Conserved Features

We then carried out genome annotation to identify repeats of C. ornata. A total of 123,340,324 bp transposable elements (TEs) were predicted, accounting for 16.58% of the genome (Table S4). DNA transposons were dominated in TEs with a proportion of 8.70% genome assembly, followed by 6.54% long interspersed elements (LINEs) and 4.00% long terminal repeats (LTRs) of the genome (Table S4). We found that repetitive sequences showed significant contraction in teleost fishes compared to cartilaginous fishes. It has been previously reported that repetitive sequence can effectively mediate genome expansion in lungfish [36,37]. The specific contraction of TEs is consistent with the fact that the genome of teleosts is smaller, and we also found low LINEs content and LTRs content in early teleosts (Figure 3a,b and Table S4), which we speculate may be the result of the adaptive evolution of teleosts.
Previous studies of vertebrates have shown that the covariance of early ray-finned fishes with modern teleosts is not conserved, suggesting that the teleosts genome underwent a complex and dramatic process of diversification [7,38]. For example, spotted gar are more obviously co-linear with chickens than zebrafish [38]. Based on genome-wide comparison data, we analyzed the conserved evolution of the chromosomes of C. ornata. By analyzing conserved fragments of early ray-finned fishes (Reedfish) and modern teleosts (Zebrafish), we found that the C. ornata and Reedfish have a good collinearity relationship, which is consistent with their evolutionary status as primitive teleosts (Figure 3c,d), suggesting that the C. ornata has more ancestrally conserved genomic features than modern teleost fishes.

3.4. Ancient Genetic Regulation Associated with Pectoral Fin Evolution

Various non-coding conserved elements (CNEs) have been reported to contribute to morphological diversification during evolution [7,32,37,39,40]. One example is an CNE downstream of the Hand2 gene critical for heart development [7]. In addition, CNE, which is located between the exons of genes, also plays an important role. In the study of the genome of African lungfish [37], it was found that the CNE of the Foxp1 intergene region plays an important regulatory role and is essential for vertebrate lung function. Previous studies have shown that the pectoral fins generate thrust through movement, which affects the swimming behavior of fishes [41]. The pectoral fins of fish are a homologous organ of the forelimbs of tetrapods [42,43,44,45,46,47]. Tetrapods evolved from fins to limbs through the inheritance of limb enhancers from their ancestors and a series of genetic innovations, which laid a crucial foundation for their adaptation to terrestrial environments [7,36,37,48]. However, teleosts lost some of their original pectoral-fin structure, which can otherwise be observed in the fins of non-teleost ray-finned fishes and extant lobe-finned fishes (Figure 4a). Specifically, the pectoral fins of early vertebrates are divided into three parts: propterygium, mesopterygium, and metapterygium [42,49,50]. This structure is still found in cartilaginous fish and some early ray-finned fishes. But lobe-finned fish retained only the metapterygium and developed into the forelimbs of tetrapods, while teleosts completely discarded the metapterygium to form a minimalist fin [42,49]. We hypothesize that other limb skeletal regulatory elements from ancestors may also be present in basal fishes. The Gli3 gene encodes a protein that belongs to the C2H2-type zinc finger proteins subclass of the Gli family. The Gli3 gene has been reported to be associated with limb development in mice and fin development in fishes [51,52,53] (Figure 4b). Based on our genome-wide comparison dataset, we observed a specific conserved non-coding element located between the eleventh and twelfth exons of the Gli3 gene, which is present in all gnawed vertebrates except teleost fishes (Figure 4c). To determine CNE function, we validated it in combination with DNase-seq data and found that the CNE overlaps with the open reading frame (ORF) region of Gli3 (Figure 4d), and our data suggest a regulatory role for this CNE.
Thus, our results suggest that, with the help of a comprehensive comparative analysis of the genome of early teleost fishes, it can be suggested that the loss of metapterygium in teleosts may be related to the differentiation of certain regulatory elements.

4. Conclusions

We have assembled a chromosome-level genome of C. ornata, a crucial species of early teleosts, with a genome size of 837.26 Mb. This assembly marks the highest quality chromosome-level genome achieved thus far within the Osteoglossiformes order, offering a reference-level genomic resource for the investigation of early teleosts.
Our phylogenetic analysis indicates that teleosts originated approximately 269 million years ago, with C. ornata diverging around 203 million years ago. This timing suggests a potential correlation with paleotectonic plate movements, hinting at a possible influence of these geological events on the speciation of Osteoglossiformes.
Furthermore, our study revealed specific expansions and contractions of TEs linked to teleost fishes. These changes in TEs might have contributed to the emergence of a more compact genome structure in teleosts. Additionally, we observed the loss of certain CNEs associated with the Gli3 gene, specifically in teleosts. Binding experiments further demonstrated that Gli3 suggests that the loss of metapterygium in teleost fishes may be related to the differentiation of certain regulatory elements. Overall, our comprehensive dataset provides invaluable resources for delving deeper into the evolutionary history of early teleosts.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/biology13070478/s1: Figure S1: Comparative genomic analysis visualization of C. ornata; Table S1: Statistics of genome assembly quality in fishes; Table S2: Statistics on the distribution and proportion of TEs in fishes.; Table S3: Analysis of the top 20 extended gene enrichment results; Table S4: Genome feature statistics of 235 fishes.

Author Contributions

X.L. and G.F. were the leaders and designers of the research; Z.Y., Y.S. and Y.C. conducted field sample collection and prepared samples for sequencing; Z.Y. conducted bioinformatics analysis. Z.Y., S.Z. and M.X. manually checked the genes under study and wrote the paper. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the Shandong Province Key Research and Development Plan (No. 2023CXPT057, to G.Y.F.).

Institutional Review Board Statement

All experimental animal treatments in this study have been verified and identified by taxonomic experts and museum taxonomists according to the guidelines approved by the instituted Review Board of Bioethics and Biosafety (BGI-IRB, ethical permit ID: BGI-IRB A20007-T1).

Informed Consent Statement

Not applicable.

Data Availability Statement

The genome assemblies have been deposited in the China National GeneBank DataBase (CNGB) under BioProject CNP0004025.

Conflicts of Interest

All the authors were employed by the company BGI. The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

  1. Parey, E.; Louis, A.; Montfort, J.; Bouchez, O.; Roques, C.; Iampietro, C.; Lluch, J.; Castinel, A.; Donnadieu, C.; Desvignes, T. Genome structures resolve the early diversification of teleost fishes. Science 2023, 379, 572–575. [Google Scholar] [CrossRef]
  2. Coates, M.I.; Cohn, M.J. Fins, limbs, and tails: Outgrowths and axial patterning in vertebrate evolution. BioEssays 1998, 20, 371–381. [Google Scholar] [CrossRef]
  3. Coates, M.I. The origin of vertebrate limbs. Development 1994, 1994, 169–180. [Google Scholar] [CrossRef]
  4. Don, E.K.; Currie, P.D.; Cole, N.J. The evolutionary history of the development of the pelvic fin/hindlimb. J. Anat. 2013, 222, 114–133. [Google Scholar] [CrossRef] [PubMed]
  5. Cass, A.N.; Elias, A.; Fudala, M.L.; Knick, B.D.; Davis, M.C. Conserved mechanisms, novel anatomies: The developmental basis of fin evolution and the origin of limbs. Diversity 2021, 13, 384. [Google Scholar] [CrossRef]
  6. Tanaka, Y.; Kudoh, H.; Abe, G.; Yonei-Tamura, S.; Tamura, K. Evo-Devo of the Fin-to-Limb Transition. In Evolutionary Developmental Biology: A Reference Guide; Springer: Berlin/Heidelberg, Germany, 2021; pp. 907–920. [Google Scholar]
  7. Bi, X.; Wang, K.; Yang, L.; Pan, H.; Jiang, H.; Wei, Q.; Fang, M.; Yu, H.; Zhu, C.; Cai, Y. Tracing the genetic footprints of vertebrate landing in non-teleost ray-finned fishes. Cell 2021, 184, 1377–1391.e14. [Google Scholar] [CrossRef]
  8. Fricke, R. Eschmeyer’s catalog of fishes: Genera/species by family/subfamily. Recuperado 2021, 11, 1–230. [Google Scholar]
  9. Wang, O.; Chin, R.; Cheng, X.; Wu, M.K.Y.; Mao, Q.; Tang, J.; Sun, Y.; Anderson, E.; Lam, H.K.; Chen, D. Efficient and unique cobarcoding of second-generation sequencing reads from long DNA molecules enabling cost-effective and accurate sequencing, haplotyping, and de novo assembly. Genome Res. 2019, 29, 798–808. [Google Scholar] [CrossRef] [PubMed]
  10. Cheng, H.; Concepcion, G.T.; Feng, X.; Zhang, H.; Li, H. Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm. Nat. Methods 2021, 18, 170–175. [Google Scholar] [CrossRef]
  11. Servant, N.; Varoquaux, N.; Lajoie, B.R.; Viara, E.; Chen, C.-J.; Vert, J.-P.; Heard, E.; Dekker, J.; Barillot, E. HiC-Pro: An optimized and flexible pipeline for Hi-C data processing. Genome Biol. 2015, 16, 259. [Google Scholar] [CrossRef]
  12. Durand, N.C.; Shamim, M.S.; Machol, I.; Rao, S.S.; Huntley, M.H.; Lander, E.S.; Aiden, E.L. Juicer provides a one-click system for analyzing loop-resolution Hi-C experiments. Cell Syst. 2016, 3, 95–98. [Google Scholar] [CrossRef] [PubMed]
  13. Luo, R.; Liu, B.; Xie, Y.; Li, Z.; Huang, W.; Yuan, J.; He, G.; Chen, Y.; Pan, Q.; Liu, Y. SOAPdenovo2: An empirically improved memory-efficient short-read de novo assembler. Gigascience 2012, 1, 18. [Google Scholar] [CrossRef] [PubMed]
  14. Stanke, M.; Diekhans, M.; Baertsch, R.; Haussler, D. Using native and syntenically mapped cDNA alignments to improve de novo gene finding. Bioinformatics 2008, 24, 637–644. [Google Scholar] [CrossRef] [PubMed]
  15. Majoros, W.H.; Pertea, M.; Salzberg, S.L. TigrScan and GlimmerHMM: Two open source ab initio eukaryotic gene-finders. Bioinformatics 2004, 20, 2878–2879. [Google Scholar] [CrossRef] [PubMed]
  16. Kent, W.J. BLAT—The BLAST-like alignment tool. Genome Res. 2002, 12, 656–664. [Google Scholar] [PubMed]
  17. Birney, E.; Clamp, M.; Durbin, R. GeneWise and genomewise. Genome Res. 2004, 14, 988–995. [Google Scholar] [CrossRef] [PubMed]
  18. Elsik, C.G.; Mackey, A.J.; Reese, J.T.; Milshina, N.V.; Roos, D.S.; Weinstock, G.M. Creating a honey bee consensus gene set. Genome Biol. 2007, 8, R13. [Google Scholar] [CrossRef] [PubMed]
  19. De Bie, T.; Cristianini, N.; Demuth, J.P.; Hahn, M.W. CAFE: A computational tool for the study of gene family evolution. Bioinformatics 2006, 22, 1269–1271. [Google Scholar] [CrossRef] [PubMed]
  20. Jurka, J.; Kapitonov, V.V.; Pavlicek, A.; Klonowski, P.; Kohany, O.; Walichiewicz, J. Repbase Update, a database of eukaryotic repetitive elements. Cytogenet. Genome Res. 2005, 110, 462–467. [Google Scholar] [CrossRef]
  21. Chen, N. Using Repeat Masker to identify repetitive elements in genomic sequences. Curr. Protoc. Bioinform. 2004, 5, 4.10.1–4.10.14. [Google Scholar] [CrossRef]
  22. Flynn, J.M.; Hubley, R.; Goubert, C.; Rosen, J.; Clark, A.G.; Feschotte, C.; Smit, A.F. RepeatModeler2 for automated genomic discovery of transposable element families. Proc. Natl. Acad. Sci. USA 2020, 117, 9451–9457. [Google Scholar] [CrossRef] [PubMed]
  23. Xu, Z.; Wang, H. LTR_FINDER: An efficient tool for the prediction of full-length LTR retrotransposons. Nucleic Acids Res. 2007, 35, W265–W268. [Google Scholar] [CrossRef]
  24. Harris, R.S. Improved Pairwise Alignment of Genomic DNA; The Pennsylvania State University: University Park, PA, USA, 2007. [Google Scholar]
  25. Blanchette, M.; Kent, W.J.; Riemer, C.; Elnitski, L.; Smit, A.F.; Roskin, K.M.; Baertsch, R.; Rosenbloom, K.; Clawson, H.; Green, E.D. Aligning multiple genomic sequences with the threaded blockset aligner. Genome Res. 2004, 14, 708–715. [Google Scholar] [CrossRef]
  26. Stamatakis, A. RAxML version 8: A tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 2014, 30, 1312–1313. [Google Scholar] [CrossRef] [PubMed]
  27. Frazer, K.A.; Pachter, L.; Poliakov, A.; Rubin, E.M.; Dubchak, I. VISTA: Computational tools for comparative genomics. Nucleic Acids Res. 2004, 32, W273–W279. [Google Scholar] [CrossRef]
  28. Supiwong, W.; Tanomtong, A.; Khakhong, S.; Silawong, K.; Aoki, S.; Sanoamuang, L.-O. The first chromosomal characteristics of nucleolar organizer regions and karyological analysis of clown knife fish, Chitala ornata (Osteoglossiformes, Notopteridae) by T-lymphocyte cell culture. Cytologia 2012, 77, 393–399. [Google Scholar] [CrossRef]
  29. Hughes, L.C.; Ortí, G.; Huang, Y.; Sun, Y.; Baldwin, C.C.; Thompson, A.W.; Arcila, D.; Betancur, R.; Li, C.; Becker, L. Comprehensive phylogeny of ray-finned fishes (Actinopterygii) based on transcriptomic and genomic data. Proc. Natl. Acad. Sci. USA 2018, 115, 6249–6254. [Google Scholar] [CrossRef] [PubMed]
  30. Hao, S.; Han, K.; Meng, L.; Huang, X.; Cao, W.; Shi, C.; Zhang, M.; Wang, Y.; Liu, Q.; Zhang, Y. African Arowana genome provides insights on ancient teleost evolution. Iscience 2020, 23, 101662. [Google Scholar] [CrossRef] [PubMed]
  31. Chen, L.; Qiu, Q.; Jiang, Y.; Wang, K.; Lin, Z.; Li, Z.; Bibi, F.; Yang, Y.; Wang, J.; Nie, W. Large-scale ruminant genome sequencing provides insights into their evolution and distinct traits. Science 2019, 364, eaav6202. [Google Scholar] [CrossRef]
  32. Christmas, M.J.; Kaplow, I.M.; Genereux, D.P.; Dong, M.X.; Hughes, G.M.; Li, X.; Sullivan, P.F.; Hindle, A.G.; Andrews, G.; Armstrong, J.C. Evolutionary constraint and innovation across hundreds of placental mammals. Science 2023, 380, eabn3943. [Google Scholar] [CrossRef]
  33. Foley, N.M.; Mason, V.C.; Harris, A.J.; Bredemeyer, K.R.; Damas, J.; Lewin, H.A.; Eizirik, E.; Gatesy, J.; Karlsson, E.K.; Lindblad-Toh, K. A genomic timescale for placental mammal evolution. Science 2023, 380, eabl8189. [Google Scholar] [CrossRef] [PubMed]
  34. Stiller, J.; Feng, S.; Chowdhury, A.-A.; Rivas-González, I.; Duchêne, D.A.; Fang, Q.; Deng, Y.; Kozlov, A.; Stamatakis, A.; Claramunt, S. Complexity of avian evolution revealed by family-level genomes. Nature 2024, 629, 851–860. [Google Scholar] [CrossRef] [PubMed]
  35. Phylogenetics, U.B. Resolution of the Early Placental Mammal Radiation. Science 1998, 282, 1871. [Google Scholar]
  36. Meyer, A.; Schloissnig, S.; Franchini, P.; Du, K.; Woltering, J.M.; Irisarri, I.; Wong, W.Y.; Nowoshilow, S.; Kneitz, S.; Kawaguchi, A. Giant lungfish genome elucidates the conquest of land by vertebrates. Nature 2021, 590, 284–289. [Google Scholar] [CrossRef] [PubMed]
  37. Wang, K.; Wang, J.; Zhu, C.; Yang, L.; Ren, Y.; Ruan, J.; Fan, G.; Hu, J.; Xu, W.; Bi, X. African lungfish genome sheds light on the vertebrate water-to-land transition. Cell 2021, 184, 1362–1376.e18. [Google Scholar] [CrossRef] [PubMed]
  38. Braasch, I.; Gehrke, A.R.; Smith, J.J.; Kawasaki, K.; Manousaki, T.; Pasquier, J.; Amores, A.; Desvignes, T.; Batzel, P.; Catchen, J. The spotted gar genome illuminates vertebrate evolution and facilitates human-teleost comparisons. Nat. Genet. 2016, 48, 427–437. [Google Scholar] [CrossRef] [PubMed]
  39. Peng, C.; Wu, D.-D.; Ren, J.-L.; Peng, Z.-L.; Ma, Z.; Wu, W.; Lv, Y.; Wang, Z.; Deng, C.; Jiang, K. Large-scale snake genome analyses provide insights into vertebrate development. Cell 2023, 186, 2959–2976.e22. [Google Scholar] [CrossRef]
  40. Seki, R.; Li, C.; Fang, Q.; Hayashi, S.; Egawa, S.; Hu, J.; Xu, L.; Pan, H.; Kondo, M.; Sato, T. Functional roles of Aves class-specific cis-regulatory elements on macroevolution of bird-specific features. Nat. Commun. 2017, 8, 14229. [Google Scholar] [CrossRef] [PubMed]
  41. Wilga, C.; Lauder, G.V. Locomotion in sturgeon: Function of the pectoral fins. J. Exp. Biol. 1999, 202, 2413–2432. [Google Scholar] [CrossRef]
  42. Hawkins, M.B.; Henke, K.; Harris, M.P. Latent developmental potential to form limb-like skeletal structures in zebrafish. Cell 2021, 184, 899–911.e13. [Google Scholar] [CrossRef]
  43. Stewart, T.A.; Lemberg, J.B.; Taft, N.K.; Yoo, I.; Daeschler, E.B.; Shubin, N.H. Fin ray patterns at the fin-to-limb transition. Proc. Natl. Acad. Sci. USA 2020, 117, 1612–1620. [Google Scholar] [CrossRef] [PubMed]
  44. Zhu, M.; Yu, X. Stem sarcopterygians have primitive polybasal fin articulation. Biol. Lett. 2009, 5, 372–375. [Google Scholar] [CrossRef] [PubMed]
  45. Woltering, J.M.; Irisarri, I.; Ericsson, R.; Joss, J.M.; Sordino, P.; Meyer, A. Sarcopterygian fin ontogeny elucidates the origin of hands with digits. Sci. Adv. 2020, 6, eabc3510. [Google Scholar] [CrossRef] [PubMed]
  46. Tulenko, F.J.; Augustus, G.J.; Massey, J.L.; Sims, S.E.; Mazan, S.; Davis, M.C. HoxD expression in the fin-fold compartment of basal gnathostomes and implications for paired appendage evolution. Sci. Rep. 2016, 6, 22720. [Google Scholar] [CrossRef] [PubMed]
  47. Freitas, R.; Zhang, G.; Cohn, M.J. Biphasic Hoxd gene expression in shark paired fins reveals an ancient origin of the distal limb domain. PLoS ONE 2007, 2, e754. [Google Scholar] [CrossRef] [PubMed]
  48. Amemiya, C.T.; Alföldi, J.; Lee, A.P.; Fan, S.; Philippe, H.; MacCallum, I.; Braasch, I.; Manousaki, T.; Schneider, I.; Rohner, N. The African coelacanth genome provides insights into tetrapod evolution. Nature 2013, 496, 311–316. [Google Scholar] [CrossRef] [PubMed]
  49. Thompson, A.W.; Hawkins, M.B.; Parey, E.; Wcisel, D.J.; Ota, T.; Kawasaki, K.; Funk, E.; Losilla, M.; Fitch, O.E.; Pan, Q. The bowfin genome illuminates the developmental evolution of ray-finned fishes. Nat. Genet. 2021, 53, 1373–1384. [Google Scholar] [CrossRef]
  50. Marlétaz, F.; de la Calle-Mustienes, E.; Acemel, R.D.; Paliou, C.; Naranjo, S.; Martínez-García, P.M.; Cases, I.; Sleight, V.A.; Hirschberger, C.; Marcet-Houben, M. The little skate genome and the evolutionary emergence of wing-like fins. Nature 2023, 616, 495–503. [Google Scholar] [CrossRef] [PubMed]
  51. Tanaka, M. Fins into limbs: Autopod acquisition and anterior elements reduction by modifying gene networks involving 5′Hox, Gli3, and Shh. Dev. Biol. 2016, 413, 1–7. [Google Scholar] [CrossRef]
  52. Wang, C.; Rüther, U.; Wang, B. The Shh-independent activator function of the full-length Gli3 protein and its role in vertebrate limb digit patterning. Dev. Biol. 2007, 305, 460–469. [Google Scholar] [CrossRef]
  53. Zhang, J.; Wagh, P.; Guay, D.; Sanchez-Pulido, L.; Padhi, B.K.; Korzh, V.; Andrade-Navarro, M.A.; Akimenko, M.-A. Loss of fish actinotrichia proteins and the fin-to-limb transition. Nature 2010, 466, 234–237. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Chromosome-level genome assemblies of C. ornata. (a) The interaction map of Hi-C data. (b) Genomic quality statistics of the Osteoglossiformes.
Figure 1. Chromosome-level genome assemblies of C. ornata. (a) The interaction map of Hi-C data. (b) Genomic quality statistics of the Osteoglossiformes.
Biology 13 00478 g001
Figure 2. Phylogenetic and divergence time analysis. (a) Phylogenetic relationship analysis of C. ornata. (b) Analysis of paleogeologic changes and divergence times.
Figure 2. Phylogenetic and divergence time analysis. (a) Phylogenetic relationship analysis of C. ornata. (b) Analysis of paleogeologic changes and divergence times.
Biology 13 00478 g002
Figure 3. TEs and synteny analysis across multiple species. (a) Comparative analysis of TE composition in different fishes. (b) Total proportion and genomic characterization of TEs in 235 fishes. (c) Genome synteny of the Reedfish–Clown featherback. (d) Genome synteny of Zebrafish–Clown featherback.
Figure 3. TEs and synteny analysis across multiple species. (a) Comparative analysis of TE composition in different fishes. (b) Total proportion and genomic characterization of TEs in 235 fishes. (c) Genome synteny of the Reedfish–Clown featherback. (d) Genome synteny of Zebrafish–Clown featherback.
Biology 13 00478 g003
Figure 4. Identification of conserved elements of the pectoral fin. (a) Hypothetical transitions in pectoral fin evolution. (b) Diagram of Gli3 contributing to limb development [51]. (c) VISTA plot showing the presence of a limb-related CNE intro of the Gli3 gene across all jawed vertebrates except the teleost. Peaks (blue, exons; red, non-coding regions) indicate regions with conserved sequences compared to their human counterparts. The Gli3-CNE is highlighted in pale yellow. (d) Human DNase I hypersensitivity site (DHS) data suggest CNE as a potential regulatory element (data from https://www.encodeproject.org/ and accessed on 1 April 2023).
Figure 4. Identification of conserved elements of the pectoral fin. (a) Hypothetical transitions in pectoral fin evolution. (b) Diagram of Gli3 contributing to limb development [51]. (c) VISTA plot showing the presence of a limb-related CNE intro of the Gli3 gene across all jawed vertebrates except the teleost. Peaks (blue, exons; red, non-coding regions) indicate regions with conserved sequences compared to their human counterparts. The Gli3-CNE is highlighted in pale yellow. (d) Human DNase I hypersensitivity site (DHS) data suggest CNE as a potential regulatory element (data from https://www.encodeproject.org/ and accessed on 1 April 2023).
Biology 13 00478 g004
Table 1. Summary of the species and genome data in this study.
Table 1. Summary of the species and genome data in this study.
Scientific NameChitala ornata
English nameClown featherback
Contig number126
Contig N50 (bp)32,781,493
Contig N90 (bp)10,454,454
Chromosome number21
Scaffold N50 (bp)40,727,490
Scaffold N90 (bp)22,642,485
Hi-C-anchored ratio94.78%
Assembled genome size (bp)837,264,786
GC content41.92%
Genome-complete BUSCOs (C)96.40%
Complete and single-copy BUSCOs (S)93.10%
Complete and duplicated BUSCOs (D)3.30%
Fragmented BUSCOs (F)1.40%
Missing BUSCOs (M)2.20%
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Yuan, Z.; Song, Y.; Zhang, S.; Chen, Y.; Xu, M.; Fan, G.; Liu, X. The Chromosome-Scale Genome of Chitala ornata Illuminates the Evolution of Early Teleosts. Biology 2024, 13, 478. https://doi.org/10.3390/biology13070478

AMA Style

Yuan Z, Song Y, Zhang S, Chen Y, Xu M, Fan G, Liu X. The Chromosome-Scale Genome of Chitala ornata Illuminates the Evolution of Early Teleosts. Biology. 2024; 13(7):478. https://doi.org/10.3390/biology13070478

Chicago/Turabian Style

Yuan, Zengbao, Yue Song, Suyu Zhang, Yadong Chen, Mengyang Xu, Guangyi Fan, and Xin Liu. 2024. "The Chromosome-Scale Genome of Chitala ornata Illuminates the Evolution of Early Teleosts" Biology 13, no. 7: 478. https://doi.org/10.3390/biology13070478

APA Style

Yuan, Z., Song, Y., Zhang, S., Chen, Y., Xu, M., Fan, G., & Liu, X. (2024). The Chromosome-Scale Genome of Chitala ornata Illuminates the Evolution of Early Teleosts. Biology, 13(7), 478. https://doi.org/10.3390/biology13070478

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop