The Development of Plant Genome Sequencing Technology and Its Conservation and Application in Endangered Gymnosperms

Genome sequencing is widely recognized as a fundamental pillar in genetic research and legal studies of biological phenomena, providing essential insights for genetic investigations and legal analyses of biological events. The field of genome sequencing has experienced significant progress due to rapid improvements in scientific and technological developments. These advancements encompass not only significant improvements in the speed and quality of sequencing but also provide an unparalleled opportunity to explore the subtle complexities of genomes, particularly in the context of rare species. Such a wide range of possibilities has successfully supported the validation of plant gene functions and the refinement of precision breeding methodologies. This expanded scope now includes a comprehensive exploration of the current state and conservation efforts of gymnosperm gene sequencing, offering invaluable insights into their genomic landscapes. This comprehensive review elucidates the trajectory of development and the diverse applications of genome sequencing. It encompasses various domains, including crop breeding, responses to abiotic stress, species evolutionary dynamics, biodiversity, and the unique challenges faced in the conservation and utilization of gymnosperms. It highlights both ongoing challenges and the unveiling of forthcoming developmental trajectories.


Introduction
Recent modern biotechnology, such as plant genome sequencing, has emerged as a pivotal tool for studying plant biology and ecology.The sequencing technique serves as plant structural organization and functional complexities and functions as a valuable resource for uncovering the plant molecular mechanisms underlying adaptive evolution, stress resilience, and agricultural characteristics [1,2].
For instance, the identification of two distinct Sichuan pepper (Zanthoxylum armatum and Zanthoxylum bungeanum) genomes has enabled the exploration of evolutionary relationships, phenotypic variations, and the interaction of adaptive evolution within the genomes of Chinese Sichuan pepper [3].Plant genome sequencing expanded, encompassing diverse species, revealing hidden genetics influencing responses to the environment and pathogen resilience, boosting agricultural productivity [4].Genome sequencing technology significantly impacts essential areas like genetic optimization and crop breeding advancement.A recent study had scholars conduct genome sequencing on nine wild tomato species (Solanum lycopersicum) and two cultivated variants, thoroughly analyzing genetic diversity and structural variations in these tomato genomes [5].Nevertheless, the ongoing progression of technological innovation has introduced a variety of new challenges, namely in the areas of data analysis and the complex annotation of gene functionality [6].Genome sequencing provides valuable resources for the utilization of plant resources and the conservation of endangered plants.However, the majority of genome sequencing efforts have been focused on angiosperms, making genome sequencing of gymnosperms of significant scientific and practical importance [7].The importance of gymnosperm genome sequencing is evident in several aspects.Firstly, gymnosperms represent an ancient group of plants, and their genome research contributes to unveiling their long evolutionary history and mechanisms of adaptation to diverse environments [8].Furthermore, gymnosperms play a significant role in ecology and ecosystem management, particularly in the stability of forest ecosystems [9][10][11][12].Research in this field also provides the foundation for enhancing the quality and sustainable utilization of forest products, including timber, pulp, and medicinal plants [13,14].Thus, gymnosperm genome sequencing holds profound significance for scientific research, ecosystem conservation, and economic resource development.In this review, we will delve into the developmental journey of plant genome sequencing and the latest research findings in the field of gymnosperm genome sequencing.We will explore the applications of genome sequencing technology and its implications for the conservation of gymnosperms.Through this review, we aim to stimulate greater interest in gymnosperm genome sequencing research, encouraging more scientists to engage in research in this field, and providing additional knowledge and wisdom for the development of an ecological civilization.

Advancements in Genome Sequencing Technology and Its Evolution
A faster processing speed and lower cost are responsible for the major advances in genomics.Many academics have spent the last 50 years working to create the tools and processes needed to sequence DNA and RNA molecules.This era has observed significant advancements in sequencing technology, transitioning from the analysis of individual bases to the processing of millions, and evolving from efforts to decipher the coding sequence of isolated genes to the efficient and cost-effective pursuit of whole-genome level sequencing [15].The trajectory began with the initiation of DNA sequencing in 1977 [16], followed by the advent of Next-Generation Sequencing (NGS) in the early 2000s [17].The early 2010s witnessed the emergence of third-generation sequencing technology, along with the progression of technological advancements [18].The advancement of technology has not only propelled the pursuit of corresponding experimental analyses but has also catalyzed rapid progress across the entire scientific domain (Figure 1).A comparison of the advantages and disadvantages of first-generation sequencing technology, secondgeneration sequencing technology, and third-generation sequencing technology is shown in Table 1.
Table 1.Comparison of the Advantages and Disadvantages of First-generation Sequencing Technology, Second-generation Sequencing Technology, and Third-generation Sequencing Technology.

Pioneering Generation Genome Sequencing Technology
Sanger sequencing represents a pioneering example of first-generation genome sequencing technology [37].It predominantly employs the dideoxynucleotide chain termination method in conjunction with the Sodium Dodecyl Sulfate Poly Acrylamide Gel Electrophoresis (SDS-PAGE).The basic idea is that dideoxyribonucleoside tri-phosphates (ddNTPs) are incorporated into the newly manufactured DNA strand and prevent the creation of phosphodiester linkages, which stops nucleotide inclusion.As a result, many DNA fragments with distinct endpoints and a similar beginning point are produced.Selectively, these pieces end at adenine (A), cytosine (C), guanine (G), or thymine (T) nucleotide bases.After the fragments are separated using polyacrylamide gel electrophoresis, the original template's nucleotide sequence can be inferred using auto-radiography.(Figure 2).

Pioneering Generation Genome Sequencing Technology
Sanger sequencing represents a pioneering example of first-generation genome sequencing technology [37].It predominantly employs the dideoxynucleotide chain termination method in conjunction with the Sodium Dodecyl Sulfate Poly Acrylamide Gel Electrophoresis (SDS-PAGE).The basic idea is that dideoxyribonucleoside tri-phosphates (ddNTPs) are incorporated into the newly manufactured DNA strand and prevent the creation of phosphodiester linkages, which stops nucleotide inclusion.As a result, many DNA fragments with distinct endpoints and a similar beginning point are produced.Selectively, these pieces end at adenine (A), cytosine (C), guanine (G), or thymine (T) nucleotide bases.After the fragments are separated using polyacrylamide gel electrophoresis, the original template's nucleotide sequence can be inferred using auto-radiography.(Figure 2).
An extended read length, modest throughput, accuracy, and dependability made Sanger sequencing the gold standard for DNA sequencing for a considerable amount of time.However, its equipment is expensive and time-consuming, which limits its widespread large-scale deployment [38].An extended read length, modest throughput, accuracy, and dependability made Sanger sequencing the gold standard for DNA sequencing for a considerable amount of time.However, its equipment is expensive and time-consuming, which limits its widespread large-scale deployment [38].

Next Generation Sequencing
Genome sequencing technology is endlessly progressing in terms of cost reduction, increased throughput, and enhanced speed [39].The development of second-generation sequencing technology followed the introduction of large-scale dideoxy sequencing, which brought to light the shortcomings of first-generation sequencing technology in satisfying the demands of high-throughput and high-quality large-scale genome sequencing.NGS is a type of high-throughput sequencing that allows for the simultaneous examination of millions of nucleic acid molecular sequences, revolutionizing sequencing methods.The next generation of genome sequencing technologies has a strong foundation thanks to this innovation.Through continuous technological refinement, second-generation sequencing technology emerged, such as Roche's 454 technology [40], Illumina's Solexa technology [41], and ABI's Solid technology [42].Unlike its predecessor, this technology does not rely on nucleotide consistency inference through pre-electrophoretic visualization using radioactive or fluorescently tagged dNTPs or oligonucleotides.

Roche 454 Sequencing Technology
The 454 sequencing technology, also known as pyrosequencing, employs a methodology known as "sequencing by synthesis" (SBS).This technique encompasses the utilization of "water-in-oil PCR" and "pyrosequencing technology".The current methodology involves conducting polymerase chain reactions (PCR) within confined compartments, referred to as small wells.Each cycle of the PCR reaction entails the addition of a single dNTP based on the sequence.When dNTP and sequence couple well, a pyrophosphate group is released and reacts with ATP sulfate chemase to produce adenosine triphosphate (ATP).When luciferase and freshly created ATP oxidize, light is released and recorded for sequencing using a CCD camera.The length of homomers can be difficult to measure exactly with present technology, which could result in errors and inaccuracies.Roche, who invented second-generation sequencing, shut out its 454 business in 2013 because of accuracy problems that limited platform updates and application scope [43].

Illumina Sequencing and Solexa Technology
A multitude of parallel sequencing techniques surfaced with the introduction of 454 sequencing.Notably, the Solexa method was very significant; it was later acquired by Illumina.The "SBS" method used by Solexa sequencing entails the "reversible terminal termination reaction" and "DNA clustering" processes.The Solexa method links DNA fragments with cleaved connectors by using complementary oligonucleotides that stick to the flow pool, unlike bead-based PCR.By using arching duplicated DNA, the solidphase PCR creates clusters from primitive flow cell-bound DNA strands, which starts the "bridge amplification" process.Solexa achieves self-sequencing utilizing fluorescent "reversible terminator" dNTPs.When the fluorophore is in the 3' hydroxyl position, these inhibit binding.For simultaneous sequencing, the fluorophore is removed prior to polymerization.Modified dNTPs and DNA polymerases are washed cyclically in singlestranded cell-binding clusters.CCD monitors nucleotide identity via activated fluorophores, before enzymatic removal of the blocked fluorescent part, progressing to the next site (Figure 3).
Illumina released Solexa sequencing for sale in 2006.The first-generation product was the Initial Genome Analyzer (GA).Hiseq thereafter released models such as the X-ten system, Hiseq2000, Hiseq2500, Hiseq3000, and Hiseq4000.Illumina continued to grow by introducing the compact, lower throughput Miseq system and the high throughput Nextseq platform.The Miseq platform has a number of advantages, including lower costs, quicker processing, and longer read lengths [44].Illumina released Solexa sequencing for sale in 2006.The first-generation produ was the Initial Genome Analyzer (GA).Hiseq thereafter released models such as the X-te system, Hiseq2000, Hiseq2500, Hiseq3000, and Hiseq4000.Illumina continued to grow b introducing the compact, lower throughput Miseq system and the high throughp Nextseq platform.The Miseq platform has a number of advantages, including lower cost quicker processing, and longer read lengths [44].

ABI SOLiD Sequencing Technology
"Sequencing while connecting" is the basic idea behind SOLiD (Sequencing by O gonucleotide Ligation and Detection) sequencing technology.This is accomplished by em ploying "water-in-oil PCR" and "ligase sequencing" techniques.The library configuratio

ABI SOLiD Sequencing Technology
"Sequencing while connecting" is the basic idea behind SOLiD (Sequencing by Oligonucleotide Ligation and Detection) sequencing technology.This is accomplished by employing "water-in-oil PCR" and "ligase sequencing" techniques.The library configuration is similar to the Solexa technique, which uses hybridization to attach DNA molecules to beads.Beads, PCR components, and oil are used to create separated reaction environments, which improves the sequencing template amplification process.Sequencing beads are attached to a SOLiD slide's surface in order to facilitate the detection of sequence information.In order to decode SOLiD, a fluorescent DNA probe must connect to the target sequence in order to release signals that indicate the sequence order.The method's 99.99% accuracy surpasses that of previous second-generation techniques.Dual reading and linked technology, which successfully addresses PCR limits in high GC regions, are used to achieve this.According to a prior study, the SOLiD sequencing tool performs better in samples with a high GC content [45].
DNA sequencer capabilities are advancing at a rate that is significantly faster than the traditional Moore's Law trend.According to Moore's Law, the number of transistors per unit cost, which measures the degree of complexity displayed by microchips, tends to double every two years.However, the evolution of DNA sequencing capabilities presents a remarkable contrast, with a striking doubling frequency occurring every five months from 2004 to 2015 [46].These various fields of technological advancement exhibit differences in their functional capabilities, chemical makeup, and technical attributes.Consequently, they provide a range of unique technical platforms that researchers can customize to meet their own experimental requirements.

Third−Generation Genome Sequencing Technology (TGS)
While Oxford Nanopore Technologies (ONT) and Pacific Biosciences offer long-read sequencing (more than 500 base pairs), Illumina and Thermo Fisher offer short-read sequencing platforms (between 100 and 400 base pairs).Another benefit of third-generation sequencing technologies is this [47].The basic principles of third-generation genome sequencing technology can be seen in Figure 4.
Plants 2023, 12, x FOR PEER REVIEW 7 of 27 accuracy surpasses that of previous second-generation techniques.Dual reading and linked technology, which successfully addresses PCR limits in high GC regions, are used to achieve this.According to a prior study, the SOLiD sequencing tool performs better in samples with a high GC content [45].DNA sequencer capabilities are advancing at a rate that is significantly faster than the traditional Moore's Law trend.According to Moore's Law, the number of transistors per unit cost, which measures the degree of complexity displayed by microchips, tends to double every two years.However, the evolution of DNA sequencing capabilities presents a remarkable contrast, with a striking doubling frequency occurring every five months from 2004 to 2015 [46].These various fields of technological advancement exhibit differences in their functional capabilities, chemical makeup, and technical attributes.Consequently, they provide a range of unique technical platforms that researchers can customize to meet their own experimental requirements.

Third−Generation Genome Sequencing Technology (TGS)
While Oxford Nanopore Technologies (ONT) and Pacific Biosciences offer long-read sequencing (more than 500 base pairs), Illumina and Thermo Fisher offer short-read sequencing platforms (between 100 and 400 base pairs).Another benefit of third-generation sequencing technologies is this [47].The basic principles of third-generation genome sequencing technology can be seen in Figure 4.The two leading companies in third-generation sequencing are ONT and Pacific Biosciences (PacBio).With the introduction of PacBio's SMRT sequencing, a third-generation innovation that permits single-molecule sequencing, doing away with PCR, and permitting infinite nucleic acid sequencing, technological breakthroughs have reached a significant milestone in genome sequencing [48].Third-generation sequencing offers several benefits, chief among them the remarkable capacity to drastically cut down on single-base mistakes, thereby circumventing earlier constraints.Because this approach is unbiased and unaffected by palindromic sequences, it can accurately identify mutations and avoid false positives.It avoids PCR, is resistant to changes caused by humans, provides long readings, guarantees even coverage, and finds methylation directly.
The SMRT technique from PacBio, which is renowned for its precision and adaptability, is revolutionizing genome sequencing.It makes use of nanostructures called Zero-Mode Waveguides (ZMWs), which have tiny holes in them to allow real-time DNA polymerization.This discovery could completely change the field of genetic research.In ZMW exposed to light, DNA polymerase elongates strands one base at a time using tagged dNTPs in real time.DNA molecule sequencing is accelerated by detectable fluorescence from recently inserted nucleotides; label removal cuts off the signal.
There are a number of benefits that set the PacBio sequencing series unique from other commercial technologies.The ability of SMRT sequencing to produce kinetic data during polymerase-driven sequencing is a crucial technical feature.This information facilitates base modification detection, which is an essential tool for de novo genome assembly [49,50].This novel method not only speeds up the sequencing of individual molecules but also offers insightful information about genetic changes and structure.In the 1980s, the idea of nanopore sequencing first surfaced [51].Nanopore technology analyzes alterations when biomolecules move through microscopic pores.Because of this, single-molecule sensing and analysis is possible using nanopore technology.The first iteration of ONT, called MinION, a single-molecule sequencing technology based on nanopores, was made available in 2014 [52].Many alignments and tools for base detection, data processing, read mapping, de novo assembly, and variant discovery have now been developed by Nanopore technology [53].Poretools is a data processing tool for MinION nanopore sequencing that supports quality control, format conversion, and data exploration [54].Genopo is an android application for nanopore sequencing, bringing nanopore sequencing analysis to smartphones for the first time, making genetic research more convenient [55].SquiggleKit is a toolkit for manipulating and querying nanopore data, simplifying file handling, data extraction, visualization, and signal processing [56].A flexible toolset for analyzing DNA and RNA modifications is called ModPhred.It supports many alteration types, incorporates modification information into FASTQ and BAM files, and makes it easier to see and analyze modification data [57].Because nanopore sequencing allows the study of genetic and epigenetic alterations and their function in gene expression easier and more accessible, it has several uses.PacBio HiFi and ONT each have their own strengths and weaknesses, as shown in Table 2.

Applications of Genome Sequencing Technology
The PacBio RS II sequencer has been effectively utilized to generate a 1.27 Gb genome assembly of Dendrobium officinale [70].By utilizing advanced sequencing technologies such as Illumina HiSeq, Nanopore, PacBio, and Hi-C, the results have revealed remarkable N50 values of 44 Mb and 65.35 Mb for Gardenia jasminoides and Chimonanthus praecox, respectively, surpassing previously perceived limitations [71,72].Additionally, to date, it is worth noting that a substantial number of plant reference genome sequences, exceeding 800, have been officially released and are accessible to the public [73].This surge has created new opportunities to enhance the efficiency of plant genetic research at the molecular level, enabling a deeper understanding of plant genome structure, gene composition, functionality, and the evolutionary processes of different species (Figure 5).Complex data processing needs more computing resources and tools.
Relatively shorter read lengths are unsuitable for some long-read studies. [68,69]

Applications of Genome Sequencing Technology
The PacBio RS II sequencer has been effectively utilized to generate a 1.27 Gb genome assembly of Dendrobium officinale [70].By utilizing advanced sequencing technologies such as Illumina HiSeq, Nanopore, PacBio, and Hi-C, the results have revealed remarkable N50 values of 44 Mb and 65.35 Mb for Gardenia jasminoides and Chimonanthus praecox, respectively, surpassing previously perceived limitations [71,72].Additionally, to date, it is worth noting that a substantial number of plant reference genome sequences, exceeding 800, have been officially released and are accessible to the public [73].This surge has created new opportunities to enhance the efficiency of plant genetic research at the molecular level, enabling a deeper understanding of plant genome structure, gene composition, functionality, and the evolutionary processes of different species (Figure 5).

Enhancing Crop Quality through Molecular Breeding
A high-throughput sequencing technique called genotyping by sequencing (GBS) greatly broadens the pool of molecular markers that are available for crop genetics

Enhancing Crop Quality through Molecular Breeding
A high-throughput sequencing technique called genotyping by sequencing (GBS) greatly broadens the pool of molecular markers that are available for crop genetics research.It is possible to use the association between these Single Nucleotide Polymorphisms (SNPs) and pertinent agronomic factors to validate trait-associated haplotypes in crops or to aid in marker-assisted breeding.Unlike other genotyping techniques, like simple sequence repeats (SSR) or restricted fragment length polymorphism (RFLP), GBS may identify a wide variety of SNPs [74].In genotyping scenarios, including populations such as recombinant inbred lines (RILs) in rice (Oryza sativa), maize (Zea mays), barley (Hordeum vulgare), and double haploid (DH) populations in wheat (Triticum aestivum), this technology has demonstrated success [75][76][77].Furthermore, the application of MutMap technology [78] has successfully identified the pathogenic gene OsRR22 in the rice salt-tolerant mutant hst1.The precise insights into gene transcript abundance are provided by RNA-seq-assisted expression profiling, which makes it easier to identify specific symptoms [79].Pearl millet (Pennisetum glaucum), a cereal crop that is widely recognized for its exceptional heat tolerance, has been studied using a graph-based pan-genome approach in the context of genetic sequencing.This has revealed genomic variations linked to heat resilience and opened the door to cultivating more resilient crops in the face of changing climatic conditions [80].Furthermore, the genome of the CIMBL55 maize drought-resistant germplasm resource has been assembled and annotated to a high standard thanks to the use of third-generation PacBio long-read sequencing technology, Hi-C technology, and optical mapping [81].

Exploration of Epigenetic Regulatory Mechanisms
Plant methylation is divided into two types: Cytosine methylation (C-methylation) and Adenine methylation (A-methylation) [82].Joint-snhmC-seq is a novel technique for accurately determining 5-methylcytosine (5mC) and 5-hydroxylmethylcytosine (5hmC) levels in DNA at the single-cell level, offering advantages of low sample input, high accuracy, and simultaneous detection of both modifications [83].Recently developed software, such as DeepSignal-plant (version 1.2.0), combines deep learning and nanopore sequencing to detect methylation information in plant genomes [84].Analysis of Methylation-Sensitive Amplified Fragment Length Polymorphism (MS-AFLP or MSAP) is commonly used to assess changes in response to methylated cytosine under various stimuli and has recently been applied in ecological studies of wild plant populations [85].
A-methylation is a common modification in RNA, including N6-methyladenosine (m 6 A) and N1-methyladenosine (m 1 A) [86].Among these two, m 6 A is considered the most common, abundant, and evolutionarily conserved internal transcription modification in eukaryotic mRNA [87].Nanopore sequencing is a method that does not require heavy sulfuric acid conversion or immunoprecipitation enrichment experiments.It enables the direct simultaneous detection of C5-methylcytosine and m6A during sequencing [88].In recent times, the utilization of the ONT platform for direct RNA sequencing (DRS) has been considered a promising alternative method for studying m 6 A [89].
In Phyllostachys edulis, the application of DRS for the modification of circular RNA has successfully achieved the precise detection of m 6 A, revealing the presence of m 6 A modifications in circular RNA and their distribution within exonic circular RNA [90].Through the use of an improved rice genome and SMRT sequencing technology, researchers have successfully identified genome-wide m 6 A sites in both indica and japonica rice genomes at a single-nucleotide resolution.Concurrently, they observed a positive correlation between m 6 A and the expression of key genes associated with heat stress.Subsequently, they conducted further screening for potential mutants related to epigenetics [91].In petunia petals, a comprehensive transcriptome analysis of RNA containing an m1A modification was conducted by combining LC-MS/MS, dot blotting, and methylated RNA immunoprecipitation sequencing (MeRIP-seq or m 1 A-seq).The study revealed that ethylene treatment decreased the overall mRNA m1A peak in the petals [92].These studies have provided valuable information for a deeper understanding of the mechanistic roles of m 6 A and m1A in different biological systems.
Techniques for detecting DNA methylation fall into three categories.Whole-genome investigations utilizing a variety of methods, including nanopore sequencing, wholegenome bisulfite sequencing (WGBS), DNA methylation immunoprecipitation (MeDIP), and reduced representation bisulfite (RRBS) [93].Advances in sequencing technologies have led to a significant expansion in the field of studying plant epigenetics in a variety of contexts.Significantly enhancing our understanding of how plants epigenetically respond to environmental signals has, in turn, facilitated plant adaptation.
Bisulfite pyrosequencing is a quick gold standard for methylation, and NGS, like WGBS, facilitates comprehensive genome-wide analysis of DNA methylation changes [94].WGBS, a robust molecular experimental technique, enables the precise examination of methylation status at individual CpG sites with high resolution [95].WGBS treats DNA with sodium bisulfite, converting unmethylated cytosines to uracils while preserving methylated cytosines.High-throughput sequencing compares results with the reference genome to identify methylated sites and levels [96].In Morus alba, the WGBS methylation analysis unveiled genomic modifications in response to drought stress, thereby enhancing our comprehension of the intricate relationship between DNA methylation and the regulation of gene expression under abiotic stress conditions [97].Similarly, the application of analytical methods such as WGBS revealed a correlation between tanshinone accumulation and the methylation levels of key enzyme genes, underscoring the significance of CHH methylation in the regulation of tanshinone biosynthesis [98].In Fragaria vesca, WGBS studies brought to light that FDM1 regulates gene expression through CHH methylation, particularly at the promoter and 3' end, exerting an impact on DNA methylation levels and influencing both plant height and fruit size [99].In Glycine max, the utilization of WGBS demonstrated the heritability of DNA methylation variation [100].However, due to short read lengths, WGBS is unable to analyze repetitive genomic regions or regions with 5mC in PCR-biased areas.
MeDIP is an additional accurate technique for analyzing DNA methylation.Methylationsensitive restriction enzyme sequencing (MRE-seq) can be combined with MeDIP to improve the precision of methylation investigations [101].The combination of MeDIP and MREseq serves to further refine the precision of methylation studies [102].In Prunus avium, DNA methylation levels were analyzed through MeDIP, revealing that PavMADS1 and PavMADS2 are intriguing candidate genes involved in regulating flowering.Additionally, they play a crucial role in regulating dormancy in sweet cherries [103].Global MeDIP-Seq in young and aging Gossypium hirsutum revealed lower DNA methylation in aging cotton leaves.Reduced DNA methyltransferase activity is key to regulating secondary metabolites [104].
RRBS utilizes a methylation-sensitive restriction endonuclease to cleave unmethylated DNA into fragments enriched in high GC-density CpG sites.Following additional processing and selection steps, these fragments undergo bisulfite conversion, PCR amplification, and sequencing [96].RRBS is extendable for ecological experimental designs, applicable to organisms without a reference genome, and exhibits higher resolution compared to previous marker-based methods [105].Platt et al. employed RRBS in Quercus lobata, revealing stronger population differentiation at SMPs than SNPs, suggesting epigenetic heritability [106].RRBS methylation quantification has been widely employed in large-scale sample analyses of plant methylation profiles, providing evidence for Epigenome-Wide Association Studies (EWAS) [107].Schmitz et al. conducted RRBS research on 83 soybean Recombinant Inbred Lines (RILs) and their parents, aiming to identify patterns of methylation variation and heritability.The study sought to gain a deeper understanding of how methylation variation contributes to phenotypic diversity [108].
Studying the methylation of mammalian cells helps regulate genes, better understand diseases, and direct the creation of treatments.In order to ensure genomic integrity, silence genes, and promote development, DNA methylation is essential [109].The analysis of 15,000 samples from 348 mammal species revealed a close link between DNA methylation patterns and genetic evolution [110].Longer-lived species exhibit distinct methylation peaks and valleys in their genomes [111].Furthermore, researchers can learn more about the function of methylation in gene regulation by comparing methylation patterns under various physiological or pathological conditions.DNA methylation in mammals may be one of several elements preventing cancer in large, long-lived species [112].For example, by suppressing repetitive DNA elements that threaten genome integrity or by limiting the developmental plasticity of differentiated cells [113].

Evolutionary Analysis of the Origin of Species
Through the lens of comparative genomics, the emergence of new genomic resources made possible by the development of genome sequencing technology has not only improved our comprehension of the developmental processes in land plants but also revealed the ancient evolutionary origins of plants.Using genome evolution analysis and transcriptome sequencing in a phylogenetic context, some fascinating discoveries have been made.Recent findings have shown that among the existing land plants (embryonic plants), there are a large number of novel species of vascular plants that are all related to a common ancestor [114].
Notably, a comparative analysis of the genomes of deciduous and evergreen trees has produced some interesting results.Interestingly, Siberian Larch (Larix sibirica) has been found to harbor prominent expression of the genes regulating EXL2 and DRM1 proteins, while evergreen trees have been found to harbor an overabundance of genes regulating immune receptor proteins [115].The advancement of sequencing technology has not only produced essential references for interpreting the genomes of Hordeum vulgare and Triticum aestivum, but it has also established a key framework for addressing outstanding questions regarding the genomics of the domestication of wheat and barley [116].The genome of a polyploid cultivar of Chrysanthemum morifolium was successfully deciphered in a recent study, which provides the first report of a segmental allo-polyploid genome worldwide and provides extensive insight into the history of breeding and origin of cultivated Chrysanthemum morifolium [117].The genome of Chimonanthus praecox offers insights into the molecular mechanisms that drive magnolia evolution and petal color development [118].Moreover, by the utilization of large-scale genetic data and complex data analysis techniques, the research has fully resolved the mysteries surrounding the genesis, domestication, and migration of grapes.By addressing many debates within the scholarly grape community, this research has created a cohesive viewpoint on the origin and migration of grapes.Consequently, the current storyline found in grape research textbooks has been updated [119].Recent research initiatives have drawn special attention to the in-depth historical account of the careful human selection and nurturing of individual wheat grains in a variety of environmental conditions.This thorough investigation has brought to light its importance as a valuable collection of genetic variants and as a crucial tool in the field of wheat breeding techniques [120].

Biodiversity
The first reference genome assembly for the high-extinction-risk Qinling serow has been made possible in large part by the use of HiFi sequencing technology in the field of animal conservation.This historic achievement has since laid the groundwork for investigating the evolutionary history of the Qin-ling serow and clarifying the fundamental causes of its elevated extinction risk [121].Technological innovation resulting from the genetic revolution has profound consequences for the conservation of plant resources and their inherent diversity, as well as for the protection of animal resources [122].Noteworthy research has underscored the capacity of individual plant genes to exert influence over the species diversity within entire ecosystems [123].Beyond its impact on the diversity of other species, plant sequencing carries profound ramifications for the internal diversity of plant species themselves, as well as their conservation endeavors.
Genome sequencing is becoming a powerful method for identifying genetic variety in plant populations.One such instance is the correlation between comprehensive wholegenome SNP data and phenotypic information obtained from ginkgo seed nuclei.In order to preserve and utilize the domesticated germplasm of this living fossil plant, this integration has effectively identified correlations between 54 SNPs and a variety of ginkgo properties [124].Furthermore, an insightful theory regarding the development of plastid genome diversity, which is influenced by intricate interactions with the nuclear genome has been put forth by a study that harmonizes Illumina and PacBio sequencing datasets [125].
Prominent accomplishments in this field include the painstaking assembly of the hawthorn genome, which has yielded new understandings of the dynamics of the hawthorn plant variety and its evolutionary adaptation.As such, this accomplishment serves as a crucial point of reference for future post-genomic investigations in the hawthorn genus [126].Researchers who have used genotype-by-sequencing (GBS) technology have examined the germplasm of four different Brassica oleracea subspecies in detail.This meticulous project has shown that the allelic diversity of the original broccoli species is higher than that of its hybrid equivalents by a ratio of 4.8, highlighting the possibility of maintaining this variety to improve the quality of broccoli [127].Notably, the fields of genomics and species conservation have been effectively linked by the invention of seedeR, a predictive tool that makes use of genetic offsets and allele frequency transformation functions.This instrument can predict the genetic similarity between known sources of germplasm and particular target locus [128].
Furthermore, the application of genome sequencing has become a powerful tool for protecting threatened plant species.This species is in danger of going extinct because of adaptive gene degradation linked to stress responses and habitat expansion, which has been brought about by the Circaeasteraceae Ranunculales' dependence on stress-free habitats, as revealed by the genetic analysis [129].A thorough re-sequencing analysis has revealed that climatic change plays a critical role in Davidia involucrata's susceptibility.As such, climate sensitivity is one of the most important factors that must be carefully taken into account in the effort to protect this species [130].The critically endangered Rhododendron griersonianum genome was sequenced using PacBio and Illumina technologies, along with Hi-C-assisted genome assembly and population genetics analysis.This case study serves as an example of how important it is for conservation initiatives to limit opportunities for inbreeding in order to ensure the maintenance and perpetuation of the population [131].Ostrya rehderana's re-sequencing has provided complementary findings that highlight the urgent need to prevent inbreeding from causing sharp declines in population genetic diversity, as this constitutes a concealed threat to the species' capacity to adapt to shifting environmental conditions [132].

Abiotic Stress and Biotic Stress
Pennisetum glaucum's pangenome construction has produced important genetic resource information for future study in related domains.Furthermore, a thorough comprehension of the molecular processes underlying this species' ability to withstand heat has been attained, providing insight into the role that structural variations play in heat stress reactions [80].A total of 102 genotypes of Zea mays were resequenced under both control and heat stress conditions.The results showed alterations in gene expression and genomic regulatory regions associated with responses to heat stress [133].Furthermore, the examination of reference genome sequences from both farmed and wild varieties of Phaseolus vulgaris revealed insights into the mechanisms underlying their ability to recover from moderate heat stress and the decrease in the reservoir of genes associated with disease resistance [134].

Genes/QTL and Plant Stress
One or two key genes/QTL that give high resistance often govern resistance.For plants to respond to biotic and abiotic challenges, gene/quantitative trait locus (QTL) must be identified, localized, and stacked [135].A basis for comprehending significant phenotypic and genetic linkages connected to early-stage drought resistance in Triticum aestivum was established by GWAS and BPP QTL co-mapping [136].Rice resilience is increased by the enhanced Tapaswini rice variety, which has six gene/QTL for resistance to both biotic and abiotic stressors and four BB-resistant genes stacked [137], utilizing conventional gene mapping techniques to find and locate disease-resistance genes in order to create high-yielding, stress-tolerant Pisum sativum cultivars [138].Creating molecular genetic markers and applying these markers to QTL analysis is a method that is being used more and more frequently in crop breeding programs to enable complicated quantitative trait selection [139].It is now feasible to quickly and effectively create DNA markers, fine map, and identify candidate genes for biotic resistance in mung beans thanks to the reference genome sequence of the bean and modern advanced sequencing technologies [140].

DNA Methylation and Plant Stress
Plants exhibit three distinct forms of methylation, namely mCG, mCHG, and mCHH [119].While WGBS is not appropriate for repeating sequences, it is mostly used to detect cytosine methylation.Methylation detection may be impacted by bisulfite treatment and DNA degradation.Sequential methylation can be directly detected by Nanopore and Pacbio SMRT sequencing, which works well for repetitive sequences [84].
DNA methylation is a highly significant epigenetic modification [141].Unexpectedly, DNA methylation controls the expression of some plant defense genes in response to biotic stress.Distinctive differential methylation patterns can be triggered by various stress circumstances [142].DNA methylation influences chromatin accessibility and histone modifications, which in turn controls the expression of nearby and distant genes throughout the domestication process of rice.Domesticated rice experiences a decrease in the DNA methylation linked to stress tolerance, but weedy rice, which can withstand more severe stress, may show an increase in this link [143].The drought response regulatory network of Fragaria nilgerrensis was revealed by means of a thorough examination of gene expression profiles, whole-genome DNA methylation maps, and physiological parameters at four distinct time points under drought stress treatment [144].The methylation-sensitive amplified polymorphisms (MSAP) technique is a key technology for studying the dynamic changes in DNA methylation [145].A study that looked at how cold stress affected the DNA methylation in maize seedlings discovered that a quick and beneficial epigenetic response of the plants to the stress is the demethylation of particular genes, which advances our knowledge of their adaptive mechanisms [146].In gymnosperms, high levels of methylation enable the cycad to maintain stability and integrity throughout its evolutionary process [147].DNA methylation studies show that genes linked to seed growth, such as those involved in lipid and cell wall synthesis, are found in demethylated regions of ginkgo seed genomes.These modifications may also have an impact on the production of energy [8].In the genome of Pinus tabuliformis, there are numerous repetitive sequences, and significantly expanded gene families are primarily associated with stress responses [148].A significant portion of these sequences are transposons that come from archaic viruses and can be harmful to the genome.Chinese pine therefore depends on high methylation levels to inhibit their function [149].In ginkgo, differential expression of DNA methylation-related genes may be related to the gender determination of ginkgo leaves [150].In Pinus radiata, high-temperature stress results in reduced methylation in embryogenic callus and somatic plants [151].

The Synthesis of Secondary Metabolites in Plants
An increasing number of plant genomes and pan-genomes have been thoroughly characterized thanks to the continuous progress in sequencing technology.Comprehensive datasets like these are important resources for revealing the genetic foundations of metabolic variation [152].Utilizing Pueraria lobata as a case study, researchers employed PacBio and Hi-C sequencing methodologies to attain a superior genome assembly, thereby revealing its complex attributes.The biosynthesis pathways of essential secondary metabolites have been studied using multi-omics techniques, providing insights into resource efficiency and the possibility of improving genetics through breeding [153].Researchers have successfully used multi-omics techniques to shed light on the intricate biosynthesis of terpenoids and flavonoids found in the dimorphic floral structures of the biennial Sinoswertia tetraptera that thrives in high-altitude domains [154].ONT and Illumina sequencing.along with sophisticated assembly techniques, were used in the Catharanthus roseus investigation.This all-inclusive method investigated chromatin interactions within a well-assembled genomic structure using Hi-C data.The results showed how different metabolite synthesis patterns are guided by chromosomal conformation, which controls the expression of genes specific to organs [155].

Current Status of Gene Sequencing in Gymnosperms
Only in the last ten years have entire genome assemblies for gymnosperms been accomplished, owing to their remarkably enormous genome sizes [7].Early in 2013, the Pinus taeda assembly was first suggested, representing the first gymnosperm species genome draft [156].With the advancement of technology, an increasing number of gymnosperm whole genomes have been sequenced.Below is a list of currently available gymnosperm whole genome assemblies (Table 3).Gymnosperms often show paternal inheritance for chloroplasts (pollen flow and trace gene flow direction) and maternal inheritance for mitochondria (seed flow and population migration history).Currently, Illumina sequencing is used to sequence the genomes of both chloroplasts and mitochondria in the majority of gymnosperms [173].The chloroplast genome provides insights into the exploration of the evolution and relationships among gymnosperms.The recent phylogenetic relationship between Tsuga longibracteata and Tsuga chinensis is of particular interest [174].An increasing number of gymnosperms have been the subject of chloroplast genome research, such as Cycas Szechuanensis [175], Cycas ferruginea [176], Ginkgo biloba [177], and Podocarpus imbricatus [178].Large repeat sequences play a crucial role in the evolution and recombination of chloroplast genomes in gymnosperms, as confirmed in chloroplast genome studies of Picea schrenkiana [179], Picea rubens [180], and Pinus wangii [181].Cycas hongheensis is ranked as an extremely endangered species by the International Union for Conservation of Nature (IUCN) Red List of Endangered Species.The chloroplast genome will facilitate the investigation of the phylogeography of Cycadaceae plants and enable more comparative research of chloroplast genomes within the Cycadaceae family [182].Combining PacBio long-read sequencing data with Illumina short-read sequencing data from a single haploid megagametophyte allowed for the assembly of the first mitochondrial genome of Abies alba.Analysis of the mitochondrial genome sequencing showed significant structural and compositional heterogeneity [183].

Gene Sequencing Technology and Endangered Gymnosperms
The national key protected wild plant list, as of 2021, includes seven gymnosperm families: Cycadaceae, Ginkgoaceae, Podo-carpaceae, Cupressaceae, Taxaceae, Pinaceae, and Ephedraceae.Based on the examination of DNA sequences and information from 115 microsatellite locus (SSR), certain genetic characteristics have been identified in Cycas simplicipinna, an endangered species in the Cycadaceae family.These include a marked genetic structure, a notable level of genetic variation among groupings, and recent population decreases.These findings can provide guidance for the conservation of this endangered species in order to prevent its extinction [162].After resequencing 545 ginkgo genomes worldwide, three refugia and the ancient genetic components of ginkgo were discovered.This has made it feasible to draw inferences about the genetic composition and interactions amongst ginkgo populations, providing invaluable genomic resources for addressing a variety of issues related to living fossil species [163].Genomic research has uncovered the strategies used by ginkgo biloba to expand its genome and has also identified important genes involved in the formation of the seed flagellum.With the exception of ginkgo and horsetail plants, these genes have vanished from all seed plants, providing insight into the evolution of gymnosperms [146].Pherosphaera hookeriana, a plant of the Podocarpaceae family, is endangered.Utilizing SSR markers as a tool to assess genetic diversity and organization, the findings revealed evidence of a post-glacial genetic bottleneck in P. hookeriana.Nonetheless, some of its diversity has survived, thanks, in part, to the maintenance of gender differences, geographic range, and population continuity after the ice age.One of the biggest challenges to the survival of this species is still preventing fires [164].In order to better understand Cupressus chengiana, 884.82 high-quality SNPs from 266 different Cupressus chengiana samples were identified, identifying this endangered species in the Cupressaceae family.Analyzing the population genetics of the species across its whole range was the aim of this assessment.In population genomics, high-throughput sequencing (HTS) can reconstruct the complex evolutionary history of mountainous species that are endangered, providing crucial data for conservation initiatives and advancing our comprehension of the enormous biodiversity found in mountainous regions [184].
Genetic diversity among wild populations of Taxus wallichiana was measured using Random Amplified Polymorphic DNA (RAPD) and Amplified Fragment Length Polymorphism (AFLP).The study's findings demonstrated a rough correlation between population distribution patterns and genetic differentiation.The optimal conservation strategy for T. wallichiana is to safeguard all populations off-site while also safeguarding select populations on-site.APD and AFLP, or amplified fragment length polymorphism and random amplified polymorphic DNA, were employed to quantify genetic variation in Taxus wallichiana wild populations [166].The study's findings demonstrated a rough correlation between population distribution patterns and genetic differentiation.The optimum conservation strategy for T. wallichiana is to protect all populations off-site while also safeguarding certain populations on-site [162].Using Direct Amplification of Mini-Satellite DNA (DAMD) and Inter-Simple Sequence Repeat (ISSR) markers, the genetic variability and population structure of Ephedra foliate were evaluated [185].The results of the study imply that the high levels of genetic differentiation and moderate gene flow seen in the species may be caused by elements such as geographic isolation, regional climate conditions, overexploitation, and incorrect seed harvesting [186].

Discussion
First-generation sequencing techniques are no longer as useful in modern genome sequencing projects; they are now mostly useful for smaller-scale sequencing jobs like point mutation detection and clone verification [187].Especially in more specialized applications, conventional methods such as Sanger sequencing, TA cloning, and online tools for DNA methylation detection have been used [188].
The rising use of second-generation sequencing can be ascribed to its benefits, which include high throughput and affordability.The read lengths of second-generation sequencing are often shorter, varying between 35 and 700 base pairs.Along with Insertions/Deletions (InDels), these short-read sequencing methods have been essential in advancing innovative reference genome assembly, deciphering the complexities of population structure, and identifying SNPs [189,190].
Third-generation sequencing systems, on the other hand, such as PacBio and Oxford Nanopore, provide long-read sequencing technologies that can reach thousands of base pairs, surpassing the limitations of short-read sequencing and providing researchers with creative alternatives.By avoiding the biases associated with reverse transcription and amplification, long-read sequencing technology allows for the direct sequencing of individual molecules, opening up new avenues for functional genomics research in the field of plant biology [48].
Through the combination of Hi-C and Third-Generation Sequencing (TGS), recent studies have successfully assembled Telomere-to-Telomere (T2T) genomes for a number of plant species [191].The research team led by Dr. Su Xiaohua used a hybrid strategy that combined second and third-generation sequencing technologies to achieve a reliable chromosome-level genome assembly for Populus Koreana [192].
The selection of different sequencing technologies holds the potential for beneficial outcomes across multiple fields of study.A software tool has been developed for designing Small Guide RNAs (sgRNAs) applicable to all sequenced species [193].Additionally, a novel plant gene functional annotation software, GFAP (3.8 version), has been introduced, demonstrating the ability to efficiently annotate more than 2000 genes with GO, KEGG, and Pfam information in just 4.5 s.This efficiency surpasses the performance of current mainstream functional annotation tools [194].The future is expected to witness the emergence of additional software solutions tailored to gene sequencing, facilitating enhanced research endeavors for scholars in the field.
Plant genome sequencing still has certain challenges, especially when it comes to creating comprehensive, intricate, and pan-genomes for plants.Large genome sizes, heterozygosity, and polyploidy are only a few of the factors that continue to be very difficult.Plant genome decoding is a challenging task because of the enormous variations in genome size [195].For instance, the genome of the lily family's Fritillaria exceeds 100 Gb, despite spiral algae (Spirogyra) having only 63 Mb.This discrepancy in size is partly explained by the presence of large amounts of non-coding DNA sequences [6].
While cross-species comparative genomics is becoming more and more popular, access to many population genomics datasets is limited, which hinders additional research.In order to encourage data sharing, researchers in the field of population genomics are calling for a more transparent and cooperative approach [75].Furthermore, the discourse presently encompasses perspectives on the preservation endeavors and utilization of gene sequencing in gymno-sperms, tackling their distinct obstacles and inputs to the wider domain [73].

Conclusions
To summarize, the increasing adoption of genome sequencing technology has emerged as a crucial pillar of modern plant science.The ongoing development of sequencing methods portends a time when plant genome sequencing precision will rise to previously unheard-of levels, providing dramatic improvements in the reliability and quality of data.This trajectory has the potential to offer more precise and all-encompassing support to researchers, enabling the investigation of the complex structure, functional dynamics, and composition of plant genomes.
Advances in technology are shedding light on mysterious features found in plant genomes and providing a greater knowledge of genetic information.These significant realizations provide a strong scientific basis for tackling pressing issues in ecology, agriculture, and environmental preservation.With the advancement of genome sequencing technology, important aspects such as stress tolerance, plant adaptability, evolutionary processes, and the complex interactions of genetic diversity will soon be explored.Consequently, this creates a strong basis for upcoming sustainable development projects.
Throughout this evolutionary process, a clear pattern appears that points to the widespread use and ongoing improvement of plant genome sequencing.The possibility of accurately modifying plant genomes is enhanced by the continuous development of various gene editing and synthetic biology technologies, opening up new avenues for breeding and development.Concurrently, techniques controlling the analysis and interpretation of genome sequencing data are constantly improved and optimized, guaranteeing the precise and significant insights that are extracted from large datasets.
Recognizing the unique contributions made by genome sequencing technology to the preservation and use of genetic data in gymnosperms is crucial in this regard.The use of gene sequencing methods in gymnosperm conservation raises the bar for the profession and emphasizes how crucial it is to comprehend and maintain the genetic diversity present in these ancient plant species.To summarize, genome sequencing technology is going to have a significant and long-lasting influence on the direction of plant science, offering solid and essential backing for investigating mysterious features of the plant kingdom.With technology developing at a breakneck pace, our understanding of plant genomes will inevitably grow, providing a solid scientific foundation that will enable humanity to map out a brighter, more enlightened future.

Figure 1 .
Figure 1.The development history of gene sequencing technology.

Figure 2 .
Figure 2. The fundamental principle of Sanger sequencing.Different colors represent different types of nucleic acids.

Figure 2 .
Figure 2. The fundamental principle of Sanger sequencing.Different colors represent different types of nucleic acids.

Figure 4 .
Figure 4.The principle of third−generation sequencing technology.

Figure 4 .
Figure 4.The principle of third−generation sequencing technology.

Plants 2023 ,
12, x FOR PEER REVIEW 9 of 27 Compact and lightweight device.

Figure 5 .
Figure 5.The application of genetic sequencing technology.

Figure 5 .
Figure 5.The application of genetic sequencing technology.

Table 2 .
Comparison of the Advantages and Disadvantages of PacBio HiFi and ONT.

Table 3 .
List of currently available gymnosperm whole-genome assemblies.