Next Article in Journal
Anthropomorphizing with Critical Reflexivity: The Danger and Potential of Anthropomorphizing in Equine-Facilitated Learning and Psychotherapy
Previous Article in Journal
Integrated Microbiome and Metabolomics Analysis Reveals Altered Aggressive Behaviors in Broiler Chickens Showing Different Tonic Immobility
Previous Article in Special Issue
Resequencing Composite Kazakh Whiteheaded Cattle: Insights into Ancestral Breed Contributions, Selection Signatures, and Candidate Genetic Variants
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Chromosome-Level Genome Assembly of the Meishan Pig and Insights into Its Domestication Mechanisms

National Key Laboratory for Swine Genetic Improvement and Germplasm Innovation, Ministry of Science and Technology of China, Jiangxi Agricultural University, Nanchang 330029, China
*
Author to whom correspondence should be addressed.
Animals 2025, 15(4), 603; https://doi.org/10.3390/ani15040603
Submission received: 3 January 2025 / Revised: 14 February 2025 / Accepted: 18 February 2025 / Published: 19 February 2025
(This article belongs to the Special Issue Genomic Prediction in Livestock)

Simple Summary

This study assembled a high-quality chromosome-level genome of the Meishan pig using Illumina and PacBio sequencing technologies. By integrating this new genome with existing pig genome assemblies, we contribute to developing a pig pan-genome, facilitating research on structural variations. Selective sweep analysis between the Chinese wild boar and the Meishan pig revealed key candidate genes, such as TBX19 and PGR, linked to domestication traits. These findings provide valuable genetic resources for studying Meishan pig biology and offer a foundation for improving breeding strategies and conservation efforts.

Abstract

Pigs are essential agricultural animals, and among the various breeds, the Meishan pig, a native breed of China, is renowned for its high reproductive performance. This breed has been introduced to many countries to enhance local pig breeding programs. However, there have been limited genomic and population genetics studies focusing on Meishan pigs. We created a chromosomal-level genomic assembly using high-depth PacBio sequencing and Illumina sequencing data collected from a Meishan pig. Additionally, we analyzed whole-genome sequencing (WGS) data from Chinese boars and Meishan pigs to identify domestication selection signals within the Meishan breed. The assembled genome of the Meishan pig (MSjxau) was found to be 2.45 Gb in size, with a scaffold length of 139.17 Mb. The quality value was 37.06, and the BUSCO score was 96.2%, indicating good completeness, continuity, and accuracy. We annotated transposable elements, segmental duplication, and genes in the MSjxau genome. By combining these data with 28 publicly available genomes, we provide a high-quality structural variants resource for pigs. Furthermore, we identified 716 selective sweep intervals between Chinese wild pigs and Meishan pigs, where the selected gene PGR may be linked to the high fertility observed in Meishan pigs. Our study offers valuable genomic and variation resources for pig breeding and identifies several genes associated with the domestication of the Meishan pig. This lays the groundwork for further investigation into the genetic mechanisms behind complex traits in pigs.

1. Introduction

The pig (Sus scrofa) is not only one of the most important farm animals in the world, but also a near-perfect biomedical model animal [1,2]. Genomics and population genetics studies are critical to understanding the genetic mechanisms of complex traits in pigs [3,4]. The advent of low-cost second-generation sequencing technology has led to a rise in large-scale whole genome sequencing (WGS) based population genetics studies in pigs, with several recent research efforts involving sample sizes of over 1000 [5,6,7]. However, in the field of swine genetics, only a limited number of causal genes or variants have been thoroughly identified, leaving the genetic mechanisms behind most phenotypic traits still largely unresolved [8,9]. One reason is that the NGS sequencing data cannot accurately determine the genotype of all genetic variants due to sequence duplication or sequencing bias, resulting in a loss of heritability [10,11]. Advances in long-read sequencing technologies and assembly algorithms have revolutionized genome assembly [12,13,14], enabling the construction of chromosome-level assemblies for multiple pig breeds [15,16,17]. Long-read sequencing platforms, such as PacBio and Oxford Nanopore, combined with Hi-C scaffolding and optical mapping, have significantly improved contiguity and completeness, leading to high-quality genome assemblies for breeds like the Bamaxiang [18], Ningxiang [19], Luchang [20], and Korean Nanchukmacdon pigs [21]. These resources have provided insights into structural variations (SVs), evolutionary history, and trait-associated genes [22,23]. To date, only 39 high-quality pig genomes have been reported in the NCBI database, representing 29 different pig breeds. The genomic resources available for pig populations are still limited in comparison to those for humans [24].
The domestication of pigs in China dates back over 9000 years, and indigenous breeds such as the Meishan (MS) pig are the most well-known domesticated pigs in the Taihu Lake region. Meishan pigs are renowned for their high reproductive efficiency, early maturity, and superior meat quality, making them one of the most important maternal lines in modern pig breeding [25]. Although a chromosome-level genome of the Meishan pig has been reported previously, this resource primarily represents individual animals [26], and little is known about the genetic structure at the population level. Additionally, while the domestication history of pigs from wild boars (Sus scrofa) has been studied [27,28], the genetic mechanisms that underlie the domestication of the Meishan breed are not well understood [29]. Understanding the genetic differences between Meishan pigs and Chinese wild pigs, particularly in terms of adaptation and selective breeding, is critical for both conservation and breeding efforts aimed at enhancing desirable traits [30].
Recent advances in structural variant (SV) research have highlighted the importance of population-level SV catalogs as essential resources for understanding genomic diversity and its functional implications [31]. High-quality, population-scale genome assembly data are essential for accurately detecting of large fragments and complex structural variants, which significantly contribute to the genetic variation underlying complex traits and diseases [32,33,34]. Furthermore, the algorithms like Graphtyper [35] and Paragraph [36] enable graph genome-based structural variant typing of NGS data with a sample size from hundreds to millions. Recently, a study of porcine graph-based genome analyzed 11 high-quality pig genomes [37], which may not be entirely representative of the rich genetic diversity of pigs; therefore, it is important to construct structural variant sets based on larger genomic datasets.
Here, we aim to generate high-quality chromosomal level genome assembly for Meishan pigs using a comprehensive approach combining PacBio long read sequencing and Illumina short read sequencing. The genome generated in this study is an important complement to existing pig genome resources. At the same time, we also reveal the selective domestication signal between Meishan pigs and Chinese wild boars based on the large-scale and high depth WGS resources. Our findings will contribute to a broader understanding of the genetic mechanisms of pig domestication and adaptation, with implications for breeding and conservation strategies.

2. Materials and Methods

2.1. Sample Collection

A male Meishan pig from Shanghai, China, was used for de novo whole-genome assembly. Ear tissue samples from this individual were collected, immediately frozen in liquid nitrogen, and then stored at −80 °C until DNA extraction. All animal procedures were conducted in accordance with the “Guidelines for the Care and Use of Laboratory Animals” (GB/T 27416-2014, Quality and Capability Requirements for Laboratory Animal Institutions) and were approved by the National Standards of the People’s Republic of China as well as the Ethics Committee of Jiangxi Agricultural University.

2.2. DNA Extraction and Sequencing of the Genome

Genomic DNA was extracted from ear tissues using Qiagen’s DNeasy Blood & Tissue Kit (Manufactured by Qiagen, Hilden, Germany). DNA concentrations were measured using Qubit 3.0 Fluorometer and Nanodrop-1000 instrument (Thermo Scientific, Waltham, MA, USA), while DNA purity and integrity were assessed by agarose (0.8%) gel electrophoresis. Paired-end genomic DNA libraries with insert sizes of 300–400 bp were constructed using the NEBNext® Ultra™ DNA Library Prep Kit for Illumina (NEB, San Diego, CA, USA) according to the manufacturer’s instructions and sequenced on an Illumina Novaseq6000 platform using the paired-end 2 × 150 bp mode at Novogene Biotech, Beijing, China. FastQC [38] was used to assess the quality of the original raw data, and fastp [39] was used to remove the sequencing connectors and delete low-quality, single-ended reads to form clean Illumina data. The data sizes were 192.16 Gb, and the average sequencing depth was about 74x.
Full-length genomic DNA were extracted from ear tissue based on the standard operating manual from PacBio official. Femto Pulse System and Qubit 3 Fluorometer were used to assess the quality of DNA sequence. An above 10 kb SMRTbell CLR (continuous long reads) DNA library was constructed following the manufacturer’s protocol and then sequenced on 2 SMRT (Single Molecule Real Time) cells within a PacBio Sequel platform (Pacific Biosciences of California, Menlo Park, CA, USA) with P6/C4 chemistry at Novogene Biotech, Beijing, China. On average, this generated 23.85 million subreads, with a subread N50 of 11.73 kb. Please note that the average sequencing depth was about 87x.

2.3. Genome Size Estimation and De Novo Assembly

We performed quality control on Illumina data using fastp, and then used KMC (v3.2.4) [40] software to construct a k-mer frequency distribution map with k = 21. Based on the above k-mer data, we used Genomescope2 [41] software to estimate the genome size and heterozygosity of the Meishan pig.
The de novo genome assembly was performed using PacBio subreads with the Flye (v2.9.5) [42] software. The preliminary assembly was polished using the filtered Illumina paired-end reads via Pilon [43]. One round of iterative error correction was conducted to ensure the accuracy of the assembly. As a result, highly accurate contigs were identified. These high-accuracy contigs were then mounted to the chromosome level based on Sscrofa11.1 using RagTag [43], successfully constructing the chromosome-level Meishan pig genome. Subsequently, gaps in the chromosomes were filled using long PacBio reads (>10 kb) with TGS-GapCloser (v1.2.1) [44]. Finally, a final round of polishing was performed with second-generation data, completing the entire genome assembly process.

2.4. Genome Quality Assessment

To assess the integrity and accuracy of the newly assembled Meishan pig genome, we performed the following validation steps. First, we aligned the whole-genome sequencing short reads of the Meishan pig to the genome using BWA software(v2.2.1) [45] to estimate the accuracy of the single-base assembly. Additionally, we evaluated the quality value (QV) scores for each chromosome using yak. We also assessed the quality of the generated genome using BUSCO (v5.0.0) [46] software based on the mammalia_odb10 lineage dataset (created on 20 November 2019). Finally, we compared the synteny between MSjxau and Susscrofa11.1 using doplotly.

2.5. Distribution of Repetitive Elements and Genome Annotation

We used RepeatModeler (v.2.0.1) [47] to construct a de novo repeat sequence library, which was then merged with a pig-specific repeat sequence library extracted from the RepeatMasker [48] repeat database. Then, using default parameters and the constructed repeat sequence library, we annotated the repetitive elements in the assembly using RepeatMasker (v.4.0.5). The repeat landscape was visualized using tools provided by RepeatMasker, specifically the scripts calcDivergenceFromAlign.pl and createRepeatLandscape.pl. Additionally, we annotated the fragment repeats in MSjxau using seadef, filtering the results based on the following criteria: SD fragment size of at least 1 kb, and sequence similarity among two SDs greater than 90%. Furthermore, we used liftover [49] to map the gene annotation information from Susscrofa11.1 v110 to MSjxau, annotating a total of 21,042 genes.

2.6. Genomic Structural Variation Analysis

We downloaded nanopore/PacBio sequencing data for 11 pigs from public databases [50,51] and assembled the data into contigs using Flye [42]. Additionally, we obtained genome assembly data for 17 pigs based on third-generation sequencing data from public sources [20,26,52,53,54,55]. We aligned the MSjxau genome and the 28 aforementioned genomes to the Sscrofa11.1 reference genome, and used DipCall [56] to detect structural variations (SVs) in each genome. The structural variants from all individuals were then merged using JasmineSV [57] to form a non-redundant SV set. DipCall also reports small fragments of SNPs and indels between genome comparisons. Therefore, we used SnpEff [58] to annotate the SNPs and indels between the MSjxau and Sscrofa11.1 genomes.

2.7. Domestication Signal Analysis of Meishan Pigs

We downloaded whole-genome sequencing (WGS) data from 44 Meishan pigs (MS) and 45 wild boars from public databases, with sequencing depths of at least 9x, which allows for accurate identification of genetic variations. We scanned the entire genome for domestication regions using a 20 kb sliding window with a 10 kb step size. We calculated the average Fst and nucleotide diversity (Pi) within each 20 kb window using vcftools [59]. Significant differentiated regions were defined as the top 1% of Fst regions, which also had a Pi(wild boar) to Pi(Meishan pig) ratio greater than 2. GO functional enrichment analysis was performed using the R package clusterProfiler [60].

3. Results

3.1. Genome Assembly and Evaluation of the Meishan Pig

With the assistance of high-depth PacBio CLR and NGS sequencing data, we de novo assembled the Meishan pig genome (MSjxau). The detailed genome assembly workflow is shown in Figure 1A. We generated a total of 227.88 Gb of raw PacBio data (with a read N50 of 11.73 Kb), corresponding to a sequencing depth of 87.64x, and 192.16 Gb of whole-genome NGS data, with a sequencing depth of 73.91x. We estimated the MS genome characteristics using Illumina data, and under a K-mer threshold of 31, the chromosome size was estimated to be 2.24 Gb with a heterozygosity rate of 0.689% (Figure 1B), which is close to the size of the Sscrofa11.1 genome. Using PacBio CLR reads, we assembled the MS contigs with Flye, resulting in a total length of 2,443,415,892 bp, consisting of 1921 contigs, with a contig N50 size of 15,622,572 bp. After scaffolding with Ragtag, gap filling using TGS data, and polishing with NGS data, we obtained the final MSjxau genome assembly. The MSjxau genome is 2,447,343,390 bp in size, with 98.42% of the sequences anchored to 20 chromosomes, and the scaffold N50 is 139.17 Mb (Table 1).
We conducted a comprehensive evaluation of the quality and completeness of the MSjxau genome assembly using multiple approaches. First, mapping with minimap2 revealed that 90.55% of the PacBio reads could be remapped to MSjxau, with regions supported by at least 15 PacBio reads covering 100% of the MSjxau genome. Similarly, 98.2% of the Illumina reads could be remapped to MSjxau, with regions supported by at least 15 Illumina reads covering 99% of the genome. Secondly, the Meishan pig assembly demonstrated overall high base accuracy. The base accuracy of the genome was estimated using Illumina read data, with a QV score of 37.06. The QV scores for each chromosome ranged from 30.09 to 37.80, indicating good accuracy. Furthermore, each chromosome was composed of several large contigs, suggesting good continuity (Figure 1C). Finally, BUSCO analysis of genome completeness showed that approximately 96.2% of the core conserved mammalian genes were present in the Meishan pig assembly, exceeding the 95.8% found in the Sscrofa11.1 reference genome (Figure 1D). This also indicates that the genome assembly is close to complete. These results highlight the superior quality and potential applications of the MSjxau assembly in genomic research.

3.2. Genome Annotation of the Meishan Pig

Based on the gene annotation file of the reference genome Sscrofa11.1, we used Liftoff to annotate 21,472 protein-coding genes in MSjxau, with an average of 2.1 transcripts per gene. Additionally, we observed that regions with high gene density in the genome generally also exhibited higher GC content (Figure 2A). We also annotated repeat elements in the Meishan pig assembly. Overall, 42.06% of the regions in the Meishan pig assembly were marked as repetitive. Consistent with previous studies, LINEs were the most abundant repeat type, accounting for 19.40% of the Meishan pig assembly, followed by SINEs at 13.84%, LTRs at 4.72%, and DNA transposons at 2.33% (Figure 2B). We also annotated SD sequences in MSjxau, identifying a total of 56,257,287 SD sequences, of which 37.82% had repeat units greater than 10 kb. Furthermore, we compared the synteny between MSjxau and Sscrofa11.1 using dopplotly, revealing a good level of sequence consistency (Figure 2C), which supports the overall accuracy of the assembly. However, there were also 10,634,385 SNPs and 2,565,379 indels between the two genomes. Among these, 41,477 SNPs were predicted by SnpEff to be harmful mutations, and 4856 indels were predicted as frameshift variants. These variations may represent significant phenotypic differences between MS pigs and Duroc (DRC) pigs.

3.3. Comparison of Genomic Structural Variation

The MSjxau genome, along with 28 other publicly available genomes, was compared to the Sscrofa11.1 genome to detect genomic structural variations (SVs) (Table 2). Compared to Sscrofa11.1, an average of 37,440 SVs (18,658 insertions and 18,782 deletions) were identified for each genome. More SV loci were detected in the outbred pig genomes compared to the Asian and European pigs (Figure 3A). Subsequently, we merged the SVs from the 29 genomes to obtain a non-redundant set of SVs. A total of 282,372 SVs were identified, covering 78.75 Mb of the genome. We found that 11,327 SVs were shared across the Asian, European, and outbred pig genomes, while the majority of SVs were population-specific (Figure 3B). Finally, by comparing our results with recent studies on pig pan-genome SV genotyping, we identified more variation sites, with a total of 196,785 SVs. This provides additional valuable information for future SV analysis based on pan-genomes [61] (Figure 3C).

3.4. The Selective Domestication Between Chinese Wild Boar and Meishan Pig

We downloaded second-generation sequencing data for 44 Meishan pigs and 45 Chinese wild boars from public databases, and based on the Sscrofa11.1 reference genome, we identified a total of 7,671,328 genetic variants across 89 samples. Selective sweep analysis was performed using both FST (fixation index) and nucleotide diversity (π) methods [62], identifying 716 selective regions, which involved 408 protein-coding genes (Figure 4A). GO functional analysis revealed that these genes were significantly enriched in pathways related to chromatin binding, transferase activity, and protein kinase activity (Figure 4B). The candidate genes most strongly associated with Meishan pig domestication include TMEM116, TBX19, SFT2D2, PTAR1, SMCHD1, PGR, and SH3RF1. Among them, TBX19, a transcription factor involved in developmental process regulation, has been shown in several studies to be significantly associated with body size. The PGR gene, involved in multiple processes including the response to gonadotropin stimulation, plays a key role in significantly enhancing reproductive capacity, suggesting its potential link to the high reproductive traits of Meishan pigs (Figure 4C, Table 3).
Next, we collected a total of 203 high-depth NGS datasets from 16 pig breeds as part of a previous project in our lab [5]. Our analysis revealed that TBX19 and PRG-related selective haplotypes are specific to Asian pig breeds (Figure 5A,B). Notably, the PRG haplotypes are consistently present only in certain Asian breeds, such as the Erhualian pig. Interestingly, some individuals of the Landrace breed also exhibit genotypes that align with those found in the Meishan pig (Figure 5B).

4. Discussion

In this study, we constructed a chromosome level genome assembly for Meishan pigs with good accuracy, continuity, and completeness. A strong collinear relationship exists between MSjxau and the reference genome Sscrofa11.1, although structural variation and sequence differences due to different breeds are also observed. Currently, only 17 porcine chromosome level genomes have been assembled in the NCBI database using the PacBio or Nanopore sequencing data. Among these, A genome of the MS pig chromosome based on multiple sequencing techniques has also been reported recently [26]. The MSjxau we assembled is close to the quality of the genome previously assembled. The two MS genomes reflect the genetic diversity between different individuals of the same breed. Therefore, providing additional chromosome-level genome assemblies is essential for advancing domestic pig genomics research. Notably, one application of chromosomal-level genome is to construct a graphical pan-genome, which is conducive to fully and accurately mining structural variation. In this study, the MSjxau genome was integrated with 28 other genomes, and a total of 122,306 SVs were identified of which with 69.68% were new, which is significant for the analysis of using graph-based genome for genotyping of NGS data [61].
High-quality NGS data are the basis for accurate detection of population-level genetic variation. Here, we analyzed 45 CWB and 44 MS samples to precisely identify genetic variation between the two populations. Through selective sweep analysis, we identified 716 significantly differentiated regions containing numerous candidate genes under selection. Including TBX19, a gene that has been identified as being associated with the domestication history of Chinese native pigs in our previous study [63]. We also focused on the PGR gene, which encodes the progesterone receptor and plays key roles in reproductive tissues and thus impacts mammalian fertility [64]. Previous research has shown that progesterone receptor alleles derived from Neanderthals promote modern human fertility [65]. Therefore, we hypothesize that the PGR gene contributes to the high reproductive performance of the Meishan pig [66], which may have significant implications for the breeding and improvement of reproductive traits in domestic pigs. Additionally, the biological functions of selected genes are also involved in neurodevelopment, cell signaling, gene regulation, and lipid metabolism. For instance, NRXN1 is closely linked to synaptic connectivity and neural function. mutations in this gene can lead to neurodevelopmental disorders, which may be associated with the docile behavior of domesticated pigs [67]. SFT2D2 contributes to cell membrane stability and lipid metabolism [68], while SMCHD1 is associated with chromosome stability and gene silencing, playing a key role in X chromosome inactivation [69]. Furthermore, SH3RF1 affects cell growth by regulating cell signaling and protein degradation [70]. The gene referenced as TMEM116 is also crucial for cell membrane stability and material transport [71]. The functions of all these genes can significantly impact various physiological processes, particularly in how cells adapt to environmental changes.
Due to issues such as inbreeding, the genetic diversity of Meishan pigs has decreased, leading to a narrow gene pool. This reduction in genetic diversity has negatively impacted their adaptability to environmental changes and resistance to diseases [72]. Furthermore, the rampant spread of African swine fever has caused some local Chinese pig breeds to approach extinction, and the Meishan pig is facing a similarly severe situation [73]. The rapid growth rate and high feed conversion efficiency of foreign superior pig breeds (such as Landrace and Duroc) have allowed these breeds to dominate the pig farming market, further limiting the breeding opportunities for local Chinese breeds like the Meishan pig. To address this, we recommend that breeders adopt genomic selection methods to identify and further propagate individuals with exceptional performance. Specifically, emphasis should be placed on incorporating favorable haplotypes, such as those associated with improved growth and reproductive traits, including the PGR and TBX19 haplotypes identified in this study. Furthermore, government departments need to invest more in policies and funding to support the conservation of local breeds. At the same time, through policy guidance, local governments and farming enterprises should be encouraged to participate in breed conservation efforts. Efforts should also be made to promote the unique advantages of Meishan pigs, such as high reproductive capacity and strong adaptability, in order to secure a unique position in a broader market.
This study also has some limitations; for example, the MSjxau genome was scaffolded based on Sscrofa11.1 [1], without considering for large structural variations between breeds, which may have introduced a small number of structural errors. Additionally, we only achieved a chromosome-level assembly, leaving many gaps and assembly errors in highly complex regions. Several species have already completed T2T genome assemblies using HiFi sequencing technology [74,75], a standard we have not yet met. Therefore, future population-level T2T genome analysis will be an important research direction.

5. Conclusions

In conclusion, the Meishan pig genome assembly represents an important genetic resource for Chinese indigenous pigs. The high-quality chromosome-level genome enhances our ability to capture and characterize structural variations (SVs), particularly those carrying important genes. Importantly, selective sweep analysis between the Chinese wild boar and the Meishan pig revealed several key candidate genes, such as TBX19 and PGR, linked to agronomic traits. Therefore, this study provides a foundation for improving breeding strategies and conservation of domestic pigs.

Author Contributions

Conceptualization, H.D. and Z.W.; methodology, H.D. and Z.W.; software, H.D. and Z.W.; validation, H.D. and Z.W.; formal analysis, H.D.; investigation, J.H.; resources, Z.Z.; data curation, Z.W.; writing—original draft preparation, H.D.; writing—review and editing, H.D. and Z.W.; visualization, H.D. and Z.W.; supervision, Z.W.; project administration, Z.Z. and Z.W.; funding acquisition, Z.W. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Key Research and Development Program of China (2021YFD1200801), Major Scientific and Technological R&D Projects of Jiangxi Provincial Department of Science and Technology (Grant No. 20213AAF01010), and the Postdoctoral Fellowship Program of CPSF under Grant Number (GZC20240620).

Institutional Review Board Statement

All procedures involving animals followed the guidelines for the care and use of experimental animals (GB/T 27416-2014, Laboratory animal institutions requirements for quality and competence) were approved by the National Standard of the People’s Republic of China, and also the Ethics Committee of Jiangxi Agricultural University with number JXAULL- 2023-11-15.

Informed Consent Statement

Not applicable.

Data Availability Statement

The assembly of the MSjxau genome can be obtained from https://github.com/jxlabWzZ/MS_jxau_genome_assembly.

Acknowledgments

We sincerely thank the Laboratory for these data resources, computational resources and scientific guidance. We appreciate all colleagues in the National Key Laboratory for Swine Genetic Improvement and Germplasm Innovation, Jiangxi Agricultural University for their assistance in experiments and data analysis.

Conflicts of Interest

The authors declare no competing interests.

References

  1. Warr, A.; Affara, N.; Aken, B.; Beiki, H.; Bickhart, D.M.; Billis, K.; Chow, W.; Eory, L.; Finlayson, H.A.; Flicek, P. An improved pig reference genome sequence to enable pig genetics and genomics research. Gigascience 2020, 9, giaa051. [Google Scholar] [CrossRef] [PubMed]
  2. Ye, R.-S.; Li, M.; Chen, T.; Wei, X.-C.; Qi, Q.-E.; Cheng, X.; Li, C.-Y.; Jiang, Q.-Y.; Xi, Q.-Y.; Zhang, Y.-L. miRNAome, mRNAome and degradome analysis of Tibetan minipigs anterior pituitary. Gen. Comp. Endocrinol. 2018, 259, 104–114. [Google Scholar] [CrossRef] [PubMed]
  3. Zak, L.J.; Gaustad, A.H.; Bolarin, A.; Broekhuijse, M.L.; Walling, G.A.; Knol, E.F. Genetic control of complex traits, with a focus on reproduction in pigs. Mol. Reprod. Dev. 2017, 84, 1004–1011. [Google Scholar] [CrossRef]
  4. Chen, K.; Baxter, T.; Muir, W.M.; Groenen, M.A.; Schook, L.B. Genetic resources, genome mapping and evolutionary genomics of the pig (Sus scrofa). Int. J. Biol. Sci. 2007, 3, 153. [Google Scholar] [CrossRef]
  5. Tong, X.; Chen, D.; Hu, J.; Lin, S.; Ling, Z.; Ai, H.; Zhang, Z.; Huang, L. Accurate haplotype construction and detection of selection signatures enabled by high quality pig genome sequences. Nat. Commun. 2023, 14, 5126. [Google Scholar] [CrossRef]
  6. Du, H.; Zhou, L.; Liu, Z.; Zhuo, Y.; Zhang, M.; Huang, Q.; Lu, S.; Xing, K.; Jiang, L.; Liu, J.-F. The 1000 Chinese Indigenous Pig Genomes Project provides insights into the genomic architecture of pigs. Nat. Commun. 2024, 15, 10137. [Google Scholar] [CrossRef] [PubMed]
  7. Yang, L.; Yin, H.; Bai, L.; Yao, W.; Tao, T.; Zhao, Q.; Gao, Y.; Teng, J.; Xu, Z.; Lin, Q. Mapping and functional characterization of structural variation in 1060 pig genomes. Genome Biol. 2024, 25, 116. [Google Scholar] [CrossRef] [PubMed]
  8. Yang, H.; Wu, J.; Huang, X.; Zhou, Y.; Zhang, Y.; Liu, M.; Liu, Q.; Ke, S.; He, M.; Fu, H. ABO genotype alters the gut microbiota by regulating GalNAc levels in pigs. Nature 2022, 606, 358–367. [Google Scholar] [CrossRef] [PubMed]
  9. Ma, J.; Yang, J.; Zhou, L.; Ren, J.; Liu, X.; Zhang, H.; Yang, B.; Zhang, Z.; Ma, H.; Xie, X. A splice mutation in the PHKG1 gene causes high glycogen content and low meat quality in pig skeletal muscle. PLoS Genet. 2014, 10, e1004710. [Google Scholar] [CrossRef] [PubMed]
  10. Olson, N.D.; Wagner, J.; Dwarshuis, N.; Miga, K.H.; Sedlazeck, F.J.; Salit, M.; Zook, J.M. Variant calling and benchmarking in an era of complete human genome sequences. Nat. Rev. Genet. 2023, 24, 464–483. [Google Scholar] [CrossRef] [PubMed]
  11. Wu, Z.; Gong, H.; Zhou, Z.; Jiang, T.; Lin, Z.; Li, J.; Xiao, S.; Yang, B.; Huang, L. Mapping short tandem repeats for liver gene expression traits helps prioritize potential causal variants for complex traits in pigs. J. Anim. Sci. Biotechnol. 2022, 13, 8. [Google Scholar] [CrossRef]
  12. Espinosa, E.; Bautista, R.; Larrosa, R.; Plata, O. Advancements in long-read genome sequencing technologies and algorithms. Genomics 2024, 116, 110842. [Google Scholar] [CrossRef] [PubMed]
  13. Logsdon, G.A.; Vollger, M.R.; Eichler, E.E. Long-read human genome sequencing and its applications. Nat. Rev. Genet. 2020, 21, 597–614. [Google Scholar] [CrossRef]
  14. Hu, T.; Chitnis, N.; Monos, D.; Dinh, A. Next-generation sequencing technologies: An overview. Hum. Immunol. 2021, 82, 801–811. [Google Scholar] [CrossRef]
  15. Du, H.; Lu, S.; Huang, Q.; Zhou, L.; Liu, J.-F. Chromosome-level genome assembly of Huai pig (Sus scrofa). Sci. Data 2024, 11, 1072. [Google Scholar] [CrossRef] [PubMed]
  16. Wy, S.; Kwon, D.; Park, W.; Chai, H.-H.; Cho, I.-C.; Kim, J. A chromosome-level genome assembly of the Korean minipig (Sus scrofa). Sci. Data 2024, 11, 840. [Google Scholar] [CrossRef]
  17. Cao, C.; Miao, J.; Xie, Q.; Sun, J.; Cheng, H.; Zhang, Z.; Wu, F.; Liu, S.; Ye, X.; Zhang, Z. The first near-complete genome assembly of pig: Enabling more accurate genetic research. bioRxiv 2024. [Google Scholar] [CrossRef]
  18. Zhang, L.; Huang, Y.; Wang, M.; Guo, Y.; Liang, J.; Yang, X.; Qi, W.; Wu, Y.; Si, J.; Zhu, S. Development and genome sequencing of a laboratory-inbred miniature pig facilitates study of human diabetic disease. iScience 2019, 19, 162–176. [Google Scholar] [CrossRef] [PubMed]
  19. Jain, C.; Rhie, A.; Hansen, N.F.; Koren, S.; Phillippy, A.M. Long-read mapping to repetitive reference sequences using Winnowmap2. Nat. Methods 2022, 19, 705–710. [Google Scholar] [CrossRef] [PubMed]
  20. Liu, Y.; Fu, Y.; Yang, Y.; Yi, G.; Lian, J.; Xie, B.; Yao, Y.; Chen, M.; Niu, Y.; Liu, L. Integration of multi-omics data reveals cis-regulatory variants that are associated with phenotypic differentiation of eastern from western pigs. Genet. Sel. Evol. 2022, 54, 62. [Google Scholar] [CrossRef] [PubMed]
  21. Kwon, D.; Park, N.; Wy, S.; Lee, D.; Chai, H.-H.; Cho, I.-C.; Lee, J.; Kwon, K.; Kim, H.; Moon, Y. A chromosome-level genome assembly of the Korean crossbred pig Nanchukmacdon (Sus scrofa). Sci. Data 2023, 10, 761. [Google Scholar] [CrossRef] [PubMed]
  22. Gabur, I.; Chawla, H.S.; Snowdon, R.J.; Parkin, I.A. Connecting genome structural variation with complex traits in crop plants. Theor. Appl. Genet. 2019, 132, 733–750. [Google Scholar] [CrossRef] [PubMed]
  23. Chen, S.; Fang, J.; Wang, Y.; Wang, P.; Zhang, S.; Liao, Z.; Lu, H.; Zhang, X. Evolutionary genomics of structural variation in the tea plant, Camellia sinensis. Trop. Plants 2022, 1, 1–11. [Google Scholar] [CrossRef]
  24. Liao, W.-W.; Asri, M.; Ebler, J.; Doerr, D.; Haukness, M.; Hickey, G.; Lu, S.; Lucas, J.K.; Monlong, J.; Abel, H.J. A draft human pangenome reference. Nature 2023, 617, 312–324. [Google Scholar] [CrossRef]
  25. Yang, B.; Cui, L.; Perez-Enciso, M.; Traspov, A.; Crooijmans, R.P.; Zinovieva, N.; Schook, L.B.; Archibald, A.; Gatphayak, K.; Knorr, C. Genome-wide SNP data unveils the globalization of domesticated pigs. Genet. Sel. Evol. 2017, 49, 71. [Google Scholar] [CrossRef] [PubMed]
  26. Zhou, R.; Li, S.T.; Yao, W.Y.; Xie, C.D.; Chen, Z.; Zeng, Z.J.; Wang, D.; Xu, K.; Shen, Z.J.; Mu, Y. The Meishan pig genome reveals structural variation-mediated gene expression and phenotypic divergence underlying Asian pig domestication. Mol. Ecol. Resour. 2021, 21, 2077–2092. [Google Scholar] [CrossRef] [PubMed]
  27. Albarella, U.; Dobney, K.; Rowley-Conwy, P. The Domestication of the Pig (Sus scrofa): New Challenges and Approaches; University of California Press: Oakland, CA, USA, 2006. [Google Scholar]
  28. Wehr, N.H. Historical range expansion and biological changes of Sus scrofa corresponding to domestication and feralization. Mammal Res. 2021, 66, 1–12. [Google Scholar] [CrossRef]
  29. Zhao, P.; Yu, Y.; Feng, W.; Du, H.; Yu, J.; Kang, H.; Zheng, X.; Wang, Z.; Liu, G.E.; Ernst, C.W. Evidence of evolutionary history and selective sweeps in the genome of Meishan pig reveals its genetic and phenotypic characterization. Gigascience 2018, 7, giy058. [Google Scholar] [CrossRef]
  30. Collingbourne, S.J. Conservation Genetics of Traditional and Commercial Pig Breeds, and Evaluation of Their Crossbreeding Potential for Productivity Improvement; University of Essex: Wivenhoe Park, UK, 2019. [Google Scholar]
  31. Gong, Y.; Li, Y.; Liu, X.; Ma, Y.; Jiang, L. A review of the pangenome: How it affects our understanding of genomic variation, selection and breeding in domestic animals? J. Anim. Sci. Biotechnol. 2023, 14, 73. [Google Scholar] [CrossRef] [PubMed]
  32. Ho, S.S.; Urban, A.E.; Mills, R.E. Structural variation in the sequencing era. Nat. Rev. Genet. 2020, 21, 171–189. [Google Scholar] [CrossRef] [PubMed]
  33. Cao, H.; Hastie, A.R.; Cao, D.; Lam, E.T.; Sun, Y.; Huang, H.; Liu, X.; Lin, L.; Andrews, W.; Chan, S. Rapid detection of structural variation in a human genome using nanochannel-based genome mapping technology. GigaScience 2014, 3, 34. [Google Scholar] [CrossRef] [PubMed]
  34. De Coster, W.; Weissensteiner, M.H.; Sedlazeck, F.J. Towards population-scale long-read sequencing. Nat. Rev. Genet. 2021, 22, 572–587. [Google Scholar] [CrossRef] [PubMed]
  35. Eggertsson, H.P.; Kristmundsdottir, S.; Beyter, D.; Jonsson, H.; Skuladottir, A.; Hardarson, M.T.; Gudbjartsson, D.F.; Stefansson, K.; Halldorsson, B.V.; Melsted, P. GraphTyper2 enables population-scale genotyping of structural variation using pangenome graphs. Nat. Commun. 2019, 10, 5402. [Google Scholar] [CrossRef] [PubMed]
  36. Chen, S.; Krusche, P.; Dolzhenko, E.; Sherman, R.M.; Petrovski, R.; Schlesinger, F.; Kirsche, M.; Bentley, D.R.; Schatz, M.C.; Sedlazeck, F.J. Paragraph: A graph-based structural variant genotyper for short-read sequence data. Genome Biol. 2019, 20, 291. [Google Scholar] [CrossRef]
  37. Du, H.; Zhuo, Y.; Lu, S.; Li, W.; Zhou, L.; Sun, F.; Liu, G.; Liu, J.-F. Pangenome Reveals Gene Content Variations and Structural Variants Contributing to Pig Characteristics. Genom. Proteom. Bioinform. 2024, qzae081. [Google Scholar] [CrossRef]
  38. Andrews, S. FastQC: A Quality Control Tool for High Throughput Sequence Data; Babraham Bioinformatics: Cambridge, UK, 2010. [Google Scholar]
  39. Chen, S.; Zhou, Y.; Chen, Y.; Gu, J. fastp: An ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 2018, 34, i884–i890. [Google Scholar] [CrossRef] [PubMed]
  40. Kokot, M.; Długosz, M.; Deorowicz, S. KMC 3: Counting and manipulating k-mer statistics. Bioinformatics 2017, 33, 2759–2761. [Google Scholar] [CrossRef] [PubMed]
  41. Ranallo-Benavidez, T.R.; Jaron, K.S.; Schatz, M.C. GenomeScope 2.0 and Smudgeplot for reference-free profiling of polyploid genomes. Nat. Commun. 2020, 11, 1432. [Google Scholar] [CrossRef] [PubMed]
  42. Freire, B.; Ladra, S.; Paramá, J.R. Memory-efficient assembly using Flye. IEEE/ACM Trans. Comput. Biol. Bioinform. 2021, 19, 3564–3577. [Google Scholar] [CrossRef]
  43. Walker, B.J.; Abeel, T.; Shea, T.; Priest, M.; Abouelliel, A.; Sakthikumar, S.; Cuomo, C.A.; Zeng, Q.; Wortman, J.; Young, S.K. Pilon: An integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS ONE 2014, 9, e112963. [Google Scholar] [CrossRef]
  44. Xu, M.; Guo, L.; Gu, S.; Wang, O.; Zhang, R.; Peters, B.A.; Fan, G.; Liu, X.; Xu, X.; Deng, L. TGS-GapCloser: A fast and accurate gap closer for large genomes with low coverage of error-prone long reads. GigaScience 2020, 9, giaa094. [Google Scholar] [CrossRef] [PubMed]
  45. Jung, Y.; Han, D. BWA-MEME: BWA-MEM emulated with a machine learning approach. Bioinformatics 2022, 38, 2404–2413. [Google Scholar] [CrossRef]
  46. Seppey, M.; Manni, M.; Zdobnov, E.M. BUSCO: Assessing genome assembly and annotation completeness. In Gene Prediction: Methods and Protocols; Springer: Berlin/Heidelberg, Germany, 2019; pp. 227–245. [Google Scholar]
  47. Flynn, J.M.; Hubley, R.; Goubert, C.; Rosen, J.; Clark, A.G.; Feschotte, C.; Smit, A.F. RepeatModeler2 for automated genomic discovery of transposable element families. Proc. Natl. Acad. Sci. USA 2020, 117, 9451–9457. [Google Scholar] [CrossRef] [PubMed]
  48. Chen, N. Using Repeat Masker to identify repetitive elements in genomic sequences. Curr. Protoc. Bioinform. 2004, 5, 4–10. [Google Scholar] [CrossRef]
  49. Luu, P.-L.; Ong, P.-T.; Dinh, T.-P.; Clark, S.J. Benchmark study comparing liftover tools for genome conversion of epigenome sequencing data. NAR Genom. Bioinform. 2020, 2, lqaa054. [Google Scholar] [CrossRef] [PubMed]
  50. Jiang, Y.-F.; Wang, S.; Wang, C.-L.; Xu, R.-H.; Wang, W.-W.; Jiang, Y.; Wang, M.-S.; Jiang, L.; Dai, L.-H.; Wang, J.-R. Pangenome obtained by long-read sequencing of 11 genomes reveal hidden functional structural variants in pigs. iScience 2023, 26, 106119. [Google Scholar] [CrossRef]
  51. Niu, Y.; Fan, X.; Yang, Y.; Li, J.; Lian, J.; Wang, L.; Zhang, Y.; Tang, Y.; Tang, Z. Haplotype-resolved assembly of a pig genome using single-sperm sequencing. Commun. Biol. 2024, 7, 738. [Google Scholar] [CrossRef]
  52. Ma, H.; Jiang, J.; He, J.; Liu, H.; Han, L.; Gong, Y.; Li, B.; Yu, Z.; Tang, S.; Zhang, Y. Long-read assembly of the Chinese indigenous Ningxiang pig genome and identification of genetic variations in fat metabolism among different breeds. Mol. Ecol. Resour. 2022, 22, 1508–1520. [Google Scholar] [CrossRef]
  53. Zhang, Y.; Fan, G.; Liu, X.; Skovgaard, K.; Sturek, M.; Heegaard, P.M. The genome of the naturally evolved obesity-prone Ossabaw miniature pig. iScience 2021, 24, 103081. [Google Scholar] [CrossRef] [PubMed]
  54. Xie, H.-B.; Yan, C.; Adeola, A.C.; Wang, K.; Huang, C.-P.; Xu, M.-M.; Qiu, Q.; Yin, X.; Fan, C.-Y.; Ma, Y.-F. African suid genomes provide insights into the local adaptation to diverse African environments. Mol. Biol. Evol. 2022, 39, msac256. [Google Scholar] [CrossRef]
  55. Wang, Y.; Gou, Y.; Yuan, R.; Zou, Q.; Zhang, X.; Zheng, T.; Fei, K.; Shi, R.; Zhang, M.; Li, Y. A chromosome-level genome of Chenghua pig provides new insights into the domestication and local adaptation of pigs. Int. J. Biol. Macromol. 2024, 270, 131796. [Google Scholar] [CrossRef] [PubMed]
  56. Li, H.; Feng, X.; Chu, C. The design and construction of reference pangenome graphs with minigraph. Genome Biol. 2020, 21, 265. [Google Scholar] [CrossRef] [PubMed]
  57. Kirsche, M.; Prabhu, G.; Sherman, R.; Ni, B.; Aganezov, S.; Schatz, M.C. Jasmine: Population-scale structural variant comparison and analysis. bioRxiv 2021. [Google Scholar] [CrossRef]
  58. Cingolani, P.; Platts, A.; Wang, L.L.; Coon, M.; Nguyen, T.; Wang, L.; Land, S.J.; Lu, X.; Ruden, D.M. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly 2012, 6, 80–92. [Google Scholar] [CrossRef] [PubMed]
  59. Danecek, P.; Auton, A.; Abecasis, G.; Albers, C.A.; Banks, E.; DePristo, M.A.; Handsaker, R.E.; Lunter, G.; Marth, G.T.; Sherry, S.T. The variant call format and VCFtools. Bioinformatics 2011, 27, 2156–2158. [Google Scholar] [CrossRef]
  60. Yu, G.; Wang, L.-G.; Han, Y.; He, Q.-Y. clusterProfiler: An R package for comparing biological themes among gene clusters. Omics J. Integr. Biol. 2012, 16, 284–287. [Google Scholar] [CrossRef]
  61. Li, X.; Liu, Q.; Fu, C.; Li, M.; Li, C.; Li, X.; Zhao, S.; Zheng, Z. Characterizing structural variants based on graph-genotyping provides insights into pig domestication and local adaption. J. Genet. Genom. 2024, 51, 394–406. [Google Scholar] [CrossRef]
  62. Khan, M.M.H.; Rafii, M.Y.; Ramlee, S.I.; Jusoh, M.; Al Mamun, M.; Halidu, J. DNA fingerprinting, fixation-index (Fst), and admixture mapping of selected Bambara groundnut (Vigna subterranea [L.] Verdc.) accessions using ISSR markers system. Sci. Rep. 2021, 11, 14527. [Google Scholar]
  63. Zhu, Y.; Li, W.; Yang, B.; Zhang, Z.; Ai, H.; Ren, J.; Huang, L. Signatures of selection and interspecies introgression in the genome of Chinese domestic pigs. Genome Biol. Evol. 2017, 9, 2592–2603. [Google Scholar] [CrossRef]
  64. Dinh, D.T.; Breen, J.; Nicol, B.; Foot, N.J.; Bersten, D.C.; Emery, A.; Smith, K.M.; Wong, Y.Y.; Barry, S.C.; Yao, H.H. Progesterone receptor mediates ovulatory transcription through RUNX transcription factor interactions and chromatin remodelling. Nucleic Acids Res. 2023, 51, 5981–5996. [Google Scholar] [CrossRef]
  65. Zeberg, H.; Kelso, J.; Pääbo, S. The Neandertal progesterone receptor. Mol. Biol. Evol. 2020, 37, 2655–2660. [Google Scholar] [CrossRef] [PubMed]
  66. Chen, X.; Fu, J.; Wang, A. Expression of genes involved in progesterone receptor paracrine signaling and their effect on litter size in pigs. J. Anim. Sci. Biotechnol. 2016, 7, 31. [Google Scholar] [CrossRef] [PubMed]
  67. Hu, Z.; Xiao, X.; Zhang, Z.; Li, M. Genetic insights and neurobiological implications from NRXN1 in neuropsychiatric disorders. Mol. Psychiatry 2019, 24, 1400–1414. [Google Scholar] [CrossRef]
  68. Bonnet, A.; Le Cao, K.-A.; Sancristobal, M.; Benne, F.; Robert-Granié, C.; Law-So, G.; Fabre, S.; Besse, P.; De Billy, E.; Quesnel, H. In vivo gene expression in granulosa cells during pig terminal follicular development. Reproduction 2008, 136, 211–224. [Google Scholar] [CrossRef]
  69. Massah, S.; Jubene, J.; Lee, F.; Beischlag, T.; Prefontaine, G. The effects of the DNA Demethylating reagent, 5-azacytidine on SMCHD1 genomic localization. BMC Genet. 2020, 21, 3. [Google Scholar] [CrossRef] [PubMed]
  70. Kim, T.W.; Kang, Y.K.; Park, Z.Y.; Kim, Y.-H.; Hong, S.W.; Oh, S.J.; Sohn, H.A.; Yang, S.-J.; Jang, Y.J.; Lee, D.C. SH3RF2 functions as an oncogene by mediating PAK4 protein stability. Carcinogenesis 2014, 35, 624–634. [Google Scholar] [CrossRef]
  71. Wesoly, J.; Pstrąg, N.; Derylo, K.; Michalec-Wawiórka, B.; Derebecka, N.; Nowicka, H.; Kajdasz, A.; Kluzek, K.; Srebniak, M.; Tchórzewski, M. Structural, topological, and functional characterization of transmembrane proteins TMEM213, 207, 116, 72 and 30B provides a potential link to ccRCC etiology. Am. J. Cancer Res. 2023, 13, 1863. [Google Scholar]
  72. Tong, S.-f.; Mo, Z.; Rui, X.; Li, D.-f.; Zhang, L.-f.; Yang, L. Genome-wide detection for runs of homozygosity analysis in three pig breeds from Chinese Taihu Basin and Landrace pigs by SLAF-seq data. J. Integr. Agric. 2022, 21, 3293–3301. [Google Scholar] [CrossRef]
  73. Lander, B.; Schneider, M.; Brunson, K. A history of pigs in China: From curious omnivores to industrial pork. J. Asian Stud. 2020, 79, 865–889. [Google Scholar] [CrossRef]
  74. Wu, H.; Luo, L.-Y.; Zhang, Y.-H.; Zhang, C.-Y.; Huang, J.-H.; Mo, D.-X.; Zhao, L.-M.; Wang, Z.-X.; Wang, Y.-C.; He-Hua, E. Telomere-to-telomere genome assembly of a male goat reveals variants associated with cashmere traits. Nat. Commun. 2024, 15, 10041. [Google Scholar] [CrossRef] [PubMed]
  75. Liu, J.; Li, Q.; Hu, Y.; Yu, Y.; Zheng, K.; Li, D.; Qin, L.; Yu, X. The complete telomere-to-telomere sequence of a mouse genome. Science 2024, 386, 1141–1146. [Google Scholar] [CrossRef] [PubMed]
Figure 1. The genome assembly of the Meishan pig. (A) Workflow for the genome assembly of Meishan pig. (B) The frequency distribution of k-mer for Meishan pig genome (k = 31). (C) The distribution of contigs on the chromosome, with green and gray representing different contigs. (D) BUSCO evaluates the completeness of genome assembly.
Figure 1. The genome assembly of the Meishan pig. (A) Workflow for the genome assembly of Meishan pig. (B) The frequency distribution of k-mer for Meishan pig genome (k = 31). (C) The distribution of contigs on the chromosome, with green and gray representing different contigs. (D) BUSCO evaluates the completeness of genome assembly.
Animals 15 00603 g001
Figure 2. Genome annotation of Meishan pig assembly. (A) Distribution of features in the Meishan pig genome. For the outer to inner regions, each circle represents the GC content, transposable elements number, gene number and collinearity of repetitive sequences in the 500 kb nonoverlapping windows. (B) Sequence divergence of repetitive elements in the Meishan pig assembly. (C) Sequence consistency between Meishan pigs and Sscrofa11.1.
Figure 2. Genome annotation of Meishan pig assembly. (A) Distribution of features in the Meishan pig genome. For the outer to inner regions, each circle represents the GC content, transposable elements number, gene number and collinearity of repetitive sequences in the 500 kb nonoverlapping windows. (B) Sequence divergence of repetitive elements in the Meishan pig assembly. (C) Sequence consistency between Meishan pigs and Sscrofa11.1.
Animals 15 00603 g002
Figure 3. Statistical analysis of structural variations (SVs) in Meishan pigs. (A) The distribution of the number of structural variations detected in different pig breeds. (B) The statistical distribution of the number of structural variations in outgroup, Asian, and European populations. (C) Comparison and statistical analysis of the number of structural variations. PHA, Phacochoerus africanus; POT, Potamochoerus porcus; BMX, Bamaxiang pig; WZS, Wuzhishan pig; LC, Luchuan pig; DNXE, Diannan small-ear pig; TT, Tibetan pig; MS, Meishan pig; SWT, Shawutou pig; TC, Tunchang pig; RC, Rongchang pig; NX, Ningxiang pig; HUAI, Huai pig; LWH, Laiwu pig; CH, Chenghua pig; AQLB, Anqingliubai pig; MIN, Min pig; KORE, Korean minipig; NANC, Nanchukmacdon pig; OSS, Ossabaw minipig; DU, Duroc pig; DLY, Duroc × Landrace × Yorkshire pig; BBH, Babraham pig; LR, Landrace pig; LW, Largewhite pig; EWB, European wild pig; EUP, European pig; ASP, Asian pig; OUT, outgroup.
Figure 3. Statistical analysis of structural variations (SVs) in Meishan pigs. (A) The distribution of the number of structural variations detected in different pig breeds. (B) The statistical distribution of the number of structural variations in outgroup, Asian, and European populations. (C) Comparison and statistical analysis of the number of structural variations. PHA, Phacochoerus africanus; POT, Potamochoerus porcus; BMX, Bamaxiang pig; WZS, Wuzhishan pig; LC, Luchuan pig; DNXE, Diannan small-ear pig; TT, Tibetan pig; MS, Meishan pig; SWT, Shawutou pig; TC, Tunchang pig; RC, Rongchang pig; NX, Ningxiang pig; HUAI, Huai pig; LWH, Laiwu pig; CH, Chenghua pig; AQLB, Anqingliubai pig; MIN, Min pig; KORE, Korean minipig; NANC, Nanchukmacdon pig; OSS, Ossabaw minipig; DU, Duroc pig; DLY, Duroc × Landrace × Yorkshire pig; BBH, Babraham pig; LR, Landrace pig; LW, Largewhite pig; EWB, European wild pig; EUP, European pig; ASP, Asian pig; OUT, outgroup.
Animals 15 00603 g003
Figure 4. Selective regions on SVs between the CWB and the Meishan genome assembly. (A) Statistic Fst was plotted for selected SVs with threshold Fst value ≥ 0.01. (B) Pathway enrichment of a gene set. (C) Local display of Fst and nucleotide diversity ratio in chr9:31.9-32.2 Mb region. Note that the PGR gene overlaps with this selective sweep region.
Figure 4. Selective regions on SVs between the CWB and the Meishan genome assembly. (A) Statistic Fst was plotted for selected SVs with threshold Fst value ≥ 0.01. (B) Pathway enrichment of a gene set. (C) Local display of Fst and nucleotide diversity ratio in chr9:31.9-32.2 Mb region. Note that the PGR gene overlaps with this selective sweep region.
Animals 15 00603 g004
Figure 5. The degree of haplotype sharing in pairwise comparisons among 16 populations. (A) Haplotype sharing in interval chr4:82,660,001-82,730,000; (B) Haplotype sharing in interval chr9:32,050,001-32,070,000. AWB, Asian wild pig; MS, Meishan pig; WZS, Wuzhishan pig; LC, Luchuan pig; BMX, Bamaxiang pig; LA, Lean pig; EHL, Erhualian pig; TT, Tibetan pig; LWU, Laiwu pig; MIN, Min pig; HT, Heitao pig; EWB, European wild pig; PT, Pietrain pig; LW, Largewhite pig; LR, Landrace pig; DU, Duroc pig.
Figure 5. The degree of haplotype sharing in pairwise comparisons among 16 populations. (A) Haplotype sharing in interval chr4:82,660,001-82,730,000; (B) Haplotype sharing in interval chr9:32,050,001-32,070,000. AWB, Asian wild pig; MS, Meishan pig; WZS, Wuzhishan pig; LC, Luchuan pig; BMX, Bamaxiang pig; LA, Lean pig; EHL, Erhualian pig; TT, Tibetan pig; LWU, Laiwu pig; MIN, Min pig; HT, Heitao pig; EWB, European wild pig; PT, Pietrain pig; LW, Largewhite pig; LR, Landrace pig; DU, Duroc pig.
Animals 15 00603 g005
Table 1. Statistics of the Meishan pig assembly and the reference genome assembly of the pig (Sscrofa11.1).
Table 1. Statistics of the Meishan pig assembly and the reference genome assembly of the pig (Sscrofa11.1).
AssemblyMSjxauSscrofa11.1
Total length of assembly (Gb)2.452.5
Min length (bp)100815,096
Max length (Mb)292.79274.33
Number of scaffolds1144613
Number of gaps466544
Quality values (QV)37.06NA
Scaffold N50 (Mb)139.17138.97
Contig N50 (Mb)15.6248.23
Table 2. The basic classification of 28 public available pig genomes and MSjxau genome.
Table 2. The basic classification of 28 public available pig genomes and MSjxau genome.
AssemblyGroupsLevelsAssemblyGroupsLevels
PHA_SAMN15809474OutgroupChromosomeLC_CNA0017742AsianContig
PHA_SAMC146517OutgroupChromosomeTT_CRR307872AsianContig
POT_SAMC146518OutgroupChromosomeMS_CRR307869AsianContig
BMX_SAMN09531794AsianChromosomeRC_CRR307871AsianContig
DNXE_SAMC146519AsianChromosomeLWU_CRR307868AsianContig
MS_SAMN14422843AsianChromosomeAQLB_CRR307865AsianContig
MS_jxauAsianChromosomeMIN_CRR307870AsianContig
SWT_PRJCA017882AsianChromosomeOSS_SAMN28857594EuropeanChromosome
TC_PRJCA017882AsianChromosomeDU_SAMN15501053EuropeanChromosome
NX_SAMN12123497AsianChromosomeDLY_SAMN07325927EuropeanChromosome
HUAI_SAMN43144123AsianChromosomeBBH_SAMN37151398EuropeanChromosome
CH_SAMN36411884AsianChromosomeLR_SRR24831861EuropeanContig
KORE_SAMN41069951AsianChromosomeLW_CRR307874EuropeanContig
NANC_SAMN34578002AsianChromosomeLW_CRR307874EuropeanContig
WZS_CRR307873AsianContig
Table 3. A fraction of domesticated genes in MS pigs based on Fst and Pi statistics.
Table 3. A fraction of domesticated genes in MS pigs based on Fst and Pi statistics.
RegionGeneTop FstDescription
chr1:223030001-223100000PTAR10.841Protein prenyltransferase activity
chr3:90190001-90220000NRXN10.827Nervous development
chr4:82660001-82730000TBX190.848Developmental processes
chr4:82660001-82730000SFT2D20.856lipid metabolism
chr6:103930001-103970000SMCHD10.814Epigenetic silencing
chr9:32050001-32070000PGR0.801Progesterone Receptor
chr14:20450001-20470000SH3RF10.841Protein ligase activity
chr14:39710001-39780000TMEM1160.906Transmembrane protein
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Du, H.; Hu, J.; Zhang, Z.; Wu, Z. Chromosome-Level Genome Assembly of the Meishan Pig and Insights into Its Domestication Mechanisms. Animals 2025, 15, 603. https://doi.org/10.3390/ani15040603

AMA Style

Du H, Hu J, Zhang Z, Wu Z. Chromosome-Level Genome Assembly of the Meishan Pig and Insights into Its Domestication Mechanisms. Animals. 2025; 15(4):603. https://doi.org/10.3390/ani15040603

Chicago/Turabian Style

Du, Huipeng, Jianchao Hu, Zhiyan Zhang, and Zhongzi Wu. 2025. "Chromosome-Level Genome Assembly of the Meishan Pig and Insights into Its Domestication Mechanisms" Animals 15, no. 4: 603. https://doi.org/10.3390/ani15040603

APA Style

Du, H., Hu, J., Zhang, Z., & Wu, Z. (2025). Chromosome-Level Genome Assembly of the Meishan Pig and Insights into Its Domestication Mechanisms. Animals, 15(4), 603. https://doi.org/10.3390/ani15040603

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop