Next Article in Journal
Impacts of Climate Change and Inter-Specific Competition on the Spatial Distribution of Elliot’s Pheasant (Syrmaticus ellioti, Swinhoe, 1872) in Huzhou City, China
Previous Article in Journal
Ecological Roles of Lichens as Monitors of a Changing Global Environment
Previous Article in Special Issue
A Multi-Breed GWAS for Carcass Weight in Jeju Black Cattle and Hanwoo × Jeju Black Crossbreds
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Genomic Evaluation of the Genetic Structure and Analysis of Selective Evolutionary Signatures of Xupu Goose

1
College of Animal Science and Technology, Yangzhou University, Yangzhou 225009, China
2
Key Laboratory for Evaluation and Utilization of Livestock and Poultry Resources (Poultry), Ministry of Agriculture and Rural Affairs, Beijing 100125, China
*
Author to whom correspondence should be addressed.
Biology 2026, 15(6), 479; https://doi.org/10.3390/biology15060479
Submission received: 13 February 2026 / Revised: 10 March 2026 / Accepted: 12 March 2026 / Published: 17 March 2026
(This article belongs to the Special Issue Advances in Animal Functional Genomics)

Simple Summary

The Xupu goose is an economically valuable indigenous breed in China, celebrated for its large body size and exceptional fatty liver capacity. However, as an “at-risk” population, it requires urgent conservation efforts. In this study, we conducted whole-genome sequencing on 15 Xupu geese to evaluate their genetic diversity and uncover the genomic basis of their unique traits. We found that while the breed retains a moderate genetic reservoir, it carries a legacy of historical inbreeding driven by past population bottlenecks. Importantly, our results indicate that current conservation practices have successfully prevented severe recent inbreeding, despite the natural emergence of distinct family lineages. Furthermore, we identified key candidate genes strongly linked to lipid metabolism and muscle development. These findings provide crucial scientific guidance for optimizing future mating strategies and lay the genomic groundwork for the precision breeding of this vital waterfowl.

Abstract

As an elite indigenous poultry breed under national protection in China, the Xupu goose is renowned for its large body size, superior fatty liver production, premium meat quality, and high tolerance to roughage. To elucidate its genomic architecture, genetic diversity, and evolutionary selection signatures, we conducted whole-genome resequencing on 15 purposively selected, unrelated male Xupu geese. An average of 6.79 Gb of high-quality sequence data was generated per individual, yielding approximately 4.27 million single-nucleotide polymorphisms (SNPs) with a transition/transversion (Ti/Tv) ratio of 2.49. Population genomic analyses revealed that while the population retains a moderate genetic reservoir (HE = 0.298), it exhibits a distinct heterozygote deficit (HO = 0.217) and a moderate genomic inbreeding coefficient FROH = 0.204). This structural pattern underscores the genetic impact of historical ex situ closed-flock conservation and the consequent formation of cryptic family lineages. Furthermore, genome-wide integrated haplotype score (iHS) scans detected distinct regions under recent positive selection. Functional annotation of these regions highlighted candidate genes tightly associated with the breed’s hallmark traits, specifically lipid metabolism and hepatic fat deposition (ACSS2, ACSS3, PECR), alongside muscle development (CMYA5, MTPN, LEPR). Conclusively, this study delineates a comprehensive genomic landscape of the Xupu goose, providing a robust foundational resource for future germplasm conservation, molecular marker development, and precision breeding programs.

1. Introduction

China’s vast territory and diverse ecological types have nurtured abundant indigenous goose genetic resources with distinct characteristics [1,2]. These indigenous breeds are widely distributed across China and have developed strong adaptability to complex local environments through long-term natural domestication and artificial selection [3]. Among them, the Xupu goose, originating from the hilly regions of Xupu County in Hunan Province and recognized as one of the “Three Famous Geese of China”, is renowned for its large body size, exceptional fatty liver performance, high-quality down, and tolerance to coarse fodder. Studies have shown that its meat is rich in unsaturated fatty acids, and the average weight of its fatty liver can exceed 606 g [4]. Furthermore, conservation assessments by Chen et al. classify the Xupu goose as “at-risk” among Chinese indigenous waterfowl resources [5]. In recognition of its exceptional breed characteristics and urgent conservation value, the breed has been accorded high-level national protection status. Specifically, the Xupu goose was inscribed in the National List of Livestock and Poultry Genetic Resources for Conservation in February 2014 and was subsequently designated as a National Geographical Indication Protected Product in December of that year.
Whole-genome resequencing (WGS), relying on reference genomes, utilizes high-throughput technology to accurately identify single-nucleotide polymorphisms (SNPs), insertions/deletions (Indels), and structural variations (SVs) within the genome [6,7]. These genetic variation data are instrumental in revealing the genetic diversity, selective evolutionary signals, and domestication processes of goose populations [8,9]. In studies of breed origins, Wen et al. [10] confirmed the dual-origin model of domestic geese—originating from Swan Geese (Anser cygnoides) and Greylag Geese (Anser anser)—through resequencing analysis of domestic geese and their wild ancestors, and elucidated the impact of gene introgression, such as IGF-1, on body size evolution. Regarding phenotypic trait dissection, Ren et al. [11] identified 26 plumage color regulatory genes, including KITLG, using Pool-Seq technology. Sun et al. [12] mapped 88 SNP loci significantly associated with body size through GWAS, revealing the regulatory mechanisms of genes like THADA in skeletal development. For reproductive traits, Zhao et al. [13] screened 107 egg-production-related candidate genes and found them significantly enriched in signaling pathways such as PI3K-Akt. In summary, whole-genome data serves not only as the cornerstone for genetic resource conservation but also as a key driver for transitioning the goose industry toward molecular precision breeding.
The unique lineage of the Xupu goose is deeply rooted in its geographic and cultural history. As documented in the 1869 edition of the Xupu County Annals, the breed emerged through over a century of localized, closed-herd breeding, with residents rarely transferring populations outside the hilly regions of Hunan Province. While this unique breeding history has shaped its superior breed characteristics, it also exposes the population to the potential risk of reduced genetic diversity. However, a systematic genomic evaluation of the Xupu goose—particularly regarding its genetic diversity, inbreeding levels (ROH), and selective signatures—remains insufficient. This study aims to conduct a whole-genome resequencing analysis of Xupu geese to deeply explore their genetic variation characteristics, thereby providing systematic theoretical support for the conservation and molecular breeding of this breed.

2. Materials and Methods

2.1. Ethics Statement

To ensure the highest level of Animal Ethics and welfare, this study was conducted under the rigorous oversight of the committee at Yangzhou University. All protocols were officially authorized under Permit Number 2023004742, ensuring that every step of the research met established safety and humanitarian guidelines. Beyond institutional approval, the project maintained full Regulatory Compliance with the “Regulations on the Administration of Laboratory Animal Affairs” (Yangzhou University, 2012) and “the specific Management Standards for Experimental Practices” (Jiangsu, China, 2008) established in Jiangsu, China. By integrating these legal frameworks, the study guarantees that all animal management procedures were performed with professional integrity and in accordance with modern scientific standards.

2.2. Experimental Animals

In this genetic study, male Xupu geese were obtained from the National Waterfowl Gene Bank (Taizhou, China), where the conservation population is maintained under a closed management system utilizing rotational mating. To maximize breed representativeness and minimize kinship bias, experimental individuals were randomly selected from distinct, unrelated families based on comprehensive pedigree records. Venous blood samples (2 mL) were collected from the brachial vein of 15 male geese (90 days old) using sterile heparinized syringes. Blood collection was performed by licensed veterinarians. To ensure animal welfare, geese were gently restrained using a calm, manual holding technique to minimize stress. No anesthesia was required for the minimally invasive wing vein puncture, and hemostasis was immediately confirmed after sampling to prevent infection. Collected blood samples were processed for genomic DNA extraction and subsequent genetic analyses. Prior to extraction, samples were temporarily stored at −20 °C. Following extraction, the purified genomic DNA was stored at −80 °C to preserve integrity for downstream sequencing.

2.3. Genomic DNA Extraction and Quality Assessment

Genomic DNA was extracted from blood samples using a DNA extraction kit (DP304, Tiangen, China). The integrity and fragmentation of the extracted genomic DNA were verified using 1% agarose gel electrophoresis, and the precise concentration was determined using a NanoDrop-2000 micro-spectrophotometer (Menlo Park, CA, USA) to ensure all samples met the stringent requirements for library construction and sequencing. Only samples meeting stringent quality criteria (concentration > 200 ng/µL; OD260/280: 1.75–2.0) were selected for library construction and sequencing. Qualified DNA samples were shipped to Nanjing Jisihuiyuan Biotechnology Co., Ltd. (Nanjing, China) for resequencing library construction and sequenced on the Illumina NovaSeq 6000 platform. The Anser cygnoides (Swan goose) reference genome (ASM4018256v1; NCBI Accession: GCF_040182565.1) was utilized for read mapping and downstream genomic analyses. SNP calling was performed using the Genome Analysis Toolkit (GATK). After stringent filtering, the obtained SNP markers were utilized for haplotype phasing and integrated Haplotype Score (iHS) analysis [14].

2.4. Data Quality Control and Variant Calling

Raw data were acquired using a high-throughput sequencing platform (Illumina NovaSeq 6000, San Diego, CA, USA) and converted into raw reads. Subsequently, the sequencing reads were aligned to the reference genome using BWA. On this basis, PCR duplicates were removed using Picard tools (V2.17.0) (Mark Duplicates) to eliminate the effects of amplification bias. Subsequent variant detection was primarily implemented using the GATK (v4.1.4.1) toolkit. Specifically, variant calling was first performed to identify candidate SNPs and InDel loci, followed by Variant Quality Score Recalibration and stringent data filtering to construct the final set of high-confidence SNP loci.

2.5. Analysis of Population Genetic Diversity, Linkage Disequilibrium (LD), and Runs of Homozygosity (ROH)

To conduct an in-depth analysis of the genetic structure of the Xupu goose population, the --het and --freq functions in PLINK 1.9 software were employed post-filtering to accurately calculate key indices of genetic variation, including expected heterozygosity (HE) and observed heterozygosity (HO), which were subsequently used to assess the inbreeding coefficient (F) and intra-population genetic distance (Dst) [15].
Population genetic structure is a crucial approach for investigating internal evolutionary processes and phylogenetic relationships within a species. The top 10 eigenvalues and eigenvectors were calculated using the --pca parameter in PLINK v1.9 software. Subsequently, the first two principal components (PC1 and PC2) were visualized using the ggplot2 package in R v4.4.0 to explore the clustering relationships and genetic consistency within the conserved population.
The characterization of population genetic structure is an essential approach for elucidating intraspecific evolutionary dynamics and phylogenetic relationships. To this end, Principal Component Analysis (PCA) was conducted using PLINK v1.9 (with the --pca parameter) to extract the top 10 principal components. Subsequently, the first two principal components (PC1 and PC2) were visualized using the ggplot2 package in R v4.4.0 to assess spatial clustering patterns and genetic homogeneity within the conserved population.
Linkage Disequilibrium (LD) analysis was performed using PopLDdecay (https://github.com/BGI-shenzhen/PopLDdecay, accessed on 14 February 2026) with the following specific parameters: -MaxDist 500, -Het 0.1, -Miss 0.3, and -OutPairLD 5. The chromosomal distribution of LD was visualized via LD decay plots, which characterize the relationship between the LD decay rate and physical or genetic distance [16].
To identify Runs of Homozygosity (ROH) within the Xupu goose genome, a systematic scan of the autosomes was performed using PLINK V1.9 software [17,18]. The detection process employed a sliding window approach, with stringent filtering parameters established based on the studies by Yu et al. [19] and Sun et al. [20] to ensure the reliability of the results. The detection parameters were strictly defined as follows: a sliding window of 50 SNPs was employed (--homozyg-window-snp 50), allowing for a maximum of one heterozygous genotype (--homozyg-window-het 1) and five missing genotypes (--homozyg-window-missing 5) per window, with a threshold of 0.05 (--homozyg-window-threshold 0.05). To be classified as an ROH, a segment was required to span at least 100 kb (--homozyg-kb 100), contain a minimum of 10 SNPs (--homozyg-snp 10), and maintain a minimum density of one SNP per 10 kb (--homozyg-density 10), with a maximum gap of 100 kb between adjacent markers (--homozyg-gap 100). Statistical analyses were conducted on the frequency, length, and distribution of ROH within the Xupu goose population. The genomic inbreeding coefficient based on FROH (was calculated using the following formula [21]:
F R O H = L R O H L A U T O
where L R O H represents the total length of all ROH segments detected in each individual, and L A U T O is the total length of the autosomal genome covered by the genotype data.

2.6. Detection of Selection Signatures Using iHS

To identify genomic regions under selection (significant signatures) within the Xupu goose population, the Integrated Haplotype Score (iHS) method was employed. The standardized iHS score is typically calculated as follows [22]:
i H S = l n ( i H H A i H H D ) E p [ l n ( i H H A i H H D ) ] S D p l n ( i H H A i H H D )
where i H H A and i H H D represent the integrated Extended Haplotype Homozygosity (EHH)scores for the ancestral and derived core alleles, respectively. E p [ l n ( i H H A i H H D ) ] and S D p l n ( i H H A i H H D ) denote the expected value (mean) and standard deviation calculated within allele frequency bin p. iHS scores were calculated and visualized as Manhattan plots using the rehh package (v3.2.1) in R. Candidate genes located within genomic regions exhibiting significant selection signatures were identified based on a threshold of |iHS| > 2 (p < 0.05) to filter potential false positives. Subsequently, functional enrichment analyses, including Gene Ontology (GO) terms and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways, were performed using the Metascape platform [23]. Statistically significant terms were identified using a Benjamini–Hochberg corrected p-value threshold of < 0.05.

3. Results

3.1. Statistics and Quality Assessment of Sample Sequencing Data

In this study, whole-genome resequencing was performed on the Xupu goose, a distinctive indigenous breed, followed by a rigorous quality control pipeline for the acquired sequencing data. Data analysis results indicated that the average raw data volume per sample reached 6.87 Gb, with an average of 22,896,147 raw reads (Table 1). After stringent filtering, the average effective data volume was 6.79 Gb, with an average of 22,660,317 clean reads, yielding a high effective data utilization rate of 98.97% (Figure 1A). Regarding sequencing quality assessment, the average values for Q20 and Q30 reached 97.23% and 91.32%, respectively, with an average GC content of 43.77%. Furthermore, the GC content distribution across all samples was balanced, ranging from 43.31% to 44.52%. In summary, the sequencing data exhibited excellent accuracy and stability, fully meeting the technical requirements for genome-wide SNP variant detection and subsequent population genetics analysis, thus providing a solid data foundation.

3.2. Analysis of SNP Variant Detection and Annotation Results

Whole-genome SNP variant detection was performed on all samples in this study. Statistical results indicate some fluctuation in SNP counts across samples (Table 2), with an average of 4,269,107, ranging from 3,877,534 in XP.15 to 4,629,469 in XP.12. These inter-individual differences may stem from subtle variations in sequencing depth or the inherent heterogeneity of genetic backgrounds.
In terms of heterozygosity analysis, the average number of heterozygous SNPs (2,312,216) was significantly higher than that of homozygous SNPs 1,959,778, with an average Heterozygous variant proportion of approximately 54% (Figure 1A). Inter-individual analysis showed that sample XP.12 had the highest number of heterozygous loci (2,661,174), suggesting strong individual genetic polymorphism; conversely, sample XP.15 had a relatively lower number (2,008,960), suggesting this individual may have accumulated more homozygous segments.
Regarding variant types, the transition/transversion (Ti/Tv) ratio across all samples showed high consistency, ranging between 2.47 and 2.50 (mean 2.49). In molecular evolution, due to the cytosine methylation deamination effect, the biochemical probability of transitions is significantly higher than that of transversions. The ratio of 2.49 obtained in this study aligns with the typical characteristics of most vertebrate genomes, usually between 2.0 and 2.5, indicating not only the high accuracy and reliability of the variant detection but also the exclusion of potential systematic bias during sequencing. Further analysis revealed that C:G > T:A transitions were the most abundant, followed by T:A > C:G, while T:A > A:T transversions were the least frequent (Figure 1B).
SNP functional annotation results revealed the distribution patterns of variants across different genomic functional regions (Table 3). The vast majority of SNPs were enriched in non-coding regions: intronic regions contained an average of 1,821,812 SNPs (accounting for 56.4%), and intergenic regions contained an average of 1,223,203 SNPs (accounting for 37.9%). This aligns with the “Neutral Theory” of molecular evolution, which posits that non-coding regions are subject to less selection pressure and possess a greater capacity to tolerate variations. In gene regulation-related regions, upstream and downstream areas contained an average of 66,370 and 71,264 SNPs, respectively, suggesting they may participate in gene expression regulation by influencing transcription initiation or post-transcriptional modifications; meanwhile, the average number of SNPs in upstream/downstream overlapping regions was 10,517. In coding regions, synonymous mutations averaged 35,063; while not altering amino acid sequences, they may affect translation efficiency. Notably, non-synonymous mutations averaged 12,458. Although non-synonymous mutations account for a very small proportion, because they directly result in amino acid changes, they are often eliminated by strong purifying selection. Therefore, these retained non-synonymous mutations are often key potential factors driving phenotypic variation (such as growth rate and meat quality traits) and disease susceptibility, serving as important candidate loci for subsequent functional gene mining.

3.3. Demographic History and Genetic Status

In this study, the average total length of ROH in the Xupu goose genome was 230.2 Mb, ranging from 195.1 to 272.9 Mb. Analysis of the segment size distribution (Figure 1E) revealed that short fragments (0.1–0.2 Mb) were the most abundant, totaling 7056 segments and accounting for 54.13% of the total, whereas long segments (>1 Mb) comprised only 1.76% (Supplementary Table S1). This dominance of short segments suggests the influence of ancient common ancestors or historical bottleneck effects rather than recent inbreeding—a conclusion further supported by the rapid decrease in frequency as ROH length increases. Consequently, the calculated average genomic inbreeding coefficient ( F R O H ) was 0.204 (range: 0.173–0.241) (Figure 1D) (Supplementary Table S2). Collectively, these results demonstrate that while the Xupu goose population faces moderate inbreeding pressure. This moderate level of inbreeding is consistent with the breed’s history of closed-population conservation. Given the limitations of historical pedigree depth, the F R O H was calculated to provide a precise estimate of individual autozygosity and population inbreeding levels. Regarding heterozygosity and inbreeding levels (Figure 2B), the observed heterozygosity ( H O ) ranged from 0.194 to 0.249 (mean 0.217), whereas the expected heterozygosity ( H E ) remained stable, varying narrowly between 0.297 and 0.298 (mean 0.298). The average H E was significantly higher than the average H O . Furthermore, linear regression analysis (Figure 2D) revealed a distinct negative correlation between H O and F. Collectively, these findings indicate a degree of heterozygote deficiency and inbreeding within the population.
The genomic relationship matrix (G-matrix) heatmap (Figure 2A) visualizes the kinship structure among the 15 sequenced individuals. Visual inspection of the kinship heatmap revealed a generally moderate background relatedness across the population, characterized by predominant dark blue coloration. However, distinct clusters of elevated genomic similarity (lighter hues) were observed among specific individuals. To further validate this observed substructure, a Principal Component Analysis (PCA) was conducted (Supplementary Figure S1). The PCA results strikingly corroborated the G-matrix patterns, with the first principal component (PC1) accounting for an exceptionally high 66.23% of the total genetic variance. Rather than forming a single panmictic cloud, the 15 individuals exhibited clear stratification along the PC1 axis, aggregating into corresponding sub-clusters.
The decay pattern of Linkage Disequilibrium (LD) reflects the population’s recombination history and serves as a critical determinant of the resolution of association mapping. Analysis of LD decay (Figure 2C) reveals that the linkage disequilibrium coefficient (r2) exhibits a rapid decline as physical distance increases. Notably, the curve drops precipitously within the short-range interval of 0–100 kb, after which it plateaus and stabilizes at a relatively low level.

3.4. Genome-Wide Detection of Selection Signatures Using iHS

To investigate signatures of recent positive selection associated with the domestication of the Xupu goose, a genome-wide scan was performed using the Integrated Haplotype Score (iHS) method. First, the raw iHS values were standardized to facilitate comparison. The resulting genome-wide frequency distribution (Figure 3B) revealed that the bulk of the observed iHS values (blue curve) aligned closely with the theoretical standard normal distribution (red curve), exhibiting a typical bell-shaped pattern. This distributional pattern is critical for establishing statistical significance, as extreme values serve as indicators of potential targets of selection. Subsequently, a Manhattan plot (Figure 3A) was constructed to visualize the spatial distribution of these candidate selection signatures. Using a significance threshold of |iHS| > 2, multiple genomic regions enriched for selection signals were identified. Notably, distinct signal peaks were observed on autosomes 1, 2, and 4 (Chr1, Chr2, Chr4), and on the sex chromosome (ChrZ).

3.5. Gene Annotation and GO/KEGG Pathway Enrichment Analysis

To further elucidate the biological functions of these selected regions, the identified significant SNPs were mapped to the reference genome for gene annotation. A total of 3235 potential candidate genes were identified. Subsequently, functional enrichment analysis was performed on these genes using the Gene Ontology (GO) and KEGG databases.
GO functional enrichment analysis revealed that the identified candidate genes were significantly enriched across three main categories (Figure 3C): Biological Process (BP), Cellular Component (CC), and Molecular Function (MF). The top five most significantly enriched GO terms within each category are detailed (Table 4). In the BP category, the genes were predominantly involved in DNA recombination, smooth muscle contraction, and fatty acid metabolic process, along with other processes such as axonemal dynein complex assembly, transmembrane transport, and cell growth. Regarding the CC category, the candidate gene products were mainly localized in the cell junction, postsynaptic membrane, and cilium, while also showing significant enrichment in muscle-related components like myofibril and myosin filament. Within the MF category, ATP binding exhibited the highest level of significance and the largest gene count, followed by protein serine/threonine kinase activity and motor activity. Furthermore, enrichment in terms such as ionotropic glutamate receptor activity, sphingolipid transporter activity, and structural constituent of muscle. The Xupu goose population possesses specific genetic characteristics related to fat deposition, muscle development, and neural regulation. KEGG pathway enrichment analysis revealed (Figure 3D) that the candidate genes under selection were primarily involved in Glycosphingolipid biosynthesis, Neuroactive ligand-receptor interaction, Selenocompound metabolism, and ABC transporters. The top ten most significantly enriched pathways are listed in detail (Table 5).
KOG functional classification results revealed that the candidate genes were distributed across all 25 functional categories (Figure 3E). Among them, signal transduction mechanisms, General function prediction only, and Function unknown represented the three most enriched categories. Notably, the number of genes involved in Signal transduction mechanisms reached 75, indicating their predominant role in the identified selection signals. In addition, a significant number of genes were associated with Lipid transport and metabolism, Posttranslational modification, protein turnover, chaperones, and the cytoskeleton. These findings suggest that the evolutionary selection process in this population is highly concentrated on pathways related to cellular signaling, protein regulation, and structural development.
Protein–protein interaction network analysis revealed tight interactions among proteins encoded by the candidate genes (Figure 3F), forming multiple distinct functional modules. Within the network structure, the GRIA gene family exhibited high connectivity, constituting one of the core modules. This finding aligns perfectly with the significant GO term “ionotropic glutamate receptor activity”, suggesting that neural signaling and endocrine regulation play pivotal roles in defining the breed’s characteristics. Concurrently, the ATR protein emerged as another major hub node, interacting closely with FANCA and CHEK2, thereby forming a functional sub-network governing DNA repair and cell cycle surveillance.

4. Discussion

4.1. Genome-Wide Variation Patterns

Whole-genome resequencing has emerged as a pivotal tool for characterizing the genetic structure of livestock populations and evaluating germplasm resources [24]. In this study, we leveraged high-throughput WGS data to investigate the genetic diversity and signatures of selection within the Xupu goose population. It is important to note that the average sequencing depth of 7× may limit the detection of rare variants. However, stringent filtering criteria were applied to ensure that the common SNPs used for population structure and selection signature analyses are high-confidence variants. Post-quality control (QC) analysis revealed that the average GC content (~45%) and mapping rate (~98%) were consistent with those reported for other goose breeds [11,20], ensuring the reliability and accuracy of the data for subsequent analyses. Single-Nucleotide Polymorphisms (SNPs) serve as primary markers for assessing genetic diversity and are crucial for reconstructing evolutionary trajectories [25]. In the present study, statistical analysis identified an average of approximately 4.272 million SNPs per individual in the Xupu goose samples. The Ti/Tv ratio of 2.49 observed in this study falls well within the typical range reported for vertebrate genomes. This consistency not only attests to the high accuracy and reliability of the variant detection results, but also effectively rules out potential systematic biases introduced during the sequencing process. The genomic distribution of SNPs is consistent with the findings of Joanna Grzegorczyk et al. [26], being primarily concentrated in intergenic and intronic regions. Analysis of zygosity distribution reveals that the average number of heterozygous sites in Xupu goose samples generally exceeds that of homozygous sites; the observed proportion of heterozygous variants of 54% is within a reasonable range. Typically, long-term high-intensity artificial selection leads to a rapid increase in population homozygosity [27,28]; however, as a superior indigenous breed, the relatively high heterozygosity of the Xupu goose suggests that the population has not experienced a severe genetic bottleneck, thereby possessing potential for environmental adaptation and breeding plasticity. Although non-synonymous mutations account for only a minute proportion of variations, they are often subject to strong purifying selection and eliminated because such mutations can directly lead to amino acid alterations [29].
The length and frequency of ROH reflect individual relatedness; longer and more frequent ROH segments indicate a higher probability of consanguinity, with different segment lengths corresponding to ancient and recent inbreeding histories, respectively [30,31]. This study revealed that ROH in the Xupu goose genome were predominantly short segments (0.1–0.2 Mb), with their frequency decreasing rapidly as segment length increased. The extremely low proportion of long ROH segments (>1 Mb) was observed in the Xupu goose (accounting for only 1.76%). The findings suggest that the observed genomic homozygosity is likely attributable to ancient common ancestry or historical population bottlenecks [32].
Genetic diversity is the core for evaluating the value of livestock and poultry germplasm resources, with expected ( H E ) and observed ( H O ) heterozygosity serving as key indicators for measuring population variation levels [33,34]. Meanwhile, the genomic inbreeding coefficient ( F R O H ) calculated based on runs of homozygosity (ROH) sensitively reflects the loss of population heterozygosity and the depletion trend of genetic variation [35]. For context, Huang et al. [36] reported an average H O of 0.261 and F R O H of 0.223 in the Shitou goose population. Similarly, Zhang et al. [37] observed an average H O of 0.345 alongside a notably elevated F R O H of 0.352 in the Landes breed. In contrast, the Xupu goose population in this study exhibited a lower average H O (0.217), while H E remained at a higher level (0.298), with an average F R O H of 0.204. The genomic inbreeding level of the Xupu goose is comparable to that of other conserved breeds [38], yet remains significantly lower than that observed in the intensively selected Landes breed. Notably, although an F R O H of 0.204 suggests moderate inbreeding pressure, its genomic landscape is characterized by the absolute predominance of short ROH segments and a low proportion of non-synonymous mutations ( P n = 0.313). This unique combination of “high F R O H , predominant short segments, and low P n indicates that the population’s homozygosity primarily stems from long-term historical genetic drift or ancient ancestral bottleneck effects, rather than recent high-intensity inbreeding. This finding aligns with the “managed balanced population” concept proposed by Cendron et al. [39], which maintains breed characteristics while effectively mitigating the risk of recent inbreeding depression. Furthermore, the gap between the high H E level and H O in Xupu geese reveals a unique “heterozygosity potential space”, indicating for future breeding improvement. As stated by Groeneveld et al. [40], indigenous breeds often experience a phased increase in inbreeding coefficients during conservation breeding, which not only elucidates the formation mechanism of the current moderate inbreeding level in Xupu geese but also provides a scientific basis for subsequent marker-assisted selection and sustainable utilization of germplasm resources.
To further elucidate the genetic architecture underlying these metrics, population structure was assessed using a genomic relationship matrix (G-matrix) and Principal Component Analysis (PCA). The G-matrix heatmap revealed specific clusters of elevated genomic similarity. This structural stratification was strongly corroborated by the PCA projection, where the first principal component (PC1) accounted for a substantial 66.23% of the total genetic variance, grouping individuals into distinct sub-clusters. This dual line of evidence confirms the presence of cryptic family-lineage substructures within the ex situ conservation flock. Complementing these findings, Linkage Disequilibrium (LD) decay analysis demonstrated a rapid decline in r 2 over short physical distances—a pattern highly concordant with the predominance of short ROH segments. This reinforces the conclusion that while distinct family substructures exist, the overarching inbreeding signature primarily reflects historical genetic drift rather than recent, intensive consanguineous mating.

4.2. Genomic Signatures of Selection and Biological Functions

In this study, GO and KEGG enrichment analyses of genomic regions under selection, filtered by specific thresholds, revealed that genes such as GRIA1, GRIA4, LEPR and GABRA were significantly enriched in the “Neuroactive ligand-receptor interaction” pathway. AMPA receptors are tetrameric ion channels formed by the combinatorial assembly of GRIA1, GRIA2, GRIA3, and GRIA4 subunits; they function as the primary mediators of fast excitatory neurotransmission in the central nervous system (CNS), facilitating rapid synaptic signal transduction [41]. Furthermore, the trafficking and expression of AMPA receptors play pivotal roles in long-term potentiation (LTP) and long-term depression (LTD), mechanisms that are intrinsically linked to learning, memory, and environmental adaptability in animals [42]. Relevant experiments in chicks have demonstrated that the interaction between NMDA and AMPA receptors plays a significant role in the regulation of food intake [43]. The GABRA gene family encodes the primary inhibitory receptors for gamma-aminobutyric acid (GABA), which are ubiquitously expressed and play critical roles in the mammalian central nervous system (CNS) [44]; dysfunction or loss of GABRA significantly impairs neural development [45]. The leptin receptor belongs to the Class I cytokine receptor family and is primarily responsible for mediating the majority of the biological effects of leptin (LEP) in both the central nervous system and peripheral tissues [46]. Studies have demonstrated that specific haplotype blocks within the LEPR gene are significantly associated with body weight at 49 and 70 days of age, and feed intake in broiler chickens, thereby confirming its pivotal role in regulating growth and appetite in poultry [47,48]. These neural pathways likely facilitated the domestication process by modulating feeding behavior and neural regulation, thereby enabling the geese to better adapt to the environmental conditions of captivity.
Subsequently, during the identification of traits associated with fat deposition and muscle growth, the ACSS2, ACSS3, and PECR genes were found to be enriched in the fatty acid metabolism pathway. Existing studies have indicated that ACSS2 is primarily responsible for the conversion of acetate into acetyl-CoA [49]. In mouse models, ACSS2 deficiency has been shown to significantly attenuate body weight gain under high-fat diet conditions and ameliorate hepatic steatosis, confirming its role in optimizing systemic fat storage and utilization through the selective regulation of genes related to lipid metabolism [50]. Similar to ACSS2, ACSS3 is a pivotal gene in lipid metabolism regulation, functioning primarily to drive propionate catabolism by converting it into propionyl-CoA [51]. Furthermore, studies on PRKAA2 have observed that its expression is significantly upregulated in ducks as dietary energy levels increase, suggesting its potential involvement in fatty acid regulation, lipid biosynthesis, and transport processes [52]. Concurrently, other findings have demonstrated a significant correlation between this gene and meat quality traits, specifically muscle tenderness and pH value [53]. CMYA5 and MTPN, which are involved in the myosin filament pathway and the cell growth pathway, are also significant genes influencing muscle growth. MTPN promotes protein synthesis and induces cellular hypertrophy by upregulating the expression of various cardiac marker genes in cardiomyocytes. Regarding skeletal muscle, Shohei Shiraishi et al. [54] found that MTPN significantly increases the content of structural proteins in murine muscle, while a study by Wang et al. [55] demonstrated that MTPN promotes porcine myocyte differentiation and myotube hypertrophy, results which are consistent with previous reports. Concurrently, MTPN has also been identified as a crucial candidate gene regulating skeletal muscle growth and development in beef cattle. Furthermore, in studies on pork quality, CMYA5 is regarded as a key candidate gene, with research showing it is significantly correlated with drip loss and intramuscular fat content [56,57].
However, lacking a comparative control breed renders the associations of these single-population sweeps putative. Therefore, future research should prioritize expanding the sample size and conducting cross-breed comparative analyses, complemented by multi-omics approaches and functional validation to fully elucidate these complex molecular networks.

5. Conclusions

This study provides a foundational whole-genome resequencing dataset for the “protected indigenous breed” Xupu goose. Population genomics analysis indicates that while the breed retains moderate genetic potential, it currently faces significant historical inbreeding pressure. Crucially, dual evidence from the G-matrix and PCA reveals the presence of a cryptic family substructure within the ex situ conserved population. This structural stratification accurately accounts for the observed heterozygote deficiency and confirms that the population’s homozygosity stems from historical background drift and closed-herd management rather than recent extreme inbreeding. Through intra-population Integrated Haplotype Score (iHS) scanning, we identified multiple candidate genes associated with the breed’s unique phenotypic traits. Genes such as ACSS2, ACSS3 and PECR were identified as candidates for lipid metabolism and fatty liver deposition. Meanwhile, CMYA5, MTPN and LEPR were found to be associated with muscular development. While these single-population associations require future cross-breed and multi-omics validation, these foundational genomic insights provide critical resources and theoretical guidance for optimizing conservation strategies, marker-assisted selection, and sustainable breeding programs for this indispensable waterfowl resource.

Supplementary Materials

The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/biology15060479/s1. Figure S1: PCA Analysis of XP Population; Table S1: Statistics of ROH Length Distribution; Table S2: Statistics of Individual Inbreeding Coefficients (Froh).

Author Contributions

Conceptualization, G.C. and K.Z.; methodology, K.Z. and Z.A.; formal analysis, K.Z. and Y.C. (Yuchun Cai); validation, K.Z., Y.L.; data curation, K.Z., Y.C. (Yuchun Cai) and Y.C. (Yuhang Cheng); writing—original draft preparation, K.Z. and Y.Z.; writing—review and editing, G.C., K.Z. and Y.Z.; visualization, W.Z.; supervision, Y.Z. and G.C.; project administration, G.C.; funding acquisition, Y.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Key Research and Development Program of China [2021YFD1200302].

Institutional Review Board Statement

All animal experimental protocols used in the present study were approved by the Yangzhou Institutional Animal Committee (Approval Number: 2023004742, Date, April 2023). The procedures were performed in accordance with the Regulations for the Administration of Affairs Concerning Experimental Animals (Yangzhou University, China, 2012) and the Standards for the Administration of Experimental Practices (Jiangsu, China, 2008).

Informed Consent Statement

Written informed consent has been obtained from the owner of the animals involved in this study.

Data Availability Statement

All data generated or analyzed during this study are included in this published paper.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Li, H.-F.; Chen, K.-W.; Yang, N.; Song, W.-T.; Tang, Q.-P. Evaluation of Genetic Diversity of Chinese Native Geese Revealed by Microsatellite Markers. World’s Poult. Sci. J. 2007, 63, 381–390. [Google Scholar] [CrossRef]
  2. Li, H.F.; Zhu, W.Q.; Chen, K.W.; Xu, W.J.; Song, W. Two Maternal Origins of Chinese Domestic Goose. Poult. Sci. 2011, 90, 2705–2710. [Google Scholar] [CrossRef] [PubMed]
  3. Eda, M.; Itahashi, Y.; Kikuchi, H.; Sun, G.; Hsu, K.-H.; Gakuhari, T.; Yoneda, M.; Jiang, L.; Yang, G.; Nakamura, S. Multiple lines of evidence of early goose domestication in a 7000-y-old rice cultivation village in the lower Yangtze River, China. Proc. Natl. Acad. Sci. USA 2022, 119, e2117064119. [Google Scholar] [CrossRef] [PubMed]
  4. Liu, X.; Xu, M.; Qu, X.; Guo, S.; Liu, Y.; He, C.; He, J.; Liu, W. Molecular Cloning, Characterisation, and Expression Analysis of Adipocyte Fatty Acid Binding Protein Gene in Xupu Goose (Anser Cygnoides Domesticus). Br. Poult. Sci. 2019, 60, 659–665. [Google Scholar] [CrossRef]
  5. Chen, G.; Liu, J.; Xu, Q. Current status and prospect of conservation and utilization of goose genetic resources in China. Guide Chin. Poult. 2022, 39, 6–12. [Google Scholar]
  6. Bentley, D.R.; Balasubramanian, S.; Swerdlow, H.P.; Smith, G.P.; Milton, J.; Brown, C.G.; Hall, K.P.; Evers, D.J.; Barnes, C.L.; Bignell, H.R.; et al. Accurate Whole Human Genome Sequencing Using Reversible Terminator Chemistry. Nature 2008, 456, 53–59. [Google Scholar] [CrossRef]
  7. 1000 Genomes Project Consortium. A Map of Human Genome Variation from Population Scale Sequencing. Nature 2010, 467, 1061–1073. [Google Scholar] [CrossRef] [PubMed]
  8. Huang, M.; Sun, J.; Wang, J.; Ye, X.; Chen, Z.; Zhao, X.; Zhang, K.; Ma, L.; Xue, J.; Luo, Y.; et al. Goose multi-omics database: A comprehensive multi-omics database for goose genomics. Poult. Sci. 2025, 104, 104842. [Google Scholar] [CrossRef]
  9. Gao, G.; Zhang, H.; Ni, J.; Zhao, X.; Zhang, K.; Wang, J.; Kong, X.; Wang, Q. Insights into Genetic Diversity and Phenotypic Variations in Domestic Geese through Comprehensive Population and Pan-Genome Analysis. J. Anim. Sci. Biotechnol. 2023, 14, 150. [Google Scholar] [CrossRef]
  10. Wen, J.; Li, H.; Wang, H.; Yu, J.; Zhu, T.; Zhang, J.; Li, X.; Jiang, Z.; Ning, Z.; Qu, L. Origins, Timing and Introgression of Domestic Geese Revealed by Whole Genome Data. J. Anim. Sci. Biotechnol. 2023, 14, 26. [Google Scholar] [CrossRef]
  11. Ren, S.; Lyu, G.; Irwin, D.M.; Liu, X.; Feng, C.; Luo, R.; Zhang, J.; Sun, Y.; Shang, S.; Zhang, S.; et al. Pooled Sequencing Analysis of Geese (Anser Cygnoides) Reveals Genomic Variations Associated with Feather Color. Front. Genet. 2021, 12, 650013. [Google Scholar] [CrossRef]
  12. Sun, G.; Zhao, H.; Mu, X.; Li, X.; Wang, J.; Zhao, M.; Ji, R.; Lv, H.; Li, Y.; Chen, C.; et al. Whole-Genome Re-Sequencing Association Study on Body Size Traits at 10-Weeks of Age in Chinese Indigenous Geese. Front. Vet. Sci. 2024, 11, 1506471. [Google Scholar] [CrossRef]
  13. Zhao, H.; Sun, G.; Mu, X.; Li, X.; Wang, J.; Zhao, M.; Zhang, G.; Ji, R.; Chen, C.; Gao, G.; et al. Genome-Wide Selective Signatures Mining the Candidate Genes for Egg Laying in Goose. BMC Genom. 2023, 24, 750. [Google Scholar] [CrossRef]
  14. Tang, M.; Wang, T.; Zhang, X. A Review of SNP Heritability Estimation Methods. Brief. Bioinform. 2022, 23, bbac067. [Google Scholar]
  15. Purcell, S.; Neale, B.; Todd-Brown, K.; Thomas, L.; Ferreira, M.A.R.; Bender, D.; Maller, J.; Sklar, P.; de Bakker, P.I.W.; Daly, M.J.; et al. PLINK: A Tool Set for Whole-Genome Association and Population-Based Linkage Analyses. Am. J. Hum. Genet. 2007, 81, 559–575. [Google Scholar] [CrossRef] [PubMed]
  16. Liang, H.; Sedillo, J.C.; Schrodi, S.J.; Ikeda, A. Structural Variants in Linkage Disequilibrium with GWAS-Significant SNPs. Heliyon 2024, 10, e32053. [Google Scholar] [CrossRef]
  17. Kim, J.; Macharia, J.K.; Kim, M.; Heo, J.M.; Yu, M.; Choo, H.J.; Lee, J.H. Runs of Homozygosity Analysis for Selection Signatures in the Yellow Korean Native Chicken. Anim. Biosci. 2024, 37, 1683. [Google Scholar] [CrossRef]
  18. Chang, C.C.; Chow, C.C.; Tellier, L.C.; Vattikuti, S.; Purcell, S.M.; Lee, J.J. Second-Generation PLINK: Rising to the Challenge of Larger and Richer Datasets. Gigascience 2015, 4, 7. [Google Scholar] [CrossRef]
  19. Yu, H.; Zhang, K.; Cheng, G.; Mei, C.; Wang, H.; Zan, L. Genome-Wide Analysis Reveals Genomic Diversity and Signatures of Selection in Qinchuan Beef Cattle. BMC Genom. 2024, 25, 558. [Google Scholar] [CrossRef] [PubMed]
  20. Sun, X.; Guo, J.; Li, L.; Zhong, T.; Wang, L.; Zhan, S.; Lu, J.; Wang, D.; Dai, D.; Liu, G.E.; et al. Genetic Diversity and Selection Signatures in Jianchang Black Goats Revealed by Whole-Genome Sequencing Data. Animals 2022, 12, 2365. [Google Scholar] [CrossRef]
  21. McQuillan, R.; Leutenegger, A.-L.; Abdel-Rahman, R.; Franklin, C.S.; Pericic, M.; Barac-Lauc, L.; Smolej-Narancic, N.; Janicijevic, B.; Polasek, O.; Tenesa, A.; et al. Runs of Homozygosity in European Populations. Am. J. Hum. Genet. 2008, 83, 359–372. [Google Scholar] [CrossRef]
  22. Sun, X.; Niu, Q.; Jiang, J.; Wang, G.; Zhou, P.; Li, J.; Chen, C.; Liu, L.; Xu, L.; Ren, H. Identifying Candidate Genes for Litter Size and Three Morphological Traits in Youzhou Dark Goats Based on Genome-Wide SNP Markers. Genes 2023, 14, 1183. [Google Scholar] [CrossRef]
  23. Bu, D.; Luo, H.; Huo, P.; Wang, Z.; Zhang, S.; He, Z.; Wu, Y.; Zhao, L.; Liu, J.; Guo, J.; et al. KOBAS-i: Intelligent Prioritization and Exploratory Visualization of Biological Functions for Gene Enrichment Analysis. Nucleic Acids Res. 2021, 49, W317–W325. [Google Scholar] [CrossRef]
  24. Bello, S.F.; Lawal, R.A.; Adeola, A.C.; Nie, Q. The Study of Selection Signature and Its Applications on Identification of Candidate Genes Using Whole Genome Sequencing Data in Chicken—A Review. Poult. Sci. 2023, 102, 102657. [Google Scholar] [CrossRef] [PubMed]
  25. Vignal, A.; Milan, D.; SanCristobal, M.; Eggen, A. A Review on SNP and Other Types of Molecular Markers and Their Use in Animal Genetics. Genet. Sel. Evol. 2002, 34, 275–305. [Google Scholar] [CrossRef] [PubMed]
  26. Grzegorczyk, J.; Gurgul, A.; Oczkowicz, M.; Szmatoła, T.; Fornal, A.; Bugno-Poniewierska, M. Single Nucleotide Polymorphism Discovery and Genetic Differentiation Analysis of Geese Bred in Poland, Using Genotyping-by-Sequencing (GBS). Genes 2021, 12, 1074. [Google Scholar] [CrossRef] [PubMed]
  27. Muir, W.M.; Wong, G.K.-S.; Zhang, Y.; Wang, J.; Groenen, M.A.M.; Crooijmans, R.P.M.A.; Megens, H.-J.; Zhang, H.; Okimoto, R.; Vereijken, A.; et al. Genome-Wide Assessment of Worldwide Chicken SNP Genetic Diversity Indicates Significant Absence of Rare Alleles in Commercial Breeds. Proc. Natl. Acad. Sci. USA 2008, 105, 17312–17317. [Google Scholar] [CrossRef]
  28. Rubin, C.-J.; Zody, M.C.; Eriksson, J.; Meadows, J.R.S.; Sherwood, E.; Webster, M.T.; Jiang, L.; Ingman, M.; Sharpe, T.; Ka, S.; et al. Whole-Genome Resequencing Reveals Loci under Selection during Chicken Domestication. Nature 2010, 464, 587–591. [Google Scholar] [CrossRef]
  29. Ahad, M.A. The Neutral Theory (Theory of Genetic Drift) and the Nearly Neutral Theory of Molecular Evolution Are Opposite to Evolution. Int. J. Bio-Resour. Stress Manag. 2023, 14, 1016–1027. [Google Scholar] [CrossRef]
  30. Ai, H.; Fang, X.; Yang, B.; Huang, Z.; Chen, H.; Mao, L.; Zhang, F.; Zhang, L.; Cui, L.; He, W.; et al. Adaptation and Possible Ancient Interspecies Introgression in Pigs Identified by Whole-Genome Sequencing. Nat. Genet. 2015, 47, 217–225. [Google Scholar] [CrossRef]
  31. Curik, I.; Ferenčaković, M.; Sölkner, J. Inbreeding and Runs of Homozygosity: A Possible Solution to an Old Problem. Livest. Sci. 2014, 166, 26–34. [Google Scholar] [CrossRef]
  32. Forutan, M.; Ansari Mahyari, S.; Baes, C.; Melzer, N.; Schenkel, F.S.; Sargolzaei, M. Inbreeding and Runs of Homozygosity before and after Genomic Selection in North American Holstein Cattle. BMC Genom. 2018, 19, 98. [Google Scholar] [CrossRef] [PubMed]
  33. Kanaka, K.K.; Sukhija, N.; Goli, R.C.; Singh, S.; Ganguly, I.; Dixit, S.P.; Dash, A.; Malik, A.A. On the Concepts and Measures of Diversity in the Genomics Era. Curr. Plant Biol. 2023, 33, 100278. [Google Scholar] [CrossRef]
  34. Qin, H.; Li, X.; Wang, J.; Sun, G.; Mu, X.; Ji, R. Ovarian Transcriptome Profile from Pre-Laying Period to Broody Period of Xupu Goose. Poult. Sci. 2021, 100, 101403. [Google Scholar] [CrossRef]
  35. Rebelato, A.B.; Caetano, A.R. Runs of Homozygosity for Autozygosity Estimation and Genomic Analysis in Production Animals. Pesqui. Agropecu. Bras. 2018, 53, 975–984. [Google Scholar] [CrossRef]
  36. Huang, H.Y.; Zhang, L.Y.; Huang, Z.R.; Zhang, X.T.; Chen, Y.; Wu, Y.B.; Zhao, Y.; Liu, D.L.; Lin, Z.L.; Sun, Y.H.; et al. Population genetic diversity and genome-wide association analysis of body weight and body size in Shitou goose. Acta Vet. Zootech. Sin. 2024, 55, 3914–3924. [Google Scholar]
  37. Zhang, T.; Cheng, S.; Zhang, X.; Ye, F.; Guan, Y.; Qu, K.; Qi, X.; Xiang, H.; Zhang, X. Analysis of genomic genetic structure and ROH assessment in different Landes goose populations. J. Agric. Biotechnol. 2026, 34, 556–565. [Google Scholar]
  38. Qi, S.; Wu, T.; Wu, H.; Liang, Y.; Zhao, W.; Zhang, Y.; Xu, Q.; Chen, G. Whole-genome resequencing reveals the population structure and domestication processes of endemic endangered goose breeds (Anser cygnoides). Poult. Sci. 2025, 104, 105004. [Google Scholar] [CrossRef]
  39. Cendron, F.; Mastrangelo, S.; Tolone, M.; Perini, F.; Lasagna, E.; Cassandro, M. Genome-Wide Analysis Reveals the Patterns of Genetic Diversity and Population Structure of 8 Italian Local Chicken Breeds. Poult. Sci. 2021, 100, 441–451. [Google Scholar] [CrossRef]
  40. Groeneveld, L.F.; Lenstra, J.A.; Eding, H.; Toro, M.A.; Scherf, B.; Pilling, D.; Negrini, R.; Finlay, E.K.; Jianlin, H.; Groeneveld, E.; et al. Genetic Diversity in Farm Animals—A Review. Anim. Genet. 2010, 41, 6–31. [Google Scholar] [CrossRef]
  41. Traynelis, S.F.; Wollmuth, L.P.; McBain, C.J.; Menniti, F.S.; Vance, K.M.; Ogden, K.K.; Hansen, K.B.; Yuan, H.; Myers, S.J.; Dingledine, R. Glutamate Receptor Ion Channels: Structure, Regulation, and Function. Pharmacol. Rev. 2010, 62, 405–496. [Google Scholar] [CrossRef]
  42. Malinow, R.; Malenka, R.C. AMPA Receptor Trafficking and Synaptic Plasticity. Annu. Rev. Neurosci. 2002, 25, 103–126. [Google Scholar] [CrossRef]
  43. Adeli, A.; Zendehdel, M.; Babapour, V.; Panahi, N. Interaction between Leptin and Glutamatergic System on Food Intake Regulation in Neonatal Chicken: Role of NMDA and AMPA Receptors. Int. J. Neurosci. 2020, 130, 713–721. [Google Scholar] [CrossRef] [PubMed]
  44. García-Martín, E.; Esguevillas, G.; Serrador, M.; Alonso-Navarro, H.; Navacerrada, F.; Amo, G.; García-Albea, E.; Agúndez, J.A.G.; Jiménez-Jiménez, F.J. Gamma-Aminobutyric Acid (GABA) Receptors GABRA4, GABRE, and GABRQ Gene Polymorphisms and Risk for Migraine. J. Neural Transm. 2018, 125, 689–698. [Google Scholar] [CrossRef] [PubMed]
  45. Samarut, É.; Swaminathan, A.; Riché, R.; Liao, M.; Hassan-Abdi, R.; Renault, S.; Allard, M.; Dufour, L.; Cossette, P.; Soussi-Yanicostas, N.; et al. γ-Aminobutyric Acid Receptor Alpha 1 Subunit Loss of Function Causes Genetic Generalized Epilepsy by Impairing Inhibitory Network Neurodevelopment. Epilepsia 2018, 59, 2061–2074. [Google Scholar] [CrossRef] [PubMed]
  46. Horev, G.; Einat, P.; Aharoni, T.; Eshdat, Y.; Friedman-Einat, M. Molecular Cloning and Properties of the Chicken Leptin-Receptor (CLEPR) Gene. Mol. Cell. Endocrinol. 2000, 162, 95–106. [Google Scholar] [CrossRef]
  47. Karabağ, K.; Alkan, S.; Karslı, T.; İkten, C.; Şahin, İ.; Mendeş, M. Effects of Selection in Terms of Meat Yield Traits on Leptin Receptor Gene in Japanese Quail Lines. Slov. Vet. Res. 2022, 59, 89–98. [Google Scholar] [CrossRef]
  48. El Moujahid, E.M.; Chen, S.; Jin, S.; Lu, Y.; Zhang, D.; Ji, C.; Yang, N. Association of Leptin Receptor Gene Polymorphisms with Growth and Feed Efficiency in Meat-Type Chickens. Poult. Sci. 2014, 93, 1910–1915. [Google Scholar] [CrossRef]
  49. Huang, Z.; Zhang, M.; Plec, A.A.; Estill, S.J.; Cai, L.; Repa, J.J.; McKnight, S.L.; Tu, B.P. ACSS2 Promotes Systemic Fat Storage and Utilization through Selective Regulation of Genes Involved in Lipid Metabolism. Proc. Natl. Acad. Sci. USA 2018, 115, E9499–E9506. [Google Scholar] [CrossRef]
  50. Chen, N.; Zhao, M.; Wu, N.; Guo, Y.; Cao, B.; Zhan, B.; Li, Y.; Zhou, T.; Zhu, F.; Guo, C.; et al. ACSS2 Controls PPARγ Activity Homeostasis to Potentiate Adipose-Tissue Plasticity. Cell Death Differ. 2024, 31, 479–496. [Google Scholar] [CrossRef]
  51. Jia, Z.; Chen, X.; Chen, J.; Zhang, L.; Oprescu, S.N.; Luo, N.; Xiong, Y.; Yue, F.; Kuang, S. ACSS3 in Brown Fat Drives Propionate Catabolism and Its Deficiency Leads to Autophagy and Systemic Metabolic Dysfunction. Clin. Transl. Med. 2022, 12, e665. [Google Scholar] [CrossRef]
  52. Cho, J.; Jeong, J.; Jeong, Y.; Heo, J.M.; Choi, I. Gene Expression Patterns in Korean Native Ducks (Anas platyrhynchos) with Different Apparent Metabolisable Energy (AME) levels (AME) (Anas platyrhynchos). Livest. Sci. 2017, 202, 67–73. [Google Scholar] [CrossRef]
  53. Talpur, M.Z.; Abdulwahid, A.M.; Ahmed, I. Chicken meat quality; association with different gene expression, physiochemical properties and muscle glycogen. Pak. J. Agric. Sci. 2018, 55, 979–994. [Google Scholar]
  54. Shiraishi, S.; Nakamura, Y.-N.; Iwamoto, H.; Haruno, A.; Sato, Y.; Mori, S.; Ikeuchi, Y.; Chikushi, J.; Hayashi, T.; Sato, M.; et al. S-Myotrophin Promotes the Hypertrophy of Skeletal Muscle of Mice in vivo. Int. J. Biochem. Cell Biol. 2006, 38, 1114–1122. [Google Scholar] [CrossRef] [PubMed]
  55. Wang, L.; Wang, Y. Molecular Characterization, Expression Patterns and Subcellular Localization of Myotrophin (MTPN) Gene in Porcine Skeletal Muscle. Mol. Biol. Rep. 2012, 39, 2733–2738. [Google Scholar] [CrossRef] [PubMed]
  56. Bordbar, F.; Jensen, J.; Du, M.; Abied, A.; Guo, W.; Xu, L.; Gao, H.; Zhang, L.; Li, J. Identification and validation of a novel candidate gene regulating net meat weight in Simmental beef cattle based on imputed next-generation sequencing. Cell Prolif. 2020, 53, e12870. [Google Scholar] [CrossRef] [PubMed]
  57. Xu, X.; Xu, X.; Yin, Q.; Sun, L.; Liu, B.; Wang, Y. The Molecular Characterization and Associations of Porcine Cardiomyopathy Asssociated 5 (CMYA5) Gene with Carcass Trait and Meat Quality. Mol. Biol. Rep. 2011, 38, 2085–2090. [Google Scholar] [CrossRef]
Figure 1. Sequencing quality, SNP variation characteristics, and genomic inbreeding patterns derived from whole-genome resequencing of Xupu geese: (A) Alignment statistics of sequencing reads to the reference genome and average sequencing depth. (B) Distribution of SNP heterozygosity rates across individuals. (C) Statistics of genome-wide SNP variant types. (D) Statistics of genome-wide SNP variant types. Genomic inbreeding coefficients FROH) estimated based on Runs of Homozygosity (ROH). (E) Proportional distribution of ROH across different length categories.
Figure 1. Sequencing quality, SNP variation characteristics, and genomic inbreeding patterns derived from whole-genome resequencing of Xupu geese: (A) Alignment statistics of sequencing reads to the reference genome and average sequencing depth. (B) Distribution of SNP heterozygosity rates across individuals. (C) Statistics of genome-wide SNP variant types. (D) Statistics of genome-wide SNP variant types. Genomic inbreeding coefficients FROH) estimated based on Runs of Homozygosity (ROH). (E) Proportional distribution of ROH across different length categories.
Biology 15 00479 g001
Figure 2. Genetic diversity and population structure analysis of the Xupu goose: (A) Genomic relationship matrix (G-matrix) heatmap. The color gradient represents the genomic kinship coefficient, with lighter blocks indicating higher pairwise relatedness, revealing family substructures within the population. (B) Distribution of heterozygosity. The bar chart compares the expected heterozygosity (HE, red) and observed heterozygosity (HO, blue) for each sample. (C) Linkage disequilibrium (LD) decay. (D) Correlation between observed heterozygosity (HO) and inbreeding coefficients (F). The scatter plot illustrates the linear regression between HO (x-axis) and F (y-axis), with point color intensity corresponding to the magnitude of the F value.
Figure 2. Genetic diversity and population structure analysis of the Xupu goose: (A) Genomic relationship matrix (G-matrix) heatmap. The color gradient represents the genomic kinship coefficient, with lighter blocks indicating higher pairwise relatedness, revealing family substructures within the population. (B) Distribution of heterozygosity. The bar chart compares the expected heterozygosity (HE, red) and observed heterozygosity (HO, blue) for each sample. (C) Linkage disequilibrium (LD) decay. (D) Correlation between observed heterozygosity (HO) and inbreeding coefficients (F). The scatter plot illustrates the linear regression between HO (x-axis) and F (y-axis), with point color intensity corresponding to the magnitude of the F value.
Biology 15 00479 g002
Figure 3. Comprehensive Analysis of Genomic Genetic Diversity and Positive Selection Signatures in Xupu Goose: (A) Manhattan plot of genome-wide iHS selection signals; (B) Genome-wide distribution of iHS statistics: comparison between the score distribution and the theoretical normal distribution; (C) Bar plot of GO functional enrichment; (D) Statistical plot of KEGG pathway enrichment: major metabolic pathways and biological systems associated with candidate genes within selected regions; (E) Statistical plot of KOG functional classification; (F) Statistical plot of KOG functional classification.
Figure 3. Comprehensive Analysis of Genomic Genetic Diversity and Positive Selection Signatures in Xupu Goose: (A) Manhattan plot of genome-wide iHS selection signals; (B) Genome-wide distribution of iHS statistics: comparison between the score distribution and the theoretical normal distribution; (C) Bar plot of GO functional enrichment; (D) Statistical plot of KEGG pathway enrichment: major metabolic pathways and biological systems associated with candidate genes within selected regions; (E) Statistical plot of KOG functional classification; (F) Statistical plot of KOG functional classification.
Biology 15 00479 g003
Table 1. Statistical Summary of Sequencing Data Quality for the Xupu Goose.
Table 1. Statistical Summary of Sequencing Data Quality for the Xupu Goose.
ItemRaw DataRaw ReadsClean DataClean ReadsQ20Q30GC
XP.17.2824,271,6327.2124,034,90997.0190.6143.75
XP.27.8726,227,4497.7825,934,23797.8693.1444.52
XP.37.3624,527,5127.2824,276,56996.5989.7543.59
XP.46.7122,364,5436.6422,145,94096.7689.9843.61
XP.56.3121,029,9776.2420,798,31395.8687.5144.23
XP.65.9619,863,3665.9019,662,64797.7392.7043.81
XP.77.8719,277,1057.7919,073,64397.4191.7343.98
XP.87.8421,761,8927.7621,536,17197.6692.5043.77
XP.98.0923,672,4188.0123,421,84097.3991.6944.06
XP.106.4226,225,7996.3625,957,96297.1691.2743.65
XP.116.3126,142,7986.2525,875,17096.9290.5443.54
XP.125.6026,981,5875.5426,711,71397.6292.3543.68
XP.135.7821,395,2405.7221,184,29697.8092.9943.31
XP.146.5321,048,8076.4620,842,46297.2691.3443.43
XP.157.1018,652,0747.0318,464,47597.3891.6643.62
Average6.8722,896,1476.8022,661,35697.2391.3243.77
Note: Raw Data, total output of raw sequencing data (Gb); Raw Reads, total count of raw reads; Clean Data, high-quality data after filtering (Gb); Clean Reads, total count of clean reads; Q20 and Q30, percentage of bases with Phred quality scores > 20 (99% accuracy) and > 30 (99.9% accuracy), respectively; GC, proportion of Guanine and Cytosine bases.
Table 2. Statistics of SNP variations identified across Xupu goose samples.
Table 2. Statistics of SNP variations identified across Xupu goose samples.
ItemSNPTiTvTi/Tv (%)HeHo
XP.14,489,1913,199,9201,289,2712.482,520,6261,968,565
XP.24,530,1043,236,0551,294,0492.502,658,0061,872,098
XP.34,449,3273,171,5161,277,8112.482,526,2591,923,068
XP.44,226,9563,016,0671,210,8892.492,188,9102,038,046
XP.54,022,6852,874,6881,147,9972.502,106,1881,916,497
XP.64,026,5562,876,1731,150,3832.502,065,9521,960,604
XP.73,971,3162,835,4291,135,8872.502,036,5451,934,771
XP.84,141,7302,956,3491,185,3812.492,155,0201,986,710
XP.94,141,7303,091,0421,239,4092.492,305,1672,025,284
XP.104,516,0933,217,2571,298,8362.482,520,9621,995,131
XP.114,445,8903,165,6131,280,2772.472,366,2192,079,671
XP.124,445,8903,298,7951,330,6742.482,661,1741,968,295
XP.134,224,2633,011,2181,213,0452.482,273,1801,951,083
XP.144,198,3502,993,0751,205,2752.482,290,0771,908,273
XP.153,877,5342,766,1821,111,3522.492,008,9601,868,574
Average4,271,9943,047,2921,224,7022.492,312,2161,959,778
Note: SNP, the total number of single-nucleotide polymorphism sites detected within the sample; Ti (Transition), refers to the substitution between bases of the same chemical type; Tv (Transversion), refers to the substitution between bases of different chemical types; Ti/Tv, the ratio of the frequency of transitions to the frequency of transversions; He, Expected Heterozygosity; Ho, Observed Heterozygosity.
Table 3. Statistics of SNP functional annotation.
Table 3. Statistics of SNP functional annotation.
Annotation TypeMean ± SD
Upstream66,369.93 ± 3090.43
Alternative Splicing143.53 ± 14.30
Synonymous35,062.87 ± 1438.87
Non-synonymous12,458.07 ± 747.76
Intron1,821,812.33 ± 73,183.17
Downstream71,263.67 ± 3329.00
Up/Downstream10,775.33 ± 426.38
Intergenic1,223,202.67 ± 52,416.49
Table 4. Top significantly enriched GO terms of candidate genes in Xupu goose.
Table 4. Top significantly enriched GO terms of candidate genes in Xupu goose.
ItemIDTermsNpGenes Name
BPGO:0006310DNA recombination59.86 × 10−4ACTR8/RECQL5/MND1/RTEL1/LIG3
GO:0070286Axonemal dynein complex assembly42.28 × 10−3ZC3H7A/CCDC62/WDR86/WDR88
GO:0006939Smooth muscle contraction84.26 × 10−3SYCP1/MAD1L1/ERC2/HTR1D/LOC106039360/GOLGA3/ERC1/CCDC73
GO:0055085Transmembrane transport54.41 × 10−3ABCC10/LOC106041486/FLVCR2/ABCC5/SPNS
GO:0006631Fatty acid metabolic process45.54 × 10−3PECR/ACSS3/AASDH/ACSS2
CCGO:0005929Cilium105.44 × 10−3ANKS6/ARL13B/WDR86/NEK1/IFT122/RAB12/WDR88/BBS12/NEK10/CEP131
GO:0030054Cell junction321.02 × 10−2GPHN/LOC106046349/DTNB/GRIA3/GABRA4/GRIA2/GRIA1/SRGAP2/SYNE2/GRIK4/GRIA4, etc.
GO:0032982Myosin filament81.13 × 10−2ERC1/GOLGA3/CCDC73/IQCE/CMYA5/MAD1L1/SYCP1/ERC2
GO:0030016Myofibril91.42 × 10−2ERC1/GOLGA3/IQCE/CCDC73/CMYA5/SYCP1/MAD1L1/ERC2/BDP1
GO:0033017Sarcoplasmic reticulum membrane32.48 × 10−2ATP2A2/FKBP1B/FKBP6
MFGO:0005524ATP binding22.34 × 10−8KASH5
GO:0005234Extracellular glutamate-gated ion channel activity745.16 × 10−6UBE2T/MAP3K15/CSNK1G3/PIK3CD/ABCB7/LOC106043431/DPH6/AASDH/HSP90AB1, etc.
GO:0004970Ionotropic glutamate receptor activity57.71 × 10−5GRIA3/GRIA4/GRIA1/GRIK4/GRIA2
GO:0046624Sphingolipid transporter activity)142.22 × 10−2MAP3K15/CSNK1G3/KALRN/CAMKK2/STK24/ATR/PRKAA2/NEK1/BRSK2/TTBK2/NEK10/ADCK1/CPNE3/MAP3K19
GO:0003774Motor activity106.18 × 10−3ERC1/GOLGA3/CCDC73/MAD1L1/SYCP1/ERC2
Note: GO Category refers to the primary domains of the Gene Ontology (GO) system. It comprises three fundamental classifications: BP (Biological Process), which describes the broad biological objectives or biochemical pathways to which a gene product contributes; CC (Cellular Component), which specifies the precise locations within the cell where a gene product exerts its function; and MF (Molecular Function), defines the fundamental activities of a gene product at the molecular level. Terms provides the specific functional descriptions associated with each GO entry; N indicates the number of candidate genes from this study that are significantly enriched within a specific GO term. Genes Name: Lists the specific symbols of the candidate genes associated with the corresponding GO term; p (p-value), the statistical significance of the enrichment analysis; a value of p < 0.05 typically indicates that the enrichment of candidate genes in that pathway is highly significant and unlikely to have occurred by chance; Genes Name, lists the specific symbols of the candidate genes associated with the corresponding GO term.
Table 5. KEGG pathway enrichment analysis of candidate genes in Xupu goose.
Table 5. KEGG pathway enrichment analysis of candidate genes in Xupu goose.
PathwayGene RatiopNGenes Name
Glycosphingolipid biosynthesis—globo and isoglobo series2.155.25 × 10−33B3GALT5/FUT9/ST3GAL2
Neuroactive ligand-receptor interaction11.12.33 × 10−216CNR1/GABRA4/GRIA1/GRIA2/GRIA3/GRIA4/GRIK4/HTR1D/LEPR/LOC106032793/LOC106039360/LOC106045569/LPAR4/P2RX6/P2RY8/SSTR2
ABC transporters2.873.19 × 10−24ABCB7/ABCC10/ABCC5
Selenocompound metabolism1.455.77 × 10−22MTR/SCLY
Lysine degradation2.875.89 × 10−24KMT2C/LOC106036705/NSD2/PRDM2
Thiamine metabolism1.436.49 × 10−22AK7/LOC106046413
Adrenergic signaling in cardiomyocytes5.036.64 × 10−27ATP2A2/CACNA2D1/CACNB4/LOC106048956/MAPK12/PLCB2/RAPGEF4
Ribosome biogenesis in eukaryotes3.598.13 × 10−25LOC125184964/LOC125184965/RCL1/SPATA5/XRN1
Pentose and glucuronate interconversions 1.448.78 × 10−32CRPPA/XYLB
One carbon pool by folate1.428.78 × 10−32LOC106043431/MTR
Note: Pathway, Refers to the specific name assigned to a metabolic or signaling pathway; Gene Ratio, The ratio of the number of candidate genes enriched in a specific pathway to the total number of candidate genes annotated in the entire dataset; p (p-value), The statistical significance of the enrichment analysis; a value of p < 0.05 typically indicates that the enrichment of candidate genes in that pathway is highly significant and unlikely to have occurred by chance; N, Indicates the number of candidate genes from the dataset that are significantly enriched within a particular pathway; Genes Name, Lists the symbols of candidate genes associated with the specific pathway.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zhu, K.; Ai, Z.; Cai, Y.; Li, Y.; Cheng, Y.; Zhang, Y.; Zhao, W.; Chen, G. Genomic Evaluation of the Genetic Structure and Analysis of Selective Evolutionary Signatures of Xupu Goose. Biology 2026, 15, 479. https://doi.org/10.3390/biology15060479

AMA Style

Zhu K, Ai Z, Cai Y, Li Y, Cheng Y, Zhang Y, Zhao W, Chen G. Genomic Evaluation of the Genetic Structure and Analysis of Selective Evolutionary Signatures of Xupu Goose. Biology. 2026; 15(6):479. https://doi.org/10.3390/biology15060479

Chicago/Turabian Style

Zhu, Kairui, Zhenkang Ai, Yuchun Cai, Yonghao Li, Yuhang Cheng, Yang Zhang, Wenming Zhao, and Guohong Chen. 2026. "Genomic Evaluation of the Genetic Structure and Analysis of Selective Evolutionary Signatures of Xupu Goose" Biology 15, no. 6: 479. https://doi.org/10.3390/biology15060479

APA Style

Zhu, K., Ai, Z., Cai, Y., Li, Y., Cheng, Y., Zhang, Y., Zhao, W., & Chen, G. (2026). Genomic Evaluation of the Genetic Structure and Analysis of Selective Evolutionary Signatures of Xupu Goose. Biology, 15(6), 479. https://doi.org/10.3390/biology15060479

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop