Next Article in Journal
Can Different Dietary Protein Sources Influence the Survival, Growth, and Physiology of 0+Marron (Cherax cainii) Exposed to Feed Deprivation?
Previous Article in Journal
Expression of αv Integrin in Feline Injection-Site Sarcoma (FISS): Preliminary Investigations
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Development and Validation of a 5K Liquid Chip for Identifying Cashmere Goat Populations in Inner Mongolia Autonomous Region

1
College of Animal Science, Inner Mongolia Agricultural University, Hohhot 010018, China
2
Grassland Research Institute, Chinese Academy of Agricultural Sciences, Hohhot 010018, China
3
Inner Mongolia Key Laboratory of Sheep & Goat Genetics Breeding and Reproduction, Hohhot 010018, China
4
Key Laboratory of Mutton Sheep & Goat Genetics and Breeding, Ministry of Agriculture And Rural Affairs, Hohhot 010018, China
*
Authors to whom correspondence should be addressed.
Animals 2024, 14(24), 3589; https://doi.org/10.3390/ani14243589
Submission received: 25 October 2024 / Revised: 28 November 2024 / Accepted: 9 December 2024 / Published: 12 December 2024
(This article belongs to the Section Animal Genetics and Genomics)

Simple Summary

Cashmere goats, a characteristic species, are rich in genetic resources. Protecting and rationally utilizing these genetic resources is of great significance for the genetic improvement of cashmere goats. In the Inner Mongolia Autonomous Region, there are five nationally recognized populations of cashmere goats: the Arbas type (ARBS), Erlangshan type (ELS), Alashan type (ALS), Hanshan white cashmere goats (HS), and Ujimqin white cashmere goats (WZMQ), as well as three breeds: Inner Mongolia white cashmere goats, Hanshan white cashmere goats, and Ujimqin white cashmere goats. This study aims to protect and utilize the genetic resources of Inner Mongolia‘s cashmere goats by developing low-density liquid chips. The low-density liquid chip used for identifying cashmere goats in the Inner Mongolia Autonomous Region contains 5002 SNP sites. The sites of this chip are distributed across all chromosomes except the Y chromosome. Most of these SNP sites are located in intergenic and intronic regions. Principal component analysis and phylogenetic tree construction indicated that the ARBS, ELS, and ALS populations cluster into one category, suggesting that the chip cannot accurately distinguish among these three populations. However, this chip can accurately identify the three breeds: Inner Mongolia white cashmere goats, Hanshan white cashmere goats, and Ujimqin white cashmere goats.

Abstract

(1) Background: Cashmere goats, as one of the characteristic species, are rich in genetic resources. Protecting and rationally utilizing these genetic resources is of great significance for the genetic improvement of cashmere goats. (2) Methods: In this study, tissue samples were collected from Inner Mongolia white cashmere goats, which included the Arbas type (ARBS); Erlangshan type (ELS); Alashan type (ALS), Hanshan white cashmere goats (HS), and Ujimqin white cashmere goats (WZMQ). Genomic DNA was extracted and subjected to high-depth genome resequencing. GenoBait technology was used for probe design and site optimization, followed by the synthesis of a low-density liquid phase chip. Finally, a total of 281 individuals from five cashmere goat populations in the Inner Mongolia Autonomous Region were randomly selected to verify the chip. (3) Results: The results showed that a total of 5002 SNP sites were finally screened and retained for the synthesis of the low-density liquid chip for identifying cashmere goats in Inner Mongolia Autonomous Region. Principal component analysis and phylogenetic tree construction indicated that the ARBS, ELS, and ALS populations were clustered into one category. (4) Conclusions: This chip can accurately identify the three breeds: Inner Mongolia white cashmere goats, Hanshan white cashmere goats, and Ujimqin white cashmere goats.

1. Introduction

As one of the oldest domesticated animals, goats trace their origins back to the region near the Zagros Mountains, with a remarkable breeding history spanning approximately 10,000 years [1]. Integral to the development of human civilization, goats are found worldwide and serve as a crucial resource. They provide not only valued meat and nutritious milk, but also precious fiber, significantly contributing to human sustenance and well-being [2]. In China, there are a total of 78 goat breeds, including 60 local breeds and 18 improved breeds. China has the richest resources of cashmere goats, leading the world in both population and cashmere production. Among these, Inner Mongolia cashmere goats and Liaoning cashmere goats are important genetic resources in China, renowned for their cashmere quality and yield, respectively. Both breeds serve as paternal sources for the development of cashmere goat breeds in China. The Inner Mongolia cashmere goats have been included in the first batch of breeds for genetic resources protection of livestock and poultry in China. Within Inner Mongolia Autonomous Region, there are three cashmere goat breeds, namely Inner Mongolia white cashmere goat, Hanshan white cashmere goat, and Ujimqin white cashmere goat. The Inner Mongolia white cashmere goats include three types based on geographical location, the Inner Mongolia Arbas white cashmere goat, Inner Mongolia Erlangshan white cashmere goat, and Inner Mongolia Alashan white cashmere goat. There are notable differences among the cashmere goat breeds, primarily in terms of external characteristics, environmental adaptability, and production performance. The cashmere goats in the Inner Mongolia Autonomous Region are favored by breeders and consumers for their high cashmere production, superior cashmere quality, and delicious meat.
Breed identification in animals involves the application of suitable methods to distinguish individuals of unknown breeds. This process is crucial for the conservation of genetic resources and the enhancement of animal production and reproduction [3]. In the past, due to the limitations of knowledge and technology, breed identification relied primarily on morphological characteristics, such as coat color, body type, ear shape, and other features. For cashmere goats, horn shape was a key criterion. Identifying breeds based on external morphology required an in-depth understanding of the distinctive features of various breeds and extensive experience in classification and breeding, which made this method somewhat restrictive. Moreover, due to the uniformity among animals and the variability in their appearance, this method was prone to significant errors [4,5]. Given the low accuracy of breed identification through external morphology, researchers have turned to physical and chemical indicators, including protein analysis, iris recognition, and infrared spectroscopy technology. Ashoor et al. (1988) successfully identified breeds by analyzing quantitative chromatographic differences in meat slices, achieving promising results. While the accuracy of these technologies has been greatly improved, they remain susceptible to environmental interference and sample quality issues, and the detection process can be cumbersome [6].
Following the 20th century, the rapid development of molecular biology technology has unveiled the potential of DNA molecules in significantly enhancing the precision of animal breed identification. This field encompasses a range of DNA molecular markers, including RFLP, AFLP, SSR, and SNP. In previous studies, microsatellite and ALFP markers were used for such identification [7,8]. However, with the evolution of gene chip technology, SNP loci have become increasingly utilized in breed identification. SNPs are favored for their minimal genotyping error rate, extensive genomic distribution, rapid detection capabilities, and standardization ease. Negrini et al. employed the δ method in breed classification based on the absolute difference in minor allele frequency between breeds [9]. Weir et al. leveraged Wright’s FST method to discriminate breeds by maximizing allele frequency variances between predefined breed groups [10]. However, these methods are limited to distinguishing between pairs of breeds. Paschou et al. introduced a PCA-based approach to extract informative SNP loci, contributing to the elucidation of population structures [11]. Despite the promotion and application of molecular breeding techniques, the majority of research on Inner Mongolia cashmere goats has concentrated on screening candidate genes and analyzing genes expression levels [12,13,14]. The purpose of this study was to develop a low density liquid chip for identification of cashmere goat populations in Inner Mongolia Autonomous Region. This is of great significance for the conservation of cashmere goat breed resources within the region.

2. Materials and Methods

2.1. Data Collection and SNP Calling

Whole-genome resequencing data were collected from 45 individuals in five cashmere goat populations (Inner Mongolia Arbas white cashmere goats, ARBAS; Inner Mongolia Erlangshan white cashmere goats, ELS; Inner Mongolia Alashan white cashmere goats, ALS; Hanshan white cashmere goats, HS; and Ujimqin white cashmere goats, WZMQ) in Inner Mongolia Autonomous Region. Detailed sequencing information can be found in our previous study [15]. According to the outcomes of principal component analysis (PCA) and evolutionary tree analysis, the resequencing data from three individuals (ELS-5, ALS-2, and ALS-9) were removed, leaving data from 42 individuals for site screening. To ensure the quality of the SNPs, vcftools (v0.1.16) was used to filter all SNPs [16]. The filtering criteria were as follows: sequencing depth 5X, missing rate < 10%,heterozygosity rate < 30%.

2.2. Probe Design and Selection of Specific SNP Sites

After quality control, high-quality SNPs were selected based on the following criteria: a length of 110 base pairs, a GC content ranging from 30% to 70%, and no more than five homologous regions. For SNPs that met these conditions and complied with probe design principles, minor allele frequency (MAF) values were calculated for each site across the five cashmere goat populations.
Additionally, the ΔMAF value, representing the difference in MAF between two populations, was determined for each site. The 1500 SNP sites with the highest △MAF value were selected from each population, with duplicate loci removed. This process resulted in a total of 7298 SNP sites being retained. Considering the probe panel capacity and the excess of sex chromosome sites, all autosomal sites and the top sites with the highest mean ΔMAF for sex chromosomes across the five cashmere goat populations were selected. Initially, 6000 sites were retained for testing and analysis. According to the detection results, some sites with low detection rates were deleted, and ultimately, 5002 SNP sites were kept. The probe and chip synthesis tasks were completed by MOLBREEDING Biotechnology Co., Ltd. (Shijiazhuang, China). The ANNOVAR software (version 3) was utilized to perform gene functional annotation for the sites in this chip [17].

3. Validation of 5K Liquid Chip

3.1. DNA Collection and Library Construction

In order to further verify the reliability of the genetic resource identification chip for five cashmere goat breeds in Inner Mongolia Autonomous Region, we tested 300 qualified cashmere goat DNA samples. The samples were from 60 individuals in each population. The genomic DNA was evaluated using 1% (w/v) agarose gel electrophoresis, and its quantity was determined with the Qubit 2.0 Fluorometer (Invitrogen, Shanghai, China). Genomic DNA libraries were prepared utilizing the GenoBauts DNA-seq Library Prep Kit (MolBreeding Biotechnology Co., Shijiazhuang, Hebei, China), adhering to the manufacturer’s guidelines. These libraries were then hybridized with biotin-labeled target probes at 65 °C for 16 h. To capture the hybridized targets and eliminate non-target fragments, Dynabeads MyOne Streptavidin C1 and binding buffer were employed. The captured fragments were subsequently amplified using library amplification primers and DNA polymerase. After two rounds of purification with Beckman AMPure Beads, the libraries were re-quantified using the Qubit 2.0 Fluorometer (Invitrogen, Shanghai, China).

3.2. Sequencing, Quality Control, and Variant Detection

DNA samples were sequenced using the 5K liquid chip. The raw reads obtained by sequencing were for quality control. Reads containing adaptor sequences and those with more than 10% N base content were removed. Additionally, if the proportion of bases with a quality score below 5 in any read exceeded 50% of the total read length, the corresponding paired reads were removed. The Burrows–Wheeler Alignment software (BWA v0.7.17), with parameters MEM-T4-K32-m, was used to align the cleaned reads to the assembled high-quality reference genome utilized by our research team [18]. According to the alignment of clean reads to the reference genome, the Unified Genotyper module in GATK (4.2.6.1) toolkit was applied to detect genetic variants [19]. Then, the Variant Filtration module was used to refine the detected variants.

3.3. Principal Component Analysis and Phylogenetic Tree Construction

The GCTA software (1.94.0beta) was employed to conduct principal component analysis (PCA) on the SNPs of all samples across the five cashmere goat populations using the command gcta64 --grm cashmeregoats_grm --pca 3 --out cashmere goats_pca [20].The three largest eigenvectors generated by PCA analysis were taken as the principal axes, and the plot function in R 4.3.3 was used to draw PCA plots [21]. The genetic distances among populations, based on the filtered high-quality SNPs, were calculated, and then, the Neighbor-Joining (NJ) method was applied with TreeBest software (version 7)to construct a phylogenetic tree of the SNPs. Finally, the population phylogenetic tree was visualized on the itol website (https://itol.embl.de/).

4. Results

4.1. Determination of Specific SNP Sites in the Five Cashmere Goat Breeds

After filtering the resequencing data from 42 individuals, a total of 110,713 high-quality SNPs were obtained. Further probe design was carried out, and the probes of 72,319 SNPs were successfully designed. Taking into account factors such as the maximum ΔMAF value, panel probe capacity, the prevalence of sex chromosome sites, and poor detection rates for sites that complied with probe design principles, 5002 SNP sites were ultimately reserved for the chip synthesis.

4.2. The Distribution of SNP Sites on Chromosomes for the 5K Liquid Chip

For the 5K chip, a 1 Mb window was used to slide along chromosomes, tallying the SNPs within each segment. The distribution and number of SNP sites on each chromosome are shown in Figure 1. Excluding the sex chromosomes, chromosome 8 exhibited the highest density of SNP sites, while chromosome 28 had the sparsest distribution. It was also observed that the number of core regions on each chromosome aligned with the SNP count, suggesting that the final customized chip captured precisely one SNP site per probe. The minimum allele frequency (MAF) distribution of the SNP sites on the 5K chip is presented in Figure 2. A majority of the MAF values are concentrated between 0.2 and 0.5, with 99.4% of the sites falling within this range. Notably, only 28 SNP sites have MAF values below 0.2.

4.3. Annotation of SNPs in the 5K Liquid Chip

The gene structure distribution for all SNP sites included in the 5K chip is shown in Figure 3. The majority of these sites are found in the intergenic and intronic regions, with 3361 SNPs in the intergenic region and 1442 SNPs in the intronic region, collectively accounting for 96.0% of the total SNP sites. Within the exon region, there are 32 SNPs identified. Additionally, 86 SNPs are situated in the upstream 1 Kb region, and 65 SNPs in the downstream 1 Kb region of genes. The numbers of SNPs in the 5′ and 3′ untranslated regions (UTRs) are 4 and 16, respectively. Intriguingly, there are a total of four SNPs in the 5′ UTR of one gene and the 3′ UTR of another gene.

4.4. Validation of the 5K Liquid Chip for Identification of Five Cashmere Goat Populations

In this study, a total of 300 DNA samples were used to evaluate the performance of the 5K liquid chip. Upon evaluating the DNA samples, seven were deemed unqualified and 12 that had been used for preliminary testing were ultimately excluded. Consequently, 281 individuals were retained for identification and verification across five Inner Mongolia cashmere goat populations. The detailed sample information for validation is shown in Table 1. Among the retained samples, two exhibited a low sequencing detection rate due to library preparation issues. The calling rate for the remaining samples ranged from 99.06% to 99.98%, with an overall average detection rate of 99.77%.
The missing and mutation rates of SNPs in each sample are shown in Figure 4 and detailed in Supplementary Table S1. Notably, two individuals stood out with exceptionally high missing rates, at 96.84% and 98.56%, respectively. For the remaining samples, the missing sites varied from 1 to 47, with an average missing rate of 0.23%. The range of homozygous mutation sites was 728~2516, with an average homozygous mutation rate of 26.20%. The heterozygous mutation sites ranged from 1135 to 2549, with an average heterozygous mutation rate of 39.09%. The range of sites consistent with the reference genome was 1115–2386, with an average consistency rate of 34.47%.
The statistical analysis of missing rates among the five cashmere goat populations is presented in Table 2. It can be found that there are minimal differences in missing rates across these populations. The missing rate of Ujimqin white cashmere goats is the lowest, while those of Inner Mongolia cashmere goats (Alashan type) and Inner Mongolia cashmere goats (Arbas type) are the highest. In terms of heterozygous mutation rates, the Inner Mongolia cashmere goat (Erlangshan type) population shows the highest, contrasting with the Hanshan white cashmere goat population, which has the lowest. Conversely, the homozygous mutation rate is highest in the Hanshan white cashmere goat population, and lowest in the Inner Mongolia cashmere goats (Arbas type). The consistency rate with the reference genome is found to be the lowest in the Hanshan white cashmere goat population, and the highest in the Inner Mongolia cashmere goats (Arbas type).

4.5. Principal Component Analysis and Phylogenetic Tree Construction

This study fully utilized the 5K liquid chip sequencing data from 279 cashmere goats across five populations in Inner Mongolia Autonomous Region to conduct principal component analysis and to construct a phylogenetic tree. The results are shown in Figure 5. From this, it can be seen that the three types of Inner Mongolia cashmere goat, including Arbas type, Erlangshan type, and Alashan type, are grouped together (designated as ABC), indicating relatively close genetic distances. At present, it is difficult to distinguish these three populations using the 5K liquid chip alone, and further careful screening of loci information for each population is required to facilitate the identification of the three Inner Mongolia cashmere goats types. However, the 5K liquid chip has proven effective for accurate population identification among the following three breeds: Inner Mongolia cashmere goats, Hanshan white cashmere goats, and Ujimqin white cashmere goats.

5. Discussion

The identification of animal breeds is crucial for the protection of genetic resources and the development of new breeds. The main techniques employed for breed identification include microarray chips and targeted capture sequencing, also known as “liquid phase chips” or genotyping by target sequencing (GBTS) [22]. Niu Anran et al. (2022) used SNP chips to distinguish between Duroc, Large White pigs, and other pig breeds. They successfully identified individuals from the core population of these three breeds by using PCA and population genetic structure analysis, offering excellent genetic resources for the future development of specialized lines and composite lines [23]. Min B R et al. (1995) used random amplified polymorphic DNAs (RAPDs) to differentiate among Korean cattle beef, Holstein beef, and imported beef [24]. Min et al. (1995) identified beef breeds using random amplified polymorphic DNAs [24]. Miao et al. (2023) developed a web tool for the global identification of pig breeds [25]. With advancements in sequencing technology, cost-effective and low-density SNP chips have been used as a reliable method for genetic resource identification, aiding in the conservation efforts, and facilitating the tracing of germplasm sources.
At present, several studies have reported on the use of chips for goat breeding purpose. However, no specialized chip exists specifically for the identification of goat breeds. Tosser et al. pioneered the development of a 52K medium-density goat SNP chip utilizing sequencing data from 97 individuals across six goat breeds [26]. But this chip lacks the inclusion of sequencing data from goats in China, leading to potential inaccuracies in genomic selection, genome-wide association analysis of the economic traits, and genetic resource identification for Chinese goat populations. Qiao Xian et al. (2017) expanded on this by developing a goat 66K SNP capture chip, incorporating resequencing data from domestic breeds such as Inner Mongolia cashmere goats (Arbas type, Erlangshan type, Alashan type) and Liaoning cashmere goats [27]. China has abundant genetic resources in goats and variability among breeds. In order to improve the availability of commercial SNP chips for goats, Wang Fenghong et al. (2021) used resequencing data from 598 individuals representing 85 goat breeds globally and integrated single nucleotide polymorphisms (SNPs) that significantly affect important economic traits, adaptability, and diseases in goats. Consequently, they designed a 70K goat SNP chip [28]. Currently, it has been widely adopted in goat breeding programs.
Thus, it can be seen that multiple commercial goat chips have been released, but there is a notable absence of chips specifically designed for the protection of genetic resources in cashmere goats. This study marks the first instance of screening core SNPs with high inter-population variation frequencies based on resequencing data for population identification, significantly reducing sequencing costs and enhancing the accuracy and convenience of goat breed identification. In line with the design principles of previous chips, the primary criteria for SNP selection are the uniformity of sites and the frequency of SNP variation. Uniformity of SNP sites ensures relatively consistent signal strength across various regions of the chip, thereby minimizing errors in chip manufacturing and data analysis. The mutation frequency of SNPs can impact the detection rate of mutation sites. Therefore, different methods are adopted for SNP site selection depending on the purpose of the chip. Wang Yanyun et al. have identified carp breeds using chip data, and in the screening of SNP sites for this chip, it is crucial to ensure a sufficiently large difference in minor allele frequency (MAF) between breeds [29]. This approach is fundamentally consistent with the chip design concept of this study.
This study strictly selected SNP sites to develop a low-density chip aimed at identifying the genetic resources of cashmere goats in the Inner Mongolia Autonomous Region. The individual DNA samples from the selected cashmere goat populations were obtained from purebred individuals within their respective conservation farms. High-depth resequencing was conducted on 45 individuals from five goat populations in Inner Mongolia Autonomous Region, and the SNP data underwent strict filtering. Individuals with poor clustering performance were excluded based on filtered information and subsequent evolutionary tree analysis, ensuring the uniqueness and accuracy of SNPs for chip design. Firstly, SNP sites with higher minor allele frequency (MAF) were selected to guarantee that variations could be effectively detected in subsequent analyses. On this basis, SNPs with larger inter-breed MAF differences were chosen, and those meeting both criteria were designated as the core sites of the chip. Unrelated background sites were removed, and the total number of sites was reduced to achieve breed identification, thereby reducing the cost associated with using the chip. According to the above method, a total of 5002 core sites were finally retained for inclusion on the chip. Generally, the length of a chromosome is expected to correlate with the number of SNPs on the chromosome, with SNPs being spaced at appropriate intervals. However, in this study, the numbers of SNPs on certain chromosomes (1, 5, 9, 11, 15) were not directly proportional to the length of those chromosomes. Imposing strict uniformity requirements for alignment points might diminish the accuracy of variety identification, so this study did not impose too many restrictions on the uniformity of SNPs in this chip. This is the reason for the uneven ratio of the number of SNPs on each chromosome in the selected SNPs of the chip to the length of the chromosome.
To further evaluate the efficacy of the 5K liquid chip, genotyping was performed on 281 individuals from five cashmere goat populations in Inner Mongolia Autonomous Region, followed by a population difference analysis. The average detection rate for SNPs was found to be 99.77%, suggesting that the chip has good performance, slightly surpassing the detection rate of chips designed for pig breed identification [30]. All SNPs selected for this chip had a coverage of over 5X, with the highest exceeding 100X, making this 5K liquid chip suitable for identifying cashmere goats in Inner Mongolia Autonomous Region. To further verify the effectiveness of the chip, random samples from five cashmere goat populations were genotyped to gather comprehensive core sequencing data, which were then used for principal component analysis and phylogenetic tree construction. The findings indicated that the three Mongolia cashmere goat populations cluster closely together, suggesting that the 5K liquid chip has limitations in distinguishing among these three populations.
A more detailed consideration of the SNP site information for these three populations is necessary to rescreen loci for population identification. However, the chip can be effectively used for precise population identification of three cashmere goat breeds in Inner Mongolia Autonomous Region, including Inner Mongolia white cashmere goats, Hanshan white cashmere goat and the Ujimqin white cashmere goats. It was indicated that the SNP site selection for the 5K liquid chip was stringent and reliable. Compared to the principal components and evolutionary tree results from resequencing, the SNP sites selected for this chip were more specific and representative of the breeds, enabling accurate identification of the three cashmere goat breeds in Inner Mongolia Autonomous Region.

6. Conclusions

This study fully utilized high-depth resequencing data from five cashmere goat populations in Inner Mongolia Autonomous Region to identify polymorphic loci among populations. After a rigorous screening process, 5002 SNP sites were ultimately selected to synthesize a liquid chip for the identification of cashmere goat germplasm resources in Inner Mongolia Autonomous Region. The 5K liquid chip is capable of accurately identifying three distinct breeds: Inner Mongolia cashmere goats, Hanshan white cashmere goats, and Ujimqin white cashmere goats. However, it falls short in accurately discriminating between the three populations of Inner Mongolia cashmere goats, highlighting the need for further refinement in SNP site selection for more precise population-level identification.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/ani14243589/s1, Table S1: SNP detection results with the 5K liquid chip.

Author Contributions

T.Z. was responsible for writing the initial draft, proposing research ideas, and clarifying research questions and objectives. Q.X., B.Z. and J.X. reviewed the current research progress in the field, formulated research plans, and conducted data collection. S.Z., J.L. and Q.L. applied techniques in statistics, mathematics, and computer science to extract and analyze data. Y.Z., R.W. and R.S. were responsible for data visualization, methodology, and chart drawing. Z.W. was responsible for revising and further improving the paper to ensure its clear structure. All authors have read and agreed to the published version of the manuscript.

Funding

This study was supported by National Key Research and Development Program (2022YFE0113300, 2022YFD1300201, 2022YFD1300204), Project for Enhancing Young Teacher’s Research Ability (BR220112), Program for Young Talents of Science and Technology in Universities of Inner Mongolia Autonomous Region (NJYT22038), Project for 2030 Science and Technology Innovation Major (2023ZD0405102), and Program for Inner Mongolia Autonomous Region Higher Education Innovation Team Development (NMGIRT2322).

Institutional Review Board Statement

The animals used in this manuscript has been reviewed by the Laboratory Animal Welfare and Animal Experiment Ethics Inspection Committee of Inner Mongolia Agricultural University, which meets ethical principles, approval code: NO: NND20241121.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in the study are included in the article/Supplementary Materials, further inquiries can be directed to the corresponding authors.

Conflicts of Interest

The authors declare that they have no competing interests.

References

  1. Froghi, T.; Hosaini, A. Origin and domestication of the goat and history of the Angora goat and the mohair industry. Life Sci. J. 2012, 9, 2615–2618. [Google Scholar]
  2. Panth, B.P.; Bhattarai, N.; Baral, P.; Karki, M.; Bhattarai, A.; Sapkota, S. Factors affecting the profitability from goat farming in Gulmi, Nepal. Cogent Food Agric. 2021, 7, 1963928. [Google Scholar] [CrossRef]
  3. Horst, P. Animal breeding as a factor of influence on the conservation of natural resources and utilisation of genetic resources. Anim. Res. Dev. 1997, 46, 63–74. [Google Scholar] [CrossRef]
  4. Salako, A.E. Application of Morphological Indices in the Assessment of Type and Function in Sheep. Int. J. Morphol. 2006, 24, 13–18. [Google Scholar] [CrossRef]
  5. Ouyang, J.; Wang, L. The Progress and Application of DNA Barcoding in Genetic Resources Conservation for Local Pig. J. Agric. Biotechnol. 2013, 21, 348–354. [Google Scholar]
  6. Ashoor, S.H.; Monte, W.C.; Stiles, P.G. Liquid chromatographic identification of meats. J. Assoc. Off. Anal. Chem. 1988, 71, 397–403. [Google Scholar] [CrossRef]
  7. Dalvit, C.; De Marchi, M.; Targhetta, C.; Gervaso, M.; Cassandro, M. Genetic traceability of meat using microsatellite markers. Food Res. Int. 2008, 41, 301–307. [Google Scholar] [CrossRef]
  8. Weller, J.I.; Seroussi, E.; Ron, M. Estimation of the number of genetic markers required for individual animal identification accounting for genotyping errors. Anim. Genet. 2010, 37, 387–389. [Google Scholar] [CrossRef] [PubMed]
  9. Negrini, R.; Milanesi, E.; Colli, L.; Pellecchia, M.; Nicoloso, L.; Crepaldi, P.; Lenstra, J.A.; Ajmone-Marsan, P. Breed assignment of Italian cattle using biallelic AFLP markers. Anim. Genet. 2010, 38, 147–153. [Google Scholar] [CrossRef]
  10. Weir, B.S.; Cardon, L.R.; Anderson, A.D.; Nielsen, D.M.; Hill, W.G. Measures of human population structure show heterogeneity among genomic regions. Genome Res. 2005, 15, 1468–1476. [Google Scholar] [CrossRef]
  11. Paschou, P.; Ziv, E.; Burchard, E.G.; Choudhry, S.; Rodriguez-Cintron, W.; Mahoney, M.W.; Drineas, P. PCA-Correlated SNPs for Structure Identification in Worldwide Human Populations. PLoS Genet. 2007, 3, 1672–1686. [Google Scholar] [CrossRef] [PubMed]
  12. Li, W.; Yang, L.; Ren, Y.; Yan, X.; Wang, J. Quantification of rib COL1A2 gene expression in healthy and fluorosed Inner Mongolia cashmere goats. Fluoride 2022, 40, 13–18. [Google Scholar]
  13. Wang, R.; Wang, X.; Qi, Y.; Li, Y.; Na, Q.; Yuan, H.; Rong, Y.; Ao, X.; Guo, F.; Zhang, L.; et al. Genetic diversity analysis of Inner Mongolia cashmere goats (Erlangshan subtype) based on whole genome re-sequencing. BMC Genom. 2024, 25, 698. [Google Scholar] [CrossRef]
  14. Zhao, Q.; Huang, Y.; Chen, Q.; Su, Y.; Zhang, Y.; Wang, R.; Su, R.; Xu, H.; Liu, S.; Ma, Y.; et al. Genomic Inbreeding and Runs of Homozygosity Analysis of Cashmere Goat. Animals 2024, 14, 1246. [Google Scholar] [CrossRef]
  15. Zhang, T.; Wang, Z.; Zhang, Y. Genetic diversity and population structure in five Inner Mongolia cashmere goat populations using whole-genome genotyping. Anim. Biosci. 2024, 37, 1168–1176. [Google Scholar] [CrossRef] [PubMed]
  16. Danecek, P.; Auton, A.; Abecasis, G.; Albers, C.A.; Banks, E.; DePristo, M.A.; Handsaker, R.E.; Lunter, G.; Marth, G.T.; Sherry, S.T.; et al. The variant call format and VCFtools. Bioinformatics 2011, 27, 2156–2158. [Google Scholar] [CrossRef]
  17. Wang, K.; Li, M.; Hakonarson, H. ANNOVAR: Functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 2010, 38, e164. [Google Scholar] [CrossRef]
  18. Shelkovnikov, A.B. Application of the software for BWA systems for analysis and design. In Proceedings of the 12th International Conference Microwave and Telecommunication Technology, Sevastopol, Ukraine, 9–13 September 2002; IEEE: Piscataway, NJ, USA, 2002. [Google Scholar]
  19. Tucci, N.; Cala, J.; Steyn, J.; Missier, P. Design and evaluation of a genomics variant analysis pipeline using GATK Spark tools. In Proceedings of the 26th Italian Symposium in Advanced Database Systems (SEBD 2018), Taranto, Italy, 24–27 June 2018. [Google Scholar]
  20. Yang, J.; Lee, S.H.; Goddard, M.E.; Visscher, P.M. Genome-Wide Complex Trait Analysis (GCTA): Methods, Data Analyses, and Interpretations. Methods Mol. Biol. 2013, 1019, 215–236. [Google Scholar]
  21. José, C.; Jesús, P.; Alberto, F. Data understanding with PCA: Structural and Variance Information plots. Chemom. Intell. Lab. Syst. 2010, 100, 48–56. [Google Scholar]
  22. Zhang, C.; Chen, X.; Cao, B.; Duan, H.; Sun, Q.; Ouyang, F. Tuning the band gap of the InSe monolayer by substitutional doping. Appl. Surf. Sci. 2022, 579, 152190. [Google Scholar] [CrossRef]
  23. Niu, A.; Zhang, X.; Yang, Y.; Yan, Z.; Gong, H.; Ding, R.; Ma, L. Identification of Variety Composition of Large White Pig, Changbai Pig, and Duroc Pig Based on SNP Chip Analysis. Chin. J. Anim. Husb. 2022, 58, 219–223. [Google Scholar]
  24. Min, B.R.; Han, J.Y.; Lee, M.H. The identification of beef breeds (Korean cattle beef, Holstein beef and Imported beef) using random amplified polymorphic DNAs. Korean J. Anim. Sci. 1995, 37, 651–660. [Google Scholar]
  25. Miao, J.; Chen, Z.; Zhang, Z.; Wang, Z.; Wang, Q.; Zhang, Z.; Pan, Y. A web tool for the global identification of pig breeds. Genet. Sel. Evol. 2023, 55, 18. [Google Scholar] [CrossRef]
  26. Tosserklopp, G.; Bardou, P.; Bouchez, O.; Cabau, C.; Crooijmans, R.; Dong, Y.; Donnadieu-Tonon, C.; Eggen, A.; Heuven, H.C.; Jamli, S.; et al. International Goat Genome Consortium Design and Characterization of a 52K SNP Chip for Goats. PLoS ONE 2014, 9, e86227. [Google Scholar]
  27. Qiao, X.; Su, R.; Wang, Y.; Wang, R.; Yang, T.; Li, X.; Chen, W.; He, S.; Jiang, Y.; Xu, Q.; et al. Genome-wide Target Enrichment-aided Chip Design: A 66K SNP Chip for Cashmere Goat. Sci. Rep. 2017, 7, 8621. [Google Scholar] [CrossRef]
  28. Wang, F. Design of SNP Chip for Goats and Genome-Wide Association Analysis and Genome Selection Study of Important Economic Traits in Inner Mongolia Cashmere Goats. Ph.D. Thesis, Inner Mongolia Agricultural University, Hohhot, China, 2021. [Google Scholar]
  29. Wang, Y.; Wang, J.; Liu, X.; Zhao, L.; Yang, S. Identification of Yellow River carp based on SNP markers. J. Aquac. 2022, 35, 7–14. [Google Scholar]
  30. Ramos, A.M.; Crooijmans, R.; Affara, N.A.; Amaral, A.G.; Archibald, A.L.; Beever, J.E.; Bendixen, C.; Churcher, C.; Clark, R.; Dehais, P.; et al. Design of a High Density SNP Genotyping Assay in the Pig Using SNPs Identified and Characterized by Next Generation Sequencing Technology. Public Libr. Sci. 2009, 4, e6524. [Google Scholar] [CrossRef]
Figure 1. The distribution of SNP sites in chromosomes.
Figure 1. The distribution of SNP sites in chromosomes.
Animals 14 03589 g001
Figure 2. Frequency distribution of MAF of sites in the 5K chip.
Figure 2. Frequency distribution of MAF of sites in the 5K chip.
Animals 14 03589 g002
Figure 3. Annotation of all the sites in the 5K chip. Note: (1) Intergenic: The variation is annotated in an intergenic region. (2) Intronic: The variation is annotated in the intron region. (3) Exonic: The variation is annotated in the exon region. (4) Downstream: The 1 Kb downstream region of the gene. (5) Upstream: The 1 Kb upstream region of the gene. (6) UTR3: untranslated area at the 3′ end. (7) UTR5: 5′ end untranslated area. (8) Upstream; downstream: both 1 Kb upstream of one gene and 1 Kb downstream of another gene.
Figure 3. Annotation of all the sites in the 5K chip. Note: (1) Intergenic: The variation is annotated in an intergenic region. (2) Intronic: The variation is annotated in the intron region. (3) Exonic: The variation is annotated in the exon region. (4) Downstream: The 1 Kb downstream region of the gene. (5) Upstream: The 1 Kb upstream region of the gene. (6) UTR3: untranslated area at the 3′ end. (7) UTR5: 5′ end untranslated area. (8) Upstream; downstream: both 1 Kb upstream of one gene and 1 Kb downstream of another gene.
Animals 14 03589 g003
Figure 4. SNP site mutation statistics.
Figure 4. SNP site mutation statistics.
Animals 14 03589 g004
Figure 5. Phylogenetic tree and principal component analysis. Note: ABC: Inner Mongolia cashmere goats (including Arbas type, Alashan type, and Erlangshan type); D: Hanshan white cashmere goats; E: Ujimqin white cashmere goats. Left: Phylogenetic tree for the three cashmere goat breeds; Right: Principal Component Analysis for the three cashmere goat breeds.
Figure 5. Phylogenetic tree and principal component analysis. Note: ABC: Inner Mongolia cashmere goats (including Arbas type, Alashan type, and Erlangshan type); D: Hanshan white cashmere goats; E: Ujimqin white cashmere goats. Left: Phylogenetic tree for the three cashmere goat breeds; Right: Principal Component Analysis for the three cashmere goat breeds.
Animals 14 03589 g005
Table 1. Sample information for cashmere goats in each population.
Table 1. Sample information for cashmere goats in each population.
NameSexNo.Stock
ARBSfemale56Inner Mongolia Yiwei White Cashmere Goat Co., Ltd. (Ordos, China)
ELSfemale57Inner Mongolia Mizhen International Trade Co., Ltd. (Bayannaoer, China)
ALSfemale54Alxa Left Banner Goat Breeding Farm
HSfemale56Chifeng Bahrain Right Banner Hanshan Goat Breeding Farm
WZMQfemale58Ujimqin White Goat Original breeding Farm
Note: ARBAS: Inner Mongolia cashmere goats (Arbas type); ELS: Inner Mongolia cashmere goats (Erlangshan type); ALS: Inner Mongolia cashmere goats (Alashan type); HS: Hanshan white cashmere goats; WZMQ: Ujimqin white cashmere goats.
Table 2. Statistics of SNP site mutation.
Table 2. Statistics of SNP site mutation.
PopulationNumber of
Missing Sites
Missing RateNumber of
Heterozygous Sites
Heterozygous
Mutation Rate
Number of
Homozygous Sites
Homozygous
Mutation Rate
Consistent Number
of Sites
Site Consistency
Rate
ARBAS120.25195439.1696819.40206841.45
ELS120.24209541.97104420.92185137.10
ALS120.25193838.8597419.52207741.63
HS110.23181536.36193738.81123924.83
WZMQ110.21202240.51159231.89137827.60
Mean120.23196539.37130326.11172334.52
Note: ARBAS: Inner Mongolia cashmere goats (Arbas type); ELS: Inner Mongolia cashmere goats (Erlangshan type); ALS: Inner Mongolia cashmere goats (Alashan type); HS: Hanshan white cashmere goats; WZMQ: Ujimqin white cashmere goats.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zhang, T.; Xu, Q.; Zhou, B.; Xiao, J.; Zheng, S.; Li, J.; Lv, Q.; Zhang, Y.; Wang, R.; Su, R.; et al. Development and Validation of a 5K Liquid Chip for Identifying Cashmere Goat Populations in Inner Mongolia Autonomous Region. Animals 2024, 14, 3589. https://doi.org/10.3390/ani14243589

AMA Style

Zhang T, Xu Q, Zhou B, Xiao J, Zheng S, Li J, Lv Q, Zhang Y, Wang R, Su R, et al. Development and Validation of a 5K Liquid Chip for Identifying Cashmere Goat Populations in Inner Mongolia Autonomous Region. Animals. 2024; 14(24):3589. https://doi.org/10.3390/ani14243589

Chicago/Turabian Style

Zhang, Tao, Qi Xu, Bohan Zhou, Junjie Xiao, Shudan Zheng, Jinquan Li, Qi Lv, Yanjun Zhang, Ruijun Wang, Rui Su, and et al. 2024. "Development and Validation of a 5K Liquid Chip for Identifying Cashmere Goat Populations in Inner Mongolia Autonomous Region" Animals 14, no. 24: 3589. https://doi.org/10.3390/ani14243589

APA Style

Zhang, T., Xu, Q., Zhou, B., Xiao, J., Zheng, S., Li, J., Lv, Q., Zhang, Y., Wang, R., Su, R., & Wang, Z. (2024). Development and Validation of a 5K Liquid Chip for Identifying Cashmere Goat Populations in Inner Mongolia Autonomous Region. Animals, 14(24), 3589. https://doi.org/10.3390/ani14243589

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop