Next Article in Journal
Role of MicroRNAs in Acute Myeloid Leukemia
Previous Article in Journal
Development of a BiAD Sensor for Locus-Specific Detection of Cellular Histone Acetylation Dynamics by Fluorescence Microscopy
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Whole-Genome Insights into the Genetic Basis of Conformation Traits in German Black Pied (DSN) Cattle

1
Albrecht Daniel Thaer-Institute for Agricultural and Horticultural Sciences, Animal Breeding and Molecular Genetics, Humboldt-Universität zu Berlin, Invalidenstr. 42, 10115 Berlin, Germany
2
Leibniz Institute for Zoo and Wildlife Research, Alfred-Kowalke-Straße 17, 10315 Berlin, Germany
*
Author to whom correspondence should be addressed.
Genes 2025, 16(4), 445; https://doi.org/10.3390/genes16040445
Submission received: 28 February 2025 / Revised: 3 April 2025 / Accepted: 6 April 2025 / Published: 10 April 2025
(This article belongs to the Section Animal Genetics and Genomics)

Abstract

:
Background: The German Black Pied Dairy (DSN) cattle is an endangered dual-purpose breed considered an ancestor of the modern Holstein population. DSN is known for its high milk yield, favorable milk composition, and good meat quality. Maintaining a functional body structure is essential for ensuring sustained performance across multiple lactations in dual-purpose breeds like DSN. This study aims to identify candidate genes and genetic regions associated with conformation traits in DSN cattle through genome-wide association studies (GWAS). Methods: The analysis utilized imputed whole-genome sequencing data of 1852 DSN cows with conformation data for 19 linear traits and four composite scores derived from these traits. GWAS was performed using linear mixed models. Results: In total, we identified 118 sequence variants distributed across 24 quantitative trait locus (QTL) regions comprising 74 positional candidate genes. Among the most significant findings were variants associated with “Rump width” on chromosome 21 and “Teat length” on chromosome 22, with AGBL1 and SRGAP3 identified as the most likely candidate genes. Additionally, a QTL region on chromosome 15 linked to “Central ligament” contained 39 olfactory receptor genes, and a QTL region on chromosome 23 associated with “Hock quality” included eight immune-related genes, notably, BOLA and TRIM family members. Conclusions: Selective breeding for favorable alleles of the investigated conformation traits may contribute to DSN’s longevity, robustness, and overall resilience. Hence, continuous focus on healthy udders, feet, and legs in herd management contributes to preserving DSN’s positive traits while improving conformation.

1. Introduction

German Black Pied Cattle (DSN, German: Deutsches Schwarzbuntes Niederungsrind) originated from the coastal grasslands along the North Sea in Denmark, northern Germany, and the Netherlands. Today, DSN represents an endangered dual-purpose cattle breed with a small population size (2514 cows and 27 bulls as of 2021) [1]. Despite a relatively limited effective population size (Ne = 106.9), DSN exhibits substantial genetic milk yield potential, ranging from 7000 to over 8000 kg at 4.3% fat and 3.7% protein, and maintains desirable conformation traits, including a harmonious body structure, sturdy hooves, and a generally horned and black-and-white coat pattern [2,3].
Previous genome-wide association studies (GWAS) have investigated conformation traits in various cattle populations, including Chinese Holstein, Fleckvieh, Swiss Brown, and mixed Nordic dairy breeds, uncovering multiple QTL regions and candidate genes related to dairy character, udder conformation, feet and legs, and overall body structure [4,5,6,7]. These studies have identified genes implicated in cellular structure, metabolic pathways, immune responses, mammary gland development, and musculoskeletal integrity. Some candidate genes, such as those associated with dairy character or udder traits, appear to overlap across multiple breeds, while others exhibit breed-specific associations. These patterns reflect the genetic complexity underlying conformation traits and highlight the importance of breed-specific analyses, larger sample sizes, and functional validation to improve health, productivity, and sustainability in diverse cattle populations.
Within the DSN breed, previous GWAS have targeted traits such as milk production, mastitis, fertility, and resilience against endoparasites [8,9,10,11,12]. For instance, the favorable variant of the K232A polymorphism in DGAT1 (diacylglycerol O-acyltransferase 1) is already fixed in DSN, and therefore, selection for the favorable DGAT1 variant cannot further enhance milk yield. Furthermore, the MGST1 (microsomal glutathione-S-transferase 1) gene, as well as genes of the casein gene cluster, both previously identified in Holstein and other breeds, appear relevant for breed improvement in DSN. These studies demonstrate the value of GWAS in uncovering genetic factors that may drive productivity and adaptability, even in endangered and relatively small cattle populations.
The primary goal of this study is to identify genomic regions and candidate genes associated with conformation traits in DSN cattle using whole-genome sequencing (WGS) data. In Germany, DSN cattle are routinely evaluated by trained personnel for 19 linear conformation traits during their first lactation. Understanding the genomic basis of these traits and the composite scores derived from these traits will enable more targeted and effective selection strategies to improve the body conformation and, thereby, the productivity and health of the animals. Ultimately, such insights can provide actionable genetic markers to guide breeding decisions to enhance the sustainability, health, and overall performance of DSN cattle and ensure their continued competitiveness compared to larger commercial dairy populations.

2. Materials and Methods

2.1. Population and Phenotypes

Conformation data of 1852 DSN cows with available genotypes, born between 2005 and 2018, were obtained from the breeding company RBB Rinderproduktion Berlin-Brandenburg GmbH (Groß Kreutz, Germany) as of April 2020. Only farms with at least 20 DSN cows were considered to minimize farm-related environmental influences in the GWAS. The conformation data consisted of 19 linear (LIN) traits contributing to the four composite (COM) scores “Dairy strength”, “Body conformation”, “Legs&Feet”, and “Mammary”, as listed in Table 1. The linear traits describe the external characteristic of each cow assessed once during the first lactation using a classification scheme by a trained member of the German Federal Association of Swine and Cattle (German: Bundesverband Schwein und Rind). All linear traits were measured on a scale from 1 to 9, except for “Stature” (LINSTATURE) which was measured in centimeters (cm). The composite scores, which were derived from the linear traits, are based on a 100-point system, with values ranging from 65 to 88 (for cows in their first lactation, the maximum attainable score is 88). To determine potential relationships among the four composite scores and the individual linear traits, pairwise correlations were calculated using the cor.test() function in R.

2.2. Genotypes

Genotypes of the 1852 DSN cows were available from Illumina BovineSNP50 (Illumina Inc., San Diego, CA, USA), EuroG MD (EuroGenomics, The Hague, The Netherlands), custom DSN200K SNP chips (Thermo Fisher Scientific, Waltham, MA, USA), and whole-genome sequencing [10,13]. Genotypes were imputed to WGS data density using Beagle v5.1 [14]. Genotypes from the DSN200K chip were directly imputed to WGS data positions using a reference panel of 304 sequenced DSN cattle. Genotypes from Illumina BovineSNP50 and EuroGenomics EuroG MD chips were initially imputed to match the DSN200K chip positions using a reference panel of 1580 DSN cattle genotyped with the DSN200K chip, and subsequently imputed to WGS level. Variants with a minor allele frequency (MAF) of < 5% and a variant call rate of < 90% were removed with VCFtools v0.1.17 [15], resulting in 12,024,918 variants suitable for GWAS. The genomic positions of variants provided in this study are based on the Bos taurus genome version ARS-UCD1.2, serving as the reference genome for accurate alignment and annotation [16].

2.3. GWAS Model

For the continuously scaled values of the composite scores (COMDAIRY STRENGTH, COMBODY CONFORMATION, COMLEGS&FEET, COMMAMMARY), as well as for the linear trait LINSTATURE, GWAS were performed using mixed linear regression models using the basic lm() function in R. For the other 18 ordinally scaled linear traits, ordinal regression models using either the clm() function from the R package ordinal v2023.12-4.1 [17] or the polr() function from the R package MASS v7.3-58.2 [18] were used. While clm() was computationally faster, polr() performed better with small fixed-effect groups.
For each trait, the optimal model was determined using the Akaike Information Criterion (AIC) to evaluate whether the inclusion of additional fixed effects improved model fit. Population stratification (ps) was included as a mandatory fixed effect due to its known influence on GWAS results. Additional fixed effects were added one at a time and retained if their inclusion decreased AIC by at least 10 units (ΔAIC ≤ −10). Table S1 provides an overview of the AIC changes and the final model specifications for each trait. Depending on the trait, the final models included combinations of the following fixed effects: farm (fa), birth year (by), birth season (bs), calving year (cy), calving season (cs), age at first calving (fc), and day in lactation (ld). Genotype (gt) and residual error (e) were included in all models. The full model is expressed as follows:
Y = p s + f a + b y + b s + c y + c s + f c + l d + g t + e
Population stratification was evaluated using a concordance test in PLINK v1.9 [19] using a p-value of 0.0000001 identifying 46 family clusters. Birth years ranged from 2005 to 2018 and calving years between 2007 and 2020. To account for low sample sizes in certain categories, grouping was applied: birth years ≤ 2009 and ≥2017, and calving years ≤ 2011 and ≥2019. Days in lactation were categorized into five groups (30, 80, 120, 160, and ≥200). Genotype groups with fewer than 30 animals were excluded from GWAS to ensure stable effect estimation of statistical tests.

2.4. Significant Threshold and QTL Definition

p-values from the GWAS were adjusted for an inflation factor λ = 1.2 on the autosomal chromosomes and chromosome X, separately. Before correction, the λ values for the autosomal chromosomes ranged from 1.04 for LINFOOT_ANGLE to 2.43 for LINREAR_TEAT, and for chromosome X from 1.08 for LINFOOT_ANGLE to 3.71 for LINREAR_TEAT (Table S2). To control the risk of false-positive associations arising from multiple testing, we adjusted the p-values using the Bonferroni correction method. The number of independent tests was estimated through linkage disequilibrium (LD)-based pruning in PLINK v1.9, applying an r2 threshold of 0.6, a window size of 2 Mb, and a step size of 100, resulting in 569,464 independent variants. The significance thresholds were calculated by dividing the chosen significance levels (highly significant: 0.01, significant: 0.05, and suggestive: 0.1) by the number of independent variants corresponding to the thresholds −log10(p) of 7.76, 7.06, and 6.76, respectively. The p-values between genotype groups in the variant effect plots were calculated using the Wilcoxon test. Regions with variants above the suggestive threshold within windows of 2.5 Mb were defined as QTL regions. Distances up- and downstream of the top variant were extended to 150 kb, respectively, if they were smaller than 150 kb.

2.5. QTL Annotation

The identified QTLs were further investigated for candidate genes and variants. Protein-coding genes were obtained from Ensembl release version 110 [20]. Candidate genes in each respective QTL region were analyzed for gene ontology (GO) term enrichment using g:profiler version e111_eg58_p18_30541362 [21]. The g:SCS thresholds with p < 0.05 for significant GO term enrichments were used. Variants within QTL regions were assessed for their impact consequence on gene transcripts using Ensembl Variant Effect Predictor (VEP) [22]. For the VEP investigation, rare variants with an MAF between 1–5% within the identified QTL regions were also considered. Due to the extensive amount of the VEP output, only variants exhibiting moderate or high impact on gene transcripts were retained. Further, the identified QTL regions were examined for overlap with previously published associations and QTLs with the trait categories “Production” or “Exterior” obtained from cattleQTLdb release 54 [23]. Using the R package easyPubMed version 2.13 [24], the corresponding PubMed IDs of publications were obtained. Analyses were conducted using statistical software R version 4.3.1 [25]. Additional R packages used for data preparation, analyses, and visualization were reshape2 v1.4.4, ggpubr v0.6.0, and ggplot2 v3.5.1 [26,27,28].

3. Results

3.1. Correlation Analysis

The values of the four composite scores COMDAIRY STRENGTH, COMBODY CONFORMATION, COMLEGS&FEET, and COMMAMMARY were investigated for correlations with each other. The correlation between COMBODY CONFORMATION and COMDAIRY STRENGTH was moderately positive reaching r = 0.43. All other pairwise correlations were low and positive, ranging between r = 0.12 between COMLEGS&FEET and COMBODY CONFORMATION and r = 0.22 for COMMAMMARY and COMDAIRY STRENGTH.
The pairwise correlation analysis between composite scores and linear traits revealed that, as expected, almost all linear traits belong to their respective composite score reaching moderate (r = 0.30) to high correlations (r = 0.83) (Table 2). For COMDAIRY STRENGTH, a moderate correlation was observed with LINDAIRY_CHARACTER (r = 0.54). Interestingly, a weak positive and negative correlation of COMDAIRY STRENGTH was observed with the traits LINSTATURE (r = 0.34) and LINBCS (r = −0.27), respectively, which are actually linear traits belonging to COMBODY CONFORMATION. For COMBODY CONFORMATION, moderate to high positive correlations with five out of six linear traits belonging to this composite score were observed, except for LINRUMP_ANGLE, which had no correlation (r = −0.06). For COMLEGS&FEET, moderate to high positive correlations were observed with four out of five linear traits belonging to this composite score, but negative correlation with LINREAR_LEG_SIDE (r = −0.30). Additionally, a weak positive correlation of COMLEGS&FEET with LINREAR_UDDER was observed (r = 0.25), a linear trait that actually belongs to COMMAMMARY. Lastly, COMMAMMARY showed moderate to high correlation with six out of seven linear traits belonging to this composite score. LINTEAT_LENGTH was not correlated with COMMAMMARY (r = −0.01), although it contributed to this composite score.

3.2. Overview of GWAS Results for All Composite Scores and Linear Traits

After the correction of p-values for inflation factor λ and Bonferroni, 118 sequence variants above the suggestive threshold (p < 0.1) were identified across 19 chromosomes for 14 conformation traits (Table S3). Among these, six variants were highly significant (p < 0.01) for two traits (LINRUMP_WIDTH and LINTEAT_LENGTH), and 25 variants were significant (p < 0.05) for twelve traits or composite scores (COMDAIRY STRENGTH, LINDAIRY_CHARACTER, COMBODY CONFORMATION, LINBCS, LINRUMP_ANGLE, LINRUMP_WIDTH, COMLEGS&FEET, LINCENTRAL_LIGAMENT, LINFORE_UDDER, LINLOCOMOTION, LINREAR_LEG_SIDE, and LINREAR_UDDER). The 118 variants could be grouped into 24 QTLs, each represented by its top variant (Table 3). In the identified QTLs, 74 positional candidate genes (Table 4) and 451 unique variants with moderate to high impacts on gene transcripts were identified (Table S4). Below, we provide detailed descriptions of exemplary QTL regions that were highly significant (p < 0.01) or had additional insights available, including information on moderate- and high-impact variants, interesting candidate genes, significant GO term enrichment (Table S5), and/or overlap with previously reported associations from cattleQTLdb (Table S6).

3.3. DAIRY STRENGTH—Association for “Dairy Character”

For LINDAIRY_CHARACTER, which contributes to the composite score COMDAIRY STRENGTH, a significant locus on chromosome 20 was identified with the top variant rs381893684 located at 21,627,470 bp (−log10(p) = 7.23, Table 3, Figure 1). In the DSN population, the allele G and the allele A occurred both with a frequency of 50%. The β-effect of 0.43 indicated that DSN cows with the homozygous genotype AA scored 0.86 higher with respect to LINDAIRY_CHARACTER compared to DSN cows with the homozygous genotype GG. The corresponding QTL ranged from 21,477,400 to 21,777,400 bp and did not contain any positional candidate gene (Table 4). Around 250 kb downstream of the top variant rs381893684 is a second non-significant peak at 21,877,877 bp. The two top variants of the adjacent peaks were in low linkage (r2 = 0.35). Between those two peaks, the gene ACTBL2 (Actin β like 2) was located.

3.4. BODY CONFORMATION—Associations for “Rump Width” and “Rump Angle”

For LINRUMP_WIDTH, which belongs to the composite score COMBODY CONFORMATION, highly significant variants were identified on chromosome 21. The top variant rs210390204 was located at 17,993,077 bp (−log10(p) = 8.27, Table 3, Figure 2). The minor allele T of this variant occurred with a frequency of 19% in the examined DSN population. DSN cows with the homozygous genotype TT scored −1.16 lower in the linear description for LINRUMP_WIDTH compared to cows with the homozygous genotype CC (βMA = −0.58). In the respective QTL region ranging from 17,843,077 to 18,143,077 bp, no candidate genes were identified. The nearest gene AGBL1 (AGBL carboxypeptidase 1) was located approximately 261 kb upstream of the QTL region.
For LINRUMP_ANGLE, which is another trait contributing to COMBODY CONFORMATION, significance was reached on chromosome 26. The top variant rs110956215 was located at 21,470,978 bp (−log10(p) = 7.24), and its minor allele A had a frequency of 21% and reduced LINRUMP_ANGLE by 0.55 points (Table 3, Figure 3). In the corresponding QTL region, ranging from 21,320,978 to 21,620,978 bp, the five candidate genes SEC31B (SEC31 homolog B, COPII coat complex component), NDUFB8 (NADH:ubiquinone oxidoreductase subunit B8), HIF1AN (Hypoxia inducible factor 1 subunit α inhibitor), WNT8B (Wnt family member 8B), and PAX2 (Paired box 2) were located (Table 4). In SEC31B, four tolerated missense variants (26:21381141; 26:21,392,002; 26:21,393,52; 26:21,395,038) and one deleterious missense variant (26:21,388,919) were located. The deleterious variant was found in exon 19 out of 26 exons and changes the amino acid from valine to methionine at the protein position 803 out of 1122 amino acids not affecting any protein domain (Table S4). In HIF1AN, a low confidence tolerated missense variant (26:21,422,781) was found changing alanine to proline at protein position 12 out of 349 amino acids located in a disordered region (www.uniprot.org, accessed on 5 February 2025). In PAX2, an 8 bp indel (26:21,591,127–21,591,134) causes a frameshift at positions 24–26 of the 489 amino acid sequence, resulting in a high-impact alteration on the respective gene transcript and encoded protein. Furthermore, GO term enrichment revealed the significant GO term “[protein]-asparagine 3-dioxygenase activity” (GO:0036140) involving HIF1AN (Table S5).

3.5. LEGS&FEET—Associations for “Rear Leg Side” and “Hock Quality”

With regard to the linear traits contributing to the composite score COMLEGS&FEET, the two most interesting loci were found on chromosomes 20 and 23. On chromosome 20, the region was associated with LINREAR_LEG_SIDE with the top variant (no rs-ID) located at 29,327,008 bp (−log10(p) = 7.29) (Table 3, Figure 4). The minor allele G of the top variant occurred with a frequency of 38% and increased LINREAR_LEG_SIDE by 0.44. The QTL region spanned the genomic coordinates from 29,177,008 to 29,477,008 bp encompassing the whole gene HCN1 (hyperpolarization-activated cyclic nucleotide-gated potassium channel 1) with the top variant in the middle of the gene (Table 4). No sequence variants of moderate or high impact were found in this QTL region, but significant GO term enrichments. Involving HCN1, the GO terms “HCN channel complex” (GO:0098855) and “intracellular cAMP-activated cation channel activity involved in the regulation of presynaptic membrane potential” (GO:0140232), and the Reactome pathway “HCN channels” (REAC:R-BTA-1296061) were found significant (Table S5).
The above-identified locus for LINDAIRY_CHARACTER (on chromosome 20 at 21.8 Mb) is around 7.5 Mb upstream of the locus for LINREAR_LEG_SIDE (on chromosome 20 at 29.3 Mb). Despite the proximity, the top variants of those two QTL regions were not in linkage (r2 = 0.07).
On chromosome 23, a QTL for LINHOCK_QUALITY with the top variant rs800639948 located at 28,684,391 bp was identified (−log10(p) = 6.90, Table 3, Figure 5). The minor allele of the indel GA was observed with a frequency of 15% in the examined DSN population and reduced LINHOCK_QUALITY by 0.52, indicating a decrease of −1.04 in the score for LINHOCK_QUALITY for DSN cows with the homozygous genotype GA/GA. The corresponding QTL spanning the genomic region from 28,534,391 to 28,834,391 bp contained eight candidate genes: BOLA (MHC class I heavy chain), TRIM10 (tripartite motif containing 10), TRIM15 (tripartite motif containing 15), BOLA-NC1 (non-classical MHC class I antigen), JSP.1 (MHC Class I JSP.1), TRIM26 (tripartite motif containing 26), TRIM40 (tripartite motif containing 40), and ENSBTAG00000037421, which is a novel gene coding for a TRIM26-like protein (Table 4). The top variant was located within intron 1 of ENSBTAG00000037421, which contains a total of four introns. This QTL region shows also high variability containing 150 variants with moderate impact (107 tolerated and 43 deleterious missense variants) and 21 with high impact on gene transcripts referring to frameshift, splice acceptor or donor, start lost, stop gained, and stop lost variants (Table S4). GO enrichment analysis of the candidate genes within this QTL revealed significant associations with the biological process of “immune response” (GO:0006955) attributed to the genes BOLA, TRIM10, TRIM15, JSP.1, TRIM26, and TRIM40 (Table S5). In particular, the genes BOLA and JSP.1 are included, e.g., in the GO terms “antigen processing and presentation of endogenous peptide antigen via MHC class I via ER pathway” (GO:0002484) and “positive regulation of T cell mediated cytotoxicity” (GO:0001916), and the TRIM genes, e.g., in “negative regulation of viral life cycle” (GO:1903901) and “ubiquitin protein ligase activity” (GO:0061630).

3.6. MAMMARY—Associations for “Central Ligament” and “Teat Length”

The QTL with the highest gene density was found for LINCENTRAL_LIGAMENT, a trait belonging to the composite score COMMAMMARY, on chromosome 15 (Table 3, Figure 6). This QTL ranged from 49,373,191 to 50,164,862 bp and was represented by the intergenic top variant at 50,014,862 bp (−log10(p) = 7.21). The minor allele G of this variant occurred with a frequency of 38% and increased LINCENTRAL_LIGAMENT by 0.38. The 34 genes in this QTL region belonged either to the OR51 (olfactory receptor family 51) or OR52 (olfactory receptor family 52) gene family or were novel (Table 4). The 34 genes contained altogether 12 variants with high and 186 variants with moderate impact on gene transcripts (Table S4). The high-impact variants were frameshift and stop-gained mutations located mainly within the OR51 and OR52 genes. For example, in OR51AG1, insertions at positions 49,538,343 bp and 49,538,344 bp resulted in a frameshift mutation at position 111 out of 315 amino acids of the protein. For OR51L15, another frameshift variant was located at positions 49,597,808 bp and 49,597,809 bp, resulting in a deletion at positions 133–134 out of 319 amino acids of the protein. GO term enrichment analysis revealed significant assignment to e.g., “detection of chemical stimulus involved in sensory perception of smell” (GO:0050911) and “sensory perception of smell” (GO:0007608) (Table S5).
The most significant association for a trait belonging to COMMAMMARY was identified on chromosome 22 for LINTEAT_LENGTH, with the top variant rs378674848 located at 17,411,573 bp (−log10(p) = 7.91, Table 3, Figure 7). The minor allele T of this variant occurred with a frequency of 10% in the investigated DSN population. DSN cows with the heterozygous genotype CT had a reduced LINTEAT_LENGTH by 0.78 in comparison to cows with the homozygous CC genotype. Within the QTL ranging from 17,261,573 to 17,561,573 bp, the top variant was located in the intron of the candidate gene SRGAP3 (SLIT-ROBO Rho GTPase activating protein 3) (Table 4).

4. Discussion

In this study, we explored 19 linear conformation traits and four composite scores derived from these traits in 1852 DSN cattle using GWAS with approximately 12 million sequence variants. Despite limitations imposed by the relatively small DSN population, we identified 118 associated sequence variants mapped to 24 QTLs. These findings highlight the complexity of the genetic architecture underlying conformation traits and underscore the importance of expanding sample sizes in future research.
Our correlation analyses revealed generally low positive pairwise correlations among the four composite scores, supporting their utility as distinct metrics for assessing various aspects of conformation. The moderate correlation between COMBODY CONFORMATION and COMDAIRY STRENGTH suggests a partially shared genetic and/or phenotypic basis, without replacing each other. Examining the linear traits within their respective composite scores showed that most linear traits were appropriately assigned. Exceptions, such as LINRUMP_ANGLE and LINTEAT_LENGTH, which had no correlation to their own or any of the other composite scores, may represent unique trait dimensions valuable for specialized selection strategies. Furthermore, negative correlations between LINREAR_LEG_SIDE and COMLEGS&FEET as well as between LINBCS and COMDAIRY STRENGTH underscore potential trade-offs. The latter correlation aligns with earlier findings of an unfavorable genetic correlation between body condition score and milk yield, emphasizing the need to balance productivity with health and longevity in breeding programs [29]. Similarly, Khmelnychyii et al. (2023) found a negative correlation between LINBCS and COMDAIRY STRENGTH of r = −0.47 in the Ukrainian Brown dairy breed [30]. This suggests that animals selected for high milk production (COMDAIRY STRENGTH) might have lower body condition scores (LINBCS), emphasizing the need to ensure that high milk production does not adversely affect body condition to maintain health and longevity or vice versa. Further, Khmelnychyii et al. (2023) reported a positive correlation of r = 0.33 between “Rear Leg Set” (a combination of “Rear Leg Side View” and “Rear Leg Rear View”) and COMLEGS&FEET [30], while, in that study, the correlation of COMLEGS&FEET with LINREAR_LEG_SIDE was negative (r = −0.30), but with LINREAR_LEG_REAR positive (r = 0.61). This divergence suggests that these two linear traits contribute differently to the composite score COMLEGS&FEET in those breeds, yet the overall results may still align with the reported finding. Overall, correlation coefficients between these traits are valuable for validating the alignment of linear traits with composite scores and assessing whether adjustments or a new classification system may be needed.
The limited population size of DSN cattle, together with complex family structures, contributed to the elevated p-value inflation observed across nearly all conformation traits. In contrast to previously investigated health, fertility, and milk production traits in DSN [8,10,11], the inflation factors reported in this study were notably higher. This suggests additional underlying factors beyond population size and structure alone. For example, the composite scores, which integrate multiple linear traits, may inherently increase variability and thus inflate test statistics. Furthermore, the subjective nature of trait assessment (where trained classifiers apply standardized, yet potentially interpretable criteria) could introduce discrepancies. Such population-specific and methodological nuances highlight the need for careful adjustment and stringent quality control measures. In this study, p-values were adjusted for inflation, and population stratification was considered as a fixed effect in the statistical model, ensuring more reliable and robust GWAS results.
Several genomic regions identified in our study showed overlaps with previously reported QTLs for related traits in other cattle populations (Table S6). Notably, regions affecting “Stature” (LINSTATURE) on chromosomes 2 and 7 have been previously identified in Holstein, Montbéliarde, Simmental, and multiple other breeds [31,32]. For instance, the locus associated with “Rump width” (LINRUMP_WIDTH) on chromosome 21 overlaps with genomic regions previously linked to economically relevant traits such as “Net merit” and “Length of productive life” in Holstein cattle [33]. The QTL we identified for the trait “Rump angle” (LINRUMP_ANGLE) on chromosome 26 aligns with a previously described locus associated with “Residual feed intake”, suggesting potential biological interactions between structural conformation and feed efficiency [34]. The locus on chromosome 20 linked to “Rear leg side view” (LINREAR_LEG_SIDE) overlaps with regions previously associated with “Body weight” and growth traits across multiple beef breeds, including Charolais, Hereford, and Simmental [35,36]. Furthermore, the locus influencing “Hock quality” (LINHOCK_QUALITY) on chromosome 23 coincides with a known QTL for “Feet and leg conformation” in Holstein cattle, supporting its functional relevance for structural integrity [37]. These overlaps support the validity of our findings and reinforce the potential value of these genomic regions for targeted breeding strategies in DSN cattle.
Our identification of QTL regions harboring candidate genes related to cellular structure, immune responses, and developmental pathways provides insight into the genetic basis of conformation traits, reinforcing and extending earlier QTL findings.
For the trait LINDAIRY CHARACTER, which contributes to the composite score COMDAIRY STRENGTH, a QTL region identified on chromosome 20 at 21.6 Mb did not contain any annotated genes. However, the closest gene, ACTBL2, encodes a highly conserved protein involved in cell motility and is expressed in all eukaryotic cells (www.uniprot.org, accessed on 5 February 2025). Although not directly linked to dairy traits, its fundamental role in cellular processes may indirectly influence overall health and performance.
Regarding the traits contributing to the composite score COMBODY CONFORMATION, QTLs were described for LINRUMP_WIDTH on chromosome 21 and LINRUMP_ANGLE on chromosome 26. On chromosome 21, no gene was within the QTL boundaries, but the nearest gene, AGBL1, encodes an enzyme involved in protein modification and function. It has been associated with longevity, productive lifespan, and disease susceptibility in cattle [38]. This region was also previously linked to traits such as “Teat length”, “Length of productive life”, and fertility traits like “Calving ease” and “Pregnancy rate” in Holstein cattle [33]. Such relationships support the idea that wider rumps may facilitate easier calving, thus linking conformation to fertility and longevity. On chromosome 26, five candidate genes were identified, including PAX2, WNT8B, and HIF1AN, all of which are involved in key developmental and physiological processes (www.proteinatlas.org, accessed on 5 February 2025). PAX2, a transcription factor, plays a critical role in kidney cell differentiation and the development of the urogenital tract, eyes, and central nervous system. WNT8B is involved in the development and differentiation of specific forebrain structures, including the hippocampal region. HIF1AN contributes indirectly to developmental processes by regulating cellular responses to oxygen availability, emphasizing its role in tissue adaptation and survival under hypoxic conditions. Additionally, this QTL has been associated with “Residual feed intake” in Holstein cattle [34], underscoring the potential pleiotropic influence of these genes on growth, feed efficiency, and structural traits.
Regarding the traits contributing to the composite score COMLEGS&FEET, QTLs were described for LINREAR_LEG_SIDE on chromosome 20 and LINHOCK_QUALITY on chromosome 23. The QTL on chromosome 20 at 29.3 Mb includes HCN1, an ion channel gene important in heart and nerve cells, which has been previously implicated in breast cancer [39]. This region has also been linked to “Metabolic body weight”, “Average daily gain”, and “Body weight” [35,36]. These associations suggest that cattle with favorable alleles in this region may exhibit enhanced feed efficiency and weight gain, supported by better overall health, including healthy legs. On chromosome 23, the QTL encompassing eight immune-related candidate genes was identified. The genes include BOLA and TRIM family members, which are integral to adaptive and innate immune responses in cattle [40,41,42]. Additionally, this region has been associated with “Feet and leg conformation” in Holstein cattle [37]. The immune function genes could contribute to better feet and leg health, establishing a connection to the LINHOCK_QUALITY trait and emphasizing the importance of immune function in maintaining structural soundness and longevity.
The following QTLs contributing to the composite score COMMAMMARY were described: one affecting LINCENTRAL_LIGAMENT on chromosome 15 and another influencing LINTEAT_LENGTH on chromosome 22. The QTL on chromosome 15 contained 39 olfactory receptor genes, which are involved in olfactory and G protein-coupled receptor activities, transmembrane signaling, and sensory perception. Their expression in various tissues and organs, such as the olfactory epithelium of the nasal cavity [43], liver [44], lung [45], and pancreas [46], suggests a broader physiological role that could impact mammary function and growth. Moreover, this QTL region has been linked to “Body weight gain” in diverse breeds [35], potentially indicating pleiotropic effects on both growth and mammary development. On chromosome 22, the only candidate gene was SRGAP3 encoding a GTPase-activating protein. No moderate- or high-impact variants or GO terms were assigned to this QTL, leaving its functional relevance open for future investigation.
The availability of whole-genome sequencing data of all individuals, which were either sequenced directly or imputed, made the search for causal sequence variants feasible. Some of the top variants were located directly within the coding sequences for proteins. Many of the top variants were located in deserts without genes. Those sequence variants could be a valuable hint on regulatory functions of the causal mutation rather than a distortion of the function of encoded proteins itself. Providing evidence for regulatory effects on gene functions requires additional more comprehensive analysis methods on the molecular level of gene regulation. Evidence for strong effects on gene function through regulatory variants has been shown repeatedly in diverse species [47]. Beyond the identified candidate genes, non-coding genes, regulatory mutations, and structural variants likely play a role in conformation traits but were not fully captured in this study. Non-coding RNAs and microRNAs regulate gene expression and could impact phenotypes without altering coding sequences [48]. Regulatory mutations in enhancers and promoters may influence gene activity, contributing to trait variability. Structural variants can disrupt genes or modify regulatory landscapes, affecting conformation. These elements require further investigation using long-read sequencing and functional genomics to fully understand the impact of genetic variation underlying the examined phenotypes.
The identification of these candidate genes across multiple chromosomes emphasizes the complex genetic architecture underlying conformation traits in cattle. Further research is needed to clarify the specific functions of these genes, particularly with respect to udder conformation and functionality. Such insights could not only advance our understanding of the genetic mechanisms at play but also enhance genetic selection strategies aimed at improving both dairy and dual-purpose cattle breeds. Increasing the sample size in future studies will be crucial for validating these findings and refining their applications in breeding programs.

5. Conclusions

The genetic architecture of conformation traits in DSN cattle is complex, involving multiple QTLs, each potentially influencing a range of structural, physiological, and developmental pathways. Correlation analyses between composite scores and linear conformation traits confirmed that most linear traits were appropriately assigned to their composite scores. However, certain linear traits showed weak or no correlation to their composite scores, suggesting unique trait dimensions that could inform more targeted selection strategies. The candidate genes identified within the QTL regions highlight a wide array of biological pathways influencing conformation traits, including those associated with cellular structure, immune function, and development. These insights open up promising avenues for future functional studies aimed at improving conformation traits in DSN cattle. In practical breeding, selecting animals that exhibit favorable linear traits, particularly “Rear leg side view” and “Hock quality”, as well as “Central ligament” and “Teat length”, could enhance the overall robustness and competitiveness of the DSN breed. However, attention must be given to the negative correlation between traits, ensuring that improvements in one trait do not come at the expense of another. By combining selective breeding with sound management practices focused on maintaining healthy udders and legs, breeders can preserve the positive attributes of the DSN while striving for advancements in conformation and productivity. Further research, including expanded sample sizes, functional validation of candidate genes, and the identification of causal mechanisms leading to functional changes of gene effects, is essential for refining genetic selection strategies.

Supplementary Materials

The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/genes16040445/s1, Table S1: Selection of fixed effects for GWAS models using Akaike information criterion (AIC); Table S2: Inflation factors λ from GWAS results for the investigated conformation traits; Table S3: Full list of associated variants (p < 0.1) from GWAS with conformation traits in DSN cows; Table S4: List of sequence variant impacts in QTL regions based on Ensembl VEP annotation; Table S5: Functional enrichment analysis of sequence variants in QTL regions using g:Profiler; Table S6: List of publications with GWAS results overlap with loci identified in this study.

Author Contributions

Conceptualization, G.A.B. and P.K.; methodology, P.K.; validation, G.A.B. and P.K.; formal analysis, A.M.; investigation, A.M., P.K. and M.R.; resources, G.A.B.; data curation, M.R. and P.K.; writing—original draft preparation, A.M. and P.K.; writing—review and editing, G.A.B. and P.K.; visualization, A.M. and P.K.; supervision, P.K.; project administration, P.K.; funding acquisition, G.A.B. All authors have read and agreed to the published version of the manuscript.

Funding

The project was supported with funds from the Federal Ministry of Food and Agriculture (BMEL) based on a decision of the parliament of the Federal Republic of Germany via the Federal Office for Agriculture and Food (BLE) under the Federal Program for Ecological Farming and Other Forms of Sustainable Agriculture (Funding number: 2815NA010).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The datasets analyzed in this study can be found in the European Nucleotide Archive (ENA) (https://www.ebi.ac.uk/ena/browser/home, accessed on 5 February 2025) (WGS data: PRJEB45822, DSN200k SNP chip data: PRJEB46861, IlluminaBovineSNP50 BeadChip data: PRJEB42513).

Acknowledgments

The RBB Rinderproduktion Berlin-Brandenburg GmbH and DSN farms supported this project with their expertise in animal selection and by collecting ear tags.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Bundesanstalt für Landwirtschaft und Ernährung (BLE). Einheimische Nutztierrassen in Deutschland Und Rote Liste Gefährdeter Nutztierrassen 2023; BLE: Bonn, Germany, 2023. [Google Scholar]
  2. Adler, B.; Gassan, M.; Thiele, M. Deutsches Schwarzbuntes Niederungsrind—50 Jahre DSN-Genreservezucht in Brandenburg; RBB Rinderproduktion Berlin-Brandenburg GmbH: Groß Kreutz, Germany, 2023. [Google Scholar]
  3. Rinderzuchtverband Berlin-Brandenburg eG (RZB eG). Zuchtprogramm Für Die Rasse Deutsches Schwarzbuntes Niederungsrind; RZB eG: Groß Kreutz, Germany, 2022. [Google Scholar]
  4. Pausch, H.; Emmerling, R.; Schwarzenbacher, H.; Fries, R. A Multi-Trait Meta-Analysis with Imputed Sequence Variants Reveals Twelve QTL for Mammary Gland Morphology in Fleckvieh Cattle. Genet. Sel. Evol. GSE 2016, 48, 14. [Google Scholar] [CrossRef] [PubMed]
  5. Wu, X.; Fang, M.; Liu, L.; Wang, S.; Liu, J.; Ding, X.; Zhang, S.; Zhang, Q.; Zhang, Y.; Qiao, L.; et al. Genome Wide Association Studies for Body Conformation Traits in the Chinese Holstein Cattle Population. BMC Genom. 2013, 14, 897. [Google Scholar] [CrossRef] [PubMed]
  6. Flury, C.; Boschung, C.; Denzler, M.; Bapst, B.; Schnyder, U.; Gredler, B.; Signer-Hasler, H. Genome-wide association study for 13 udder traits from linear type classification in cattle. In Proceedings of the 10th World Congress on Genetics Applied to Livestock Production, Vancouver, BC, Canada, 17–22 August 2014. [Google Scholar] [CrossRef]
  7. Nazar, M.; Abdalla, I.M.; Chen, Z.; Ullah, N.; Liang, Y.; Chu, S.; Xu, T.; Mao, Y.; Yang, Z.; Lu, X. Genome-Wide Association Study for Udder Conformation Traits in Chinese Holstein Cattle. Animals 2022, 12, 2542. [Google Scholar] [CrossRef] [PubMed]
  8. Wolf, M.J.; Yin, T.; Neumann, G.B.; Korkuć, P.; Brockmann, G.A.; König, S.; May, K. Genome-Wide Association Study Using Whole-Genome Sequence Data for Fertility, Health Indicator, and Endoparasite Infection Traits in German Black Pied Cattle. Genes 2021, 12, 1163. [Google Scholar] [CrossRef]
  9. May, K.; Scheper, C.; Brügemann, K.; Yin, T.; Strube, C.; Korkuć, P.; Brockmann, G.A.; König, S. Genome-Wide Associations and Functional Gene Analyses for Endoparasite Resistance in an Endangered Population of Native German Black Pied Cattle. BMC Genom. 2019, 20, 277. [Google Scholar] [CrossRef]
  10. Korkuć, P.; Arends, D.; May, K.; König, S.; Brockmann, G.A. Genomic Loci Affecting Milk Production in German Black Pied Cattle (DSN). Front. Genet. 2021, 12, 640039. [Google Scholar] [CrossRef]
  11. Korkuć, P.; Neumann, G.B.; Hesse, D.; Arends, D.; Reißmann, M.; Rahmatalla, S.; May, K.; Wolf, M.J.; König, S.; Brockmann, G.A. Whole-Genome Sequencing Data Reveal New Loci Affecting Milk Production in German Black Pied Cattle (DSN). Genes 2023, 14, 581. [Google Scholar] [CrossRef]
  12. Meier, S.; Arends, D.; Korkuć, P.; Neumann, G.B.; Brockmann, G.A. A Genome-Wide Association Study for Clinical Mastitis in the Dual-Purpose German Black Pied Cattle Breed. J. Dairy Sci. 2020, 103, 10289–10298. [Google Scholar] [CrossRef]
  13. Neumann, G.B.; Korkuć, P.; Arends, D.; Wolf, M.J.; May, K.; Reißmann, M.; Elzaki, S.; König, S.; Brockmann, G.A. Design and Performance of a Bovine 200 k SNP Chip Developed for Endangered German Black Pied Cattle (DSN). BMC Genom. 2021, 22, 905. [Google Scholar] [CrossRef]
  14. Browning, B.L.; Zhou, Y.; Browning, S.R. A One-Penny Imputed Genome from next-Generation Reference Panels. Am. J. Hum. Genet. 2018, 103, 338–348. [Google Scholar] [CrossRef]
  15. Danecek, P.; Auton, A.; Abecasis, G.; Albers, C.A.; Banks, E.; DePristo, M.A.; Handsaker, R.E.; Lunter, G.; Marth, G.T.; Sherry, S.T.; et al. The Variant Call Format and VCFtools. Bioinformatics 2011, 27, 2156–2158. [Google Scholar] [CrossRef] [PubMed]
  16. Rosen, B.D.; Bickhart, D.M.; Schnabel, R.D.; Koren, S.; Elsik, C.G.; Zimin, A.; Dreischer, C.; Schultheiss, S.; Hall, R.; Schroeder, S.G.; et al. Modernizing the Bovine Reference Genome Assembly. In Proceedings of the 11th World Congress on Genetics Applied to Livestock Production, Auckland, New Zealand, 11–16 February 2018. [Google Scholar]
  17. Christensen, R.H.B. Ordinal—Regression Models for Ordinal Data. R Package Version 2023.12-4.1, 2023. Available online: https://cran.r-project.org/web/packages/ordinal/index.html (accessed on 27 February 2025).
  18. Venables, W.N.; Ripley, B.D. Modern Applied Statistics with S, 4th ed.; Springer: New York, NY, USA, 2002; ISBN 0-387-95457-0. [Google Scholar]
  19. Purcell, S.; Neale, B.; Todd-Brown, K.; Thomas, L.; Ferreira, M.A.R.; Bender, D.; Maller, J.; Sklar, P.; De Bakker, P.I.W.; Daly, M.J.; et al. PLINK: A Tool Set for Whole-Genome Association and Population-Based Linkage Analyses. Am. J. Hum. Genet. 2007, 81, 559–575. [Google Scholar] [CrossRef] [PubMed]
  20. Martin, F.J.; Amode, M.R.; Aneja, A.; Austine-Orimoloye, O.; Azov, A.G.; Barnes, I.; Becker, A.; Bennett, R.; Berry, A.; Bhai, J.; et al. Ensembl 2023. Nucleic Acids Res. 2023, 51, D933–D941. [Google Scholar] [CrossRef] [PubMed]
  21. Raudvere, U.; Kolberg, L.; Kuzmin, I.; Arak, T.; Adler, P.; Peterson, H.; Vilo, J. G:Profiler: A Web Server for Functional Enrichment Analysis and Conversions of Gene Lists (2019 Update). Nucleic Acids Res. 2019, 47, W191–W198. [Google Scholar] [CrossRef]
  22. McLaren, W.; Gil, L.; Hunt, S.E.; Riat, H.S.; Ritchie, G.R.S.S.; Thormann, A.; Flicek, P.; Cunningham, F. The Ensembl Variant Effect Predictor. Genome Biol. 2016, 17, 122. [Google Scholar] [CrossRef]
  23. Hu, Z.L.; Park, C.A.; Reecy, J.M. A Combinatorial Approach Implementing New Database Structures to Facilitate Practical Data Curation Management of QTL, Association, Correlation and Heritability Data on Trait Variants. Database 2023, 2023, 24. [Google Scholar] [CrossRef]
  24. Fantini, D. EasyPubMed: Search and Retrieve Scientific Publication Records from PubMed. R Package Version 2.13, 2019. Available online: https://10.32614/CRAN.package.easyPubMed (accessed on 27 February 2025).
  25. R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2023. [Google Scholar]
  26. Wickham, H. ggplot2: Elegant Graphics for Data Analysis; Springer: New York, NY, USA, 2016. [Google Scholar]
  27. Wickham, H. Reshaping Data with the Reshape Package. J. Stat. Softw. 2007, 21, 1–20. [Google Scholar] [CrossRef]
  28. Kassambara, A. ggpubr: “ggplot2” Based Publication Ready Plots. R Package Version 0.6.0, 2020. Available online: https://10.32614/CRAN.package.ggpubr (accessed on 27 February 2025).
  29. Loker, S.; Bastin, C.; Miglior, F.; Sewalem, A.; Schaeffer, L.R.; Jamrozik, J.; Ali, A.; Osborne, V. Genetic and Environmental Relationships between Body Condition Score and Milk Production Traits in Canadian Holsteins. J. Dairy Sci. 2012, 95, 410–419. [Google Scholar] [CrossRef]
  30. Khmelnychyii, L.; Khmelnychyii, S.; Samokhina, Y. Correlation between Descriptive and Group Type Traits in the System of Cow’s Linear Classification of Ukrainian Brown Dairy Breed. Open Agric. 2023, 8, 20220180. [Google Scholar] [CrossRef]
  31. Marete, A.G.; Guldbrandtsen, B.; Lund, M.S.; Fritz, S.; Sahana, G.; Boichard, D. A Meta-Analysis Including Pre-Selected Sequence Variants Associated with Seven Traits in Three French Dairy Cattle Populations. Front. Genet. 2018, 9, 399480. [Google Scholar] [CrossRef]
  32. Bouwman, A.C.; Daetwyler, H.D.; Chamberlain, A.J.; Ponce, C.H.; Sargolzaei, M.; Schenkel, F.S.; Sahana, G.; Govignon-Gion, A.; Boitard, S.; Dolezal, M.; et al. Meta-Analysis of Genome-Wide Association Studies for Cattle Stature Identifies Common Genes That Regulate Body Size in Mammals. Nat. Genet. 2018, 50, 362–367. [Google Scholar] [CrossRef] [PubMed]
  33. Cole, J.B.; Wiggans, G.R.; Ma, L.; Sonstegard, T.S.; Lawlor, T.J.; Crooker, B.A.; Van Tassell, C.P.; Yang, J.; Wang, S.; Matukumalli, L.K.; et al. Genome-Wide Association Analysis of Thirty One Production, Health, Reproduction and Body Conformation Traits in Contemporary U.S. Holstein Cows. BMC Genom. 2011, 12, 408. [Google Scholar] [CrossRef] [PubMed]
  34. Cohen-Zinder, M.; Asher, A.; Lipkin, E.; Feingersch, R.; Agmon, R.; Karasik, D.; Brosh, A.; Shabtay, A. FABP4 Is a Leading Candidate Gene Associated with Residual Feed Intake in Growing Holstein Calves. Physiol. Genom. 2016, 48, 367–376. [Google Scholar] [CrossRef] [PubMed]
  35. Snelling, W.M.; Allan, M.F.; Keele, J.W.; Kuehn, L.A.; McDaneld, T.; Smith, T.P.L.; Sonstegard, T.S.; Thallman, R.M.; Bennett, G.L. Genome-Wide Association Study of Growth in Crossbred Beef Cattle. J. Anim. Sci. 2010, 88, 837–848. [Google Scholar] [CrossRef]
  36. Lu, D.; Miller, S.; Sargolzaei, M.; Kelly, M.; Vander Voort, G.; Caldwell, T.; Wang, Z.; Plastow, G.; Moore, S. Genome-Wide Association Analyses for Growth and Feed Efficiency Traits in Beef Cattle1. J. Anim. Sci. 2013, 91, 3612–3633. [Google Scholar] [CrossRef]
  37. Kolbehdari, D.; Wang, Z.; Grant, J.R.R.; Murdoch, B.; Prasad, A.; Xiu, Z.; Marques, E.; Stothard, P.; Moore, S.S.S. A Whole-Genome Scan to Map Quantitative Trait Loci for Conformation and Functional Traits in Canadian Holstein Bulls. J. Dairy Sci. 2008, 91, 2844–2856. [Google Scholar] [CrossRef]
  38. Igoshin, A.V.; Yudin, N.S.; Romashov, G.A.; Larkin, D.M. A Multibreed Genome-Wide Association Study for Cattle Leukocyte Telomere Length. Genes 2023, 14, 1596. [Google Scholar] [CrossRef]
  39. Do, D.N.; Bissonnette, N.; Lacasse, P.; Miglior, F.; Sargolzaei, M.; Zhao, X.; Ibeagha-Awemu, E.M. Genome-Wide Association Analysis and Pathways Enrichment for Lactation Persistency in Canadian Holstein Cattle. J. Dairy Sci. 2017, 100, 1955–1970. [Google Scholar] [CrossRef]
  40. Babiuk, S.; Horseman, B.; Zhang, C.; Bickis, M.; Kusalik, A.; Schook, L.B.; Abrahamsen, M.S.; Pontarollo, R. BoLA Class I Allele Diversity and Polymorphism in a Herd of Cattle. Immunogenetics 2007, 59, 167–176. [Google Scholar] [CrossRef]
  41. Chen, S.Y.; Oliveira, H.R.; Schenkel, F.S.; Pedrosa, V.B.; Melka, M.G.; Brito, L.F. Using Imputed Whole-Genome Sequence Variants to Uncover Candidate Mutations and Genes Affecting Milking Speed and Temperament in Holstein Cattle. J. Dairy Sci. 2020, 103, 10383–10398. [Google Scholar] [CrossRef]
  42. Tijjani, A.; Utsunomiya, Y.T.; Ezekwe, A.G.; Nashiru, O.; Hanotte, O. Genome Sequence Analysis Reveals Selection Signatures in Endangered Trypanotolerant West African Muturu Cattle. Front. Genet. 2019, 10. [Google Scholar] [CrossRef] [PubMed]
  43. Connor, E.E.; Zhou, Y.; Liu, G.E. The Essence of Appetite: Does Olfactory Receptor Variation Play a Role? J. Anim. Sci. 2018, 96, 1551–1558. [Google Scholar] [CrossRef] [PubMed]
  44. Wu, C.; Jia, Y.; Lee, J.H.; Kim, Y.; Sekharan, S.; Batista, V.S.; Lee, S.J. Activation of OR1A1 Suppresses PPAR-γ Expression by Inducing HES-1 in Cultured Hepatocytes. Int. J. Biochem. Cell Biol. 2015, 64, 75–80. [Google Scholar] [CrossRef]
  45. An, S.S.; Liggett, S.B. Taste and Smell GPCRs in the Lung: Evidence for a Previously Unrecognized Widespread Chemosensory System. Cell. Signal. 2018, 41, 82–88. [Google Scholar] [CrossRef] [PubMed]
  46. Kang, N.; Bahk, Y.Y.; Lee, N.; Jae, Y.; Cho, Y.H.; Ku, C.R.; Byun, Y.; Lee, E.J.; Kim, M.S.; Koo, J. Olfactory Receptor Olfr544 Responding to Azelaic Acid Regulates Glucagon Secretion in α-Cells of Mouse Pancreatic Islets. Biochem. Biophys. Res. Commun. 2015, 460, 616–621. [Google Scholar] [CrossRef]
  47. Krause, F.; Mohebian, K.; Delpero, M.; Hesse, D.; Kühn, R.; Arends, D.; Brockmann, G.A. A Deletion Containing a CTCF-Element in Intron 8 of the Bbs7 Gene Is Partially Responsible for Juvenile Obesity in the Berlin Fat Mouse. Mamm. Genome 2022, 33, 465–470. [Google Scholar] [CrossRef]
  48. Mattick, J.S.; Makunin, I.V. Non-Coding RNA. Hum. Mol. Genet. 2006, 15, R17–R29. [Google Scholar] [CrossRef]
Figure 1. Genome-wide association results for LINDAIRY_CHARACTER on chromosome 20 in DSN cattle. (A) Manhattan plot displaying the genomic positions versus the significance of genetic variants. The horizontal lines represent Bonferroni-corrected thresholds for genome-wide significance (red, p < 0.05) and suggestive association (blue, p < 0.1). (B) Regional association plot highlighting a detailed region around the significant locus, showing positional candidate genes. (C) Variant effect plot illustrating genotype-specific phenotypic effects for the top variant rs381893684 at position 21,627,470 bp. Additionally, p-values between genotype groups and the number of animals in each genotype group are provided.
Figure 1. Genome-wide association results for LINDAIRY_CHARACTER on chromosome 20 in DSN cattle. (A) Manhattan plot displaying the genomic positions versus the significance of genetic variants. The horizontal lines represent Bonferroni-corrected thresholds for genome-wide significance (red, p < 0.05) and suggestive association (blue, p < 0.1). (B) Regional association plot highlighting a detailed region around the significant locus, showing positional candidate genes. (C) Variant effect plot illustrating genotype-specific phenotypic effects for the top variant rs381893684 at position 21,627,470 bp. Additionally, p-values between genotype groups and the number of animals in each genotype group are provided.
Genes 16 00445 g001
Figure 2. Genome-wide association results for LINRUMP_WIDTH on chromosome 21. (A) Manhattan plot displaying the genomic positions versus the significance of genetic variants. The horizontal lines represent Bonferroni-corrected thresholds for genome-wide significance (red, p < 0.05) and suggestive association (blue, p < 0.1). (B) Regional association plot highlighting a detailed region around the significant locus, showing positional candidate genes. (C) Variant effect plot illustrating genotype-specific phenotypic effects for the top variant rs210390204 at 17,993,077 bp. Additionally, p-values between genotype groups and the number of animals in each genotype group are provided.
Figure 2. Genome-wide association results for LINRUMP_WIDTH on chromosome 21. (A) Manhattan plot displaying the genomic positions versus the significance of genetic variants. The horizontal lines represent Bonferroni-corrected thresholds for genome-wide significance (red, p < 0.05) and suggestive association (blue, p < 0.1). (B) Regional association plot highlighting a detailed region around the significant locus, showing positional candidate genes. (C) Variant effect plot illustrating genotype-specific phenotypic effects for the top variant rs210390204 at 17,993,077 bp. Additionally, p-values between genotype groups and the number of animals in each genotype group are provided.
Genes 16 00445 g002
Figure 3. Genome-wide association results for LINRUMP_ANGLE on chromosome 26. (A) Manhattan plot displaying the genomic positions versus the significance of genetic variants. The horizontal lines represent Bonferroni-corrected thresholds for genome-wide significance (red, p < 0.05) and suggestive association (blue, p < 0.1). (B) Regional association plot highlighting a detailed region around the significant locus, showing positional candidate genes. (C) Variant effect plot illustrating genotype-specific phenotypic effects for the top variant rs110956215 at 21,470,978 bp. Additionally, p-values between genotype groups and the number of animals in each genotype group are provided.
Figure 3. Genome-wide association results for LINRUMP_ANGLE on chromosome 26. (A) Manhattan plot displaying the genomic positions versus the significance of genetic variants. The horizontal lines represent Bonferroni-corrected thresholds for genome-wide significance (red, p < 0.05) and suggestive association (blue, p < 0.1). (B) Regional association plot highlighting a detailed region around the significant locus, showing positional candidate genes. (C) Variant effect plot illustrating genotype-specific phenotypic effects for the top variant rs110956215 at 21,470,978 bp. Additionally, p-values between genotype groups and the number of animals in each genotype group are provided.
Genes 16 00445 g003
Figure 4. Genome-wide association results for LINREAR_LEG_SIDE on chromosome 20. (A) Manhattan plot displaying the genomic positions versus the significance of genetic variants. The horizontal lines represent Bonferroni-corrected thresholds for genome-wide significance (red, p < 0.05) and suggestive association (blue, p < 0.1). (B) Regional association plot highlighting a detailed region around the significant locus, showing positional candidate genes. (C) Variant effect plot illustrating genotype-specific phenotypic effects for the top variant at 29,327,008 bp. Additionally, p-values between genotype groups and the number of animals in each genotype group are provided.
Figure 4. Genome-wide association results for LINREAR_LEG_SIDE on chromosome 20. (A) Manhattan plot displaying the genomic positions versus the significance of genetic variants. The horizontal lines represent Bonferroni-corrected thresholds for genome-wide significance (red, p < 0.05) and suggestive association (blue, p < 0.1). (B) Regional association plot highlighting a detailed region around the significant locus, showing positional candidate genes. (C) Variant effect plot illustrating genotype-specific phenotypic effects for the top variant at 29,327,008 bp. Additionally, p-values between genotype groups and the number of animals in each genotype group are provided.
Genes 16 00445 g004
Figure 5. Genome-wide association results for LINHOCK_QUALITY on chromosome 23. (A) Manhattan plot displaying the genomic positions versus the significance of genetic variants. The horizontal lines represent Bonferroni-corrected thresholds for genome-wide significance (red, p < 0.05) and suggestive association (blue, p < 0.1). (B) Regional association plot highlighting a detailed region around the significant locus, showing positional candidate genes. (C) Variant effect plot illustrating genotype-specific phenotypic effects for the top variant rs800639948 at 28,684,391 bp. Additionally, p-values between genotype groups and the number of animals in each genotype group are provided.
Figure 5. Genome-wide association results for LINHOCK_QUALITY on chromosome 23. (A) Manhattan plot displaying the genomic positions versus the significance of genetic variants. The horizontal lines represent Bonferroni-corrected thresholds for genome-wide significance (red, p < 0.05) and suggestive association (blue, p < 0.1). (B) Regional association plot highlighting a detailed region around the significant locus, showing positional candidate genes. (C) Variant effect plot illustrating genotype-specific phenotypic effects for the top variant rs800639948 at 28,684,391 bp. Additionally, p-values between genotype groups and the number of animals in each genotype group are provided.
Genes 16 00445 g005
Figure 6. Genome-wide association results for LINCENTRAL_LIGAMENT on chromosome 15. (A) Manhattan plot displaying the genomic positions versus the significance of genetic variants. The horizontal lines represent Bonferroni-corrected thresholds for genome-wide significance (red, p < 0.05) and suggestive association (blue, p < 0.1). (B) Regional association plot highlighting a detailed region around the significant locus, showing positional candidate genes. (C) Variant effect plot illustrating genotype-specific phenotypic effects for the top variant at 50,014,862 bp. Additionally, p-values between genotype groups and the number of animals in each genotype group are provided.
Figure 6. Genome-wide association results for LINCENTRAL_LIGAMENT on chromosome 15. (A) Manhattan plot displaying the genomic positions versus the significance of genetic variants. The horizontal lines represent Bonferroni-corrected thresholds for genome-wide significance (red, p < 0.05) and suggestive association (blue, p < 0.1). (B) Regional association plot highlighting a detailed region around the significant locus, showing positional candidate genes. (C) Variant effect plot illustrating genotype-specific phenotypic effects for the top variant at 50,014,862 bp. Additionally, p-values between genotype groups and the number of animals in each genotype group are provided.
Genes 16 00445 g006
Figure 7. Genome-wide association results for LINTEAT_LENGTH on chromosome 22. (A) Manhattan plot displaying the genomic positions versus the significance of genetic variants. The horizontal lines represent Bonferroni-corrected thresholds for genome-wide significance (red, p < 0.05) and suggestive association (blue, p < 0.1). (B) Regional association plot highlighting a detailed region around the significant locus, showing positional candidate genes. (C) Variant effect plot illustrating genotype-specific phenotypic effects for the top variant rs378674848 at 17,411,573 bp. Additionally, p-values between genotype groups and the number of animals in each genotype group are provided.
Figure 7. Genome-wide association results for LINTEAT_LENGTH on chromosome 22. (A) Manhattan plot displaying the genomic positions versus the significance of genetic variants. The horizontal lines represent Bonferroni-corrected thresholds for genome-wide significance (red, p < 0.05) and suggestive association (blue, p < 0.1). (B) Regional association plot highlighting a detailed region around the significant locus, showing positional candidate genes. (C) Variant effect plot illustrating genotype-specific phenotypic effects for the top variant rs378674848 at 17,411,573 bp. Additionally, p-values between genotype groups and the number of animals in each genotype group are provided.
Genes 16 00445 g007
Table 1. Overview of the four composite (COM) scores and 19 linear (LIN) traits they consist of in DSN cows. The traits and scores are given with their description, the measurement unit and range, and the calculated minimum (min), mean, standard deviation (sd), and maximum (max) values in the investigated DSN cow population.
Table 1. Overview of the four composite (COM) scores and 19 linear (LIN) traits they consist of in DSN cows. The traits and scores are given with their description, the measurement unit and range, and the calculated minimum (min), mean, standard deviation (sd), and maximum (max) values in the investigated DSN cow population.
TraitDescriptionUnit, RangeMinMeanSdMax
COMDAIRY STRENGTHComposite score based on linear trait belowpoints, 65–887079.13.086
LINDAIRY_CHARACTERRip angle and openness, and bone flatness (angularity)scale, 1–913.41.37
COMBODY CONFORMATIONComposite score based on linear traits belowpoints, 65–886979.33.487
LINBCSArea between ischial tuberosities and lumbar spinescale, 1–936.11.19
LINBODY_DEPTHDistance at last rib from spine (top) to barrel (bottom)scale, 1–925.41.39
LINCHEST_WIDTHWidth between the top of the front legsscale, 1–936.31.19
LINRUMP_ANGLERump structure from hips to pinsscale, 1–924.80.97
LINRUMP_WIDTHDistance between pin bones (outer points)scale, 1–935.71.29
LINSTATUREDistance from spine (top between hips) to groundcm, -128136.82.9145
COMLEGS&FEETComposite score based on linear traits belowpoints, 65–887382.02.887
LINFOOT_ANGLEAngle at the front of the rear right hoofscale, 1–935.11.08
LINHOCK_QUALITYThickness of the ankle jointscale, 1–926.01.49
LINLOCOMOTIONStep length and straightness (movement harmony)scale, 1–915.91.79
LINREAR_LEG_REARDirection of feet (viewed from rear)scale, 1–925.91.59
LINREAR_LEG_SIDEAngle at the front of the hockscale, 1–935.30.98
COMMAMMARYComposite score based on linear traits belowpoints, 65–886878.33.587
LINCENTRAL_LIGAMENTDepth of the cleft at the base of the rear udderscale, 1–914.31.48
LINFORE_UDDERFore udder attachment strength to abdominal wallscale, 1–925.31.39
LINFRONT_TEATFront teat position (relative to udder center)scale, 1–914.01.27
LINREAR_TEATRear teat position (relative to udder center)scale, 1–925.01.18
LINREAR_UDDERDistance from vulva (bottom) to udder relative to heightscale, 1–914.31.48
LINTEAT_LENGTHLength of the front teatscale, 1–924.91.07
LINUDDER_DEPTHDistance from udder (lowest part) to the hockscale, 1–914.41.28
Table 2. Pairwise correlations between composite (COM) scores and linear (LIN) traits in DSN cows. Correlations coefficients with |r| ≥ 0.25 are highlighted in bold. Correlation coefficients with |r| ≥ 0.1 were significant after Bonferroni correction for multiple testing.
Table 2. Pairwise correlations between composite (COM) scores and linear (LIN) traits in DSN cows. Correlations coefficients with |r| ≥ 0.25 are highlighted in bold. Correlation coefficients with |r| ≥ 0.1 were significant after Bonferroni correction for multiple testing.
COMDAIRY STRENGTHCOMBODY CONFORMATIONCOMLEGS&FEETCOMMAMMARY
LINDAIRY_CHARACTER0.54−0.060.100.05
LINBCS−0.270.340.050.01
LINBODY_DEPTH0.200.660.030.07
LINCHEST_WIDTH−0.050.530.010.05
LINRUMP_ANGLE−0.10−0.06−0.04−0.08
LINRUMP_WIDTH0.130.520.050.05
LINSTATURE0.340.600.020.05
LINFOOT_ANGLE0.080.110.350.10
LINHOCK_QUALITY0.13−0.020.430.12
LINLOCOMOTION0.120.090.830.18
LINREAR_LEG_REAR0.150.110.610.17
LINREAR_LEG_SIDE−0.02−0.10−0.30−0.06
LINCENT._LIGAMENT0.120.010.090.36
LINFORE_UDDER0.070.120.110.53
LINFRONT_TEAT0.110.110.150.47
LINREAR_TEAT0.110.130.070.30
LINREAR_UDDER0.200.060.250.54
LINTEAT_LENGTH−0.010.040.01−0.01
LINUDDER_DEPTH0.120.090.030.46
Table 3. Top variants associated with conformation traits in DSN cows. Variants are sorted first by their assignment to composition scores and then by chromosomal (Chr) position. Further, rs-ID, minor allele (MA), MA frequency (MAF), β-effect of the MA (βMA), its standard error (SE), and the −log10(p) are provided. Corrected p-values were classified as highly significant (p < 0.01, −log10(p) = 7.76), significant (p < 0.05, −log10(p) = 7.06), and suggestive (p < 0.1, −log10(p) = 6.76). Variants belonging to loci discussed in detail in this study are highlighted in bold.
Table 3. Top variants associated with conformation traits in DSN cows. Variants are sorted first by their assignment to composition scores and then by chromosomal (Chr) position. Further, rs-ID, minor allele (MA), MA frequency (MAF), β-effect of the MA (βMA), its standard error (SE), and the −log10(p) are provided. Corrected p-values were classified as highly significant (p < 0.01, −log10(p) = 7.76), significant (p < 0.05, −log10(p) = 7.06), and suggestive (p < 0.1, −log10(p) = 6.76). Variants belonging to loci discussed in detail in this study are highlighted in bold.
Trait/ScoreChrPositionrs-IDMAMAFβMASE(βMA)−log10(p)
COMDAIRY STRENGTH
LINDAIRY_CHARACTER2021,627,470rs381893684G0.500.430.077.23
COMDAIRY STRENGTHX11,548,482-C0.30−0.860.127.26
COMBODY CONFORMATION
LINBCS2109,169,387rs378978517C0.16−0.550.097.01
LINRUMP_ANGLE798,433,366rs110898200A0.44−0.460.076.92
LINRUMP_WIDTH815,021,228rs109990310G0.18−0.580.096.93
LINBCS1090,095,068rs385227612ATG0.120.670.106.77
LINRUMP_WIDTH2117,993,077rs210390204T0.19−0.580.088.27
COMBODY CONFORMATION2137,111,947rs132717442A0.15−1.090.157.31
LINBCS2420,358,408rs134912642A0.400.440.077.49
LINRUMP_ANGLE2621,470,978rs110956215A0.21−0.550.087.24
COMLEGS&FEET
COMLEGS&FEET6105,886,046rs133849982C0.19−0.650.127.34
LINLOCOMOTION123,371,041rs42662438G0.220.410.087.25
LINREAR_LEG_SIDE1317,654,151-C0.080.770.136.92
LINREAR_LEG_SIDE1922,241,461rs379037369C0.440.410.076.98
LINREAR_LEG_SIDE2029,327,008-G0.380.440.077.29
LINHOCK_QUALITY2328,684,391rs800639948GA0.15−0.520.096.90
COMLEGS&FEET2410,452,692rs110128208T0.09−0.920.176.78
COMLEGS&FEET265,538,549rs135255414T0.17−0.620.126.80
COMMAMMARY
LINFORE_UDDER421,420,050rs134371955G0.50−0.370.067.11
LINREAR_UDDER615,034,445-T0.05−0.890.157.35
LINCENTRAL_LIGAMENT1550,014,862-G0.380.380.067.21
LINFORE_UDDER1714,049,897rs379498892A0.34−0.390.077.25
LINCENTRAL_LIGAMENT1816,330,666rs110023445G0.38−0.380.067.06
LINTEAT_LENGTH2217,411,573rs378674848T0.10−0.780.117.91
Table 4. QTL regions and positional candidate genes for conformation traits in DSN cows. QTL regions are listed in the same order as their corresponding top variants in Table 3. QTL regions discussed in detail in this study are highlighted in bold.
Table 4. QTL regions and positional candidate genes for conformation traits in DSN cows. QTL regions are listed in the same order as their corresponding top variants in Table 3. QTL regions discussed in detail in this study are highlighted in bold.
ChrPositionQTL StartQTL EndPositional Candidate Genes (Number of Genes)
COMDAIRY STRENGTH
2021,627,47021,477,47021,777,470- (0)
X11,548,48211,398,48211,698,482- (0)
COMBODY CONFORMATION
2109,169,387109,019,387109,319,387- (0)
798,433,36698,283,36698,583,366- (0)
815,021,22814,871,22815,171,228- (0)
1090,095,06889,945,06890,245,068NRXN3 (1)
2117,993,07717,843,07718,143,077- (0)
2137,111,94736,961,94737,261,947- (0)
2420,358,40820,208,40820,508,408FHOD3, TPGS2, ENSBTAG00000048612 (3)
2621,470,97821,320,97821,620,978SEC31B, NDUFB8, HIF1AN, WNT8B, PAX2 (5)
COMLEGS&FEET
6105,886,046105,736,046106,036,046- (0)
123,371,0413,221,0413,521,041ENSBTAG00000051623 (1)
1317,654,15117,504,15117,804,151FBH1, IL15RA, ANKRD26, YME1L1, MASTL, ACBD5, ENSBTAG00000053950 (7)
1922,241,46122,091,46122,391,461RFLNB, GEMIN4, VPS53, RPH3AL, C19H17orf97, TLCD3A, ENSBTAG00000054644 (7)
2029,327,00829,177,00829,477,008HCN1 (1)
2328,684,39128,534,39128,834,391BOLA, TRIM10, TRIM15, BOLA-NC1, JSP.1, TRIM26, TRIM40, ENSBTAG00000037421 (8)
2410,452,69210,302,69210,602,692- (0)
265,538,5495,388,5495,688,549PCDH15 (1)
COMMAMMARY
421,420,05021,270,05022,207,653ETV1 (1)
615,034,44514,884,44515,184,445ENPEP, ENSBTAG00000054468 (2)
1550,014,86249,373,19150,164,862OR51AB4, OR52AB4, OR52AE11, OR51G4, OR52J3D, ENSBTAG00000038578, OR51AG1, ENSBTAG00000039185, OR51P1, OR51V22, OR52Z12, OR52AB9, OR52E58, OR52S27, OR52Z19, OR52AD1, OR52A24, OR51L15, OR51G2, OR51L1B, ENSBTAG00000049986, ENSBTAG00000050365, OR51V1B, ENSBTAG00000051323, OR52J1, OR52S6, OR52E1, OR51S1B, OR51P1B, OR51L22, OR51L1C, OR52S20, OR51A8, OR51H5 (34)
1714,049,89713,899,89714,281,310GYPA, GYPB (2)
1816,330,66616,180,66616,480,666- (0)
2217,411,57317,261,57317,561,573SRGAP3 (1)
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Mandel, A.; Reißmann, M.; Brockmann, G.A.; Korkuć, P. Whole-Genome Insights into the Genetic Basis of Conformation Traits in German Black Pied (DSN) Cattle. Genes 2025, 16, 445. https://doi.org/10.3390/genes16040445

AMA Style

Mandel A, Reißmann M, Brockmann GA, Korkuć P. Whole-Genome Insights into the Genetic Basis of Conformation Traits in German Black Pied (DSN) Cattle. Genes. 2025; 16(4):445. https://doi.org/10.3390/genes16040445

Chicago/Turabian Style

Mandel, Amelie, Monika Reißmann, Gudrun A. Brockmann, and Paula Korkuć. 2025. "Whole-Genome Insights into the Genetic Basis of Conformation Traits in German Black Pied (DSN) Cattle" Genes 16, no. 4: 445. https://doi.org/10.3390/genes16040445

APA Style

Mandel, A., Reißmann, M., Brockmann, G. A., & Korkuć, P. (2025). Whole-Genome Insights into the Genetic Basis of Conformation Traits in German Black Pied (DSN) Cattle. Genes, 16(4), 445. https://doi.org/10.3390/genes16040445

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop