Next Article in Journal
Cystoid Macular Lesions in Inherited Retinal Diseases: Prevalence, Characteristics, and Genetic Associations in a Hungarian Cohort
Previous Article in Journal
Genetic Interrelationship Among Newly-Bred Mutant Lines of Wheat Using Diagnostic Simple Sequence Repeat Markers and Phenotypic Traits Under Drought
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Pedigree-Based Estimation of Y-STR Mutation and Male Differentiation Rates: Application to Historical Remains Identification

1
Centre for Genomics and Personalised Health, Genomics Research Centre, School of Biomedical Sciences, Queensland University of Technology (QUT), 60 Musk Avenue, Brisbane, QLD 4059, Australia
2
Unrecovered War Casualties—Army, Royal Australian Army, P.O. Box 7902, Canberra, ACT 2610, Australia
*
Author to whom correspondence should be addressed.
Genes 2025, 16(10), 1211; https://doi.org/10.3390/genes16101211
Submission received: 25 August 2025 / Revised: 12 September 2025 / Accepted: 17 September 2025 / Published: 14 October 2025
(This article belongs to the Special Issue Advances and Challenges in Forensic Genetics)

Abstract

Background/Objectives: High differentiation rates provided by Y-chromosomal short tandem repeats (Y-STRs) are highly advantageous in most forensic and genealogical casework, as they enhance the ability to exclude close or moderately related individuals, refine an individual’s position within a pedigree, and uncover the population substructure in otherwise homogeneous groups. However, the impact for historical remains identification casework is underexplored. Methods: We present a pedigree analysis of 366 males from 183 pedigrees, separated by 4 to 16 meioses at 27 Y-STR loci, from the Yfiler Plus kit. The differentiation rate for a given degree of separation was defined as the proportion of pairs at that specific number of meioses showing at least one allelic difference, relative to the total number of such pairs. Results: Our pedigree-based locus-specific mutation rates were consistent with published father–son values for 22 of 25 loci, with 3 loci (DYS389II, DYS449, and DYS570) being significantly different (p < 0.05). These results were consistent with previous pedigree-based estimates, and the strong agreement between father–son and pedigree-based mutation rates supports the use of pedigrees as a reliable method for estimating mutation rates. The probability of differentiating male relatives reached 60.1%, which is similar to previous studies using the Yfiler Plus kit. Conclusions: This high male differentiation rate is advantageous for distinguishing unrelated individuals within the same population, reducing false inclusions. However, when comparing distantly related individuals, excessive mutations accumulated over many generations may obscure genuine patrilineal relationships, increasing the risk of false exclusions. Our findings are likely to be highly valuable for future interpretation of Y-STR haplotypes from patrilineal relatives across a wide range of applications, with significant relevance to historical remains identification casework.

1. Introduction

Y-chromosome short tandem repeats (Y-STRs) have demonstrated significant utility in various fields such as forensic genetics, molecular anthropology, and genealogical studies because of their strict paternal inheritance and high discriminatory potential among unrelated males [1,2,3]. Y-STRs are located within the non-recombining region of the Y chromosome, and, because of this, are passed down through paternal lineages essentially unchanged, with the only source of variation between generations being mutation [4,5,6]. In forensic casework, Y-STR haplotypes are especially valuable in excluding male suspects as contributors to crime scene material, while matches have traditionally been evaluated through reference to population databases such as the Y Chromosome Haplotype Reference Database (YHRD; [7]). The increasing inclusion of rapidly mutating (RM) Y-STRs in commercial kits, such as Yfiler Plus, has markedly enhanced the discriminatory capacity of Y-STRs, not only between unrelated individuals but also among close paternal relatives [8,9,10].
Mutation rates are a critical parameter in the interpretation of Y-STR profiles, influencing both differentiation between paternal relatives and the statistical weight that can be attached to haplotype comparisons. RM loci exhibit mutation rates up to an order of magnitude higher than conventional markers, increasing the ability to distinguish closely related males and thereby improving outcomes in complex forensic cases and familial searching [4,5,6]. Hence, these markers are especially beneficial in scenarios involving missing persons, or mass disasters, where RM Y-STRs reduce the likelihood of coincidental matches and can offer finer resolution and individual identification [3,11,12,13,14]. However, this increased mutation rate can present interpretative challenges, particularly when comparing profiles from distantly related individuals. Over many generations, these mutations accumulate, meaning that even true paternal relatives may exhibit increased allelic mismatches. A key challenge addressed in this manuscript is understanding how many differences one might expect between two distantly related individuals, as is often encountered in historical forensic identification.
Early estimates of Y-STR mutation rates were almost exclusively based on close relatives, particularly father–son pairs, where the number of separating meioses is known with certainty and paternity can be confirmed with autosomal markers [15,16,17,18]. While this design ensures high confidence in observed mutational events, such studies require very large numbers of father–son pairs to achieve sufficient statistical power, and are often constrained by sample availability, cost, and labour. Pedigree-based studies offer a complementary approach by leveraging deep-rooted family structures in which many meioses can be represented with relatively few sampled males. This allows the accumulation of a larger number of meioses overall, theoretically increasing the precision of mutation rate estimates [19]. However, pedigree-based approaches also carry inherent challenges, including uncertainties from parallel or back mutations, multi-step changes, and misreported familial relationships [5,20,21]. Despite these limitations, recent studies have demonstrated concordance between pedigree-based and father–son-derived estimates, supporting the validity of pedigrees as a framework for mutation rate estimation [19]. Other studies have also extended Y-STR mutation analyses beyond close paternal relationships, examining individuals separated by up to four meioses [22] and, in a more limited sample, up to twenty meioses [4]. These investigations suggest that Y-STR profiles exhibiting two or fewer mutational differences are typically consistent with shared paternal ancestry, whereas higher levels of divergence are more indicative of unrelated male lineages. However, these studies employed earlier-generation Y-STR panels, and analysis was confined to relatively close relatives (≤4 meioses) or skewed towards lower degrees of separation (≤13 meioses).
Although previous investigations have highlighted the enhanced discriminatory power of RM Y-STRs (e.g., [4]), few have evaluated their cumulative mutational dynamics across extended paternal genealogies. Consequently, the interpretation of Y-STR mismatches in the context of distant paternal relationships, such as those encountered in historical human identification efforts, remains an unresolved and critical issue within forensic genetics. In Unrecovered War Casualties—Army (UWC-A) casework, skeletal remains of missing servicemembers are compared against DNA from distant paternal relatives (up to 16 meioses apart [23]), and, as the conflicts in question recede further into the past, the generational gap between potential family reference samples will continue to increase. Assumptions of haplotype concordance can be compromised by mutation events, and therefore high-resolution mutation data are particularly relevant in applied contexts such as the identification of historical or missing persons, where reference DNA samples often come from distant paternal relatives. In such cases, understanding the expected degree of differentiation between male relatives separated by multiple meioses is essential to avoid both false exclusions and overestimation of evidential value. While Ralf et al. [19] reported an analysis of male relative differentiation in distant relatives, additional studies remain limited.
In this study, we investigated Y-STR mutation and male relative differentiation rates using the Yfiler Plus kit across 183 pedigrees, encompassing 366 relatives separated by 4 to 16 meioses. By comparing our estimates with previously published father–son and pedigree-based studies, we aimed to evaluate the utility of pedigrees for mutation rate estimation, assess the discriminatory power of Yfiler Plus loci, and explore the implications of high differentiation rates for forensic casework and historical remains identification.

2. Materials and Methods

2.1. Ethics

This project received ethical approval from Queensland University of Technology and the Department of Defence Human Research Ethics Committees (2021000378 and 328-21 respectively). All analyses were performed in accordance with relevant guidelines and regulations, and all participants provided informed consent for research involvement. The study was conducted in accordance with the Declaration of Helsinki.

2.2. Sample Collection and Preparation

From September 2021 to December 2024, 674 male DNA samples were analysed, and, of these, 20 were excluded because they showed too much variation (i.e., more than 15 mutations) compared to other pedigree members to be reasonably considered paternally related. Another 8 samples were excluded from further analysis because of incomplete genotype data, and 280 were excluded because of the lack of another paternally related member with available genomic data. The remaining 366 males were from 183 pedigrees, separated by 4 to 16 meioses. The genealogical process for recruitment has been described previously [23]. These individuals were drawn from extended family pedigrees where at least two distantly related male descendants were available for Y-STR analysis. In each case, genealogical research indicated the relationship (e.g., 4 meioses could correspond to first cousins or a great-uncle–grand-nephew pair, whereas 16 meioses corresponds to seventh cousins under a symmetric pedigree; ‘once-removed’ relationships add one meiosis per generation offset).
Participants were provided with either a SpeciMAX Stabilised Saliva Collection Kit (Thermo Fisher Scientific, Waltham, MA, USA) or a PAXgene Saliva Collector (Qiagen, Hilden, Germany), which they returned to our laboratory upon collection. DNA was extracted from each saliva kit using prepIT.L2P reagent (DNA Genotek, Ottawa, ON, Canada) according to the manufacturer’s instructions. The quality and quantity of DNA were determined using the NanoDrop Spectrophotometer (Thermo Fisher Scientific) and Qubit dsDNA Quantification Assay (Invitrogen, Waltham, MA, USA).

2.3. Y-STR Genotyping

Samples were genotyped in a single multiplex reaction using the Yfiler Plus PCR Amplification Kit (Applied Biosystems, Waltham, MA, USA) as described previously [23]. This kit includes 27 Y-STR loci widely used in forensic laboratories and encompasses the traditional core Y-STRs with moderate mutation rates as well as several RM markers that have higher per-generation mutation frequencies. PCR products were separated and detected by capillary electrophoresis on an Applied Biosystems 3500 Genetic Analyser with POP-7 polymer (Applied Biosystems) as described previously [23]. Data analysis was performed using the software GeneMapper v6 (Applied Biosystems). All electropherograms and allele calls were carefully reviewed by a second independent analyst to confirm any cases of intermediate variants (e.g., duplicated alleles or null alleles indicating a deletion or primer-binding mutation).

2.4. Mutation Rate Estimation

To estimate Y-STR mutation rates using pedigree data, we adopted a frequentist framework similar to that described by Ralf et al. [19]. Mutation rates (μ) were estimated at each locus using the formula μ = x/N, where x is the number of observed mutations, and N is the total number of meioses. Importantly, in our study, each pedigree consisted of only two individuals; therefore, the number of pairwise meioses and actual meioses were equivalent, eliminating the risk of double-counting transmission events. This analysis was performed individually for each pedigree, after which, mutation counts and meioses were summed, allowing the per-marker rates to be estimated. Confidence intervals (CIs: 95%) were calculated using Epitools, an online tool provided by AusVet Animal Health Services [24]. The programme outputs intervals using five alternative calculation methods, as described by Brown et al. [25]. The Wilson and Clopper–Pearson methods were reported.
When estimating mutations from extended pedigrees (rather than direct father–son comparisons), several assumptions were necessary due to generational gaps or missing data:
  • When two males in a lineage showed no haplotypic differences, we assumed that no mutations occurred along those missing links because no intermediate relatives were available with genotype data.
  • Our analysis used extended pedigrees with no typed father–son pairs, and therefore the meiosis in which a mutation occurred could not be identified. Multi-step differences between patrilineal individuals were interpreted as multiple single-step mutations that were distributed along the connecting lineage. This rule applies only to non-adjacent relatives; in father–son pairs, a genuine multi-step event would be accepted as the only valid scenario.
  • Across all pedigrees, we applied a parsimony principle, always favouring the minimum number of mutational steps required to explain observed haplotypic differences.
These assumptions are expected to hold true for most cases, but we acknowledge that they may lead to errors in some. For example, genuine multi-step single-meiosis events may exist between non-adjacent relatives, and therefore these assumptions may slightly overcount the number of mutations occurring. However, we expect the impact on aggregate rate estimates is small.
For single-copy Y-STR loci, mutation inferences were straightforward. For example, a shift from allele 10 in one individual to allele 12 in another indicated two single-step mutations (as per Assumption 2). However, for multi-copy loci, more complex interpretations were required. In all scenarios, the guiding principle was to minimise inferred mutational steps while considering biological plausibility and data limitations. Statistical significance was determined using Fisher’s exact method.

2.5. Differentiation Rate Estimation

To estimate the Y-STR-based male relative differentiation rate across varying degrees of patrilineal separation, we again applied a frequentist approach following the methodology described by Ralf et al. [19]. Differentiation rates were calculated for relative pairs separated by 4 to 16 meioses. For each pair, we assessed whether at least one allelic difference was present across the tested Y-STR loci. The differentiation rate for a given degree of separation was defined as the proportion of pairs (at that specific number of meioses) showing at least one allelic difference, relative to the total number of such pairs. To quantify the statistical uncertainty of the estimates, 95% confidence intervals were calculated using the Clopper–Pearson method.

3. Results and Discussion

3.1. Summary Statistics

A total of 366 males from 183 pedigrees were analysed in this study, allowing the comparison of 183 paternally related pairs, separated by 4 to 16 meioses. Figure 1 provides a breakdown of the distribution of pairs. To determine the discrimination capacity, we took one individual from each unrelated family and observed 183 unique haplotypes among 183 unrelated men, which resulted in a haplotype discrimination capacity of 100%. This is consistent with previous studies [26]. Microvariants were found at loci DYS627 (20.2), DYS458 (16.2), DYS19 (14.2), and DYS570 (21.3). All variants were confirmed by repeating the analysis.

3.2. Mutation Breakdown

We observed 161 mutations (Figure 2, Table S1), noting the assumptions outlined in Section 2.4. Most of the mutations were single-step mutations (n = 157, 97.5%), with a small proportion being deletions or duplications that were observed in one individual but not the other (n = 4, 2.5%). There were eight instances where multi-step differences between patrilineal individuals were interpreted as multiple single-step mutations, as they were not father–son pairs (in which case, a single multi-step event would have been accepted as the only valid scenario).
Duplications, deletions, and even triplications are well documented within the Y chromosome [27,28]. In addition to the four instances where a duplication or deletion was observed in one individual but not the other (shown in Table S1), we observed three duplications or deletions in our samples which were evident in both individuals of the pair, demonstrating that these allele patterns can be inherited (Table 1). For example, FS_2 and FS_269 displayed a duplication (alleles 15,17 and 15,16, respectively) at DYS19. DYS19 was the first forensically useful Y-STR and has since become ubiquitous in the fields of Y-chromosomal forensic analysis and evolutionary studies. Deletions, duplications, and triplications are well documented within this locus and are reported within the National Institute of Standards and Technology (NIST) STRBase [27,29,30,31].
For FS_375, a deletion was observed in DYS448. Both samples of the pair were re-amplified to confirm the deletion, and the remaining loci in the multiplex had sufficient signal, indicating this was not allelic dropout. However, we note that this may not represent a true deletion but instead a mutation in the primer binding site that prevented amplification. As we did not use alternative primers to verify this, the possibility of a null allele due to primer binding site mutation cannot be excluded. Deletions at DYS448 have also been reported in the NIST STRbase [32].

3.3. Mutation Rate

Pedigree-based mutation rates were estimated and compared to mutation rate reference values which were derived from multiple father–son studies reported by Neuhuber et al. [15], as well as large-scale pedigree-based estimates by Ralf et al. [19] (Figure 3, Table 2). Notably, like Ralf et al. [19], we observed high concordance between mutation rates estimated from pedigree data and those based on father–son pairs obtained from Neuhuber et al. [15] (22 of the 25 Y-STRs analysed in our study, p > 0.05). Three of the markers showed a significant difference (p < 0.05) between the two ways of estimating rates: DYS389II, DYS449, and DYS570.
Like father–son estimates, pedigree-based estimates are not without limitations. For example, their reliability can be reduced by uncertainties arising from parallel mutations, back or forward mutations, and multi-step changes [5,21], and inaccuracies in reported biological relationships within pedigrees may introduce further error [20]. Despite these challenges, our findings showed that, for 22 of the 25 Y-STRs examined, the pedigree-based mutation rate estimates did not differ significantly from the father–son reference rates derived from large datasets. This high level of concordance indicates that pedigrees provide a valid and reliable framework for estimating Y-STR mutation rates.
Only three of the Y-STRs examined showed a significant difference compared to previous father–son rates. It is difficult to determine the exact cause of this; it could have been a result of the method employed to estimate the rate; the result of stochastic effects caused by the small sample size; or the result of the different biogeographic origin of the individuals included in the studies (because different populations exhibit different allelic distributions and therefore may exhibit higher or lower mutability for specific markers) [5,16]. Interestingly, though, for DYS449 and DYS570, our data also showed significant differences between our pedigree-based estimates and those by Ralf et al. [19], suggesting these are not the result of the method employed. Instead, these three loci may be outliers due to random sampling variation given the moderate number of events. Despite this, the strong agreement between the father–son and pedigree-based mutation rate estimates shown here indicates that pedigrees provide a valid framework for estimating mutation rates.

3.4. Male Differentiation Rate

The male relative differentiation rate for a given panel of Y-STRs refers to the rate at which a pair of paternally related males (such as brothers or first cousins) can be distinguished from one another by at least one allelic variation (mutation) across the markers. Across all 183 pairs separated by 4 to 16 meioses, 110 pairs exhibited ≥1 mutation, yielding an overall differentiation rate of 60.1%. This estimate is consistent with previous pedigree-based studies utilising the Yfiler Plus kit, where we observed a differentiation rate of 59.9% for pairs separated by 4 to 13 meiosis, compared with the 66.5% reported by Ralf et al. [19] for the same meiosis range. Among differentiated pairs, 97.3% differed at ≤2 loci and no pair exceeded four mutational differences. Although most pairs showed only a single mutation, forty pairs (21.9%) exhibited two, four pairs (2.2%) exhibited three, and one pair (0.5%) exhibited four. These results reflect the shallow accumulation of Y-STR changes across lineages, in line with observations from other deep pedigree studies [4,21,22,33], although this previous work did not employ Yfiler Plus.
Mutational events were not randomly distributed across loci. Instead, they were concentrated at RM Y-STRs, with DYS627 (38 pairs), DYS576 (35 pairs), DYF387S1 (33 pairs), and DYS518 (32 pairs) accounting for the majority of observed changes. This reinforces earlier findings that these loci, because of their elevated mutation rates, are especially discriminatory in patrilineal differentiation [4]. Our results therefore support previous conclusions that the incorporation of RM Y-STRs into Yfiler Plus substantially enhances the resolution of male relative differentiation (e.g., [10]).

3.5. Historical Remains Identification

Our data provide practical guardrails for interpreting Y-STR results when reference samples are available only from distant paternal relatives, as is common in historical human identification casework [23] and some missing-persons or mass-fatality contexts. Three features of our results are especially consequential: (i) the high overall male relative differentiation rate (~60% at 4–16 meioses), (ii) the concentration of mutational events at RM loci, and (iii) the shallow accumulation of mutational differences across many transmissions, with most differentiated pairs showing only one or two discordant loci and no pair showing more than four.
Our results indicate that distant patrilineal relatives can frequently differ at ≥1 Y-STR locus even when the relationship hypothesis is true. In our study, most differentiated true relatives diverged at only a small number of loci (typically ≤2), and multi-locus discordance >2 was rare. Operationally, this means that observing one to two discordant loci between a set of historical remains and a distant paternal reference should not be treated as definitive evidence against relatedness. Instead, the discordance must be weighed against (a) the number of separating meioses, (b) locus-specific mutation rates (especially for RM loci), and (c) plausible mutation pathways (including parallel and back mutations). Applying rigid ‘zero-mismatch’ expectations to distant relatives inflates the false-exclusion risk and is inconsistent with the empirical distribution we observed and with prior work using earlier panels (e.g., [4,22]).
Most observed changes arose at RM loci, reaffirming their contribution to both differentiation and interpretive complexity. When discordance is confined to one RM locus, particularly at high mutational markers, the evidential impact against relatedness is modest; by contrast, multiple independent differences at conventional (lower mutational) loci are more probative. Analysts should therefore adopt locus-aware weighting rather than treating any single mismatch as equivalent across the panel. Our concordance with father–son rate compilations for most loci (Figure 3) supports using those rates directly in likelihood calculations.
Current Scientific Working Group on DNA Analysis Methods interpretation guidelines [34] do not explicitly address scenarios where multiple discordant Y-STR haplotypes must be compared. In missing-person casework, it is common for several male relatives to be available; however, as shown here, their Y-STR haplotypes may differ because of mutational events (or misreported kinship). Rather than forcing the data into a single ‘consensus’ haplotype or discarding discordant references, casework should explicitly model alternative pedigree placements for each reference and evaluate the evidence jointly. Hence, more advanced interpretive approaches are needed, and likelihood-ratio methods have been proposed as a solution. Ge et al. [35], for example, extended the pedigree likelihood-ratio framework, already used with autosomal loci [36,37], to lineage markers such as Y-STRs and mitochondrial DNA. This method allows analysts to evaluate competing pedigree hypotheses, incorporate mutation rates, and express evidential weight as a likelihood ratio. While promising, such methods are not widely adopted, largely because of limited user-friendly software availability and the absence of detailed operational guidelines. Despite this, based on our data, we recommend moving away from binary ‘match/mismatch’ language toward quantitative likelihood ratios that condition on the proposed degree of relatedness (number of meioses); integrate locus-specific mutation rates; and allow for limited sets of discordances, especially at RM loci, under the relatedness hypothesis. The locus-specific mutation rates reported here can be incorporated into pedigree LR analyses following Ge et al. [35] by modelling a stepwise mutation process across m meioses under H1 (specified patrilineal relationship) and combining with calibrated haplotype probabilities under H2 (unrelated) that accommodate the population substructure (θ). A full LR case series requires population-matched databases and θ sensitivity analyses and is therefore beyond the scope of this rate estimation study.

4. Conclusions

Using 27 Yfiler Plus loci across 183 pedigrees (366 men) separated by 4–16 meioses, we show that pedigree-based estimation yields locus-specific mutation rates that largely agree with father–son datasets (22/25 loci), supporting pedigrees as a robust framework for rate estimation when direct transmissions are unavailable. Mutational changes were concentrated at RM loci (e.g., DYS627, DYS576, DYF387S1, DYS518), reaffirming their inflated contribution to patrilineal discrimination. The male differentiation rate observed in this study was 60.1%, with most differentiated pairs showing ≤2 locus differences and no pair showing more than 4. These dynamics are advantageous for excluding unrelated males within populations, yet they also highlight a key risk in historical or missing-persons casework, namely, that with increasing meioses, accumulated mutations can render true paternal relatives discordant at one or more loci. Accordingly, rigid mismatch thresholds (e.g., no more than three to five differences) are inappropriate for distant kinship inference and may lead to false exclusions. These findings reinforce the importance of contextual interpretation, especially during historical remains identification, and provide a foundation for improved forensic methodologies.

Supplementary Materials

The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/genes16101211/s1, Table S1: Mutations observed in 183 distantly related pairs.

Author Contributions

Conceptualisation, J.R.C. and L.R.G.; data curation, J.R.C., T.W., T.Z. and L.A.; formal analysis, J.R.C.; funding acquisition, L.R.G.; methodology, J.R.C. and N.M.; project administration, L.R.G.; resources, L.R.G.; supervision, L.R.G.; writing—original draft, J.R.C.; writing—review and editing, J.R.C., T.W., T.Z., L.A., N.M. and L.R.G. All authors have read and agreed to the published version of the manuscript.

Funding

This study was supported by the Department of Defence Innovation Hub (P19-209484).

Institutional Review Board Statement

This project received ethical approval from Queensland University of Technology and the Department of Defence Human Research Ethics Committees (2021000378 and 328-21, respectively). All analyses were performed in accordance with relevant guidelines and regulations, and all participants provided informed consent for research involvement. The study was conducted in accordance with the Declaration of Helsinki.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The datasets presented in this article are not readily available due to privacy and ethical restrictions.

Acknowledgments

We wish to acknowledge the family members of missing servicemembers who generously donated their DNA samples and time to participate in this research study and for the ongoing community support which aids our research. This research was supported by a Defence Innovation Hub contract. We also wish to acknowledge past and present staff of Unrecovered War Casualties—Army, who have provided invaluable support, including Aaron Pegram and Cameron Clarke.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
μ Mutation rates
CIConfidence interval
NIST National Institute of Standards and Technology
RMRapidly mutating
UWC-AUnrecovered War Casualties—Army
YHRD Y Chromosome Haplotype Reference Database
Y-STRsY-chromosome short tandem repeats

References

  1. Kayser, M.; Caglià, A.; Corach, D.; Fretwell, N.; Gehrig, C.; Graziosi, G.; Heidorn, F.; Herrmann, S.; Herzog, B.; Hidding, M.; et al. Evaluation of Y-chromosomal STRs: A multicenter study. Int. J. Leg. Med. 1997, 110, 125–133. [Google Scholar] [CrossRef] [PubMed]
  2. Gusmão, L.; Sánchez-Diz, P.; Calafell, F.; Martín, P.; Alonso, C.A.; Álvarez-Fernández, F.; Alves, C.; Borjas-Fajardo, L.; Bozzo, W.R.; Bravo, M.L.; et al. Mutation rates at Y chromosome specific microsatellites. Hum. Mutat. 2005, 26, 520–528. [Google Scholar] [CrossRef]
  3. Kayser, M. Forensic use of Y-chromosome DNA: A general overview. Hum. Genet. 2017, 136, 621–635. [Google Scholar] [CrossRef]
  4. Ballantyne, K.N.; Goedbloed, M.; Fang, R.; Schaap, O.; Lao, O.; Wollstein, A.; Choi, Y.; van Duijn, K.; Vermeulen, M.; Brauer, S.; et al. Mutability of Y-Chromosomal Microsatellites: Rates, Characteristics, Molecular Bases, and Forensic Implications. Am. J. Hum. Genet. 2010, 87, 341–353. [Google Scholar] [CrossRef]
  5. Claerhout, S.; Vandenbosch, M.; Nivelle, K.; Gruyters, L.; Peeters, A.; Larmuseau, M.H.D.; Decorte, R. Determining Y-STR mutation rates in deep-routing genealogies: Identification of haplogroup differences. Forensic Sci. Int. Genet. 2018, 34, 1–10. [Google Scholar] [CrossRef]
  6. Wu, W.; Ren, W.; Hao, H.; Nan, H.; He, X.; Liu, Q.; Lu, D. Mutation rates at 42 Y chromosomal short tandem repeats in Chinese Han population in Eastern China. Int. J. Leg. Med. 2018, 132, 1317–1319. [Google Scholar] [CrossRef] [PubMed]
  7. Willuweit, S.; Roewer, L. International Forensic Y Chromosome User Group Y chromosome haplotype reference database (YHRD): Update. Forensic Sci. Int. Genet. 2007, 1, 83–87. [Google Scholar] [CrossRef] [PubMed]
  8. Ballantyne, K.N.; Keerl, V.; Wollstein, A.; Choi, Y.; Zuniga, S.B.; Ralf, A.; Vermeulen, M.; de Knijff, P.; Kayser, M. A new future of forensic Y-chromosome analysis: Rapidly mutating Y-STRs for differentiating male relatives and paternal lineages. Forensic Sci. Int. Genet. 2012, 6, 208–218. [Google Scholar] [CrossRef]
  9. Purps, J.; Siegert, S.; Willuweit, S.; Nagy, M.; Alves, C.; Salazar, R.; Angustia, S.M.T.; Santos, L.H.; Anslinger, K.; Bayer, B.; et al. A global analysis of Y-chromosomal haplotype diversity for 23 STR loci. Forensic Sci. Int. Genet. 2014, 12, 12–23. [Google Scholar] [CrossRef]
  10. Gopinath, S.; Zhong, C.; Nguyen, V.; Ge, J.; Lagacé, R.E.; Short, M.L.; Mulero, J.J. Developmental validation of the Yfiler® Plus PCR Amplification Kit: An enhanced Y-STR multiplex for casework and database applications. Forensic Sci. Int. Genet. 2016, 24, 164–175. [Google Scholar] [CrossRef]
  11. Jobling, M.A.; Tyler-Smith, C. The human Y chromosome: An evolutionary marker comes of age. Nat. Rev. Genet. 2003, 4, 598–612. [Google Scholar] [CrossRef]
  12. Balanovsky, O. Toward a consensus on SNP and STR mutation rates on the human Y-chromosome. Hum. Genet. 2017, 136, 575–590. [Google Scholar] [CrossRef]
  13. Calafell, F.; Larmuseau, M.H.D. The Y chromosome as the most popular marker in genetic genealogy benefits interdisciplinary research. Hum. Genet. 2017, 136, 559–573. [Google Scholar] [CrossRef]
  14. Larmuseau, M.H.D.; Ottoni, C. Mediterranean Y-chromosome 2.0—Why the Y in the Mediterranean is still relevant in the postgenomic era. Ann. Hum. Biol. 2018, 45, 20–33. [Google Scholar] [CrossRef] [PubMed]
  15. Neuhuber, F.; Dunkelmann, B.; Grießner, I.; Helm, K.; Kayser, M.; Ralf, A. Improving the differentiation of closely related males by RMplex analysis of 30 Y-STRs with high mutation rates. Forensic Sci. Int. Genet. 2022, 58, 102682. [Google Scholar] [CrossRef]
  16. Otagiri, T.; Sato, N.; Asamura, H.; Parvanova, E.; Kayser, M.; Ralf, A. RMplex reveals population differences in RM Y-STR mutation rates and provides improved father-son differentiation in Japanese. Forensic Sci. Int. Genet. 2022, 61, 102766. [Google Scholar] [CrossRef] [PubMed]
  17. Wang, Y.; Zhang, Y.; Zhang, C.; Li, R.; Yang, Y.; Ou, X.; Tong, D.; Sun, H. Genetic polymorphisms and mutation rates of 27 Y-chromosomal STRs in a Han population from Guangdong Province, Southern China. Forensic Sci. Int. Genet. 2016, 21, 5–9. [Google Scholar] [CrossRef] [PubMed]
  18. Oh, Y.N.; Lee, H.Y.; Lee, E.Y.; Kim, E.H.; Yang, W.I.; Shin, K.-J. Haplotype and mutation analysis for newly suggested Y-STRs in Korean father–son pairs. Forensic Sci. Int. Genet. 2015, 15, 64–68. [Google Scholar] [CrossRef]
  19. Ralf, A.; Montiel González, D.; Zandstra, D.; van Wersch, B.; Kousouri, N.; de Knijff, P.; Adnan, A.; Claerhout, S.; Ghanbari, M.; Larmuseau, M.H.D.; et al. Large-scale pedigree analysis highlights rapidly mutating Y-chromosomal short tandem repeats for differentiating patrilineal relatives and predicting their degrees of consanguinity. Hum. Genet. 2023, 142, 145–160. [Google Scholar] [CrossRef]
  20. Larmuseau, M.H.D.; Vanoverbeke, J.; Van Geystelen, A.; Defraene, G.; Vanderheyden, N.; Matthys, K.; Wenseleers, T.; Decorte, R. Low historical rates of cuckoldry in a Western European human population traced by Y-chromosome and genealogical data. Proc. R. Soc. B Biol. Sci. 2013, 280, 20132400. [Google Scholar] [CrossRef]
  21. Claerhout, S.; Van der Haegen, M.; Vangeel, L.; Larmuseau, M.H.D.; Decorte, R. A game of hide and seq: Identification of parallel Y-STR evolution in deep-rooting pedigrees. Eur. J. Hum. Genet. 2019, 27, 637–646. [Google Scholar] [CrossRef]
  22. Ge, J. Interpretation of Y Chromosome STRs for Missing Persons Cases; Office of Justice Programs, National Institute of Justice, Department of Justice: Washington, DC, USA, 2022. [Google Scholar]
  23. Connell, J.R.; White, T.; Zielke, T.; Mitchell, N.; Griffiths, L.R. Family Reference DNA Database for Identifying Missing Australian Service Members: Policy, Ethical, and Social Implications. Aust. J. Forensic Sci. 2025, 1–12. [Google Scholar] [CrossRef]
  24. AusVet Animal Health Services. Epi Tools—Calculate Confidence Limits for a Sample Proportion. 2016. Available online: http://epitools.ausvet.com.au/content.php?page=CIProportion (accessed on 11 May 2025).
  25. Brown, L.D.; Cai, T.T.; DasGupta, A. Interval estimation for a binomial proportion. Stat. Sci. 2001, 16, 101–133. [Google Scholar] [CrossRef]
  26. Henry, J.; Dao, H.; Scandrett, L.; Taylor, D. Population genetic analysis of Yfiler® Plus haplotype data for three South Australian populations. Forensic Sci. Int. Genet. 2019, 41, e23–e25. [Google Scholar] [CrossRef] [PubMed]
  27. Butler, J.; Decker, A.; Kline, M.; Vallone, P. Chromosomal Duplications Along the Y-Chromosome and Their Potential Impact on Y-STR Interpretation. J. Forensic Sci. 2005, 50, JFS2004481-7. [Google Scholar] [CrossRef]
  28. Jobling, M.A. Y-chromosomal SNP haplotype diversity in forensic analysis. Forensic Sci. Int. 2001, 118, 158–162. [Google Scholar] [CrossRef]
  29. Capelli, C.; Brisighelli, F.; Scarnicci, F.; Blanco-Verea, A.; Brion, M.; Pascali, V.L. Phylogenetic evidence for multiple independent duplication events at the DYS19 locus. Forensic Sci. Int. Genet. 2007, 1, 287–290. [Google Scholar] [CrossRef] [PubMed]
  30. Santos, F.R.; Gerelsaikhan, T.; Munkhtuja, B.; Oyunsuren, T.; Epplen, J.T.; Pena, S.D. Geographic differences in the allele frequencies of the human Y-linked tetranucleotide polymorphism DYS19. Hum. Genet. 1996, 97, 309–313. [Google Scholar] [CrossRef]
  31. DYS19 Variants. 2025. Available online: https://strbase.nist.gov/Human/VariantalleleTable/DYS19 (accessed on 11 September 2025).
  32. DYS448 Variants. 2025. Available online: https://strbase.nist.gov/Human/VariantalleleTable/DYS448 (accessed on 11 September 2025).
  33. Nazir, S.; Adnan, A.; Rehman, R.A.; Al-Qahtani, W.S.; Alsaleh, A.B.; Al-Harthi, H.S.; Safhi, F.A.; Almheiri, R.; Lootah, R.; Alreyami, A.; et al. Mutation Rate Analysis of RM Y-STRs in Deep-Rooted Multi-Generational Punjabi Pedigrees from Pakistan. Genes 2022, 13, 1403. [Google Scholar] [CrossRef]
  34. SWGDAM. Interpretation Guidelines for Y-Chromosome STR Typing by Forensic DNA Laboratories. 2022. Available online: https://www.swgdam.org/_files/ugd/4344b0_bc90bcfef52c43559aa28618ef87c424.pdf (accessed on 13 June 2025).
  35. Ge, J.; Eisenberg, A.; Yan, J.; Chakraborty, R.; Budowle, B. Pedigree likelihood ratio for lineage markers. Int. J. Leg. Med. 2011, 125, 519–525. [Google Scholar] [CrossRef]
  36. Ge, J.; Budowle, B.; Chakraborty, R. DNA identification by pedigree likelihood ratio accommodating population substructure and mutations. Investig. Genet. 2010, 1, 8. [Google Scholar] [CrossRef] [PubMed]
  37. Brenner, C.H. Symbolic kinship program. Genetics 1997, 145, 535–542. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Breakdown of the paternally related pairs separated by 4 to 16 meioses (n = 183).
Figure 1. Breakdown of the paternally related pairs separated by 4 to 16 meioses (n = 183).
Genes 16 01211 g001
Figure 2. Average number of observed mutational counts across 4–16 genetic transmissions (meioses). Vertical whiskers denote the observed min–max range at each meiosis count.
Figure 2. Average number of observed mutational counts across 4–16 genetic transmissions (meioses). Vertical whiskers denote the observed min–max range at each meiosis count.
Genes 16 01211 g002
Figure 3. Mutation rate estimates for Yfiler Plus loci estimated from distantly related pairs separated by 4 to 16 meioses, n = 183 pairs. Our pedigree-derived estimates were compared against father–son estimates by Neuhuber et al. [15], as well as large-scale pedigree-based estimates by Ralf et al. [19]. * p < 0.05, ** p < 0.01.
Figure 3. Mutation rate estimates for Yfiler Plus loci estimated from distantly related pairs separated by 4 to 16 meioses, n = 183 pairs. Our pedigree-derived estimates were compared against father–son estimates by Neuhuber et al. [15], as well as large-scale pedigree-based estimates by Ralf et al. [19]. * p < 0.05, ** p < 0.01.
Genes 16 01211 g003
Table 1. Inherited duplications or deletions.
Table 1. Inherited duplications or deletions.
Family IDSample 1Sample 2MeiosesLocusAllele Sample 1Allele Sample 2Comments
FS_2RS_13RS_298DYS1915, 1715, 17Duplication
mutation (gain/loss of 1 repeat), also observed at DYF387S1
FS_375RS_942RS_9618DYS448NEGNEGDeletion
mutation (duplication in one individual), also observed in DYF387S1
FS_269RS_588RS_5745DYS1915, 1615, 16Duplication
Table 2. Locus-specific mutation rates for Yfiler Plus loci obtained from distantly related pairs separated by 4 to 16 meioses. Significant values are shown in bold. Confidence intervals were calculated using the Clopper–Pearson method. p values were calculated using Fisher’s exact method.
Table 2. Locus-specific mutation rates for Yfiler Plus loci obtained from distantly related pairs separated by 4 to 16 meioses. Significant values are shown in bold. Confidence intervals were calculated using the Clopper–Pearson method. p values were calculated using Fisher’s exact method.
Current StudyNeuhuber et al. [15]Ralf et al. [19]
MarkerTotal MeiosesMutationsMutation RateLower 95% CIUpper 95% CIMutation Ratep ValueMutation Ratep Value
DYF387S11576130.0080.0040.0140.0100.5870.0081.000
DYS19157600.0000.0000.0020.0020.1020.0020.255
DYS385157660.0040.0010.0080.0080.1090.0041.000
DYS389I157600.0000.0000.0020.0020.0700.0020.130
DYS389II157610.0010.0000.0020.0060.0060.0040.077
DYS390157640.0030.0010.0070.0031.0000.0040.564
DYS391157620.0010.0000.0050.0030.5750.0040.193
DYS392157600.0000.0000.0020.0010.6110.0001.000
DYS393157610.0010.0000.0020.0020.5030.0011.000
DYS437157620.0010.0000.0050.0011.0000.0010.597
DYS438157600.0000.0000.0020.0001.0000.0010.503
DYS4391576120.0080.0040.0130.0050.1350.0040.166
DYS448157610.0010.0000.0020.0011.0000.0030.134
DYS449157660.0040.0010.0080.0110.0030.0100.016
DYS456157660.0040.0010.0080.0041.0000.0020.317
DYS4581576110.0070.0040.0130.0090.6580.0080.844
DYS460157680.0050.0020.0100.0040.6780.0060.820
DYS481157680.0050.0020.0100.0050.8420.0020.126
DYS5181576180.0110.0070.0180.0130.6360.0160.185
DYS533157650.0030.0010.0070.0041.0000.0031.000
DYS570157650.0030.0010.0070.0080.0300.0100.008
DYS5761576200.0130.0080.0200.0131.0000.0090.241
DYS6271576210.0130.0080.0200.0150.8220.0141.000
DYS635157650.0030.0010.0070.0040.8280.0031.000
YGATAH4157660.0040.0010.0080.0020.1400.0020.528
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Connell, J.R.; White, T.; Zielke, T.; Armstrong, L.; Mitchell, N.; Griffiths, L.R. Pedigree-Based Estimation of Y-STR Mutation and Male Differentiation Rates: Application to Historical Remains Identification. Genes 2025, 16, 1211. https://doi.org/10.3390/genes16101211

AMA Style

Connell JR, White T, Zielke T, Armstrong L, Mitchell N, Griffiths LR. Pedigree-Based Estimation of Y-STR Mutation and Male Differentiation Rates: Application to Historical Remains Identification. Genes. 2025; 16(10):1211. https://doi.org/10.3390/genes16101211

Chicago/Turabian Style

Connell, Jasmine R., Toni White, Thais Zielke, Luke Armstrong, Natasha Mitchell, and Lyn R. Griffiths. 2025. "Pedigree-Based Estimation of Y-STR Mutation and Male Differentiation Rates: Application to Historical Remains Identification" Genes 16, no. 10: 1211. https://doi.org/10.3390/genes16101211

APA Style

Connell, J. R., White, T., Zielke, T., Armstrong, L., Mitchell, N., & Griffiths, L. R. (2025). Pedigree-Based Estimation of Y-STR Mutation and Male Differentiation Rates: Application to Historical Remains Identification. Genes, 16(10), 1211. https://doi.org/10.3390/genes16101211

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop