DNA Mixture Deconvolution: A Four-Strategy Framework from Physical Separation to Database Searching
Abstract
1. Introduction
2. Physical and Biological Separation: Simplification at the Analytical Front End
2.1. Single-Cell and Micro-Scale Separation
2.1.1. DEPArray™ Digital Microfluidics
2.1.2. Direct Single-Cell Subsampling (DSCS)
2.1.3. Single Sperm Typing and Clustering
2.2. Single-Cell Sequencing
2.3. Applicability and Constraints
3. Novel Genetic Markers: Information Enhancement
3.1. Traditional STR Systems: Capabilities and Limitations
3.2. NGS-STR: Sequence Polymorphism in Traditional Loci
3.3. SNP-Based Markers
3.3.1. Identity-Informative SNPs (iiSNPs)
3.3.2. Forensic Investigative Genetic Genealogy (FIGG) SNPs and MixDeR Tool
3.3.3. A Prospective Framework: Bridging SNP Deconvolution with STR Databases
3.4. Microhaplotypes (MHs): Optimal Markers for Mixture Analysis
3.4.1. Definition and Core Advantages
3.4.2. Locus Design Principles and Standardization
3.5. DIP-STR: Allele-Specific Amplification Markers
3.6. Mini-Haplotypes (MiniHaps): Ultra-High Information Markers
3.7. Marker Strategy Summary
4. Probabilistic Genotyping: Algorithmic Solutions
4.1. From Qualitative to Fully Continuous Models
4.2. Mainstream PG Platforms
4.2.1. STR-Based Systems
4.2.2. Extension to Non-STR Markers
MHs
SNP Mixtures
4.3. Algorithmic Advances in PG Inference
4.3.1. Hamiltonian Monte Carlo (HMC)
4.3.2. Variational Inference (VI)
4.3.3. Deep Learning-Enabled Deconvolution
4.4. Deconvolution and LR Evaluation: Distinct Objectives
4.4.1. The Need for Standardized Deconvolution Metrics
- (i)
- Per-locus genotype error rates (e.g., MHs vs. STR comparisons in Section 3.4.1);
- (ii)
- (iii)
- LR-derived sensitivity and specificity at selected thresholds (e.g., LR > 1 for contributor detection in Section 4.2.2; LR > 106 for database searching in Section 5.1);
- (iv)
- False positive rates against known non-contributors (e.g., <0.005% in TrueAllele casework validation).
4.4.2. The Case for Deconvolution-Specific Algorithms
- (i)
- Simultaneous NOC inference—treating NOC as a model-selection problem rather than a fixed or analyst-guided input, given the substantial impact of NOC misspecification on genotype reconstruction accuracy.
- (ii)
- Multimodal posterior reporting—explicitly presenting multiple high-probability genotype solutions rather than reducing inference to a single “most probable” estimate, particularly in allele-sharing scenarios where several genotype combinations may explain the observed data nearly equally well.
- (iii)
- Optimized mixture proportion estimation—since genotype reconstruction accuracy, especially for minor contributors, is highly sensitive to mixture ratio precision, even when LR magnitude remains comparatively stable.
- (iv)
- Marker-specific noise models—differentiating between STR stutter, NGS sequencing error, and dropout patterns rather than relying on uniform noise assumptions.
- (v)
- Inter-locus dependency modeling—as demonstrated by Yu et al. (2025), modeling between-locus corrections improved genotype reconstruction accuracy by up to 30 percentage points, suggesting that locus-independence assumptions leave recoverable information unused [103].
4.5. Summary of Section 4
5. Maximizing the Application of Deconvolution Results: Evolution of Database Searching Strategies
5.1. LR-Based Direct Searching
5.2. Translating Probabilistic Outputs into Legacy Infrastructure
5.3. Probability-Weighted Similarity Approaches
5.4. Summary of Section 5
6. Conclusions and Prospects
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| Ae | Effective allele number |
| CE-STR | Capillary electrophoresis-based short tandem repeat |
| DBLR | Database Likelihood Ratio |
| DIP | Deletion/insertion polymorphism |
| DSCS | Direct single-cell subsampling |
| FIGG | Forensic investigative genetic genealogy |
| HMC | Hamiltonian Monte Carlo |
| iiSNP | Identity-informative single nucleotide polymorphism |
| LR | Likelihood ratio |
| MAP | Maximum a posteriori |
| MCMC | Markov chain Monte Carlo |
| MH | Microhaplotype |
| MiniHap | Mini-haplotype |
| MLE | Maximum likelihood estimation |
| MNP | Multi-SNP |
| NGS | Next-generation sequencing |
| NOC | Number of contributors |
| PG | Probabilistic genotyping |
| scDNA-seq | Single-cell DNA sequencing |
| SNP | Single nucleotide polymorphism |
| STR | Short tandem repeat |
| VI | Variational inference |
| WGS | Whole-genome sequencing |
References
- Weir, B.S.; Triggs, C.M.; Starling, L.; Stowell, L.I.; Walsh, K.A.; Buckleton, J. Interpreting DNA mixtures. J. Forensic Sci. 1997, 42, 213–222. [Google Scholar] [CrossRef]
- Curran, J.M.; Triggs, C.M.; Buckleton, J.; Weir, B.S. Interpreting DNA mixtures in structured populations. J. Forensic Sci. 1999, 44, 987–995. [Google Scholar] [CrossRef] [PubMed]
- Gill, P.; Benschop, C.; Buckleton, J.; Bleka, Ø.; Taylor, D. A Review of Probabilistic Genotyping Systems: EuroForMix, DNAStatistX and STRmix™. Genes 2021, 12, 1559. [Google Scholar] [CrossRef]
- Buckleton, J.S.; Bright, J.A.; Gittelson, S.; Moretti, T.R.; Onorato, A.J.; Bieber, F.R.; Budowle, B.; Taylor, D.A. The Probabilistic Genotyping Software STRmix: Utility and Evidence for its Validity. J. Forensic Sci. 2019, 64, 393–405. [Google Scholar] [CrossRef]
- NIST IR 8351; DNA Mixture Interpretation: A NIST Scientific Foundation Review. National Institute of Standards and Technology: Gaithersburg, MD, USA, 2024.
- Huffman, K.; Ballantyne, J. Single cell genomics applications in forensic science: Current state and future directions. iScience 2023, 26, 107961. [Google Scholar] [CrossRef]
- Watkins, D.R.L.; Myers, D.; Xavier, H.E.; Marciano, M.A. Revisiting single cell analysis in forensic science. Sci. Rep. 2021, 11, 7054. [Google Scholar] [CrossRef]
- Schulte, J.; Egger, S.; Kron, S.; Scheurer, E.; Schulz, I. Evaluating novel and conventional cell-separation techniques for sexual assault investigations. J. Forensic Sci. 2025, 70, 1704–1720. [Google Scholar] [CrossRef]
- Fokias, K.; Rutten, I.; Ven, K.; Lammertyn, J.; Bekaert, B. SpermFACS: Validation of a Highly Sensitive Sperm Cell Sorting Method for Sexual Assault Casework Analysis. Anal. Chem. 2026, 98, 9741–9752. [Google Scholar] [CrossRef] [PubMed]
- Huffman, K.; Ballantyne, J. Probabilistic Genotyping of Single Cell Replicates from Mixtures Involving First-Degree Relatives Prevents the False Inclusions of Non-Donor Relatives. Genes 2022, 13, 1658. [Google Scholar] [CrossRef]
- Williamson, V.R.; Laris, T.M.; Romano, R.; Marciano, M.A. Enhanced DNA mixture deconvolution of sexual offense samples using the DEPArray™ system. Forensic Sci. Int. Genet. 2018, 34, 265–276. [Google Scholar] [CrossRef] [PubMed]
- Huffman, K.; Hanson, E.; Ballantyne, J. Cell Subsampling Recovers Probative DNA Profile Information from Unresolvable/Undetectable Minor Donors in Mixtures. Genes 2022, 13, 1117. [Google Scholar] [CrossRef]
- Huffman, K.; Hanson, E.; Ballantyne, J. Recovery of single source DNA profiles from mixtures by direct single cell subsampling and simplified micromanipulation. Sci. Justice J. Forensic Sci. Soc. 2021, 61, 13–25. [Google Scholar] [CrossRef]
- Huffman, K.; Ballantyne, J. Validation of Probabilistic Genotyping Software for Single Cell STR Analysis. Genes 2023, 14, 674. [Google Scholar] [CrossRef] [PubMed]
- Theunissen, G.M.G.; Gibb, A.; Lin, P.K.T.; Rolf, B.; Forat, S.; Jäger, R. DNA profiling of single sperm cells after whole genome amplification. Forensic Sci. Int. Rep. 2021, 4, 100240. [Google Scholar] [CrossRef]
- Peters, M.; Huffman, K.; Ballantyne, J.; Hanson, E. Probabilistic genotyping replicate analysis of FaSTR clustered single sperm aSTR haplotypes reconstitutes probative diploid DNA genotypes from complex semen mixtures. J. Forensic Sci. 2025, 70, 1687–1703. [Google Scholar] [CrossRef]
- Kulhankova, L.; Bindels, E.; Kayser, M.; Mulugeta, E. Deconvoluting multi-person biological mixtures and accurate characterization and identification of separated contributors using non-targeted single-cell DNA sequencing. Forensic Sci. Int. Genet. 2024, 71, 103030. [Google Scholar] [CrossRef]
- Feng, L.; Xu, C.; Zeng, X.; Zhang, H.; Yang, F.; Li, W.; Tu, Z.; Li, C.; Hu, L. Y-chromosomal haplotyping of single sperm cells isolated from semen mixtures—A successful identification of three perpetrators in a multi-suspect sexual assault case. Croat. Med. J. 2014, 55, 537–541. [Google Scholar] [CrossRef] [PubMed]
- Hares, D.R.; Kneppers, A.; Onorato, A.J.; Kahn, S. Rapid DNA for crime scene use: Enhancements and data needed to consider use on forensic evidence for State and National DNA Databasing—An agreed position statement by ENFSI, SWGDAM and the Rapid DNA Crime Scene Technology Advancement Task Group. Forensic Sci. Int. Genet. 2020, 48, 102349. [Google Scholar] [CrossRef]
- Kader, F.; Ghai, M.; Olaniran, A.O. Characterization of DNA methylation-based markers for human body fluid identification in forensics: A critical review. Int. J. Leg. Med. 2020, 134, 1–20. [Google Scholar] [CrossRef]
- Sijen, T.; Harbison, S. On the Identification of Body Fluids and Tissues: A Crucial Link in the Investigation and Solution of Crime. Genes 2021, 12, 1728. [Google Scholar] [CrossRef]
- Just, R.S.; Irwin, J.A. Use of the LUS in sequence allele designations to facilitate probabilistic genotyping of NGS-based STR typing results. Forensic Sci. Int. Genet. 2018, 34, 197–205. [Google Scholar] [CrossRef]
- Gettings, K.B.; Aponte, R.A.; Vallone, P.M.; Butler, J.M. STR allele sequence variation: Current knowledge and future issues. Forensic Sci. Int. Genet. 2015, 18, 118–130. [Google Scholar] [CrossRef]
- Dash, H.R.; Kaitholia, K.; Kumawat, R.K.; Singh, A.K.; Shrivastava, P.; Chaubey, G.; Das, S. Sequence variations, flanking region mutations, and allele frequency at 31 autosomal STRs in the central Indian population by next generation sequencing (NGS). Sci. Rep. 2021, 11, 23238. [Google Scholar] [CrossRef] [PubMed]
- Dai, W.; Pan, Y.; Sun, X.; Wu, R.; Li, L.; Yang, D. High polymorphism detected by massively parallel sequencing of autosomal STRs using old blood samples from a Chinese Han population. Sci. Rep. 2019, 9, 18959. [Google Scholar] [CrossRef]
- Bennett, L.; Oldoni, F.; Long, K.; Cisana, S.; Madella, K.; Wootton, S.; Chang, J.; Hasegawa, R.; Lagacé, R.; Kidd, K.K.; et al. Mixture deconvolution by massively parallel sequencing of microhaplotypes. Int. J. Leg. Med. 2019, 133, 719–729. [Google Scholar] [CrossRef]
- Giuffrida, M.; Rodrigues, P.; Köksal, Z.; Jønck, C.G.; Pereira, V.; Børsting, C. Mixture Deconvolution with Massively Parallel Sequencing Data: Microhaplotypes Versus Short Tandem Repeats. Genes 2025, 16, 1105. [Google Scholar] [CrossRef]
- Tao, R.; Yang, Q.; Xia, R.; Zhang, X.; Chen, A.; Li, C.; Zhang, S. A sequence-based 163plex microhaplotype assay for forensic DNA analysis. Front. Genet. 2022, 13, 988223. [Google Scholar] [CrossRef] [PubMed]
- Zhang, J.; Mo, X.; Li, W.; Cheng, C.; Feng, Y.; Zhang, Y.; Li, S. Nanopore sequencing of MiniHap biomarkers for forensic DNA mixture deconvolution: A proof-of-principle study. Forensic Sci. Int. Genet. 2025, 78, 103272. [Google Scholar] [CrossRef]
- Oldoni, F.; Castella, V.; Hall, D. A novel set of DIP-STR markers for improved analysis of challenging DNA mixtures. Forensic Sci. Int. Genet. 2015, 19, 156–164. [Google Scholar] [CrossRef]
- Oldoni, F.; Castella, V.; Grosjean, F.; Hall, D. Sensitive DIP-STR markers for the analysis of unbalanced mixtures from “touch” DNA samples. Forensic Sci. Int. Genet. 2017, 28, 111–117. [Google Scholar] [CrossRef] [PubMed]
- Castella, V.; Gervaix, J.; Hall, D. DIP-STR: Highly sensitive markers for the analysis of unbalanced genomic mixtures. Hum. Mutat. 2013, 34, 644–654. [Google Scholar] [CrossRef] [PubMed]
- Gill, P.; Brenner, C.H.; Buckleton, J.S.; Carracedo, A.; Krawczak, M.; Mayr, W.R.; Morling, N.; Prinz, M.; Schneider, P.M.; Weir, B.S. DNA commission of the International Society of Forensic Genetics: Recommendations on the interpretation of mixtures. Forensic Sci. Int. 2006, 160, 90–101. [Google Scholar] [CrossRef]
- Buckleton, J.S.; Curran, J.M.; Gill, P. Towards understanding the effect of uncertainty in the number of contributors to DNA stains. Forensic Sci. Int. Genet. 2007, 1, 20–28. [Google Scholar] [CrossRef]
- Bright, J.A.; Evett, I.W.; Taylor, D.; Curran, J.M.; Buckleton, J. A series of recommended tests when validating probabilistic DNA profile interpretation software. Forensic Sci. Int. Genet. 2015, 14, 125–131. [Google Scholar] [CrossRef]
- Taylor, D.; Bright, J.A.; Buckleton, J. The interpretation of single source and mixed DNA profiles. Forensic Sci. Int. Genet. 2013, 7, 516–528. [Google Scholar] [CrossRef] [PubMed]
- Ballantyne, K.N.; Goedbloed, M.; Fang, R.; Schaap, O.; Lao, O.; Wollstein, A.; Choi, Y.; van Duijn, K.; Vermeulen, M.; Brauer, S.; et al. Mutability of Y-chromosomal microsatellites: Rates, characteristics, molecular bases, and forensic implications. Am. J. Hum. Genet. 2010, 87, 341–353. [Google Scholar] [CrossRef]
- Roewer, L.; Andersen, M.M.; Ballantyne, J.; Butler, J.M.; Caliebe, A.; Corach, D.; D’Amato, M.E.; Gusmão, L.; Hou, Y.; de Knijff, P.; et al. DNA commission of the International Society of Forensic Genetics (ISFG): Recommendations on the interpretation of Y-STR results in forensic analysis. Forensic Sci. Int. Genet. 2020, 48, 102308. [Google Scholar] [CrossRef] [PubMed]
- Ballantyne, K.N.; Keerl, V.; Wollstein, A.; Choi, Y.; Zuniga, S.B.; Ralf, A.; Vermeulen, M.; de Knijff, P.; Kayser, M. A new future of forensic Y-chromosome analysis: Rapidly mutating Y-STRs for differentiating male relatives and paternal lineages. Forensic Sci. Int. Genet. 2012, 6, 208–218. [Google Scholar] [CrossRef]
- Ralf, A.; Zieger, M.; Kayser, M. Considerations on expanding criminal offender DNA databases with Y-STR profiles. J. Law Biosci. 2024, 11, lsae017. [Google Scholar] [CrossRef] [PubMed]
- Holland, M.M.; McQuillan, M.R.; O’Hanlon, K.A. Second generation sequencing allows for mtDNA mixture deconvolution and high resolution detection of heteroplasmy. Croat. Med. J. 2011, 52, 299–313. [Google Scholar] [CrossRef]
- Wisner, M.; Erlich, H.; Shih, S.; Calloway, C. Resolution of mitochondrial DNA mixtures using a probe capture next generation sequencing system and phylogenetic-based software. Forensic Sci. Int. Genet. 2021, 53, 102531. [Google Scholar] [CrossRef]
- Lutz-Bonengel, S.; Parson, W. No further evidence for paternal leakage of mitochondrial DNA in humans yet. Proc. Natl. Acad. Sci. USA 2019, 116, 1821–1822. [Google Scholar] [CrossRef]
- Laricchia, K.M.; Lake, N.J.; Watts, N.A.; Shand, M.; Haessly, A.; Gauthier, L.; Benjamin, D.; Banks, E.; Soto, J.; Garimella, K.; et al. Mitochondrial DNA variation across 56,434 individuals in gnomAD. Genome Res. 2022, 32, 569–582. [Google Scholar] [CrossRef]
- Parson, W.; Huber, G.; Moreno, L.; Madel, M.B.; Brandhagen, M.D.; Nagl, S.; Xavier, C.; Eduardoff, M.; Callaghan, T.C.; Irwin, J.A. Massively parallel sequencing of complete mitochondrial genomes from hair shaft samples. Forensic Sci. Int. Genet. 2015, 15, 8–15. [Google Scholar] [CrossRef]
- Guo, F.; Yu, J.; Zhang, L.; Li, J. Massively parallel sequencing of forensic STRs and SNPs using the Illumina(®) ForenSeq™ DNA Signature Prep Kit on the MiSeq FGx™ Forensic Genomics System. Forensic Sci. Int. Genet. 2017, 31, 135–148. [Google Scholar] [CrossRef] [PubMed]
- Sharma, V.; van der Plaat, D.A.; Liu, Y.; Wurmbach, E. Analyzing degraded DNA and challenging samples using the ForenSeq™ DNA Signature Prep kit. Sci. Justice J. Forensic Sci. Soc. 2020, 60, 243–252. [Google Scholar] [CrossRef]
- Xavier, C.; Parson, W. Evaluation of the Illumina ForenSeq™ DNA Signature Prep Kit—MPS forensic application for the MiSeq FGx™ benchtop sequencer. Forensic Sci. Int. Genet. 2017, 28, 188–194. [Google Scholar] [CrossRef] [PubMed]
- King, J.L.; Churchill, J.D.; Novroski, N.M.M.; Zeng, X.; Warshauer, D.H.; Seah, L.H.; Budowle, B. Increasing the discrimination power of ancestry- and identity-informative SNP loci within the ForenSeq™ DNA Signature Prep Kit. Forensic Sci. Int. Genet. 2018, 36, 60–76. [Google Scholar] [CrossRef]
- Wendt, F.R.; Novroski, N.M. Identity informative SNP associations in the UK Biobank. Forensic Sci. Int. Genet. 2019, 42, 45–48. [Google Scholar] [CrossRef]
- Davenport, L.; Devesse, L.; Syndercombe Court, D.; Ballard, D. Forensic identity SNPs: Characterisation of flanking region variation using massively parallel sequencing. Forensic Sci. Int. Genet. 2023, 64, 102847. [Google Scholar] [CrossRef]
- Mitchell, R.; Peck, M.; Gorden, E.; Just, R. MixDeR: A SNP mixture deconvolution workflow for forensic genetic genealogy. Forensic Sci. Int. Genet. 2025, 76, 103224. [Google Scholar] [CrossRef]
- Mitchell, R.; Enke, S.; Eskey, K.; Ferguson, T.; Just, R. A method to enable forensic genetic genealogy investigations from DNA mixtures. Forensic Sci. Int. Genet. Suppl. Ser. 2022, 8, 159–161. [Google Scholar] [CrossRef]
- Antunes, J.; Walichiewicz, P.; Forouzmand, E.; Barta, R.; Didier, M.; Han, Y.; Perez, J.C.; Snedecor, J.; Zlatkov, C.; Padmabandu, G.; et al. Developmental validation of the ForenSeq® Kintelligence kit, MiSeq FGx® sequencing system and ForenSeq Universal Analysis Software. Forensic Sci. Int. Genet. 2024, 71, 103055. [Google Scholar] [CrossRef]
- Ballard, D.; Winkler-Galicki, J.; Wesoły, J. Massive parallel sequencing in forensics: Advantages, issues, technicalities, and prospects. Int. J. Leg. Med. 2020, 134, 1291–1303. [Google Scholar] [CrossRef] [PubMed]
- Tillmar, A.; Sjölund, P.; Lundqvist, B.; Klippmark, T.; Älgenäs, C.; Green, H. Whole-genome sequencing of human remains to enable genealogy DNA database searches—A case report. Forensic Sci. Int. Genet. 2020, 46, 102233. [Google Scholar] [CrossRef] [PubMed]
- Lu, J.; Liu, J.; Li, J.; Jiang, L.; Ni, M.; Zhao, W.; Dai, L.; Zhao, C.; Li, C. Forensic investigative genetic genealogy based on low-quality DNA whole genome sequencing data. Forensic Sci. Int. Genet. 2025, 82, 103417. [Google Scholar] [CrossRef]
- Cihlar, J.C.; Woerner, A.E.; King, J.L.; Hawkins, J.B.; Coble, M.D. Developmental validation of a whole genome sequencing workflow for use in a forensic laboratory. Forensic Sci. Int. Genet. 2026, 81, 103380. [Google Scholar] [CrossRef] [PubMed]
- Xie, Q.; Zhao, W.; Liu, W.; Zhao, Y.; Chen, X.; Li, J.; Fang, Z.; Hu, L.; Li, C. Forensic SNP genealogy inference using whole genome sequencing data of varying depths. Forensic Sci. Int. Genet. 2025, 79, 103296. [Google Scholar] [CrossRef] [PubMed]
- Edge, M.D.; Algee-Hewitt, B.F.B.; Pemberton, T.J.; Li, J.Z.; Rosenberg, N.A. Linkage disequilibrium matches forensic genetic records to disjoint genomic marker sets. Proc. Natl. Acad. Sci. USA 2017, 114, 5671–5676. [Google Scholar] [CrossRef] [PubMed]
- Kim, J.; Edge, M.D.; Algee-Hewitt, B.F.B.; Li, J.Z.; Rosenberg, N.A. Statistical Detection of Relatives Typed with Disjoint Forensic and Biomedical Loci. Cell 2018, 175, 848–858.e846. [Google Scholar] [CrossRef]
- Kim, J.; Rosenberg, N.A. Record-matching of STR profiles with fragmentary genomic SNP data. Eur. J. Hum. Genet. 2023, 31, 1283–1290. [Google Scholar] [CrossRef] [PubMed]
- Gjorgjieva, T.; Rosenberg, N.A. Toward minimal SNP sets for record-matching with CODIS STR profiles. Eur. J. Hum. Genet. 2025. ahead of print. [Google Scholar] [CrossRef]
- Yin, Y.; Zhang, P.; Xing, Y. A New Computational Deconvolution Algorithm for the Analysis of Forensic DNA Mixtures with SNP Markers. Genes 2022, 13, 884. [Google Scholar] [CrossRef]
- Kidd, K.K.; Pakstis, A.J.; Speed, W.C.; Lagacé, R.; Chang, J.; Wootton, S.; Haigh, E.; Kidd, J.R. Current sequencing technology makes microhaplotypes a powerful new type of genetic marker for forensics. Forensic Sci. Int. Genet. 2014, 12, 215–224. [Google Scholar] [CrossRef]
- Turchi, C.; Melchionda, F.; Gentile, F.; Marino, A.; Colloca, D.; Pesaresi, M.; Pakstis, A.J.; Kidd, K.K. Microhaplotypes in forensic genetics: From exploration to application in degraded DNA specimens. Forensic Sci. Int. Genet. 2026, 81, 103391. [Google Scholar] [CrossRef]
- Kidd, K.K.; Speed, W.C. Criteria for selecting microhaplotypes: Mixture detection and deconvolution. Investig. Genet. 2015, 6, 1. [Google Scholar] [CrossRef]
- Gandotra, N.; Speed, W.C.; Qin, W.; Tang, Y.; Pakstis, A.J.; Kidd, K.K.; Scharfe, C. Validation of novel forensic DNA markers using multiplex microhaplotype sequencing. Forensic Sci. Int. Genet. 2020, 47, 102275. [Google Scholar] [CrossRef]
- Chen, P.; Yin, C.; Li, Z.; Pu, Y.; Yu, Y.; Zhao, P.; Chen, D.; Liang, W.; Zhang, L.; Chen, F. Evaluation of the Microhaplotypes panel for DNA mixture analyses. Forensic Sci. Int. Genet. 2018, 35, 149–155. [Google Scholar] [CrossRef]
- Kidd, K.K.; Speed, W.C.; Pakstis, A.J.; Podini, D.S.; Lagacé, R.; Chang, J.; Wootton, S.; Haigh, E.; Soundararajan, U. Evaluating 130 microhaplotypes across a global set of 83 populations. Forensic Sci. Int. Genet. 2017, 29, 29–37. [Google Scholar] [CrossRef]
- Chen, A.; Li, L.; Zhou, J.; Li, T.; Yuan, C.; Peng, H.; Li, C.; Zhang, S. Human complex mixture analysis by “FD Multi-SNP Mixture Kit”. Front. Genet. 2024, 15, 1432378. [Google Scholar] [CrossRef] [PubMed]
- Chen, J.; Chen, A.; Tao, R.; Zhu, R.; Zhang, H.; You, X.; Li, C.; Zhang, S. Solution to a case involving the interpretation of trace degraded DNA mixtures. Int. J. Leg. Med. 2024, 138, 2325–2330. [Google Scholar] [CrossRef] [PubMed]
- Podini, D.; Standage, D.S.; Phillips, C.; de la Puente, M.; Børsting, C.; Pereira, V.; Davenport, L.; Ballard, D.; Cavanaugh, S.E.; Young, B.; et al. Defining key criteria for microhaplotype locus selection in forensic genetics: Progress and recommendations by the Microhaplotype Working Group. Forensic Sci. Int. Genet. 2026, 83, 103421. [Google Scholar] [CrossRef]
- Liu, J.; Li, S.; Su, Y.; Wen, Y.; Qin, L.; Zhao, M.; Hui, M.; Jiang, L.; Chen, X.; Hou, Y.; et al. A proof-of-principle study: The potential application of MiniHap biomarkers in ancestry inference based on the QNome nanopore sequencing. Forensic Sci. Int. Genet. 2024, 68, 102947. [Google Scholar] [CrossRef] [PubMed]
- Haned, H.; Slooten, K.; Gill, P. Exploratory data analysis for the interpretation of low template DNA mixtures. Forensic Sci. Int. Genet. 2012, 6, 762–774. [Google Scholar] [CrossRef] [PubMed]
- Gill, P.; Haned, H. A new methodological framework to interpret complex DNA profiles using likelihood ratios. Forensic Sci. Int. Genet. 2013, 7, 251–263. [Google Scholar] [CrossRef]
- Schmidt, M.; Schiller, R.; Anslinger, K.; Wiegand, P.; Weirich, V. Statistefix 4.0: A novel probabilistic software tool. Forensic Sci. Int. Genet. 2021, 55, 102570. [Google Scholar] [CrossRef]
- Perlin, M.W.; Legler, M.M.; Spencer, C.E.; Smith, J.L.; Allan, W.P.; Belrose, J.L.; Duceman, B.W. Validating TrueAllele® DNA mixture interpretation. J. Forensic Sci. 2011, 56, 1430–1447. [Google Scholar] [CrossRef]
- Bright, J.A.; Richards, R.; Kruijver, M.; Kelly, H.; McGovern, C.; Magee, A.; McWhorter, A.; Ciecko, A.; Peck, B.; Baumgartner, C.; et al. Internal validation of STRmix™—A multi laboratory response to PCAST. Forensic Sci. Int. Genet. 2018, 34, 11–24. [Google Scholar] [CrossRef]
- Cheng, K.; Bleka, Ø.; Gill, P.; Curran, J.; Bright, J.A.; Taylor, D.; Buckleton, J. A comparison of likelihood ratios obtained from EuroForMix and STRmix™. J. Forensic Sci. 2021, 66, 2138–2155. [Google Scholar] [CrossRef]
- Perlin, M.W.; Belrose, J.L.; Duceman, B.W. New York State TrueAllele® casework validation study. J. Forensic Sci. 2013, 58, 1458–1466. [Google Scholar] [CrossRef] [PubMed]
- Perlin, M.W.; Dormer, K.; Hornyak, J.; Schiermeier-Wood, L.; Greenspoon, S. TrueAllele casework on Virginia DNA mixture evidence: Computer and manual interpretation in 72 reported criminal cases. PLoS ONE 2014, 9, e92837. [Google Scholar] [CrossRef]
- Greenspoon, S.A.; Schiermeier-Wood, L.; Jenkins, B.C. Establishing the Limits of TrueAllele® Casework: A Validation Study. J. Forensic Sci. 2015, 60, 1263–1276. [Google Scholar] [CrossRef]
- Bauer, D.W.; Butt, N.; Hornyak, J.M.; Perlin, M.W. Validating TrueAllele® Interpretation of DNA Mixtures Containing up to Ten Unknown Contributors. J. Forensic Sci. 2020, 65, 380–398. [Google Scholar] [CrossRef]
- Perlin, M.W.; Hornyak, J.M.; Sugimoto, G.; Miller, K.W. TrueAllele® Genotype Identification on DNA Mixtures Containing up to Five Unknown Contributors. J. Forensic Sci. 2015, 60, 857–868. [Google Scholar] [CrossRef]
- Kelly, H.; Kerr, Z.; Cheng, K.; Kruijver, M.; Bright, J.-A. Developmental validation of a software implementation of a flexible framework for the assignment of likelihood ratios for forensic investigations. Forensic Sci. Int. Rep. 2021, 4, 100231. [Google Scholar] [CrossRef]
- Bleka, Ø.; Prieto, L.; Gill, P. CaseSolver: An investigative open source expert system based on EuroForMix. Forensic Sci. Int. Genet. 2019, 41, 83–92. [Google Scholar] [CrossRef]
- Hoogenboom, J.; Sijen, T.; Benschop, C. ProbRank: An efficient DNA database search method for complex mixtures per a quantitative likelihood ratio model. Forensic Sci. Int. Genet. 2023, 65, 102884. [Google Scholar] [CrossRef]
- Cheng, K.; Bright, J.A.; Kelly, H.; Liu, Y.Y.; Lin, M.H.; Kruijver, M.; Taylor, D.; Buckleton, J. Developmental validation of STRmix™ NGS, a probabilistic genotyping tool for the interpretation of autosomal STRs from forensic profiles generated using NGS. Forensic Sci. Int. Genet. 2023, 62, 102804. [Google Scholar] [CrossRef] [PubMed]
- Bleka, Ø.; Prieto, L.; Gill, P. EFMrep: An extension of EuroForMix for improved combination of STR DNA mixture profiles. Forensic Sci. Int. Genet. 2022, 61, 102771. [Google Scholar] [CrossRef] [PubMed]
- Probabilistic Genotyping Systems for Low-Quality and Mixture Forensic Samples. Available online: https://www.fjc.gov/content/361268/probabilistic-genotyping-systems-low-quality-and-mixture-forensic-samples (accessed on 3 April 2026).
- State of New Jersey v. Corey Pickett. 2021. Available online: https://law.justia.com/cases/new-jersey/appellate-division-published/2021/a4207-19.html (accessed on 3 April 2026).
- Bleka, Ø.; Storvik, G.; Gill, P. EuroForMix: An open source software based on a continuous model to evaluate STR DNA profiles from a mixture of contributors with artefacts. Forensic Sci. Int. Genet. 2016, 21, 35–44. [Google Scholar] [CrossRef]
- President’s Council of Advisors on Science and Technology (PCAST). Forensic Science in Criminal Courts: Ensuring Scientific Validity of Feature-Comparison Methods; Executive Office of the President: Washington, DC, USA, 2016.
- Butler, J.M.; Kline, M.C.; Coble, M.D. NIST interlaboratory studies involving DNA mixtures (MIX05 and MIX13): Variation observed and lessons learned. Forensic Sci. Int. Genet. 2018, 37, 81–94. [Google Scholar] [CrossRef]
- Wang, Y.; Hou, T.; Zhu, Q.; Hu, Y.; Wang, H.; Wei, Y.; Wang, Y.; Zhang, J. A continuous model for interpreting microhaplotype profiles of forensic DNA mixtures. Forensic Sci. Int. Genet. 2025, 78, 103271. [Google Scholar] [CrossRef]
- González-Bao, J.; Mosquera-Miguel, A.; Casanova-Adán, L.; Ambroa-Conde, A.; Ruiz-Ramírez, J.; Cabrejas-Olalla, A.; Boullón-Cassau, M.; Freire-Aradas, A.; Rodríguez-López, A.; Roth, C.; et al. Performance comparison of a previously validated microhaplotype panel and a forensic STR panel for DNA mixture analysis. Forensic Sci. Int. Genet. 2025, 74, 103144. [Google Scholar] [CrossRef] [PubMed]
- Bleka, Ø.; Eduardoff, M.; Santos, C.; Phillips, C.; Parson, W.; Gill, P. Open source software EuroForMix can be used to analyse complex SNP mixtures. Forensic Sci. Int. Genet. 2017, 31, 105–110. [Google Scholar] [CrossRef] [PubMed]
- Hwa, H.L.; Wu, M.Y.; Chung, W.C.; Ko, T.M.; Lin, C.P.; Yin, H.I.; Lee, T.T.; Lee, J.C. Massively parallel sequencing analysis of nondegraded and degraded DNA mixtures using the ForenSeq™ system in combination with EuroForMix software. Int. J. Leg. Med. 2019, 133, 25–37. [Google Scholar] [CrossRef]
- Glynn, C.L. Bridging Disciplines to Form a New One: The Emergence of Forensic Genetic Genealogy. Genes 2022, 13, 1381. [Google Scholar] [CrossRef]
- Susik, M.; Schönborn, H.; Sbalzarini, I.F. Hamiltonian Monte Carlo with strict convergence criteria reduces run-to-run variability in forensic DNA mixture deconvolution. Forensic Sci. Int. Genet. 2022, 60, 102744. [Google Scholar] [CrossRef]
- Susik, M.; Sbalzarini, I.F. Variational inference accelerates accurate DNA mixture deconvolution. Forensic Sci. Int. Genet. 2023, 65, 102890. [Google Scholar] [CrossRef]
- Yu, S.; Mao, Z.; Yang, X.; Xu, Z.; Yang, F.; Zhao, X.; Zeng, L. Intelligent deconvolution algorithm for mixed STR profiles based on locus association modeling. Int. J. Leg. Med. 2025. ahead of print. [Google Scholar] [CrossRef] [PubMed]
- Taylor, D.; Buckleton, J. Combining artificial neural network classification with fully continuous probabilistic genotyping to remove the need for an analytical threshold and electropherogram reading. Forensic Sci. Int. Genet. 2023, 62, 102787. [Google Scholar] [CrossRef]
- Thompson, W.C. Uncertainty in probabilistic genotyping of low template DNA: A case study comparing STRMix™ and TrueAllele™. J. Forensic Sci. 2023, 68, 1049–1063. [Google Scholar] [CrossRef]
- Scientific Working Group on DNA Analysis Methods (SWGDAM). Guidelines for the Validation of Probabilistic Genotyping Systems; Scientific Working Group on DNA Analysis Methods (SWGDAM): Quantico, VA, USA, 2015. [Google Scholar]
- Coble, M.D.; Buckleton, J.; Butler, J.M.; Egeland, T.; Fimmers, R.; Gill, P.; Gusmão, L.; Guttman, B.; Krawczak, M.; Morling, N.; et al. DNA Commission of the International Society of Forensic Genetics: Recommendations on the validation of software programs performing biostatistical calculations for forensic genetics applications. Forensic Sci. Int. Genet. 2016, 25, 191–197. [Google Scholar] [CrossRef]
- Nozownik, S.; Hicks, T.; Basset, P.; Castella, V. Searching national DNA databases with complex DNA profiles: An empirical study using probabilistic genotyping. Forensic Sci. Int. Genet. 2025, 76, 103208. [Google Scholar] [CrossRef] [PubMed]
- Taylor, D.; Bright, J.A.; Scandrett, L.; Abarno, D.; Lee, S.I.; Wivell, R.; Kelly, H.; Buckleton, J. Validation of a top-down DNA profile analysis for database searching using a fully continuous probabilistic genotyping model. Forensic Sci. Int. Genet. 2021, 52, 102479. [Google Scholar] [CrossRef] [PubMed]
- Benschop, C.C.G.; van de Merwe, L.; de Jong, J.; Vanvooren, V.; Kempenaers, M.; Kees van der Beek, C.P.; Barni, F.; Reyes, E.L.; Moulin, L.; Pene, L.; et al. Validation of SmartRank: A likelihood ratio software for searching national DNA databases with complex DNA profiles. Forensic Sci. Int. Genet. 2017, 29, 145–153. [Google Scholar] [CrossRef] [PubMed]
- Benschop, C.C.G.; Slagter, M.; Grol, L.J.W.; Hovers, P.; Nagel, J.H.A.; Smit, S.; Duijs, F.E.; Kneppers, A.L.J. Fast DNA reports for investigative leads in casework practice: An automated workflow for mixture analysis using database searching based on probabilistic genotyping. Forensic Sci. Int. Synerg. 2025, 11, 100632. [Google Scholar] [CrossRef]


| Marker Type | Core Characteristics | Deconvolution Advantages | Limitations/Challenges | Validated Mixture Complexity | Optimal Application Scenarios | Representative Technology/Panel |
|---|---|---|---|---|---|---|
| CE-STR | Length polymorphism; 10–30 core loci; long amplicons (>200 bp) | Mature and standardized; global CODIS databases; established PG software support | Severe stutter artifacts; amplification imbalance; sensitive to degradation and low-template DNA; allele sharing limits deconvolution in balanced mixtures | Routine: 2-person; challenging: 3–4 person | Routine single-source or simple two-person mixtures; mandatory for CODIS searches | PowerPlex® Fusion; GlobalFiler; Investigator® 24plex QS Kit |
| NGS-STR | Sequence-level polymorphism at STR loci; detects 23–30% more alleles than CE-STR [22,23,24,25] | Higher discrimination than CE-STR; distinguishes length-identical alleles by internal sequence; short amplicons (<150 bp) improve degraded DNA performance | Sequence stutter persists; NGS-specific artifacts such as intra-locus noise, and length-dependent amplification imbalance, in addition to the higher computational demands higher cost and more complex data analysis than CE-STR | 2–3 person (improved minor-contributor detection at <5% ratio) | Cases requiring higher discrimination; complement to CE-STR in complex mixtures | ForenSeq™ DNA Signature Prep Kit; Precision ID GlobalFiler™ NGS STR Panel v2; PowerSeq® 46GY System |
| Microhaplotypes (MHs) | 2–6 tightly linked SNPs; short amplicon (<300 bp); length-invariant alleles; no stutter | 96.4% contributor-specific alleles vs. 51.3% for CE-STR [26]; excellent heterozygote balance; degradation-resistant; 4–5× lower genotype error rates than STR deconvolution [27] | No global standard panels yet; per-locus Ae may be lower than STRs; requires NGS infrastructure and specialized PG models | 2–5 person (163-plex panel) [28] | Complex (≥3 person) or balanced mixtures; moderately degraded or low-template DNA | Ion AmpliSeq™ MH-74 Plex; custom panels (87-, 124-, 163-plex) |
| Mini-haplotype (MiniHap) | Haplotypes with ≥5 SNPs; requires long-read sequencing for accurate phasing | Ultra-high polymorphism (average Ae = 10.96 vs. 3–5 for standard MHs); minor-contributor detection at 1:39; combined match probability 4.45 × 10−31 [29] | Proof-of-concept stage; requires nanopore or other long-read platforms; cross-population validation needed | 2-person (1:39); 3-person (1:8:1) [29] | Future ultra-complex mixture analysis requiring maximum per-locus information | Research panels (22-MiniHap panel via nanopore sequencing) |
| DIP-STR | Composite: DIP + adjacent STR; allele-specific amplification | Exceptional sensitivity for extremely unbalanced 2-person mixtures (up to 1:1000); selective amplification of 0.03–0.1 ng minor DNA [30,31] | Limited to 2-person mixtures; STR stutter effects persist; requires specialized primer design | 2-person (up to 1:1000 ratio) [32] | Sexual assault cases (sperm/epithelial); trace contributors in touch DNA | Validated panels (10-plex, 23-plex) |
| Feature | STRmix™ | EuroForMix | TrueAllele™ | Statistefix 4.0 |
|---|---|---|---|---|
| Statistical framework | Bayesian MCMC (Metropolis–Hastings); log-normal peak height model [4] | Maximum likelihood estimation; optional Bayesian mode; gamma peak height model [77] | Bayesian MCMC; hierarchical continuous model [78] | Automated MLE; continuous model |
| License | Commercial (closed-source) | Open-source (R package) | Commercial (closed-source) | Free |
| Key validation Studies | 31 labs, 2825 mixtures [79]; LR agreement with EuroForMix [80] | LRs within 1 order of magnitude of STRmix™ [80]; widely adopted across European labs | 368 casework items (NYSP) [81]; 72 Virginia cases [82]; independent Virginia DFS validation [83] | 3 labs; 2626 references + 7662 casework samples [77] |
| NOC | Routinely 2–4 (versions ≤v2.10); v2.11+ routinely supports 5-person mixtures; v2.6 supports NOC-as-nuisance for two consecutive values [3] | ≤4 unknown contributors; runtime increases substantially beyond 3 | Up to 10 unknown contributors on laboratory-prepared mixtures [84] | Primarily validated for major-contributor identification |
| NOC handling | Analyst-specified; NOC-as-nuisance option [3] | Analyst-specified | Empirically estimated from data; analyst can override [84] | Analyst-specified |
| Deconvolution output | Posterior genotype distributions; MAP genotype for DBLR™ searching | Posterior genotype probabilities per contributor | Reference-free probabilistic genotypes stored in built-in database [81,85] | Automated major-contributor calls |
| Database searching | Via DBLR™ [86] | Via CaseSolver [87] | Built-in TrueAllele Database; automated direct and familial searching [85] | Via ProbRank [88] |
| NGS compatibility | Yes; NGS version with sequence-based stutter models validated [89] | Yes; processes MHs and SNP read-count data; stutter modeling disabled for MH loci | Not explicitly validated for NGS in published literature | Not reported |
| Strengths | Broad court acceptance; large-scale multi-lab validation; NGS version available; NOC-as-nuisance feature | Open-source transparency; EFMrep extension for joint kit analysis and kinship [90]; active community development | Fully automated (no analyst thresholds); reference-free genotype separation; built-in database searching; WTC disaster identification | Free access; automated batch screening; rapid triage of large sample volumes |
| Limitations | Proprietary codebase limits independent scrutiny; source-code review recommended by Federal Judicial Center [91] | Runtime ceiling for >4 unknowns; less extensive court acceptance history | Proprietary codebase; most validations by developer; source code access contested in courts [92] | Higher allele uncertainty vs. established platforms [77]; limited validation scope |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Zhu, Q.; Mao, Z.; Zhang, J. DNA Mixture Deconvolution: A Four-Strategy Framework from Physical Separation to Database Searching. Genes 2026, 17, 434. https://doi.org/10.3390/genes17040434
Zhu Q, Mao Z, Zhang J. DNA Mixture Deconvolution: A Four-Strategy Framework from Physical Separation to Database Searching. Genes. 2026; 17(4):434. https://doi.org/10.3390/genes17040434
Chicago/Turabian StyleZhu, Qiang, Zhigang Mao, and Ji Zhang. 2026. "DNA Mixture Deconvolution: A Four-Strategy Framework from Physical Separation to Database Searching" Genes 17, no. 4: 434. https://doi.org/10.3390/genes17040434
APA StyleZhu, Q., Mao, Z., & Zhang, J. (2026). DNA Mixture Deconvolution: A Four-Strategy Framework from Physical Separation to Database Searching. Genes, 17(4), 434. https://doi.org/10.3390/genes17040434

