Variance Preserving Spectral Subsampling

Hyrum J. Hansen; Thomas L. Burr; Stephen Croft; John Kirkpatrick; David J. Mercer; Athena A. Sagadevan; Tom J. Stockman III; Emily N. Stark

doi:10.3390/a19010025

,

and

¹

Los Alamos National Laboratory, Los Alamos, NM 87545, USA

²

School of Engineering, Lancaster University, Bailrigg, Lancaster LA1 4YW, UK

³

Mirion Technologies, 800 Research Pkwy, Meriden, CT 06450, USA

^*

Author to whom correspondence should be addressed.

Algorithms2026, 19(1), 25;https://doi.org/10.3390/a19010025
(registering DOI)

This article belongs to the Section Algorithms for Multidisciplinary Applications

Version Notes

Order Reprints

Abstract

Generating statistically faithful short-duration gamma-ray spectra from a single long measurement is essential in nuclear safeguards, supporting tasks such as algorithm development and machine-learning applications, especially when list-mode data are unavailable. Existing subsampling methods often distort the statistical characteristics of genuine short-duration measurements, leading to biased or unreliable analytical outcomes and thereby undermining downstream tasks. In this work, we compare five subsampling approaches using a benchmark set of 156 genuine replicate spectra collected with a high-purity germanium detector. We evaluate each method with respect to run-to-run variance, channel-to-channel variance, and preservation of total counts (losslessness). Across a wide range of subsampling ratios, only binomial subsampling without replacement consistently reproduces the statistical properties of genuine short-duration spectra, maintaining proper dispersion even in sparse spectral regions and perfectly preserving total counts. These results provide a mathematically principled and practically validated framework for generating synthetically shortened spectra when true short-duration measurements are unavailable.

Keywords:

gamma spectroscopy; non-destructive assay; nuclear safeguards; spectral subsampling

Article Metrics

Citations

Article Access Statistics

Journal Statistics

Article metric data becomes available approximately 24 hours after publication online.