1. Introduction
Cancer arises from alterations in the inheritable molecular characteristics of normal cells [
1]. The majority of these inheritable characteristics are contained within the DNA of the cells. Changes in the structure and organization of the DNA within the nucleus can result in unregulated growth of these cells and hence cancer. Quantitative DNA analysis measures the DNA content and its spatial organization within individual cell nuclei, revealing various cellular characteristics such as ploidy status and cell cycle phase, which are identified by their fractional DNA content [
2]. As cells transform from a normal to a malignant state, several alterations occur in the DNA and evaluating the total DNA content in a cell population can inform us about the extent of genetic damage the cells have undergone. Flow cytometry and image cytometry are widely used methods that allow us to quantify the DNA content within individual cells in large cell populations accurately, making it an increasingly important tool in clinical tumor analysis [
3]. These methods rely on the use of dyes or stains which bind stoichiometrically to DNA, providing an accurate measurement of the DNA content within the cell [
4].
The Feulgen reaction remains one of the most commonly used DNA stains in image cytometry to specifically detect and quantify DNA in a consistent, reproducible, and standardized manner [
5,
6]. It enables the specific staining of DNA by utilizing Schiff reaction or Schiff-like reagents such that the amount of dye found in the cell is proportional to DNA concentration [
7]. While Thionin-stained absorption image cytometry is a valuable tool for DNA quantitation, it is often subject to methodological limitations such as staining variability and optical artifacts including refraction and glare effects.
Biological tissue samples exhibit non-uniform absorption and are inherently heterogeneous in the refractive index due to structural protein variations between nuclei, cytoplasm, connective tissue, and the embedding or mounting medium [
8]. As light passes through biological tissue, it undergoes multiple interactions, including refraction, partial reflection, and scattering, causing local variations in the transmitted intensity that are unrelated to chromogen absorption [
9]. The Beer–Lambert law describes the exponential attenuation of light as it passes through a homogeneous, non-scattering medium, assuming that all losses in transmitted intensity are due solely to absorption [
10]. Under these ideal conditions, the optical density (OD) is linearly proportional to the concentration of the absorbing species and the optical path length. However, in biological tissues and cytological samples, glare, scatter, and refraction introduce additional optical contributions which violate the assumptions of the Beer–Lambert law [
10]. In return, these effects results in an underestimation or overestimation of the optical density at each measurement point (pixel) and, consequently, the apparent DNA content. Although attenuation in biological tissue can be represented using a modified Beer–Lambert law to account for both absorption and scattering, our approach targets refraction-related artifacts, which represent the primary source of error in these images.
Given the importance of generating accurate measurements for precise DNA quantitation, correction methods that focus on reducing optical limitations or systematic errors are of great interest [
11]. Previous studies have shown that glare and refraction within the microscope optics can significantly distort optical density measurements in stained nuclei, leading to an altered measurement of DNA content and increased measurement variability. Haroske et al., for example, demonstrated that subtracting the glare contribution from transmitted light intensity improves both the precision and accuracy of DNA cytometry measurements [
12].
Spectral unmixing refers to the process of computationally unmixing stains to determine the concentration of the stain at every pixel in a selected area using the absorption spectra of each stain in the image. Thionin exhibits a distinctive and sharply peaked (600 nm) absorption spectra when compared to other non-specific absorbers or refraction effects observed within pathology slides (
Figure 1). Refraction effects generally do not cause light to be absorbed but cause light to not take a straight path through the sectioned tissue or cells. It causes more light to be detected at some pixels and less light at other pixels, so it has the effect of an absorber with both positive and negative concentrations. In this context negative concentration indicates more light appears to come from a pixel than it was illuminated with as light from neighboring pixels has been diverted by refraction into the measured pixel. By mathematically decomposing the measured spectrally overlapping signals in an image into their constituent absorptive and refractive components, spectral unmixing enhances the accuracy of stain quantification and enables more precise estimation of nuclear DNA content in cytology and histology slides. In this paper, we describe the process of utilizing spectral unmixing to improve the measurement of DNA content and quantification of the distribution of DNA within the nuclei of cells by correcting for refraction.
2. Materials and Methods
2.1. Cell Culture
Cells were grown on cytology slides to optimize staining protocols. HL-60 acute promyelocytic leukemia cells were obtained from the American Type Culture Collection (Manassas, VA, USA) and maintained in Iscove’s Modified Dulbecco’s Medium supplemented with 10% fetal bovine serum and 1% penicillin/streptomycin. Cultures were incubated at 37 °C in an atmosphere of 95% air and 5% CO2.
For slide preparation, autoclaved, uncharged, pre-cleaned glass slides were placed into square culture dishes (three slides per dish) and covered with 15 mL of cell suspension at 5 × 105 cells/mL in growth medium. Phorbol 12-myristate 13-acetate (PMA; 15 µL of a 1 mg/mL solution in ethanol; Sigma-Aldrich, Oakville, ON, Canada) was added to each dish to promote cell adherence. After 48 h, slides were rinsed, fixed in Sed-Fix® (Surgipath, Richmond, IL, USA) for 40 min, and air-dried overnight. Prior to staining, dried fixative was removed by immersing slides in ethanol for 20 min at room temperature followed by air-drying.
HL-60 slides were used because they provide a convenient and reproducible material and were routinely employed in our laboratory to monitor batch-to-batch variation in Thionin staining.
2.2. Thionin Staining
Thionin acetate (Sigma-Aldrich) was used to formulate the staining reagents. All solutions were prepared one day in advance. For the preparation of approximately 250 mL of Thionin stain, 0.125 g of Thionin powder was dissolved in 110 mL of deionized water and heated to boiling for 5 min. After cooling to room temperature, 32.5 mL of 1 N hydrochloric acid, 110 mL of tert-butanol, and 2.175 g of sodium bisulphite were added. The mixture was stirred for one hour, kept overnight, and filtered immediately before staining.
All staining procedures were carried out at 23–24 °C using a temperature-controlled water bath, with extensive rinses in deionized water between steps. Slides were post-fixed for one hour in Böhm-Sprenger fixative (methanol, formalin, and acetic acid at a 16:3:1 ratio), hydrolyzed in 5 N hydrochloric acid for one hour, and then stained in the Thionin solution for one hour. This was followed by three rinses in bisulphite solution (0.5% sodium bisulphite in 0.05 N hydrochloric acid), each separated by water washes. After a final wash, slides were dehydrated through three 30-s ethanol baths, cleared in xylene, and cover slipped for imaging.
2.3. Hyperspectral Imaging and Analysis
Hyperspectral imaging was performed using our in-house developed automated hyperspectral whole-slide imaging system. The hyperspectral scanner consists of a brightfield microscope (Zeiss Axioscope 2 Mot plus microscope and Zeiss 20X (NA 0.75) (Carl Zeiss Microscopy, Jena, Germany) plan APROCHROMAT objective lens), a CRI Varispec tunable liquid crystal filter (tunable between 420 nm and 720 nm in 20 nm increments), and an Andor Neo camera with a computer-controlled x-y stage (Marzhauser Wetzlar SCAN series ((Märzhäuser Wetzlar GmbH & Co. KG, Wetzlar, Germany)). Images were acquired with a pixel sampling spacing of 0.233 μm (20× magnification) in both x and y coordinates. The Region of Interest (ROI) to be imaged is selected by the user prior to imaging. The system automatically collected 16 different spectral images from 420 to 720 nm per camera field and covered the defined area to be imaged with sequentially acquired overlapping camera fields. Once the ROI was imaged, the system software aligned and stitched all overlapping camera fields together to make a seamless hyperspectral image. A spectral image stack from an empty area of each of the slides was also collected for flatfield calibration for each wavelength specific image (images with the light source blocked).
As the Thionin stain is stoichiometric for DNA, DNA content is proportional to the integrated optical density (IOD) of the cell. Using the spectral flatfield calibration images, we calculated the optical density for every pixel for each wavelength in the images covering the selected ROI. These optical density images were the inputs to the spectral unmixing process.
2.4. Spectral Unmixing
Spectral unmixing is performed to separate objects in the sample based on their spectra using the formula:
D is the Optical density Image data (M × N),
C is the concentration matrix (M × K),
S is the spectra Matrix (K × N) and
E is the Noise (M × N) (K components, N wavelengths and M image size in pixels) [
13,
14].
The optical density image data is reconstructed from the concentration data and spectral matrix; in our case, we used a spectral matrix with two components (k = 2).
Using the observed absorption spectra of Thionin (measured by averaging over multiple pixel areas occupied by nuclei) and the observed loss of light intensity spectra from areas of unstained cell cytoplasm (which we have assumed is due to refraction caused by the difference in index of refraction of cell cytoplasm and the surrounding paramount used to affix the coverslip) the stains are computationally unmixed to determine the concentration of stain for every pixel in the selected area and the amount of refraction occurring at each pixel location. Although Feulgen staining employs a single chromogen/dye (Thionin, absorption peaked at 600 nm), the measured spectrum is not composed of a single component. Biological tissues and cytological samples exhibit close to wavelength-independent refraction that add optical contributions to the apparent peaked absorbance signal of Thionin. As a result, measuring the spectrum at each pixel represents a combination of the true Thionin absorption spectra and an additional refraction-related component. Spectral unmixing is therefore capable of separating these components, with the Thionin absorbance spectrum treated as one spectrally distinct component and the refraction contribution treated as a second spectrally dispersed component.
The spectral unmixing assumes that every pixel in the flatfield- and dark field-corrected recorded images (16 wavelengths) was a linear combination of the concentration of the individual stains occurring at that pixel weighted by the absorption characteristics of each of the stains occurring at that pixel. When mathematically separating the components, it adds an additional error term to each pixel to compensate for the electronic and photonic noise in the images. The concentration of the Thionin was required to be non-negative while the concentration of the refraction effects was allowed to be both positive and negative.
The Multivariate Curve Resolution–Alternating Least Squares algorithm (MATLAB R2014a) was used to separate the linear combinations of absorption stains, each varying in concentration at individual pixels. The analysis involved transforming the collected hyperspectral image data using a logarithm base 10 scale to make the result linear with amount of absorption occurring at each pixel (Beer–Lambert law is commonly applied to chemical analysis measurements to determine the concentration of chemical species that absorb light) [
15,
16]. This data was then modeled to represent concentration images for each stain, each multiplied by the corresponding stain spectra plus an error term [
13,
14].
The spectral component used to model refraction in the unmixing procedure was derived empirically from regions of cytoplasm that should be devoid of Thionin. These areas exhibited a weak wavelength-dependent signal, which we attribute to light redistribution caused by local refractive index variations within and between the cytoplasm and the surrounding medium. This component is not intended to represent a universal physical spectrum of refraction, but rather a residual term that captures the dominant optical contributions present in the image. The purpose of including this term is to remove refraction-induced distortions from the Thionin channel and thereby improve nuclear contrast (potentially making segmentation easier) and measurement consistency.
4. Discussion
Here, we describe the utility of spectral unmixing in reducing refraction-related artifacts and improving the accuracy of DNA content measurements and potentially texture feature discrimination ability. Data derived from spectrally unmixed images provided more consistent and reliable estimates of DNA content, offering a representation that more closely reflects the true spatial distribution of DNA within individual cell nuclei. Although Thionin-stained absorption image cytometry remains a well-established and valuable tool for DNA quantitation, it is often limited by procedural limitation such as staining variability, illumination non-uniformity, and calibration inconsistencies. A common highlighted issue is that the measurements are affected by glare and refraction effects in the microscope system. By explicitly modeling and computationally reducing the contributions of refraction effects through spectral unmixing, we achieved a marked improvement in measurement accuracy and image interpretability.
From a DNA ploidy analysis perspective, substantial effort has been devoted to mitigating the effects of glare and refraction artifacts that compromise the accuracy of quantitative cytometry [
11,
12,
18]. Glare, caused by unwanted light scattering and reflection within the optical system, can distort the intensity measurements of DNA-stained cells, leading to inaccuracies in ploidy assessment [
19]. Refraction, caused by variations in the index of refraction of the cell cytoplasm and nuclei DNA and surrounding medium, can redirect or focus light within the sample, thereby degrading measurement accuracy. Previous methods have proposed correction methods for optical glare which have been applied and tested for DNA cytometry. These procedures rely on subtraction of the mean glare transmittance from each object to eliminate errors due to different nuclear Thionin-staining intensities [
18]. Other correction methods have focused on reducing glare and refraction effects depending on size and mean optical density [
12].
Spectral unmixing leverages the distinct spectral profiles of refraction which appear to exhibit a more uniform spectra appearance when compared to Thionin, which displays a characteristic absorption peak near 600 nm. This approach assumes a linear relationship between pixel intensities and the spectral contributions of each stain while compensating for noise introduced by factors such as refraction and effects usually attributed to glare. By separating the pure spectra of each component in the image, spectral unmixing reduces the optical contributions and distortions caused by refraction. This enables selective isolation of the pure Thionin absorption spectrum, thereby reducing optical distortions and improving the fidelity of DNA content measurements. Beyond ploidy analysis, texture features describing the spatial distribution of chromatin within nuclei are widely used in cytopathology to provide diagnostic and prognostic information for human cancers [
20]. In addition to improving ploidy analysis, by removing the background noise and refraction effects in the image, it appears to effectively enhance texture features discriminating ability. While the results here are based on hyperspectral images taken over 16 wavelengths as a proof of concept, the same process appears to work with only three spectral bands (RGB) as seen in
Figure 2. So, this approach could have broad applicability as it works with conventional color cameras for Feulgen-stained samples or any other stain with a relatively peaked absorption as a function of wavelength.
Overall, spectral unmixing has emerged as a powerful computational approach in many applications of histology and cytology research. By successfully isolating overlapping signals and removing optical artifacts introduced in the image by refraction, it significantly enhances the precision of DNA quantitation, reduces measurement variability, and appears to improve the utility of texture feature measurements. These findings suggest that incorporating spectral unmixing into standard image cytometry workflows may provide a more accurate framework for quantitative pathology. From a computational perspective, spectral unmixing is scalable and can be easily applied to larger datasets and whole-slide images. The primary concerns are image size and processing time, which can be addressed through increased memory or GPU acceleration. As such, the approach is compatible with high-throughput workflows and can be extended to larger studies without methodological limitations.
In summary, this study introduces a novel application of spectral unmixing for quantitative image cytometry, demonstrating that modeling refraction as a separate spectral component effectively corrects optical distortions inherent to brightfield imaging. By doing so, it enhances the precision of DNA ploidy quantitation and strengthens the discriminative power of nuclear texture features, both of which are critical for diagnostic and prognostic cytopathology. As spectral unmixing methods can be easily integrated and applied, incorporating this correction framework may help standardize DNA content measurements and refine image-derived metrics for cancer diagnostics and digital pathology.