Diffuse Correlation Blood Flow Tomography Based on Conv-TransNet Model

Zhang, Xiaojuan; Yan, Wen; Zhang, Peng; Tong, Xiaogang; Zhou, Haifeng; Shang, Yu

doi:10.3390/photonics12080828

Open AccessArticle

Diffuse Correlation Blood Flow Tomography Based on Conv-TransNet Model

by

Xiaojuan Zhang

¹

,

Wen Yan

²,

Peng Zhang

³

,

Xiaogang Tong

^1,4,

Haifeng Zhou

⁴ and

Yu Shang

^5,*

¹

Department of Electronic Engineering, Taiyuan Institute of Technology, Taiyuan 030008, China

²

Department of Physics and Electronics, Taiyuan Normal University, Taiyuan 030619, China

³

School of Computer and Information Technology, Shanxi University, Taiyuan 030006, China

⁴

Shanxi Tiancheng Semiconductor Material Co., Ltd., Taiyuan 030051, China

⁵

School of Life and Health Technology, Dongguan University of Technology, Dongguan 523808, China

^*

Author to whom correspondence should be addressed.

Photonics 2025, 12(8), 828; https://doi.org/10.3390/photonics12080828

Submission received: 5 July 2025 / Revised: 15 August 2025 / Accepted: 15 August 2025 / Published: 20 August 2025

(This article belongs to the Section Biophotonics and Biomedical Optics)

Download

Browse Figures

Versions Notes

Abstract

Diffuse correlation tomography (DCT) is an emerging technique for detecting diseases associated with localized abnormal perfusion from near-infrared light intensity temporal autocorrelation functions (g₂(τ)). However, a critical drawback of traditional reconstruction methods is the imbalance between optical measurements and the voxels to be reconstructed. To address this issue, this paper proposes Conv-TransNet, a convolutional neural network (CNN)–Transformer hybrid model that directly maps g₂(τ) data to blood flow index (BFI) images. For model training and testing, we constructed a dataset of 18,000 pairs of noise-free and noisy g₂(τ) data with their corresponding BFI images. In simulation validation, the root mean squared error (RMSE) for the five types of anomalies with noisy data are 2.13%, 4.43%, 2.15%, 4.05%, and 4.39%, respectively. The MJR (misjudgment ratio)of them are close to zero. In the phantom experiments, the CONTRAST of the quasi-solid cross-shaped anomaly reached 0.59, with an MJR of 2.21%. Compared with the traditional Nth-order linearization (NL) algorithm, the average CONTRAST of the speed-varied liquid tubular anomaly increased by 0.55. These metrics also demonstrate the superior performance of our method over traditional CNN-based approaches. The experimental results indicate that the Conv-TransNet model would achieve more accurate and robust reconstruction, suggesting its potential as an alternative for blood flow imaging.

Keywords:

diffuse correlation tomography; blood flow index; convolutional neural network; transformer; NL algorithm

1. Introduction

Numerous studies have demonstrated that metabolic abnormalities may be a key pathogenic factor in cancer [1]. For instance, impaired macrophage phagocytosis contributes to tumor susceptibility induced by psychological stress [2]. Hormonal dysregulation caused by anxiety and depression increases the incidence of breast cancer [3]. Moreover, even in the extremely early stages of breast tumor development before morphological formation, rapid cell division leads to localized metabolic abnormalities, including hyperperfusion, hypoxemia, and angiogenesis [4]. Currently, multiple clinical modalities are available for blood flow detection, each with inherent limitations. For instance, ultrasound Doppler is restricted to large-vessel hemodynamic assessment [5]; laser Doppler flowmetry (LDF) and laser speckle contrast imaging (LSCI) are limited to superficial tissue monitoring [6]; perfusion magnetic resonance imaging (pMRI) faces challenges due to high operational costs and lack of intraoperative compatibility [7]. Nevertheless, both Doppler ultrasound and laser Doppler measure the blood flow velocity along a specific major vessel wherein a dominant flow direction exists. At the microvasculature level, however, the red blood cells move diffusely in various directions. It is difficult for Doppler techniques to measure the velocity, because there is no dominant direction in the network of blood capillaries. By contrast, near-infrared diffuse correlation spectroscopy/tomography (DCS/DCT) quantifies the temporal autocorrelation function of light electric fields, which is sensitive to the Brownian motion of red blood cells in arbitrary directions. This motion is referred to as the blood flow index (BFI), with a unit of cm²/s. The unit represents the spatial rate of blood flow dispersion per unit time, specifically applicable for evaluating microcirculation perfusion efficiency. Although the flow units of DCS/DCT and the Doppler technique are not directly comparable, the blood flow measurements by DCS/DCT have been widely validated in previous studies, evidenced by high correlations with routine flow modalities such as Doppler ultrasound [8], laser Doppler [9], and arterial spin labeling magnetic resonance imaging (ASL-MRI) [10], as well as ¹⁵O-water PET [11]. Moreover, previous reports [11] have demonstrated that the blood flow index (BFI) can be thoroughly calibrated and converted to conventional absolute cerebral blood flow (CBF) units (mL/100 g/min). Another microscopic structural imaging technique is optical sectioning, which focuses on eliminating out-of-focus interference through optical methods to achieve high-resolution three-dimensional but superficial structural visualization [12]. Recently, diffuse correlation spectroscopy/tomography (DCS/DCT) has been widely used to assess various diseases characterized by localized perfusion abnormalities, including neurological deficits (e.g., acute stroke, intracerebral hemorrhage, traumatic brain injury, and subarachnoid hemorrhage [13,14]), skeletal muscle disorders (e.g., myasthenia gravis, progressive muscular dystrophy, periodic paralysis, and burn injuries [15,16,17]), and breast pathologies (e.g., tumors and hyperplasia [18,19]).

Compared with DCS, DCT enables spatial reconstruction of BFI distributions, providing critical advantages for tumor localization and morphological characterization (including size, shape, and position). Like other tomographic techniques, diffuse correlation tomography requires multiple source-detector pairs to acquire sufficient measurement data. However, due to practical constraints in instrumentation costs, the acquired data volume remains substantially smaller than the number of reconstruction voxels, resulting in severely ill-posed inverse problems in diffuse correlation tomography [20,21,22,23]. Traditional diffuse correlation tomography employs two primary algorithmic approaches: analytical methods and numerical analysis methods (NAMs). To address the inherent ill-posedness of the inverse problem, these algorithms incorporate various regularization techniques to improve BFI reconstruction quality. In analytical approaches, the inversion of correlation diffusion equations is typically achieved through singular value decomposition (SVD). The Tikhonov regularization parameter λ is optimally determined via L-curve analysis [22,23]. However, analytical methods require the assumption of semi-infinite tissue geometry, which restricts their applicability for imaging highly curved brain surfaces or irregular tumor morphology. As a representative numerical approach, the finite element method (FEM) has been adapted from diffuse optical tomography (DOT) [20] to diffuse correlation tomography (DCT) [24,25] to overcome these geometric and heterogeneity constraints. In this method, a modified Tikhonov minimization, equivalent to applying a Laplacian-type filter [26], is used to minimize variation within each region. Nevertheless, current FEM implementations remain limitations because only a single data point from the normalized electric field autocorrelation function g₁(τ) curve is utilized for BFI reconstruction, potentially introducing imaging errors.

To overcome the above-mentioned drawbacks, we previously [21] developed an Nth-order linearization (NL) algorithm that transforms the nonlinear correlation diffusion equation into a linear form through Taylor series expansion of g₁(τ). This approach utilizes the slope of g₁(τ) as projection data for BFI reconstruction, enabling adaptive selection of multiple data points for slope fitting to improve signal-to-noise ratio (SNR) based on noise characteristics. However, this method introduces three cumulative error sources: (1) Primary bias from linear regression during slope estimation of g₁(τ); (2) An additional reconstruction error through iterative solutions of the ill-posed inverse problem using slope data; and (3) A systematic error in the Siegert relation when converting experimental g₂(τ) measurements to g₁(τ), where the coherence factor β (typically ≈ 0.5 for single-mode fibers) introduces quantification inaccuracies. The error propagation originates from the fundamental measurement principle: diffuse correlation spectroscopy/tomography would generally reconstruct BFI values from intensity fluctuations of scattered light caused by moving red blood cells. This requires conversion of experimentally measured g₂(τ) to g₁(τ) via the Siegert relation, establishing a cascade of error accumulation throughout the reconstruction pipeline.

Deep learning, particularly large language models (LLMs), demonstrates universal approximation capabilities that enable direct nonlinear mapping from raw data to reconstructed images, especially in dynamic scattering media [27]. By establishing an end-to-end mapping from g₂(τ) measurements to BFI images, deep learning approaches could potentially bypass the three-stage error accumulation existing in conventional methods, meanwhile significantly reducing the computational time. However, two critical barriers currently impede this application: (1) The lack of standardized DCT datasets pairing g₂(τ) data with corresponding BFI ground truth (GT); and (2) The absence of specialized neural network architectures optimized for DCT-based blood flow reconstruction. Recent advances in deep learning for DCS provide relevant methodological insights: Poon et al. [28] implemented MobileNetV2 for BFI quantification in DCS; Feng et al. [29] developed a ConvGRU network for direct temporal BFI prediction from g₂(τ); and Zhang et al. [30] employed support vector machines (SVMs) for g₁(τ) denoising prior to BFI calculation. Recent advances have demonstrated the application of deep learning to DCT blood flow reconstruction. Notably, Liu et al. [31] developed a hybrid architecture combining LSTM networks for g₁(τ) curve denoising with convolutional neural networks (CNNs) for tomographic BFI image reconstruction. Li et al. [32] proposed DCT-Unet, a U-Net variant with six-stage encoding, to leverage prior knowledge in establishing the g₁(τ)-to-BFI mapping. While both approaches utilize g₁(τ) data, they differ fundamentally in data utilization: Liu’s method processes only intermediate g₁(τ) segments, while Li’s network incorporates nearly complete g₁(τ) profiles. These methods are distinct from the traditional NL algorithms that are restricted to initial g₁(τ) segments where Taylor series approximations remain valid (τ→0). Nevertheless, three critical limitations persist: (1) Computational constraints prevent full g₁(τ) waveform utilization due to memory-intensive training requirements; (2) Current implementations output either normalized or binarized blood flow values, deviating from absolute flow quantification; and (3) The inherent information loss during g₂(τ)→g₁(τ) conversion remains unaddressed.

Therefore, the core contributions of this work have three aspects: (1) Dataset construction: development of a comprehensive paired dataset containing g₂(τ) measurements with corresponding BFI GT, incorporating fundamental characteristics of diffuse correlation spectroscopy; (2) Network architecture: implementation of Conv-TransNet, a novel hybrid CNN–Transformer model capable of direct end-to-end mapping from initial g₂(τ) segments to BFI tomographic images; and (3) Experimental validation: computational simulations using tissue models with varying anomaly configurations (morphology, spatial distribution, dimensions), phantom studies under controlled conditions, and comparative performance analysis against conventional CNN architectures and NL algorithms.

2. DCT Theory and Datasets Generation

This section will show the DCT theory and the steps of datasets generation. To enhance the network’s sensitivity and accuracy in localizing and identifying anomalies, it is crucial to develop diverse tissue models with variations in anomaly location, shape, and pattern. Given the differing principles and procedures between simulation and phantom experiments, their respective data generation processes are described separately in this section.

2.1. DCT Theory

The scattering particles in biological tissue microvasculature (particularly hemoglobin) undergo Brownian motion, resulting in temporal fluctuations in the intensity of the photon-scattered speckle pattern (Figure 1). The normalized autocorrelation function of this scattered electric field, g₁(τ), can be expressed as an exponential function of the mean square displacement of the moving scatters Δr²(τ), which directly reflects the blood flow index. The decay rate of g₁(τ) directly characterizes the dynamic state of scattering particles: when blood flow velocity increases, the Δr²(τ) increases accordingly, leading to accelerated decay of g₁(τ).

2.2. Simulation and Datasets Construction

The schematic diagram of the computer simulation setup is shown in Figure 2. A semi-infinite heterogeneous tissue model (8 cm × 8 cm × 3 cm, with voxel size of 0.5 cm × 0.5 cm × 0.5 cm, in total of 1536 voxels) with extrapolated boundary conditions is illustrated in Figure 2a, containing a cross-shaped anomaly at its center. A total of 48 source-detector (S-D, 8 sources, each surrounded by 6 detectors) pairs are placed on the tissue surface. Using the preset tissue optical parameters (absorption coefficient μ_a = 0.05 cm⁻¹, reduced scattering coefficient

μ_{s}^{’}

= 8 cm⁻¹, refractive index n = 1.37 for all voxels) and the photon transmission data exported from Monte Carlo simulations (Figure 2c), the noise-free g₁(τ) can be computed via Equation (1) [21], and g₁(τ) with noise can be obtained via Equation (2) [22]. Through Siegert transformation, g₂(τ) can be obtained, as shown in Figure 2e,f.

\begin{array}{l} g_{1} (m, j, τ) = \frac{< E (m, j, 0) E^{*} (m, j, τ) >}{< {|E (m, j, 0)|}^{2} >} \\ = \int_{0}^{\infty} P (m, j, s_{1}, \dots, s_{n}) \exp (- 2 \sum_{i = 1}^{n} k_{0}^{2} (i) α D_{B} (i) s_{i} μ_{s}^{’} (i) τ) d (s_{1}, \dots, s_{n}) \end{array}

(1)

Here P(m, j, s₁, …, s_n) is the normalized distribution of detected photon path length s_i over n elements between the (mth, jth) source-detector pair, and it can be extracted from the Monte Carlo simulation [33]. k₀ is the wave vector magnitude of the light.

μ_{s}^{’}

is the reduced scattering coefficient. τ is the correlation delay time. The combined term αD_B is precisely the blood flow index to be determined in this inverse problem:

\begin{array}{c} σ (τ) = \sqrt{\frac{T}{t}} [β^{2} \frac{(1 + e^{- 2 Γ T}) (1 + e^{- 2 Γ τ}) + 2 m (1 - e^{- 2 Γ T}) e^{- 2 Γ T}}{(1 - e^{- 2 Γ T})} \\ + 2 {〈n〉}^{- 1} β (1 + e^{- 2 Γ τ}) + {〈n〉}^{- 2} (1 + β e^{- Γ τ})]^{\frac{1}{2}} \end{array}

(2)

Here, T is the correlator bin time, Γ is the decay rate, <n> = IT, and I is the detected photon count rate (i.e., light intensity).

However, adoption of the entire g₂(τ) curve as the neural network input is impractical due to its large size (48 × 254, where 254 data points cover the entire correlation delay time, from τ = 1 × 10⁻⁷ s to several seconds), which would increase computational and storage cost. Additionally, since g₂(τ) exhibits rapid decay in the initial short correlation delay time (τ ∈ [10⁻⁷, 2.7 × 10⁻⁶] s) and its Taylor expansion demonstrates accelerated convergence as τ→0, we represent the truncated g₂(τ) profiles as a 48 (spatial channels) × 22 (temporal bins) matrix in Figure 2g. To meet the input requirements of our Conv-TransNet model, which expects a square matrix, we further reshape the 48 × 22 matrix into a 32 × 32 matrix using X-Y ordering, discarding 32 data points in the process (Figure 2h).

Figure 2b shows a 0.5 cm thick slice of the tissue model in Figure 2a, with six slices obtained in total. The cross-shaped anomalous region is located approximately 0.3 cm below the tissue surface and has a thickness of 1.0 cm. Thus, the maximum anomaly depth is 1.3 cm, meaning the first three slices all contain the anomaly. Since the last two slices (depth: 2–3 cm) exceed the detectable range (maximum S-D separation: 2.83 cm; detection depth limit: ~1/2 S-D separation [34,35]), only the first four slices were stitched to form a 2D blood flow image. The final composite image has an actual size of 16 × 16 cm² with 32 × 32 voxels (Figure 2d). This process establishes the paired dataset consisting of transformed g₂(τ) curves (Figure 2h) and BFI images (Figure 2d) for deep learning model training.

Following the aforementioned procedures, we constructed paired datasets of tissue models with varying anomaly locations, shapes, and types (Figure 3). Ultimately, 9000 × 48 noise-free and noisy g₂(τ) curves for the 48 source-detector pairs were generated, respectively, as shown in Figure 3a. The corresponding 2D blood flow images obtained after slice stitching are presented in Figure 3b. Subsequently, we randomly selected and combined equal portions of noise-free and noisy data to create a new mixed dataset for deep learning model training. From this combined dataset, 8500 pairs were allocated for training, while the remaining 500 pairs served as the testing set.

2.3. Phantom Experiment and Data Generation

In phantom or clinical experiments, g₂(τ) curves can be measured directly to test the deep learning model. The diffuse correlation tomography instrumentation and phantom experiment setup for cross-shaped anomaly detection are illustrated in Figure 4. Here, we focus solely on the preparation of the phantom solution, while the working principle of diffuse correlation tomography is described elsewhere [36].

Figure 4d shows the phantom experimental setup featuring a quasi-solid cross-shaped anomaly. The background solution in the rectangular aquarium consists of distilled water, India ink, and Intralipid-20%, formulated to achieve target optical properties (μ_a = 0.05 cm⁻¹, μ^’_s = 8 cm⁻¹, which are consistent with the reported optical properties of biological tissues [34,37]). The cross-shaped anomaly was fabricated by using transparent silicone, India ink, and Intralipid-20% through natural drying, so as to match with the optical properties of the background solution. To maintain consistency with computer simulations, the anomaly was designed with asymmetric dimensions: a 3 cm horizontal bar and 3.5 cm vertical bar (1 cm width and thickness). The structure was positioned 0.3 cm below the solution surface, resulting in a maximum depth of 1.3 cm (including the 1 cm anomaly thickness).

The second phantom comprised a speed-variable liquid tubular anomaly (diameter: 0.4 cm), fabricated using a transparent glass capillary tube filled with background solution and dispersed quasi-solid microparticles (φ ≤ 0.8 mm) to simulate the complicated tissue flow dynamics. A peristaltic pump controlled the flow velocity, generating varied degrees of spatial contrast within the phantom.

3. Conv-TransNet Model

In this study, we adapt a hybrid Conv-TransNet architecture (integrating CNN and Transformer modules) for diffuse correlation tomography applications. The complete network architecture is illustrated in Figure 5.

As previously described, the truncated and reshaped g₂(τ) matrix serves as the input to the convolutional block (conv, Figure 5a). This block comprises three key components: a 2D convolutional (conv2d) layer, ReLU activation function, and normalization layer (norm layer), which collectively extract shallow features from the processed g₂(τ) curves. The extracted features are then passed through a fully connected layer, where the 32 × 32 g₂(τ) matrix undergoes linear transformation into a 64 × 256 matrix. This expanded matrix is subsequently processed by the Transformer Encoder (Figure 5b), which contains two core components: multi-head attention (MHA) and a multi-layer perceptron (MLP).

The encoder employs MHA to capture correlations among sequence elements, and the MLP is adopted to perform nonlinear transformations. This architecture enables the network to learn higher-level feature representations. To enhance generalization and mitigate gradient vanishing/exploding issues, residual connections are incorporated throughout the encoder. Finally, the decoder reconstructs a 32 × 32 blood flow image through three conv. The key design elements include (1) the norm layer to accelerate convergence and prevent overfitting; (2) the ReLU activation functions to enhance nonlinear modeling capacity; and (3) the uniform convolutional parameters: stride = 1, kernel_size = 3 with padding maintained at all Conv2D layers.

As shown in Figure 5a, the structure of our Conv-TransNet model demonstrates that the input truncated and reshaped g₂(τ) are first compressed into a low-dimensional space, then ultimately mapped to the image domain through feature extraction and fusion. This process involves five convolutional blocks, one linear layer, and nine encoder layers. In essence, the model establishes a direct end-to-end mapping between the input g₂(τ) data and the output four-slice spliced blood flow images. The proposed method significantly improves the speed of blood flow reconstruction, meanwhile enhancing its intuitiveness.

During training, we adopt the Huber loss as the loss function, with a learning rate of 7 × 10⁻⁵. The model is trained for 300 epochs with a batch size of 4 and optimized by using Adam. The implementation is based on Python (version 3.12) and PyTorch (version 2.6.0), running on an Intel Core i7-6700MQ CPU and an NVIDIA GTX 3080 Ti GPU.

For the CNN architecture, we design seven convolutional blocks, three residual layers, one concatenation layer, and six convolutional layers for feature extraction and fusion. The implementation also employs the Huber loss function and Adam optimizer, with the remaining hyperparameters consistent with the original Conv-TransNet configuration.

4. Evaluation Criteria and Experiments Results

In this study, comprehensive simulations and phantom experiments were designed to rigorously evaluate the performance of the proposed Conv-TransNet model for diffuse correlation tomography. For simulations, five heterogeneous tissue models with distinct anomaly geometries were tested: a two-dot anomaly (case A), asymmetric cross-shaped anomaly (case B), rectangular anomaly (case C), Z-shaped anomaly (case D), and two-bar anomaly (case E). For the phantom experiments, two physiologically relevant scenarios were investigated: a quasi-solid cross-shaped anomaly (case F) and the speed-varied liquid tubular anomalies (case G, H, I). The results of the Conv-TransNet model, CNN, and NL algorithm will be shown in this part, along with the evaluation criteria.

4.1. Evaluation Criteria

To comprehensively evaluate the performance of Conv-TransNet, we conducted comparative experiments with the CNN and NL algorithm, assessing both visual reconstruction quality and quantitative metrics.

Root mean squared error (RMSE) quantifies the difference between the reconstructed and the true images, and it is defined as follows:

RMSE = \sqrt{\frac{1}{n} \sum_{i}^{n} {(\frac{α D_{B}^{i} - α D_{B, 0}^{i}}{α D_{B, 0}^{i}})}^{2}} \times 100 %

(3)

Here,

α D_{B}^{i}

and

α D_{B, 0}^{i}

are the reconstructed and true blood flow index of the ith tissue voxel, respectively. A smaller RMSE value indicates that the reconstructed blood flow index image is closer to the true one.

CONTRAST quantifies the difference between the anomalies and background, which is defined as follows.

CONTRAST = \frac{α D_{B - a n o m a l y}}{α D_{B - b a c k g r o u n d}}

(4)

Here, αD_B-anomaly is the average reconstructed BFI of the anomalous voxels, and αD_B-background is the average BFI of the background. Under conditions (anomaly blood flow index = 5 × 10⁻⁸ cm²/s, background = 1 × 10⁻⁸ cm²/s), the theoretical maximum CONTRAST value of 5 indicates perfect reconstruction fidelity.

Furthermore, the misjudgment ratio (MJR) was defined by us to evaluate the performance of the three methods.

MJR = FPR + FNR

(5)

Here, the false-positive rate (FPR) refers to the probability of normal voxels being misjudged as anomalies; for instance, in scenarios with anomalies of 5 × 10⁻⁸ cm²/s against backgrounds of 1 × 10⁻⁸ cm²/s, normal voxels will be flagged as anomalous when exceeding the entire volume mean BFI by 20% (1.2 × 10⁻⁸ cm²/s). The false negative rate (FNR) is the probability that an anomaly (beyond 20% of the global average) is incorrectly labeled as normal.

4.2. Computer Simulation with Noisy Data

Figure 6 displays the reconstructed splicing blood flow index images derived from noisy g₂(τ) data. The first column (A1, B1, C1, D1, and E1) shows the ground truth (GT) tissue models for five types of anomalies: two-dot anomaly (first three slices, 2 voxels per slice), asymmetric cross-shaped anomaly (first three slices, 8 voxels per slice), rectangular anomaly (first three slices, 8 voxels per slice), Z-shaped anomaly (first three slices, 8 voxels per slice), and two-bar anomaly (first three slices, 6 voxels per slice). The background is 1 × 10⁻⁸ cm²/s, while anomalies have a BFI of 5 × 10⁻⁸ cm²/s. The second column (A2, B2, C2, D2, and E2) shows the reconstructed blood flow index images by the Conv-TransNet model for the five types of anomalies, respectively. The third column (A3, B3, C3, D3, and E3) displays the reconstructed images by the CNN for the same five anomalous types. The fourth column (A4, B4, C4, D4, and E4) depicts the images reconstructed by the NL algorithm. From the figures, we observe that all three methods accurately reconstruct the spatial locations of the anomalies. However, in terms of shape and size fidelity, the Conv-TransNet model and CNN yield more regular and GT-like reconstructions when compared with the NL algorithm. Specifically, the Conv-TransNet model achieves more accurate reconstruction of both the rectangular (Figure 6(C2)) and Z-shaped anomalies (Figure 6(D2)), precisely matching their true morphology and dimensions without introducing peripheral artifacts. By comparison, while the CNN method similarly reconstructs the correct shape and size of these anomalies, it generates a single false-positive voxel in each of the asymmetric cross-shaped and rectangular anomalous regions and three false-positive voxels in Z-shaped and two-bar anomalies. An additional advantage of the Conv-TransNet model and CNN is their consistent reconstruction quality regardless of anomaly depth. By contrast, the performance of the NL algorithm deteriorates progressively with depth, with nearly complete failure to recover anomalous blood flow index in the third layer.

For quantitative evaluation, the RMSE, CONTRAST, and MJR are listed in Table 1. As shown in it, the Conv-TransNet model achieves significantly lower RMSE values for all five types of anomalies compared to both the CNN and NL algorithms. Specifically, for the rectangular anomaly, the RMSE of the NL algorithm (13.62%) and CNN (9.32%) are approximately 6.33 and 4.33 times higher than that of the Conv-TransNet model (2.15%), respectively. This demonstrates that the Conv-TransNet model’s reconstruction achieves superior fidelity to the GT.

CONTRAST and MJR metrics were calculated based on the first two slices, as the performance of the NL algorithm degrades significantly in the third slice. The CONTRAST metrics of the five different anomalies reconstructed by Conv-TransNet model are 5.13, 4.39, 4.62, 4.13, and 4.20, respectively. Although the CONTRAST metrics for cross-shaped and rectangular anomalies are slightly lower than those obtained by CNN (4.67 and 4.92, respectively), they remain close to the theoretical optimum of 5 (background: 1 × 10⁻⁸ cm²/s; anomaly: 5 × 10⁻⁸ cm²/s) and are significantly superior to the results of the NL algorithm.

For the MJR metric, the Conv-TransNet model achieved almost 0% values across all five types of anomalies. Both the CNN and NL methods showed 0% MJR for the two-dot anomaly. However, for the cross-shaped anomaly, the CNN showed 0.39% MJR, primarily due to FPR (2 false-positive voxels/512 voxels in the first two slices), and the NL algorithm exhibited 2.34% MJR, also FPR-dominated. Similarly, for rectangular anomalies, the CNN maintained 0.39% MJR, and the NL algorithm demonstrated 4.69% MJR, with both cases being predominantly FPR. For the Z-shaped and two-bar anomalies, only one false-positive voxel was identified, corresponding to a 0.19% MJR. These rates are substantially lower than those achieved by the CNN and NL methods.

To investigate whether the truncating and reshaping g₂(τ) curves would cause data loss and whether longer truncation would improve results or not, we analyzed cases C (rectangular anomaly) and D (two-bar anomaly). Specifically, (1) We truncated each of the 48 g₂(τ) curves to 30 initial data points and reshaped them into a 38 × 38 matrix; (2) We alternatively used the first 48 data points from all 48 curves to directly construct a 48 × 48 matrix without reshaping. The reconstructed images for three different delay times are shown in Figure 7. The first row of Figure 7 presents reconstructed images of case C (rectangular anomaly), with (a) GT and (b)–(d) displaying reconstructions using 22, 30, and 48 delay-time points, respectively. The second row shows case F (two-bar anomaly), where (e) presents the GT, while (f)–(h) demonstrate reconstructions by using 22, 30, and 48 delay-time points, respectively. The results clearly demonstrate that reconstruction quality deteriorates as the delay time τ increases. Notably, the reconstruction with 48 delay-time points shows severe distortion.

Quantitative analysis in Table 2 further confirms this trend, showing that reconstruction quality worsens with the increasing delay-time points. For the two-bar anomaly, the RMSE increases from 4.39 to 15.93, while the contrast decreases from 4.24 to 3.87. Potential reasons for these observations will be discussed in the Discussion section.

4.3. Phantom Experiment

4.3.1. The Quasi-Solid Cross-Shaped Anomalous BFI Phantom Experiment

Figure 8 presents the reconstruction results of the quasi-solid cross-shaped anomaly (Case F) phantom experiment, with panels (F1), (F2), and (F3) showing the outputs from the Conv-TransNet model, CNN, and NL algorithm, respectively. Figure analysis reveals that all three methods can clearly resolve both the spatial location and morphological characteristics of the quasi-solid cross-shaped anomalies. Comparative analysis demonstrates that both Conv-TransNet and the CNN yield more regular morphological features and significantly fewer artifacts than the NL algorithm. Furthermore, quantitative evaluation reveals that the Conv-TransNet reconstruction contains marginally fewer artifacts compared to the CNN results.

For quantitative evaluation in the phantom experiments where ground truth (GT) data were unavailable, CONTRAST and MJR served as the primary metrics for comparing the performance of the three methods. As shown in Figure 8(F4), the CONTRAST values of the first three slices are 0.59, 0.60, and 0.78 for the Conv-TransNet model, CNN and NL algorithm, respectively. Although both deep learning methods achieved nearly identical CONTRAST values, the Conv-TransNet model demonstrated significantly lower MJR (2.21% vs. CNN: 3.13%; Figure 8(F5)) under our revised FPR criterion. Here, FPR was redefined to classify normal voxels as false positives when their BFI is 10% lower than the entire solution—a criterion accounting for the inherently lower BFI of the quasi-solid anomaly. Error analysis confirmed that misjudgments are predominantly false positives in both cases. The NL algorithm yielded a substantially higher MJR of 4.56%, with error decomposition demonstrating this was predominantly FNR—a clinically significant drawback that may lead to missed diagnoses in practical applications.

4.3.2. The Speed-Varied Liquid Tubular Anomaly Phantom Experiment

All three methods were evaluated on tubular anomalies at the varied flow velocities (100–600 mL/h, 100 mL/h increments). Representative reconstructions for 200 mL/h (Case G), 400 mL/h (Case H), and 600 mL/h (Case K) are displayed in Figure 9, as well as the quantitative metrics (CONTRAST and MJR) from the first two slices.

Figure 9 demonstrates the superior performance of deep learning methods, with Conv-TransNet (G1, H1, I1) and CNN (G2, H2, I2) producing more uniform backgrounds and sharper tubular anomaly contours compared to the NL algorithm (G3, H3, I3). Notably, both deep learning methods maintain consistent anomaly detection, even at low flow rates (200 mL/h) and across varying depths, meanwhile avoiding the performance degradation observed in NL algorithm results.

Figure 9 further demonstrates that while deep learning methods enhance reconstruction quality for high-flow-rate tubular anomalies (e.g., 600 mL/h), the Conv-TransNet model particularly excels in low-flow-rate conditions (200 mL/h), yielding more discernible anomaly delineation compared to conventional approaches. Figure 9(I4) demonstrates a flow-rate-dependent increase in CONTRAST for all methods, with particularly significant improvements at lower flow velocities. The Conv-TransNet model achieves a 46.3% higher CONTRAST (2.15 vs. 1.47) at 100 mL/h than the NL algorithm. This improvement gradually stabilizes across higher flow rates (200–600 mL/h), showing consistent increments of 0.56, 0.53, 0.43, 0.50, and 0.53, respectively, with a mean improvement of 0.55 ± 0.06 across all tested flow rates. Regarding MJR, as shown in Figure 9(I5), the NL algorithm maintains approximately 15%, with only marginal improvement as flow velocity increases. By contrast, the deep learning methods achieve much lower MJR values (close to zero) with the increasing flow rates. These results provide further evidence of deep learning’s superior performance over conventional NL algorithms.

Quantitative comparison reveals that while the Conv-TransNet model achieves marginally higher CONTRAST values than CNN, it demonstrates substantially superior MJR performance. As evidenced in Figure 9(I5), (1) at the 100 mL/h flow rate, Conv-TransNet attains 6.84% MJR versus CNN’s 8.01% (14.6% relative improvement); (2) at 600 mL/h, Conv-TransNet reaches perfect 0% MJR compared to CNN’s residual 0.78% error rate.

5. Discussion

This work proposes a CNN–Transformer model for direct mapping of g₂(τ) data to BFI images, outperforming conventional approaches. However, the novel insights and reconstruction challenges identified warrant further in-depth analysis.

The primary advantage of the Conv-TransNet model is its ability to directly process the g₂(τ) data matrix as the input without complex preprocessing or manual feature extraction, achieved through a self-attention mechanism that models inter-element dependencies via pairwise correlation calculations [38]. Furthermore, to optimize computational efficiency and reduce storage demands, the raw g₂(τ) curves are truncated and resampled. As illustrated in Figure 2h and Figure 5a, the truncated g₂(τ) is transformed into a discrete representation, denoted as g₂(n). Here, the x- and y-axes solely represent discrete data points, decoupled from the original correlation time τ.

Certainly, the proposed Conv-TransNet model also has limitations. For instance, the CONTRAST values for cross-shaped (4.62) and rectangular (4.13) anomalies are slightly lower than those achieved by the CNN (4.67 and 4.62, respectively). This performance gap may stem from the reliance of transformer architectures on large-scale datasets. Although we constructed 18,000 noise-free and noisy data pairs, these were essentially generated through positional variations of merely ~100 distinct anomaly prototypes, lacking true sample independence. In such small-sample, strongly ill-posed, and noise-corrupted inverse tasks, CNNs often demonstrate superior practical value due to their local inductive bias (spatial priors) and parameter efficiency. Conversely, when training data is insufficient or the architecture is not fully optimized for the physical model, hybrid designs may inadvertently compromise sensitivity to subtle hemodynamic variations.

In computer simulations, anomalies with varying BFI values were randomly selected for testing, yet all coincidentally exhibit higher BFI than the background (Figure 6). The speed-varied liquid tubular anomalies also have higher BFI. The high BFI anomaly is used to mimic hyperperfusion states induced by malignant tumors, neural activation, or high-intensity exercise. By contrast, the quasi-solid cross-shaped anomaly with lower blood flow than the surrounding medium (Figure 8) is designed to simulate clinical scenarios of hypoperfusion caused by calcified tissues or ischemia. It is important to note that DCS/DCT detects Brownian motion of particles rather than direct flow velocity [34]. The cross-shaped anomaly is a quasi-solid, and its Brownian motion is attenuated yet persists—albeit weaker than the background liquid solution. Thus, its expected CONTRAST should be less than 1. The Conv-TransNet model produces a CONTRAST of 0.59, significantly lower than that of the NL algorithm (0.77). This indicates its higher accuracy in modeling the physical characteristics of the anomaly, where the quasi-solid state attenuates Brownian motion.

Another challenge lies in the complexity of anomalies. As shown in Table 1, the RMSE increases with anomaly complexity. The two-dot anomaly represents the simplest case, consistently yielding the lowest RMSE across all three methods. The asymmetric cross-shaped, rectangular, and Z-shape anomalies all occupy 8 voxels in the first three slices. On the other hand, however, the asymmetric cross-shaped and Z-shape anomalies exhibit higher RMSE values (4.43 and 4.05) than the rectangular anomaly (2.15). This discrepancy stems from the cross-shaped anomaly’s higher curvature.

In Figure 7 and Table 2, we validate the influence of correlation delay time on the quality of reconstructed images, revealing superior anomaly reconstruction with shorter delays. The underlying reason may be explained by the principle of DCT: specifically, initial shorter correlation time τ corresponds to stronger correlation, and even a slight delay time can lead to a rapid decrease in g₂(τ). Conversely, the latter longer τ exhibits weaker correlation, resulting in gradual g₂(τ) decay. Consequently, data acquired with the latter large delay time are less conducive to the model’s extraction of data features, which in turn impairs the reconstruction quality. This fundamental principle is the basis of both the NL algorithm [21] and FEM [26], where shorter delay times (τ ∈ [10⁻⁷, 2.7 × 10⁻⁶] s) are selected to optimize reconstruction accuracy.

In phantom experiments, the definitions of FNR and FPR are different due to the different BFI of the two kinds of anomalies, but their assessment logic is unified. For Conv-TransNet model and the CNN, FPR is mainly MJR, whereby FNR is dominant MJR for the NL algorithm. However, false negatives pose significant clinical risks for tumor detection and treatment. This limitation stems from the ill-posed linear equations and back-projection initial values used by the NL algorithm, which inevitably lead to homogenized reconstructed images [36].

Furthermore, as shown in Figure 9, while deep learning methods enhance reconstruction quality for tubular anomalies at both lower and higher flow velocities, they show limited improvement for intermediate flow rates. For instance, the CONTRAST obtained by the Conv-TransNet model is increased by 0.68, 0.56, 0.53, 0.43, 0.50, and 0.53 at flow rates of 100–600 mL/h (in 100 mL/h increments), respectively. This phenomenon may be attributed to the imbalanced distribution of training data, which is predominantly categorized into two extremes: 1 × 10⁻⁸ cm²/s and 5 × 10⁻⁸ cm²/s. Such data distribution enhances contrast sensitivity at low flow velocities while better accommodating high flow rate conditions but appears to compromise performance at intermediate ranges.

6. Conclusions

To conclude, the proposed Conv-TransNet model effectively extracts inter-point correlations from g₂(τ) curves, meanwhile capturing essential features from paired data. A comprehensive dataset was constructed for training and evaluating the Conv-TransNet model. Both computer simulations and phantom experiments demonstrate that the Conv-TransNet model achieves superior performance in BFI reconstruction, exhibiting lower RMSE and MJR compared to conventional methods. These results position the Conv-TransNet model as a promising alternative for faster and more precise DCT blood flow reconstruction.

Future work will include the generation of larger datasets with anatomically diverse digital tissue models and maintenance of inter-dataset independence. To further improve reconstruction accuracy, denoising the g₂(τ) data, as well as exploring alternative network architectures and loss functions, will be investigated. More importantly, further clinical research is needed on statistically significant groups of patients with a variety of diseases and pathologies.

Author Contributions

X.Z.: Writing—original draft, Visualization, Software, Methodology, Investigation, Formal analysis, Data curation, Funding acquisition, Conceptualization. W.Y.: Software, Data curation, Conceptualization. P.Z.: Writing—review and editing, Visualization, Supervision, Resources, Methodology, Funding acquisition, Formal analysis. X.T.: Writing—review and editing, Visualization, Funding acquisition. H.Z.: Writing—review and editing. Y.S.: Conceptualization, Methodology, Writing—review and editing, Visualization, Supervision, Project administration. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Natural Science Foundation for Young Scientists of Shanxi Province (202203021212325, 202403021222024), the Scientific and Technological Innovation Programs of Higher Education Institutions in Shanxi (2024L002 and 2023L360), Taiyuan Unveils Top Projects (2024TYJB0126), and the Key R&D project for introducing high-level scientific and technological talents in Lvliang City (2024RC22).

Data Availability Statement

The data that support the findings of this study are available from the corresponding author upon reasonable request.

Conflicts of Interest

Authors Xiaogang Tong and Haifeng Zhou were employed by the company Shanxi Tiancheng Semiconductor Material Co., Ltd. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

De Nunzio, C.; Truscelli, G.; Trucchi, A.; Petta, S.; Tubaro, M.; Gacci, M.; Gaudio, C.; Presicce, F.; Tubaro, A. Metabolic abnormalities linked to an increased cardiovascular risk are associated with high-grade prostate cancer: A single biopsy cohort analysis. Prostate Cancer Prostatic Dis. 2016, 19, 35–39. [Google Scholar] [CrossRef]
Wu, Y.; Luo, X.; Zhou, Q.; Gong, H.; Gao, H.; Liu, T.; Chen, J.; Liang, L.; Kurihara, H.; Li, Y.F.; et al. The disbalance of LRP1 and SIRPα by psychological stress dampens the clearance of tumor cells by macrophages. Acta Pharm. Sin. B 2022, 12, 197–209. [Google Scholar] [CrossRef]
Nakhlband, A.; Farahzadi, R.; Saeedi, N.; Barzegar, H.; Montazersaheb, S.; Soofiyani, S.R. Bidirectional Relations Between Anxiety, Depression, and Cancer: A Review. Curr. Drug Targets 2023, 24, 118–130. [Google Scholar] [CrossRef]
Jin, Z.; Dang, Y. Study on microdetermination of blood oxygen and blood flow in breast carcinoidand malignant tumor. Chin. J. Clin. Oncol. Rehabil. 2006, 13, 104–105. [Google Scholar]
Asami, R.; Tanaka, T.; Shimizu, M.; Seki, Y.; Nishiyama, T.; Sakashita, H.; Okada, T. Ultrasonic Vascular Vector Flow Mapping for 2-D Flow Estimation. Ultrasound Med. Bio. 2019, 45, 1663–1674. [Google Scholar] [CrossRef] [PubMed]
Guven, G.; Dijkstra, A.; Kuijper, T.M.; Trommel, N.; van Baar, M.E.; Topeli, A.; Ince, C.; van der Vlies, C.H. Comparison of laser speckle contrast imaging with laser Doppler perfusion imaging for tissue perfusion measurement. Microcirculation 2023, 30, e12795. [Google Scholar] [CrossRef] [PubMed]
Qohar, U.N.A. CFD Generated Tracer Indicator Flow Assessment using Perfusion MRI Analysis. In Proceedings of the 2020 IEEE-EMBS Conference on Biomedical Engineering and Sciences(IECBES), Langkawi Island, Malaysia, 1–3 March 2021; pp. 51–56. [Google Scholar]
Roche-Labarbe, N.; Carp, S.A.; Surova, A.; Patel, M.; Boas, D.A.; Grant, R.E.; Franceschini, M.A. Noninvasive optical measures of CBV, StO(2), CBF Index, and rCMRO(2) in human premature neonates’ brains in the first six weeks of life Hum. Brain Mapp. 2010, 31, 341–352. [Google Scholar] [CrossRef] [PubMed]
Bangalore-Yogananda, C.G.; Rosenberry, R.; Soni, S.; Liu, H.; Nelson, M.D.; Tian, F. Concurrent measurement of skeletal muscle blood flow during exercise with diffuse correlation spectroscopy and Doppler ultrasound. Biomed. Opt. Express 2018, 9, 131–141. [Google Scholar] [CrossRef]
Yu, G.; Floyd, T.F.; Durduran, T.; Zhou, C.; Wang, J.J.; Detre, J.A.; Yodh, A.G. Validation of diffuse correlation spectroscopy for muscle blood flow with concurrent arterial spin labeled perfusion MRI. Opt. Express 2007, 15, 1064–1075. [Google Scholar] [CrossRef]
Giovannella, M.; Andresen, B.; Andersen, J.B.; El-Mahdaoui, S.; Contini, D.; Spinelli, L.; Torricelli, A.; Greisen, G.; Durduran, T.; Weigel, U.M.; et al. Validation of diffuse correlation spectroscopy against ¹⁵O-water PET for regional cerebral blood flow measurement in neonatal piglets. J. Cereb. Blood Flow. Metab. 2020, 40, 2055–2065. [Google Scholar] [CrossRef]
Li, X.; Dan, D.; Zavatski, S.; Gao, W.; Zhang, Q.; Zhou, Y.; Qian, J.; Yang, Y.; Yu, X.; Yan, S.; et al. Optical tweeze-sectioning microscopy for 3D imaging and manipulation of suspended cells. Sci. Adv. 2025, 11, eadx3900. [Google Scholar] [CrossRef] [PubMed]
Paul, R.; Murali, K.; Varma, H.M. High-density diffuse correlation tomography with enhanced depth localization and minimal surface artefacts. Biomed. Opt. Express 2022, 13, 6081–6099. [Google Scholar] [CrossRef]
Huang, C.; Mazdeyasna, S.; Mohtasebi, M.; Saatman, K.E.; Cheng, Q.; Yu, G.; Chen, L. Speckle contrast diffuse correlation tomography of cerebral blood flow in perinatal disease model of neonatal piglets. J. Biophotonics 2021, 14, e202000366. [Google Scholar] [CrossRef]
Zhao, M.; Huang, C.; Mazdeyasna, S.; Yu, G. Extraction of Tissue Optical Property and Blood Flow from Speckle Contrast Diffuse Correlation Tomography (scDCT) Measurements. Biomed. Opt. Express 2021, 12, 5894–5908. [Google Scholar] [CrossRef]
Johansson, J.; Saager, R. Development of a novel line scanner for speckle contrast diffuse correlation tomography of microvascular blood flow. In Proceedings of SPIE; SPIE: Bellingham, WA, USA, 2023; Volume 12387, p. 123870A. [Google Scholar]
Han, S.; Proctor, A.R.; Vella, J.B.; Benoit, D.S.; Choe, R. Non-invasive diffuse correlation tomography reveals spatial and temporal blood flow differences in murine bone grafting approaches. Biomed. Opt. Express 2016, 7, 3262–3279. [Google Scholar] [CrossRef]
Mazdeyasna, S.; Huang, C.; Paranzino, A.B.; Mohtasebi, M.; Cheng, Q.; Wong, L.; Yu, G. Intraoperative Optical and Fluorescence Imaging of Blood Flow Distributions in Mastectomy Skin Flaps for Identifying Ischemic Tissues. Plast. Reconstr. Surg. 2022, 150, 282–287. [Google Scholar] [CrossRef] [PubMed]
Yazdi, H.S.; O’Sullivan, T.D.; Leproux, A.; Hill, B.; Durkin, A.; Telep, S.; Lam, J.; Yazdi, S.S.; Police, A.M.; Carroll, R.M.; et al. Mapping breast cancer blood flow index, composition, and metabolism in a human subject using combined diffuse optical spectroscopic imaging and diffuse correlation spectroscopy. J. Biomed. Opt. 2017, 22, 45003. [Google Scholar] [CrossRef]
Dehghani, H.; Eames, M.E.; Yalavarthy, P.K.; Davis, S.C.; Paulsen, K.D. Near Infrared optical tomography using NIRFAST: Algorithm for numerical model and image reconstruction. Commun. Numer. Methods Eng. 2009, 25, 711–732. [Google Scholar] [CrossRef] [PubMed]
Zhang, X.; Gui, Z.; Qiao, Z.; Liu, Y.; Shang, Y. Nth-order linear algorithm for diffuse correlation tomography. Biomed. Opt. Express 2018, 9, 2365–2382. [Google Scholar] [CrossRef]
Zhou, C.; Yu, G.; Furuya, D.; Greenberg, J.; Durduran, T. Diffuse optical correlation tomography of cerebral blood flow during cortical spreading depression in rat brain. Opt. Express 2006, 14, 1125–1144. [Google Scholar] [CrossRef] [PubMed]
Ching-Roa, V.; Han, S.; Ren, J.; Ramirez, G.A.; Kim, S.H.; Choe, R. Diffuse Correlation Tomography Geometry Optimization with Genetic Algorithm Using Singular Value Analysis. In Proceedings of the Biophotonics Congress: Biomedical Optics Congress 2018, Hollywood, FL, USA, 3–6 April 2018. [Google Scholar]
Belau, M.; Ninck, M.; Hering, G.; Spinelli, L.; Contini, D.; Torricelli, A.; Gisler, T. Noninvasive observation of skeletal muscle contraction using near-infrared time-resolved reflectance and diffusing-wave spectroscopy. J. Biomed. Opt. 2010, 15, 057007. [Google Scholar] [CrossRef]
Durduran, T.; Choe, R.; Baker, W.B.; Yodh, A.G. Diffuse optics for tissue monitoring and tomography. Rep. Prog. Phys. 2010, 73, 76701–76743. [Google Scholar] [CrossRef]
He, L.; Lin, Y.; Huang, C.; Irwin, D.; Szabunio, M.M.; Yu, G. Noncontact diffuse correlation tomography of human breast tumor. J. Biomed. Opt. 2015, 20, 86003. [Google Scholar] [CrossRef]
Liu, H.; Wang, F.; Jin, Y.; Ma, X.; Li, S.; Bian, Y.; Situ, G. Learning-based real-time imaging through dynamic scattering media. Light Sci. Appl. 2024, 13, 194. [Google Scholar] [CrossRef]
Poon, C.S.; Long, F.; Sunar, U. Deep learning model for ultrafast quantification of blood flow in diffuse correlation spectroscopy. Biomed. Opt. Express 2020, 11, 5557–5564. [Google Scholar] [CrossRef]
Feng, J.; Jiang, M.; Bai, J.; Jia, K.; Li, Z. Cerebral blood flow monitoring using a ConvGRU model based on diffuse correlation spectroscopy. Infrared Phys. Technol. 2023, 129, 104541. [Google Scholar] [CrossRef]
Zhang, P.; Gui, Z.; Guo, G.; Shang, Y. Approaches to denoise the diffuse optical signals for tissue blood flow measurement. Biomed. Opt. Express 2018, 9, 6170–6185. [Google Scholar] [CrossRef]
Liu, J.; Wang, J.; Shang, Y. A deep convolutional neural network for diffuse correlation tomography. Appl. Phys. Lett. 2025, 126, 083702. [Google Scholar] [CrossRef]
Li, Y.; Nikolaienko, D.; Wang, J.; Shang, Y. DCT-UNet: A UNet architecture for diffuse correlation tomography. Opt. Express 2025, 33, 9133–9151. [Google Scholar] [CrossRef]
Li, T.; Chang, X.; Wang, P.; Li, Y.; Wu, L. Photon penetration depth in human brain for light monitoring and treatment: A Realistic Monte Carlo Simulation Study. J. Innov. Opt. Health Sci. 2017, 10, 1743002. [Google Scholar] [CrossRef]
Boas, D.A. Diffuse Photon Probes of Structural and Dynamical Properties of Turbid Media: Theory and Biomedical Applications. Ph.D. Thesis, University of Pennsylvania, Philadelphia, PA, USA, 1996. [Google Scholar]
Murkin, J.M.; Arango, M. Near-infrared spectroscopy as an index of brain and tissue oxygenation. Br. J. Anaesth. 2009, 103, 3–13. [Google Scholar] [CrossRef]
Zhang, X.; Ding, T.; Shang, Y.; Gui, Z. An extensive upgrading of contact diffuse CorrelationTomography system. Int. J. Imaging Syst. Technol. 2024, 34, e23076. [Google Scholar] [CrossRef]
Sun, P.; Wang, Y. Measurements of optical parameters of phantom solution and bulk animal tissues ex vivo at 650 nm. Opt. Laser Technol. 2010, 42, 1–7. [Google Scholar] [CrossRef]
Chen, Y.; Zhang, F.; Wang, M.; Zekelman, L.R.; Cetin-Karayumak, S.; Xue, T.; Zhang, C.; Song, Y.; Makris, N.; Rathi, Y. TractGraphFormer: Anatomically Informed Hybrid Graph CNN-Transformer Network for Classification from Diffusion MRI Tractography. Med. Image Anal. 2025, 101, 103476. [Google Scholar] [CrossRef]

Figure 1. Schematic diagram of single-photon scattering in a turbid medium.

Figure 2. The diagram of the computer simulation experimental setup, (a) the voxeled tissue model and S-D pairs, (b) slices of the tissue model, (c) the luminous flux distribution provided by the Monte Carlo simulation, (d) slice splicing and dimensional reduction, (e) noise-free, (f) noisy, (g) truncated, and (h) deformed g₂(τ) curves.

Figure 3. The paired g₂(τ) curves—BFI images datasets. (a) Pictures of the noisy g₂(τ) curves, (b) images of the BFI after slicing and splicing.

Figure 4. The DCT instrumentation and the phantom experiment setup of the cross-shaped anomaly, (a) data processor, (b) optical switch array, (c) S-D module, (d) phantom setup.

Figure 5. Network structure of the Conv-TransNet model; the structure of (a) Conv-TransNet model, (b) conv block, and (c) Transformer Encoder.

Figure 6. The reconstructed splicing blood flow index images from noisy g₂(τ); case A: two-dot anomaly (A1) GT, reconstructed BFI images by (A2) Conv-TransNet, (A3) CNN, and (A4) NL algorithm; case B: asymmetric cross-shaped anomaly (B1) GT, reconstructed BFI image by (B2) Conv-TransNet, (B3) CNN, and (B4) NL algorithm; case C: rectangular anomaly (C1) GT, reconstructed BFI image by (C2) Conv-TransNet, (C3) CNN, and (C4) NL algorithm; case D: Z-shaped anomaly (D1) GT, reconstructed BFI image by (D2) Conv-TransNet, (D3) CNN, and (D4) NL algorithm; case E: two-bar anomaly (E1) GT, reconstructed BFI image by (E2) Conv-TransNet, (E3) CNN, and (E4) NL algorithm.

Figure 7. The reconstructed images with different correlation delay times τ, (a) rectangular anomaly GT, reconstructed rectangular anomaly images with (b) 22 data points, (c) 30 data points, (d) 48 data points, (e) two-bar anomaly GT; reconstructed two-bar anomaly images with (f) 22 data points, (g) 30 data points, (h) 48 data points.

Figure 8. The splicing blood flow images of the quasi-solid cross-shaped anomalies in phantom experiment, reconstructed images by (F1) Conv-TransNet model (F2) CNN, (F3) NL algorithm, and the quantitative evaluation criteria, (F4) CONTRAST, (F5) MJR.

Figure 9. The splicing blood flow images of the speed-varied liquid tubular anomaly in phantom experiment, the reconstructed BFI images of 200 mL/h by (G1) Conv-TransNet model, (G2) CNN, (G3) NL algorithm; 400 mL/h by (H1) Conv-TransNet model, (H2) CNN, (H3) NL algorithm; 600 mL/h by (I1) Conv-TransNet model, (I2) CNN, (I3) NL algorithm; and the quantitative evaluation criteria (I4) the linear regression of the pump speed and CONTRAST, (I5) the linear regression of the pump speed and MJR, legend for both (I4) and (I5).

Table 1. The evaluation criteria for noisy data with different reconstruction algorithms.

Anomaly	RMSE (%)			CONTRAST			MJR (%)
Anomaly	Conv-TransNet	CNN	NL	Conv-TransNet	CNN	NL	Conv-TransNet	CNN	NL
case A: two-dot	2.13	2.66	3.76	5.13	4.23	2.78	0	0	0
case B: cross-shaped	4.43	10.93	15.84	4.39	4.67	3.24	0	0.39	2.34
case C: rectangular	2.15	9.32	13.62	4.62	4.92	3.35	0	0.39	4.69
case D: Z-shape	4.05	9.87	19.62	4.13	3.56	3.54	0.19	1.56	2.15
case E: two-bar	4.39	16.94	18.72	4.20	3.72	3.75	0.19	1.37	3.71

Table 2. The evaluation criteria of rectangular anomaly and two-bar anomaly with different correlation delay times τ.

Anomaly	RMSE (%)	CONTRAST	MJR (%)
Rectangular (τ-22)	2.15	4.39	0
Rectangular (τ-30)	2.18	4.30	0
Rectangular (τ-48)	5.93	3.92	0.39
Two-bar (τ-22)	4.39	4.24	0.19
Two-bar (τ-30)	13.42	4.02	0.59
Two-bar (τ-48)	15.93	3.87	0.59

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhang, X.; Yan, W.; Zhang, P.; Tong, X.; Zhou, H.; Shang, Y. Diffuse Correlation Blood Flow Tomography Based on Conv-TransNet Model. Photonics 2025, 12, 828. https://doi.org/10.3390/photonics12080828

AMA Style

Zhang X, Yan W, Zhang P, Tong X, Zhou H, Shang Y. Diffuse Correlation Blood Flow Tomography Based on Conv-TransNet Model. Photonics. 2025; 12(8):828. https://doi.org/10.3390/photonics12080828

Chicago/Turabian Style

Zhang, Xiaojuan, Wen Yan, Peng Zhang, Xiaogang Tong, Haifeng Zhou, and Yu Shang. 2025. "Diffuse Correlation Blood Flow Tomography Based on Conv-TransNet Model" Photonics 12, no. 8: 828. https://doi.org/10.3390/photonics12080828

APA Style

Zhang, X., Yan, W., Zhang, P., Tong, X., Zhou, H., & Shang, Y. (2025). Diffuse Correlation Blood Flow Tomography Based on Conv-TransNet Model. Photonics, 12(8), 828. https://doi.org/10.3390/photonics12080828

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Diffuse Correlation Blood Flow Tomography Based on Conv-TransNet Model

Abstract

1. Introduction

2. DCT Theory and Datasets Generation

2.1. DCT Theory

2.2. Simulation and Datasets Construction

2.3. Phantom Experiment and Data Generation

3. Conv-TransNet Model

4. Evaluation Criteria and Experiments Results

4.1. Evaluation Criteria

4.2. Computer Simulation with Noisy Data

4.3. Phantom Experiment

4.3.1. The Quasi-Solid Cross-Shaped Anomalous BFI Phantom Experiment

4.3.2. The Speed-Varied Liquid Tubular Anomaly Phantom Experiment

5. Discussion

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI