3.1. Theoretical Performance of the DDKF
In this section, we prove that when
, the DDKF estimation of state
x is unbiased and the variance converges. The proof is classical and can be referred to in the article [
16].
The temporary state estimation error of sensor
i is defined as
, and the final state estimation error of sensor
i is defined as
. According to Equations (
15) and (
16), we have:
Then, the step 8 in Algorithm 2 can be rewritten as follows:
where
.
Because for any . Thus, , which means that the DDKF estimation of the state variables is unbiased.
Let
be the covariance of
,
; then:
Assuming that the matrices
do not change with
t, which means that they are constant value matrices, the evolution matrix
F is stable, which means that all eigenvalues of
F have a modulus lengths of less than one, and in
, the following limitations exist:
Then, there exists a constant
such that
, where
is the unique solution to the following equation:
where
3.2. DDKF in the Toy Model
Before applying the DDKF method to assimilate large-scale neural networks, we first validated the effectiveness of the DDKF method through a small network toy model. Here, we use a simplified dynamic mean field model (DMF) [
20] to calculate the global dynamics of the entire cerebral cortex. DMF consistently expresses the temporal evolution of collective activities of different excitatory and inhibitory neural populations. In the DMF model, the firing rate of each population depends on the input current entering the population. On the other hand, the input current depends on the firing rate of the respective populations. Therefore, the population firing rate can be determined automatically and consistently using a simplified system of coupled nonlinear differential equations that represent the population firing rate and input currents, respectively. The topology of the large-scale mean field model is generated according to the neuroanatomical connection DWI between these brain regions, which is connected by the structural connection matrix (SC).
The global brain dynamics are described by the following set of coupled nonlinear stochastic differential equations:
For the brain region
i,
represents the average firing rate of the excitatory neuron population,
represents the average synaptic gating variable of the excitatory neuron population,
represents the excitatory input current, and
represents the connecting structure between brain area
i and brain area
j, which is generally derived from neural anatomical data.
represents the activation function of the excitatory neuronal population, which maps the input current of the neuronal population to the firing rate, and
are the constant parameters of this function. Correspondingly, the superscripts marked with
I represent the relevant variables of the inhibitory neuron population. The parameters of the synaptic dynamics equation are set as
.
is the background input current, and
are the parameters that regulate the background current of excitatory and inhibitory neuronal populations, respectively.
is the synaptic current conductivity coefficient of NMDA,
is the synaptic current conductivity coefficient of GABA, and
G is the global coupling coefficient, which are used to regulate the neuronal population to maintain a lower level of spontaneous activity.
represents the weight coefficient from the excitatory to the inhibitory neuronal population when connected to a neural network. Similarly,
are defined as the weight coefficient.
is the Gaussian noise with a variance of
. We set up the model such that
can be modified, and other parameter settings can be found in the article [
21].
Then, based on the above model, we set the parameters in advance and then use DDKF to estimate . We compare the estimated values of model parameters and state values with the preset values to verify the effectiveness of our proposed method. To better compare the differences between the DDKF, DKF, and diffKF and to facilitate their application in brain network data assimilation, we set the number of brain regions in the DMF network to . At this point, the network connection matrix is , where elements are randomly sampled from according to a uniform distribution. The global coupling coefficient is , . The DMF model is nonlinear. We use the idea of ensemble Kalman filtering to calculate the covariance matrix, which generates simulated samples (particles) to simulate the system. By calculating the variance of the samples, we obtain the state covariance matrix of the system.
Assuming that our observation of the model is the average excitation rate of each brain region, the average excitation rate of the brain region
i is:
Then, we estimate the parameters of each brain region in the model
and the state variables based on the observation sequence. As mentioned in
Section 2.1.1,
is regarded following the equation
. Thus, the state variables of DMF are
.
In this paper, the sensor network required for the deep diffusion algorithm is set as follows: the sensor network is fully connected, and sensor
i can only observe the average excitation rate of the brain region
i. Finally, we use the average estimation of all sensors as the final estimate of the network state. When estimating the state of brain region
r, we believe that the estimation performance of sensor
r is better than that of other sensors. When information diffusion occurs, the proportion of sensor
r should be higher than that of other sensors. Therefore, we design the following diffusion matrix
:
Easy to verify, the diffusion matrix
satisfies Equation (
9), and we call
the isolation coefficient of the deep diffusion algorithm. When
, the weight of the sensor itself during the diffusion process is 0; that is, it completely adopts the system state estimated by other sensors. When
, the weight of the sensor itself is 1; this means that the system state is only estimated by the sensor itself, and the algorithm is equivalent to the DKF method without neighbors. When
, the weight of the sensor itself is the same as that of other sensors; that is, all sensors estimate the same system state after the diffusion process. At this time, the algorithm is equivalent to the diffKF method where each element of the diffusion matrix is 1. To gain a more intuitive understanding of the Algorithm 2, we have provided a detailed algorithm process under the diffusion matrix
in
Appendix B.
We note that the estimation error of the model parameters is:
and the assimilation error of the observation data is:
As shown in
Figure 2, we demonstrate the impact of the value of
on the data assimilation effect. It can be seen that under different
values, the parameters and observation can ultimately be accurately estimated. More details can be found in
Appendix B.
Notice that the definition of observation error represents the cumulative error, which results in the error seeming to be not too small. We define the real-time error as follows and calculate the errors under the different isolation coefficient
, as shown in
Table 2.
However, according to
Figure 2a, a much larger
leads to a much slower convergence speed. Thus, there is a balance between convergence speed and accuracy. We need to choose the proper isolation coefficient using prior knowledge to ensure system assimilation effectiveness.
3.3. Resting-State Twin Brain Assimilation
We use the deep diffusion Kalman filter method to estimate the state variables of the large-scale cortex model according to real resting-state brain BOLD signals. In this section, we assume that the AMPA synaptic conductance can be modified, and we add to the state variables. The real resting-state brain BOLD signals are a series of observation data from a total of time points, which were generated by the fMRI scanner with an observation frequency of 1.25 Hz. At any observation time, the dimension of the BOLD signals is the same as the number of brain regions in the cortex. The calculation iteration step of the neuronal network is still 1 ms, but to mimic the real BOLD signals series measured by fMRI, we set the observation function as a down-sampling process, recording every 800 ms interval.
Similarly, we compare the DKF, diffKF, and DDKF methods, which were evaluated by the assimilation error of the observation data and real-time error at the final time
, shown in
Table 3. The BOLD signals can be estimated well with a lower isolation coefficient and within a proper range; the larger the coefficient, the more accurate the estimation is. Excessive coefficients are not conducive to information exchange in sensor networks, making this method ineffective.
Furthermore, we demonstrate the assimilation error of observation data during DDKF data assimilation, shown in
Figure 3. Combined with
Table 3, although smaller coefficients converge faster at the beginning, their final accuracy is not as good as larger coefficients.
Multi-scale brain connectivity refers to the physical and functional connectivity patterns between a group of brain units on multiple spatial scales. For large and complex systems such as the human brain, brain units can be defined as individual neurons, voxels, or macroscopic brain regions. The connectivity between these units can be further divided into the anatomical structural connectivity (SC) [
22], functional connectivity (FC) [
23] between their functional states, or effective connectivity of their inferred causal interactions (EC) [
24]. Brain connectivity is the foundation of signal propagation in neural networks, making it crucial for understanding cognition and the flow of information and activity in the brain during health and disease.
The BOLD signal reflects the changes in activity and metabolism of different brain regions during fMRI brain imaging acquisition. It is believed that if the synchronization of BOLD signal changes in two brain regions is high, that is, if the synchronization of brain activity metabolism is high, then there should be a strong functional connection between these two brain locations. Usually, the FC matrix can be obtained by calculating the Pearson correlation coefficients of the BOLD signals of different brain regions [
25]. Let denote the BOLD signal in brain region
i as
, then the
ith row and
jth column elements of the FC matrix are:
Based on the BOLD signal that we collected, we can draw an empirical functional connection matrix as shown in
Figure 4. However, after data assimilation, we obtain a neural network system that can generate the BOLD signal, and we can observe the system by sampling neurons. According to the BOLD signals generated by the assimilated computational network, we can also calculate the assimilated functional connected matrix as shown in
Figure 5. The correlation coefficient between the empirical and assimilated FC matrix is
up to 98.42%. At the same time, some methods that adjust the global coupling parameter by exhaustive search can only achieve a correlation of around 50% [
26,
27], and this fully demonstrates that DA, as a data-simulating method, can effectively estimate BOLD signals and the FC matrix.
On the other hand, based on the activation of neurons in different brain regions per millisecond, we can also draw a connection matrix, as shown in
Figure 6. The calculation method for the firing rate of different brain regions is to calculate the average firing rate of neurons in that brain region within 800 ms. It can be found that there is a certain similarity with the empirical FC matrix where the correlation coefficient is
, and we believe that this statistical method can become a new tool for calculating functional connectivity matrices. Because we have achieved neuron-level modeling, we can study the process of synaptic transmission between brain regions or voxels or even between neurons, which is further helpful for us to understand the resting-state brain. Moreover, we can also study the mechanisms and treatment methods of brain diseases by conducting experiments on computational brain models without any moral or ethical issues.