Next Article in Journal
Hippocampus Parcellation via Discriminative Embedded Clustering of fMRI Functional Connectivity
Previous Article in Journal
Persistent Non-Suicidal Self-Injury and Suicidality in Referred Adolescents: A Longitudinal Study Exploring the Role of Cyclothymic Temperament
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

How Sample Size Impacts Probabilistic Stimulation Maps in Deep Brain Stimulation

1
Department of Biomedical Engineering, Linköping University, 58185 Linköping, Sweden
2
Department of Clinical Science, Neuroscience, Umeå University, 90185 Umeå, Sweden
3
Institute for Medical Engineering and Medical Informatics, School of Life Sciences, University of Applied Sciences and Arts Northwestern Switzerland, 4132 Muttenz, Switzerland
*
Author to whom correspondence should be addressed.
Brain Sci. 2023, 13(5), 756; https://doi.org/10.3390/brainsci13050756
Submission received: 24 March 2023 / Revised: 25 April 2023 / Accepted: 28 April 2023 / Published: 3 May 2023
(This article belongs to the Section Neurotechnology and Neuroimaging)

Abstract

:
Probabilistic stimulation maps of deep brain stimulation (DBS) effect based on voxel-wise statistics (p-maps) have increased in literature over the last decade. These p-maps require correction for Type-1 errors due to multiple testing based on the same data. Some analyses do not reach overall significance, and this study aims to evaluate the impact of sample size on p-map computation. A dataset of 61 essential tremor patients treated with DBS was used for the investigation. Each patient contributed with four stimulation settings, one for each contact. From the dataset, 5 to 61 patients were randomly sampled with replacement for computation of p-maps and extraction of high- and low-improvement volumes. For each sample size, the process was iterated 20 times with new samples generating in total 1140 maps. The overall p-value corrected for multiple comparisons, significance volumes, and dice coefficients (DC) of the volumes within each sample size were evaluated. With less than 30 patients (120 simulations) in the sample, the variation in overall significance was larger and the median significance volumes increased with sample size. Above 120 simulations, the trends stabilize but present some variations in cluster location, with a highest median DC of 0.73 for n = 57. The variation in location was mainly related to the region between the high- and low-improvement clusters. In conclusion, p-maps created with small sample sizes should be evaluated with caution, and above 120 simulations in single-center studies are probably required for stable results.

1. Introduction

Deep brain stimulation (DBS) provides electrical pulses to alleviate symptoms of different movement disorders, psychiatric disorders, pain conditions, etc. Depending on the disorder and pathophysiology, different target structures have been suggested, some more established than others. However, the disorders are often complex and multifactorial, and the optimal location of stimulation within the different targets is still debated [1].
Probabilistic stimulation maps (PSM) were introduced to objectively investigate the optimal stimulation location. PSMs combine an estimate of the volume of tissue activated (VTA) by the stimulation and the clinical outcome to create a statistical mapping of the DBS effect for individual patients [2], or patient groups [3,4,5]. A common procedure for VTA estimation is electric field simulation based on the location of the DBS lead in the anatomy and the stimulation settings [6].
When PSMs are created by applying statistical tests in each voxel, a multitude of comparisons are conducted on the same data. The resulting probability map (p-map) will therefore require correction for multiple comparisons (Type 1 error). There are several methods for this, e.g., permutation procedures [7,8,9] or false-discovery rate (FDR) [10,11] where permutation test is more common. Some analyses fail to survive the correction for multiple comparisons, and the limited number of patients (e.g., approximately 10–30 patients) in several DBS-PSM studies can be questioned [12,13,14].
A recent publication reviewing data from several studies on essential tremor (ET) also showed a notable variation in the optimal stimulation spot [13]. A future possibility is to predict the effect in new patients based on PSMs, but that requires reliable results of the mappings. Several publications have included predictive algorithms based on PSMs [11,13,15,16], but they explain the interpatient variability poorly. The influence of the amount of data on the mapping results has not yet been explored.
This paper aims to investigate the variation in p-maps from DBS with regard to sample size. The evaluated parameters are the overall p-map significance together with the volume and location of significant voxels.

2. Materials and Methods

Thorough descriptions of the clinical procedure [17] and PSM method [3] have been published previously. Section 2.1, Section 2.2, Section 2.3 and Section 2.4 will contain a brief summary.

2.1. Clinical Data

Data from 61 patients with ET operated on in the posterior subthalamic area (caudal zona incerta) between 2003–2016 (lead 3387 or 3389, Medtronic, Minneapolis, MN, USA) were retrieved. The leads in the left hemisphere only were used for analysis. Written informed consent was obtained from the patients and the study was approved by the local Ethics Committee at Umeå University (Dnr. 01-123M, 2017/122-31). The patient data included pre-operative MRI (T1- and T2-weighted), postoperative CT for lead localization, stimulation parameters, and clinical improvement according to the essential tremor rating scale (ETRS) [18]. The analysis was based on data from a monopolar review 1 year after surgery, i.e., each contact of the DBS quadripolar lead was tested for clinical improvement using standardized pulse width (60 μs) and repetition frequency (145 Hz). During this test, the amplitude was increased in increments of 0.2–0.3 V up to 5 V, and the lowest amplitude generating the best outcome according to ETRS items 5/6 and 11–14 was noted. This gave a total of 244 stimulations.

2.2. Electric Field Simulations

To estimate the tissue impacted by the stimulation, the electric field was simulated with the finite element method (FEM) (COMSOL Multiphysics v. 5.5, COMSOL AB, Stockholm, Sweden) [19]. This method uses the volume conductor model with a heterogeneous isotropic tissue conductivity based on segmentation of the pre-operative T2-weighted MRI. The DBS lead was modeled according to manufacturer specification and placed in the tissue volume according to the post-operative CT. A 250 µm thick peri-electrode space surrounding the lead was modeled as white brain matter [20]. The simulations were computed in a box of size 100 × 100 × 100 mm with a physics-controlled mesh (the finest element size closest to the lead).

2.3. Image Normalization

For group analysis, all patients were transformed to the same space, a group-specific template [3]. As a first step, the preoperative MRI was aligned to the MNI nLin-aSym 2009b [21] using an affine transform. Subsequently, 8 iterations of non-linear registration and template update were performed to refine the template. The resulting template is an average representation of the patient group. The electric field norm from the simulations was then transformed to the template space using the warp fields computed during the normalization. As a last step, the electric fields were sampled on the template voxel grid to facilitate spatial group analysis.

2.4. Probabilistic Stimulation Maps

The resampled electric fields were used to compute probabilistic maps (p-maps) with a voxel-wise statistical test [3]. Before analysis, the electric field was truncated at 0.2 V/mm since lower magnitudes are assumed to not invoke any clinical response. A t-statistics based on the electric field magnitude and the clinical improvement in tremor reduction (TR) was computed in each voxel according to
t i = T R ^ i 70 s i 2 N i
where N is the number of simulations above 0.2 V/mm in voxel i, s is the standard deviation, and T R ^ (%) is the weighted mean tremor reduction. Voxels with Ni less than 10% of all the sampled simulations or 5, whichever was greatest, were excluded from the analysis, i.e., the threshold was dependent on the sample size. To evaluate the effect of the exclusion threshold, complete analyses were also run with fixed thresholds of Ni ≤ 5 and Ni ≤ 12 corresponding to the minimum threshold and the 10% threshold for 30 patients. The T R ^ , weighted with the electric field norm E (V/m) over the stimulation amplitude A (V), was calculated in each voxel i according to
T R ^ i = k E i k A k · T R k k E i k A k
where k denotes the stimulation. A voxel p-value < 0.05 was regarded as significantly different from a 70% improvement, which corresponds to the mean improvement of the stimulations in the investigated population. Voxels with positive t-statistics were grouped into a high-improvement cluster, and voxels with negative t-statistics were grouped into a low-improvement cluster.
To correct for type 1 error of multiple comparisons, a permutation test was applied where the improvement values were permuted 200 times, first by permuting all leads and then by permuting the simulations within each lead. The permuted distributions were used to recompute p-maps and compared with the true distribution using an overall statistic. Significant p-value (p < 0.05) indicates an overall significance of the true p-map. The same test with the 10% exclusion threshold was repeated with 1000 permutations to separate the effect of sample size and the number of permutations.

2.5. Evaluation of Sample Size

To evaluate the effect of sample size, the p-maps were computed by sampling 5–61 (n) patients, i.e., 20–244 simulations, using random sampling with replacement. The sampling was done on a patient level since there might be a dependency within stimulations from the same patient and to have an equal frequency of the stimulating contacts. For each sample size, 20 different samplings were made resulting in 1140 p-maps.

2.6. Data Analysis

The resulting p-maps were evaluated by investigating the overall p-value from the permutation test in the different sample sizes. Furthermore, cluster volumes (V) from the p-maps were extracted representing the high-improvement voxels (voxels > 70% improvement) and low-improvement voxels (voxels < 70% improvement). For reference, the total analyzed volume and mean number of simulations per voxel were computed. The dice coefficient (DC) (Equation (3)) was used for evaluating the spatial variation between the 20 clusters resulting from the same sample size, for high and low improvement separately.
D C = 2 V X V Y V X + V Y X = 1 , , 19 ;   Y = X + 1 , , 20
The result for each sample size is presented as the median and interquartile range (IQR). For evaluations over all the sample sizes, the results are presented as mean ± standard deviation.

3. Results

The result for the overall p-values can be seen in Figure 1. With 200 permutations, the variation in overall p-value was higher for small sample sizes (max IQR = 0.333, n = 8) but reached low p-values around 20–30 patients, i.e., 80 to 120 simulations, with only a few outliers (IQR = 0, n > 35). A similar result was seen with 1000 permutations (Figure 1b) (max IQR = 0.495, n = 11; IQR = 0, n > 35) and for the fixed exclusion thresholds (Figure 1c,d) (max IQR = 0.530/0.413, n = 6/5; IQR = 0, n > 35/35 for threshold of 5 and 12, respectively) but with slightly larger variation for low sample sizes.
The variation of the cluster volumes for both high and low improvement was similar over the different sample sizes, but a low exclusion threshold generates increasing variability (Table 1) (Figure 2). The absolute volumes increased with sample size and reached stabilization around n = 30 for both the high- and low-improvement clusters but this trend was less pronounced for the fixed exclusion threshold analyses (Figure 2b,c,e,f). For reference, Figure 3 shows the total volume included in the statistical analysis and its mean occurrence of simulations. While the mean number of simulations per voxel increased in a nearly linear way, the analyzed volume was stable after reaching the point where the threshold increases with sample size, i.e., at 12 patients. When using a fixed threshold analysis, the analyzed volume increased through the entire analysis, but more pronounced for low sample sizes.
The DC increased over the entire span of sample sizes but more rapidly in the beginning (n < 40) (Figure 4). The median values of the DC reached a maximum of 0.68–0.72 for the different analyses, and the variation was similar for all sample sizes (Table 2). Similar trends and effect sizes were seen for all analyses independent of the threshold selection.
Figure 5 shows an example of a p-map and visualizes the localization of all clusters for sampling 5, 30, and 60 patients for the 10% threshold analysis. The clusters were in the vicinity of each other with a clear trend that the cluster centers converge to one region with increasing samples. The discrepancy in the location shown by the DC was mainly due to variations in the outer regions in the inferior-posterior direction for high-improvement volumes and the superior-anterior direction for low-improvement volumes.

4. Discussion

The present study has investigated the influence of the number of included DBS patients in p-map computations. A total of 244 simulations from 61 ET patients were included based on stimulation information obtained during a monopolar review 1 year after surgery.
This study shows that a small sample size generates an uncertain result in the creation of p-maps. When increasing the sample size to above 30 patients (120 simulations), the overall significance was zero for most analyses, but there were still variations in the location of the resulting significance volumes. These variations were in the region between the high- and low-improvement clusters. This is an expected result since voxels in the region between the clusters can be assumed to include a mixture of high- and low-improvement stimulations. The mixture will generate voxel-wise improvement close to the average of 70% in tremor reduction, and thereby not reach significance in the statistical test. Where this average region is located will be a result of the variation in lead localization between samples of patients. The expansion in the superior direction for the high-improvement clusters seen in Figure 5 is a result of more simulations included in each voxel, and thereby more voxels included in the analysis (Figure 3). For analyses with few patients, many of the voxels at the overall outer border will have too few simulations included and they will thereby be excluded.
The significance analysis is impacted by the sample size, but possibly also by the number of permutations. Increasing the number of permutations generates a higher power to the overall p-value due to a denser sampling of the null distribution. The statistics are based on the work of Eisenstein et al., which used 200 permutations [7]. However, for statistics on functional imaging, it has been suggested to use at least 1000 permutations [24], and some DBS studies uses this number [4]. As seen in the present study, with few patients the statistics varied more between the samplings even when increasing the number of permutations (Figure 1a–b). Therefore, the variation seen is due to the different sampled individuals in each iteration.
The increasing number of patients in the test will reduce the overall p-value and the variation of cluster location and will increase the cluster volume. The difference in volume is partly due to the threshold for excluding voxels with too few samples (Figure 3). With few patients, fewer voxels will be included in the statistics and the overall result will be more uncertain due to less volume covered by the analysis. Interestingly, while the cluster volumes were impacted by the selected thresholding method neither the overall p-value nor the DC were. With the 10% threshold, the overall analyzed volume was constant for the region where the threshold increased with sample size (>12 patients) while the cluster volume continued to increase up to about 30 patients. However, even for the region where the total volume was stable, the DC continued to increase generating a more stable location with 60 patients compared to 30 patients (Figure 4 and Figure 5). For the fixed thresholds, the cluster volumes increased for all sample sizes, but with decreasing rate above 25–30 patients, and similar stabilization of location as for the percentage threshold. The selected thresholding method will impact the outer border of the cluster volume. As analyses based on PSMs are often relative, e.g., correlation between the overlap of a patient VTA with the PSM, the selected threshold will have minor effect. Nevertheless, to reduce the effect of outliers and increase the PSM stability, a threshold less than 12 simulations is probably not suitable.
This study concerns monopolar review data, the result of which cannot directly be generalized to other datasets like intra-operative data, or data based on the clinical DBS settings used in the patient’s daily life. For other symptoms, a monopolar review is not as simple since the response to therapy can take much longer times, e.g., up to months for dystonia [25]. Therefore, the clinical DBS settings may be the only data available. One problem with the clinical settings is more pronounced skewness towards high improvement. This skewness can result in a necessity to modify the statistics and the selected improvement value to test against, but also that more samples may be required to achieve similar significance levels. On the other hand, the DBS outcomes for ET patients have relatively low variance and thus for disorders with higher variance in the outcome, a smaller sample may be sufficient. It is important to recognize that the result is a combined effect of the number of patients and simulations. While hundreds of simulations in a single patient generate much information, it cannot be generalized to a patient group. In this study, the number of stimulation settings increased the dataset moderately by including one setting per contact while not causing a large problem of dependency between simulations. However, it can be assumed that PSMs computed from clinical settings could require four times as many patients.
One limitation in this study is the homogeneity of the data. All patients were implanted in the same clinic aiming for the same target, which reduces the variation and results in a bias in spatial analysis. Furthermore, since the dataset is only composed of 61 patients, the dependency between the analysis within a cohort size is higher for larger n. This was addressed by using sampling with replacement, and the similar variation over the dataset confirms that this indeed reduces this dependency.

5. Conclusions

In conclusion, the resulting p-maps in DBS vary to a large extent for small sample sizes both in the size of significance clusters and the overall p-value. This variation could not be compensated for by increasing the number of permutations. At sample sizes of around 30 patients (120 simulations), the result stabilizes in significance and volume for this monopolar review dataset. This implies that data analysis based on small samples should be interpreted with caution since small variations in contact position will impact the result.

Author Contributions

Conceptualization, T.N., S.H. and K.W.; methodology, T.N.; software, T.N.; validation, T.N.; formal analysis, T.N.; investigation, P.B.; resources, P.B., K.W.; data curation, P.B.; writing—original draft preparation, T.N.; writing—review and editing, P.B., S.H. and K.W.; visualization, T.N.; supervision, K.W. and S.H.; project administration, T.N. and K.W.; funding acquisition, K.W. All authors have read and agreed to the published version of the manuscript.

Funding

This study was financially supported by the Swedish Foundation for Strategic Research (SSF BD15-0032) and by the Swedish Research Council (VR 2016-03564).

Institutional Review Board Statement

The study was conducted according to the guidelines of the Declaration of Helsinki and approved by the Local Ethics Committee at Umeå University (Dnr. 01-123M, 2017/122-31).

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The data are not available due to ethical considerations.

Acknowledgments

The authors want to acknowledge the Swedish national supercomputer centre (NSC) for the computation resources required for brain template creation.

Conflicts of Interest

Patric Blomstedt is consultant for Abbott, Boston Scientific, Koh Young and shareholder in Mithridaticum AB. Karin Wårdell is shareholder in FluoLink AB.

References

  1. Lozano, A.M.; Lipsman, N.; Bergman, H.; Brown, P.; Chabardes, S.; Chang, J.W.; Matthews, K.; McIntyre, C.C.; Schlaepfer, T.E.; Schulder, M.; et al. Deep brain stimulation: Current challenges and future directions. Nat. Rev. Neurol. 2019, 15, 148–160. [Google Scholar] [CrossRef] [PubMed]
  2. Shah, A.; Vogel, D.; Alonso, F.; Lemaire, J.-J.; Pison, D.; Coste, J.; Wårdell, K.; Schkommodau, E.; Hemm, S. Stimulation maps: Visualization of results of quantitative intraoperative testing for deep brain stimulation surgery. Med. Biol. Eng. Comput. 2020, 58, 771–784. [Google Scholar] [CrossRef] [PubMed]
  3. Nordin, T.; Vogel, D.; Osterlund, E.; Johansson, J.; Blomstedt, P.; Fytagoridis, A.; Hemm, S.; Wardell, K. Probabilistic maps for deep brain stimulation—Impact of methodological differences. Brain Stimul. 2022, 15, 1139–1152. [Google Scholar] [CrossRef] [PubMed]
  4. Dembek, T.A.; Barbe, M.T.; Åström, M.; Hoevels, M.; Visser-Vandewalle, V.; Fink, G.R.; Timmermann, L. Probabilistic map-ping of deep brain stimulation effects in essential tremor. NeuroImage Clin. 2017, 13, 164–173. [Google Scholar] [CrossRef] [PubMed]
  5. Elias, G.J.B.; Boutet, A.; Joel, S.E.; Germann, J.; Gwun, D.; Neudorfer, C.; Gramer, R.M.; Algarni, M.; Paramanandam, V.; Prasad, S.; et al. Probabilistic Mapping of Deep Brain Stimulation: Insights from 15 Years of Therapy. Ann. Neurol. 2021, 89, 426–443. [Google Scholar] [CrossRef]
  6. Wårdell, K.; Nordin, T.; Vogel, D.; Zsigmond, P.; Westin, C.-F.; Hariz, M.; Hemm, S. Deep Brain Stimulation: Emerging Tools for Simulation, Data Analysis, and Visualization. Front. Neurosci. 2022, 16, 834026. [Google Scholar] [CrossRef]
  7. Eisenstein, S.A.; Koller, J.M.; Black, K.D.; Campbell, M.C.; Lugar, H.M.; Ushe, M.; Tabbal, S.D.; Karimi, M.; Hershey, T.; Perlmutter, J.S.; et al. Functional anatomy of subthalamic nucleus stimulation in Parkinson disease. Ann. Neurol. 2014, 76, 279–295. [Google Scholar] [CrossRef]
  8. Dembek, T.A.; Roediger, J.; Horn, A.; Reker, P.; Oehrn, C.; Dafsari, H.S.; Li, N.; Kuhn, A.A.; Fink, G.R.; Visser-Vandewalle, V.; et al. Probabilistic sweet spots predict motor outcome for deep brain stimulation in Parkinson disease. Ann. Neurol. 2019, 86, 527–538. [Google Scholar] [CrossRef]
  9. Akram, H.; Sotiropoulos, S.N.; Jbabdi, S.; Georgiev, D.; Mahlknecht, P.; Hyam, J.; Foltynie, T.; Limousin, P.; De Vita, E.; Jahanshahi, M.; et al. Subthalamic deep brain stimulation sweet spots and hyperdirect cortical connectivity in Parkinson’s disease. NeuroImage 2017, 158, 332–345. [Google Scholar] [CrossRef]
  10. Nowacki, A.; Schober, M.; Nader, L.; Saryyeva, A.; Nguyen, T.-A.K.; Green, A.L.; Pollo, C.; Krauss, J.K.; Fontaine, D.; Aziz, T.Z. Deep Brain Stimulation for Chronic Cluster Headache: Meta-Analysis of Individual Patient Data. Ann. Neurol. 2020, 88, 956–969. [Google Scholar] [CrossRef]
  11. Nguyen, T.A.K.; Nowacki, A.; Debove, I.; Petermann, K.; Tinkhauser, G.; Wiest, R.; Schüpbach, M.; Krack, P.; Pollo, C. Direc-tional stimulation of subthalamic nucleus sweet spot predicts clinical efficacy: Proof of concept. Brain Stimul. 2019, 12, 1127–1134. [Google Scholar] [CrossRef]
  12. Persson, R.S.; Nordin, T.; Hariz, G.-M.; Wårdell, K.; Forsgren, L.; Hariz, M.; Blomstedt, P. Deep Brain Stimulation of Caudal Zona Incerta for Parkinson’s Disease: One-Year Follow-Up and Electric Field Simulations. Neuromodulation Technol. Neural Interface 2021, 25, 935–944. [Google Scholar] [CrossRef]
  13. Kübler, D.; Kroneberg, D.; Al-Fatly, B.; Schneider, G.-H.; Ewert, S.; van Riesen, C.; Gruber, D.; Ebersbach, G.; Kühn, A.A. Determining an efficient deep brain stimulation target in essential tremor—Cohort study and review of the literature. Park. Relat. Disord. 2021, 89, 54–62. [Google Scholar] [CrossRef]
  14. Akbarian-Tefaghi, L.; Akram, H.; Johansson, J.; Zrinzo, L.; Kefalopoulou, Z.; Limousin, P.; Joyce, E.; Hariz, M.; Wårdell, K.; Foltynie, T. Refining the Deep Brain Stimulation Target within the Limbic Globus Pallidus Internus for Tourette Syndrome. Stereotact. Funct. Neurosurg. 2017, 95, 251–258. [Google Scholar] [CrossRef]
  15. Reich, M.M.; Horn, A.; Lange, F.; Roothans, J.; Paschen, S.; Runge, J.; Wodarg, F.; Pozzi, N.G.; Witt, K.; Nickl, R.C.; et al. Probabilistic mapping of the antidystonic effect of pallidal neurostimulation: A multicentre imaging study. Brain 2019, 142, 1386–1398. [Google Scholar] [CrossRef]
  16. Nowacki, A.; Barlatey, S.; Al-Fatly, B.; Dembek, T.; Bot, M.; Green, A.L.; Kubler, D.; Lachenmayer, M.L.; Debove, I.; Se-gura-Amil, A.; et al. Probabilistic Mapping Reveals Optimal Stimulation Site in Essential Tremor. Ann. Neurol. 2022, 91, 602–612. [Google Scholar] [CrossRef]
  17. Blomstedt, P.; Sandvik, U.; Tisch, S. Deep brain stimulation in the posterior subthalamic area in the treatment of essential tremor. Mov. Disord. 2010, 25, 1350–1356. [Google Scholar] [CrossRef]
  18. Fahn, S.; Tolosa, E.; Marín, C. Clinical rating scale for tremor. In Parkinson’s Disease and Movement Disorders; Jankovic, J., Tolo-sa, E., Eds.; Urban and Schwarzenberg Baltimore: Münich, Germany, 1988; Volume 2, pp. 225–234. [Google Scholar]
  19. Åström, M.; Zrinzo, L.U.; Tisch, S.; Tripoliti, E.; Hariz, M.I.; Wårdell, K. Method for patient-specific finite element modeling and simulation of deep brain stimulation. Med. Biol. Eng. Comput. 2009, 47, 21–28. [Google Scholar] [CrossRef]
  20. Alonso, F.; Hemm-Ode, S.; Wårdell, K. Influence on Deep Brain Stimulation from Lead Design, Operating Mode and Tissue Impedance Changes—A Simulation Study. Brain Disord. Ther. 2015, 4, 169. [Google Scholar] [CrossRef]
  21. Fonov, V.; Evans, A.C.; Botteron, K.; Almli, C.R.; McKinstry, R.C.; Collins, D.L. Unbiased average age-appropriate atlases for pediatric studies. NeuroImage 2011, 54, 313–327. [Google Scholar] [CrossRef]
  22. Morel, A. Stereotactic Atlas of the Human Thalamus and Basal Ganglia; CRC Press: Boca Raton, FL, USA, 2007. [Google Scholar]
  23. Jakab, A.; Blanc, R.; Berényi, E.L.; Székely, G. Generation of Individualized Thalamus Target Maps by Using Statistical Shape Models and Thalamocortical Tractography. Am. J. Neuroradiol. 2012, 33, 2110. [Google Scholar] [CrossRef] [PubMed]
  24. Nichols, T.E.; Holmes, A.P. Nonparametric permutation tests for functional neuroimaging: A primer with examples. Hum. Brain Mapp. 2002, 15, 1–25. [Google Scholar] [CrossRef] [PubMed]
  25. Wichmann, T.; DeLong, M.R. Deep Brain Stimulation for Movement Disorders of Basal Ganglia Origin: Restoring Function or Functionality? Neurotherapeutics 2016, 13, 264–283. [Google Scholar] [CrossRef] [PubMed]
Figure 1. The overall p-value for the computed p-maps with (a) 200 and (b) 1000 permutations with 10% exclusion threshold and with 200 permutations for fixed exclusion thresholds of (c) 5 simulations and (d) 12 simulations. All data are shown with the median (straight line) and the variation is displayed with the quartiles (dashed line) and max/min (dotted line).
Figure 1. The overall p-value for the computed p-maps with (a) 200 and (b) 1000 permutations with 10% exclusion threshold and with 200 permutations for fixed exclusion thresholds of (c) 5 simulations and (d) 12 simulations. All data are shown with the median (straight line) and the variation is displayed with the quartiles (dashed line) and max/min (dotted line).
Brainsci 13 00756 g001
Figure 2. Computed cluster volumes for (ac) high improvement and (df) low improvement for the 10% exclusion threshold (top row), and fixed threshold at 5 simulations (second row) and 12 simulations (bottom row).
Figure 2. Computed cluster volumes for (ac) high improvement and (df) low improvement for the 10% exclusion threshold (top row), and fixed threshold at 5 simulations (second row) and 12 simulations (bottom row).
Brainsci 13 00756 g002
Figure 3. The total volume included in the statistical analysis (ac), and the mean number of simulations in the analyzed voxels (Ni) (df) for the 10% exclusion threshold (top row), and fixed threshold at 5 simulations (second row) and 12 simulations (bottom row).
Figure 3. The total volume included in the statistical analysis (ac), and the mean number of simulations in the analyzed voxels (Ni) (df) for the 10% exclusion threshold (top row), and fixed threshold at 5 simulations (second row) and 12 simulations (bottom row).
Brainsci 13 00756 g003
Figure 4. Dice coefficient for the clusters within each sample size for the (ac) high-improvement clusters and (df) low-improvement clusters with the 10% exclusion threshold (top row), and fixed threshold at 5 simulations (second row) and 12 simulations (bottom row).
Figure 4. Dice coefficient for the clusters within each sample size for the (ac) high-improvement clusters and (df) low-improvement clusters with the 10% exclusion threshold (top row), and fixed threshold at 5 simulations (second row) and 12 simulations (bottom row).
Brainsci 13 00756 g004
Figure 5. The top row shows on the left an example (n = 60) of a p-map and on the right the same p-map as high- and low-improvement clusters in sagittal views together with outlines from a digital version of Morel’s atlas [22,23]. The bottom row shows sagittal views of the outlines from all high- and low-improvement clusters computed with 5, 30, and 60 patients. The clusters converge with increasing sample size and the location is expanded in the superior direction. All images are from the analysis with the 10% exclusion threshold. CL = central lateral nucleus, CM = centromedian nucleus, MDpc = mediodorsal nucleus (parvocellular division), PuM = medial pulvinar, STN = subthalamic nucleus, VApc = ventral anterior nucleus (parvocellular division), VLpd = ventral lateral posterior nucleus (dorsal division), VLpv = ventral lateral posterior nucleus (ventral division), VM = ventral medial nucleus, VPM = ventral posterior medial nucleus, A = anterior, P = posterior, n = number of included patients.
Figure 5. The top row shows on the left an example (n = 60) of a p-map and on the right the same p-map as high- and low-improvement clusters in sagittal views together with outlines from a digital version of Morel’s atlas [22,23]. The bottom row shows sagittal views of the outlines from all high- and low-improvement clusters computed with 5, 30, and 60 patients. The clusters converge with increasing sample size and the location is expanded in the superior direction. All images are from the analysis with the 10% exclusion threshold. CL = central lateral nucleus, CM = centromedian nucleus, MDpc = mediodorsal nucleus (parvocellular division), PuM = medial pulvinar, STN = subthalamic nucleus, VApc = ventral anterior nucleus (parvocellular division), VLpd = ventral lateral posterior nucleus (dorsal division), VLpv = ventral lateral posterior nucleus (ventral division), VM = ventral medial nucleus, VPM = ventral posterior medial nucleus, A = anterior, P = posterior, n = number of included patients.
Brainsci 13 00756 g005
Table 1. Median cluster volumes for 5, 30, and 60 patients and overall variability (mean ± standard deviation) of the IQR over the total range of sample sizes.
Table 1. Median cluster volumes for 5, 30, and 60 patients and overall variability (mean ± standard deviation) of the IQR over the total range of sample sizes.
10% ThresholdFixed Threshold of 5Fixed Threshold of 12
High improvement
Median volume [mm3]
n = 554440.3
n = 30277290227
n = 60229490373
IQR [mm3]96 ± 24187 ± 6494 ± 43
Low improvement
Median volume [mm3]
n = 53.94.10
n = 3089154107
n = 60121251177
IQR [mm3]51 ± 1475 ± 2250 ± 26
Table 2. The number of patients (n) where the largest median DC values were reached and the overall variability (mean ± standard deviation) of the IQR over the total range of sample sizes.
Table 2. The number of patients (n) where the largest median DC values were reached and the overall variability (mean ± standard deviation) of the IQR over the total range of sample sizes.
10% ThresholdFixed Threshold of 5Fixed Threshold of 12
High improvement
DC max0.73 (n = 57)0.68 (n = 61)0.70 (n = 47)
IQR0.16 ± 0.060.17 ± 0.060.15 ± 0.06
Low improvement
DC max0.69 (n = 58) 0.72 (n = 61)0.72 (n = 60)
IQR0.15 ± 0.050.15 ± 0.060.13 ± 0.06
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Nordin, T.; Blomstedt, P.; Hemm, S.; Wårdell, K. How Sample Size Impacts Probabilistic Stimulation Maps in Deep Brain Stimulation. Brain Sci. 2023, 13, 756. https://doi.org/10.3390/brainsci13050756

AMA Style

Nordin T, Blomstedt P, Hemm S, Wårdell K. How Sample Size Impacts Probabilistic Stimulation Maps in Deep Brain Stimulation. Brain Sciences. 2023; 13(5):756. https://doi.org/10.3390/brainsci13050756

Chicago/Turabian Style

Nordin, Teresa, Patric Blomstedt, Simone Hemm, and Karin Wårdell. 2023. "How Sample Size Impacts Probabilistic Stimulation Maps in Deep Brain Stimulation" Brain Sciences 13, no. 5: 756. https://doi.org/10.3390/brainsci13050756

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop