# The Sample Size Matters: To What Extent the Participant Reduction Affects the Outcomes of a Neuroscientific Research. A Case-Study in Neuromarketing Field

## Abstract

## 1. Introduction

## 2. Materials and Methods

#### 2.1. Experimental Sample and Design

#### 2.2. Neurophysiological Data Recording

- Index 3: descriptor of the autonomic response, namely, the emotional index (EI), computed as the combination between the SCL and the HR measures, as described by Vecchiato and colleagues [52].

#### 2.3. Data Analysis and Statistics

- Group-mean values of the index (function of t, i.e., the task duration), for each of the 630 combinations. We so obtained 630 vectors ‘v630’ (thus resulting in a matrix 630 x t);
- Pearson correlation between each ‘v630’ (630 x t) and the vector ‘v’ (1 x t), containing the mean values of the index computed over the entire population (36 subjects);
- Mean Squared Error (MSE) to describe the error committed considering each ‘v630’ rather than ‘v’ along each task (within-task variability):$$MSE=\frac{{{\displaystyle \sum}}_{i=1}^{t}{\left(v{630}_{i}-{v}_{i}\right)}^{2}}{t}$$

- 4.
- The standard deviation of the 630 values assumed by the vectors ‘v630’, for every second of the task itself (between-groups variability):$$STD=\sqrt{\frac{{{\displaystyle \sum}}_{n=1}^{630}{\left(v{630}_{n}-\overline{v630}\right)}^{2}}{630}}$$

^{−5}[53]. The Shapiro Wilk test [54] of normality demonstrated that data (the Rho correlation coefficients, MSE, and STD values) were not Gaussian; hence, the non-parametric Friedman test [55] was performed to assess the difference between groups. Specifically, the subgroups of subjects (32, 28, 24, 20, 16) were considered as within factors for the analysis run over the Rho values resulting from the correlation analysis, the MSE and the STD values. The analysis was performed for each of the three considered indices and each of the four selected spots. Nemenyi post-hoc test [56], specifically conceived for non-parametric repeated measures ANOVA (i.e., the Friedman test), was applied to further analyse the significant effects and interactions.

## 3. Results

#### 3.1. The Effect of the Index

^{−5}). The Friedman test evidenced a significant decreasing effect of the mean rho values depending on the sub-groups, Figure 2a (Friedman chi-squared = 1868, p-value < 2.2 × 10

^{−16}). The post-hoc analysis showed that rho significantly decreased from 32 to 16 subjects, assuming different values for each subgroup (p < 0.05).

^{−16}): it was significantly lower for 32 subjects, increasing with the sample size reduction. The highest error was committed considering 16 subjects instead of 36. All the groups were significantly different as showed by the Nemenyi test.

^{−16}). It was lower for 32 subjects, increasing with the reduction of the sample size. The post-hoc analysis demonstrated that STD was significantly different for each subgroup.

^{−5}). Concerning the subgroups of 24, 20, and 16 subjects, 13 over 630, 92 over 630, and 237 over 630 correlations were respectively not significant. The Friedman analysis on rho, MSE, and STD showed the effect of the groups, with a significant increase of rho (Friedman chi-squared = 1868, p-value < 2.2 × 10

^{−16}), a significant increase of MSE (Friedman chi-squared = 1932.3, p-value < 2.2 × 10

^{−16}), and a significant increase of STD from 32 to 16 subjects (Friedman chi-squared = 124, p-value < 2.2 × 10

^{−16}). The post-hoc Nemenyi test evidenced that all the subgroups assumed different values of rho, MSE, and STD.

^{−5}). The Friedman analysis on rho, MSE, and STD showed the effect of the groups, with a significant increase of rho (Friedman chi-squared = 2235.5, p-value < 2.2 × 10

^{−16}), a significant increase of MSE (Friedman chi-squared = 2232.1, p-value < 2.2 × 10

^{−16}), and a significant increase of STD from 32 to 16 subjects (Friedman chi-squared = 124, p-value < 2.2 × 10

^{−16}). The post-hoc Nemenyi test evidenced that all the subgroups assumed different values of rho, MSE, and STD.

#### 3.2. The Effect of the Task

^{−5}), while for the subgroups 32 and 28 for Index2 and Index3. The analysis evidenced that reducing the sample size at 24:10 and 49 over 630 correlations were not significant for Index2 and Index3, respectively. For a sample size of 20 subjects, 8, 90, and 138 over 630 correlations were not significant for the three indices, respectively. Moving on, 16 subjects, 49, 215, and 290 over 630 were not significant for the three indices, respectively. Friedman test on rho, MSE, and STD revealed a significant effect of the groups for the three indices, showing rho decreasing (Index1: Friedman chi-squared = 1879.4, p-value < 2.2 × 10

^{−16}; Index2: Friedman chi-squared = 1800.9, p-value < 2.2 × 10

^{−16}; Index3: Friedman chi-squared = 1888.1, p-value < 2.2 × 10

^{−16}), MSE increasing (Index1: Friedman chi-squared = 2124.8, p-value < 2.2 × 10

^{−16}; Index2: Friedman chi-squared = 2151.2, p-value < 2.2 × 10

^{−16}; Index3: Friedman chi-squared = 2182.7, p-value < 2.2 × 10

^{−16}), and STD increasing (Index1: Friedman chi-squared = 124, p-value < 2.2 × 10

^{−16}; Index2: Friedman chi-squared = Friedman chi-squared = 124, p-value < 2.2 × 10

^{−16}; Index3: Friedman chi-squared = 124, p-value < 2.2 × 10

^{−16}). The post-hoc test showed that all the comparisons between groups were significant.

#### 3.3. The Effect of the Time

^{−5}). Concerning Index1, 12, 53, and 136 correlations over 630 were not significant in subgroups 24, 20, and 16, respectively, while for Index3, 2, 63, and 216 correlations were not significant in those subgroups. Considering Index2, the number of not-significant correlations increased from the subgroup of 28 subjects in which 23 correlations were not significant, while 115, 290, and 408 over 630 correlations were not significant for the subgroups of 24, 20, and 16 subjects, respectively. Friedman test on rho values revealed a significant effect of the groups for the three indices, showing a rho decreasing related to the reduction of the sample size (Index1: Friedman chi-squared = 1712.1, p-value < 2.2 × 10

^{−16}; Index2: Friedman chi-squared = 1586.3, p-value < 2.2 × 10

^{−16}; Index3: Friedman chi-squared = 2017.2, p-value < 2.2 × 10

^{−16}). Post-hoc test showed that all the comparisons between groups were significant. The analysis of SPOT4, showed that all the correlations were significant for the subgroups of 32 subjects, for both Index1 and Index3 (p < 8 × 10

^{−5}). Concerning Index1 63, 214, 335 and 464 correlations over 630 were not significant in subgroups 28, 24, 20 and 16 respectively, while for Index3, 19, 122, 296 and 423 correlations were not significant in those subgroups. Considering Index2, the number of not-significant correlations increased, already from the subgroup of 32 subjects in which 7 correlations were not significant, while 82, 203, 342, 450 over 630 correlations were not significant for the subgroups of 28, 24, 20 and 16 subjects, respectively. Friedman test on rho values revealed a significant effect of the groups for the three indices, showing rho decrease related to the reduction of the sample size (Index1: Friedman chi-squared = 1510.9, p-value < 2.2 × 10

^{−16}; Index2: Friedman chi-squared = 1263, p-value < 2.2 × 10

^{−16}; Index3: Friedman chi-squared = 1617.6, p-value < 2.2 × 10

^{−16}). The post-hoc test showed that all the comparisons between groups were significant.

#### 3.4. Rho-Sample Size Relationship

## 4. Discussion

## 5. Conclusions

## Author Contributions

## Funding

## Institutional Review Board Statement

## Informed Consent Statement

## Data Availability Statement

## Conflicts of Interest

## References

**Figure 1.**The graph represents the trend of ‘v’, the mean value of Index1 computed over the entire population (36 subjects) during SPOT1 (ALL). The dashed lines show the trend of the maximum and minimum values (Max, Min) of the index among the ‘v630’ for each second, in each subgroup of subjects (32, 28, 24, 20, 16).

**Figure 2.**The box plots represent the effect of the subgroup of subjects on (

**a**) Rho values, (

**b**) MSE values, and (

**c**) STD values, computed for Index 1, during the SPOT 1.

**Figure 3.**Graphical representation of the trend of rho correlation coefficient at reducing the sample size (subgroups) with respect to the ‘full population index’ (36 participants). Tasks (

**a**) and (

**b**) were 30 s long, Task (

**c**) 20 s, and Task (

**d**) 15 s long.

**Table 1.**Number of significant correlations (#s.c.) and eventual decreasing as percentage of the total number of combinations (630), median ± std of rho, MSE, and STD for the three indices (I1, I2, I3) and the five subgroups of subjects (32, 28, 24, 20, 16) during SPOT1.

SPOT1(30 s) | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|

#s.c. (p < 8 × 10^{−5}) | Rho | MSE | STD | |||||||||

I1 | I2 | I3 | I1 | I2 | I3 | I1 | I2 | I3 | I1 | I2 | I3 | |

32 | 630 * | 630 * | 630 * | 0.96 ± 0.02 | 0.94 ± 0.02 | 0.97 ± 0.01 | 0.012 ± 0.01 | 0.014 ± 0.01 | 0.001 ± 0.0005 | 0.12 ± 0.02 | 0.13 ± 0.01 | 0.03 ± 0.003 |

28 | 630 * | 630 * | 630 * | 0.91 ± 0.04 | 0.89 ± 0.05 | 0.94 ± 0.02 | 0.029 ± 0.02 | 0.032 ± 0.02 | 0.002 ± 0.001 | 0,18 ± 0.03 | 0,19 ± 0.02 | 0.05 ± 0.004 |

24 | 628 (−0.3%) | 617 (−2.1%) | 630 * | 0.87 ± 0.06 | 0.84 ± 0.07 | 0.90 ± 0.03 | 0,05 ± 0.03 | 0,053 ± 0.03 | 0,004 ± 0.002 | 0.23 ± 0.04 | 0.23 ± 0.03 | 0.07 ± 0.007 |

20 | 597 (−5.2%) | 538 (−14.6%) | 630 * | 0.81 ± 0,08 | 0.76 ± 0.10 | 0.86 ± 0.04 | 0,079 ± 0.06 | 0,089 ± 0.05 | 0,007 ± 0.003 | 0.30 ± 0.05 | 0.31 ± 0.04 | 0.09 ± 0.007 |

16 | 479 (−23.9%) | 393 (−37.6%) | 618 (−1.9%) | 0.74 ± 0.12 | 0.69 ± 0.12 | 0.79 ± 0.06 | 0.128 ± 0.08 | 0.136 ± 0.08 | 0.011 ± 0.005 | 0.37 ± 0.06 | 0.38 ± 0.04 | 0.11 ± 0.01 |

**Table 2.**Number of significant correlations (#s.c.) and eventual decreasing as percentage of the total number of combinations (630), median ± std of rho, MSE, and STD for the three indices (I1, I2, I3) and the five subgroups of subjects (32, 28, 24, 20, 16) during SPOT2.

SPOT2 (30 s) | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|

#s.c. (p < 8 × 10^{−5}) | Rho | MSE | STD | |||||||||

I1 | I2 | I3 | I1 | I2 | I3 | I1 | I2 | I3 | I1 | I2 | I3 | |

32 | 630 * | 630 * | 630 * | 0.98 ± 0.007 | 0.95 ± 0.02 | 0.94 ± 0.02 | 0.02 ± 0.01 | 0.02 ± 0.01 | 0.001 ± 0.00 | 0.14 ± 0.02 | 0.13 ± 0.02 | 0.03 ± 0.004 |

28 | 630 * | 630 * | 630 * | 0.96 ± 0.01 | 0.9 ± 0.04 | 0.88 ± 0.04 | 0.04 ± 0.01 | 0.04 ± 0.02 | 0.002 ± 0.00 | 0.21 ± 0.02 | 0.19 ± 0.03 | 0.05 ± 0.006 |

24 | 630 * | 620 (−1.6%) | 598 (−5.1%) | 0.94 ± 0.03 | 0.84 ± 0.07 | 0.81 ± 0.07 | 0.07 ± 0.03 | 0.07 ± 0.03 | 0.004 ± 0.001 | 0.27 ± 0.04 | 0.26 ± 0.05 | 0.07 ± 0.008 |

20 | 622 (−1.2%) | 540 (−14.3%) | 492 (−21.9%) | 0.91 ± 0.06 | 0.78 ± 0.10 | 0.74 ± 0.11 | 0.12 ± 0.05 | 0.11 ± 0.05 | 0.007 ± 0.003 | 0.34 ± 0.04 | 0.32 ± 0.06 | 0.08 ± 0.01 |

16 | 581 (−7.8%) | 415 (−34.1%) | 340 (−46%) | 0.86 ± 0.08 | 0.70 ± 0.13 | 0.67 ± 0.14 | 0.18 ± 0.07 | 0.17 ± 0.07 | 0.01 ± 0.004 | 0.42 ± 0.06 | 0.40 ± 0.07 | 0.1 ± 0.02 |

**Table 3.**Number of significant correlations (#s.c.) and eventual decrease as percentage of the total number of combinations (630), median ± std of rho values for the three indices (I1, I2, I3), and the five subgroups of subjects (32, 28, 24, 20, 16) during SPOT3 and SPOT4.

SPOT3 (20 s) | SPOT4 (15 s) | |||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|

#s.c. (p < 8 × 10^{−5}) | Rho | #s.c. (p < 8 × 10^{−5}) | Rho | |||||||||

I1 | I2 | I3 | I1 | I2 | I3 | I1 | I2 | I3 | I1 | I2 | I3 | |

32 | 630 * | 630 * | 630 * | 0.98 ± 0.01 | 0.95 ± 0.03 | 0.98 ± 0.01 | 630 * | 623 (−1.1%) | 630 * | 0.96 ± 0.026 | 0.96 ± 0.04 | 0.97 ± 0.01 |

28 | 630 * | 607 (−3.6%) | 630 * | 0.95 ± 0.03 | 0.88 ± 0.06 | 0.95 ± 0.02 | 567 (−10%) | 548 (−13%) | 611 (−3%) | 0.92 ± 0.04 | 0.93 ± 0.05 | 0.94 ± 0.02 |

24 | 618 (−1.9%) | 515 (−18.2%) | 628 (−0.3%) | 0.92 ± 0.05 | 0.83 ± 0.08 | 0.92 ± 0.04 | 416 (−33.9%) | 427 (−32.2%) | 508 (−19.3%) | 0.87 ± 0.09 | 0.87 ± 0.13 | 0.89 ± 0.04 |

20 | 577 (−8.4%) | 340 (−46%) | 567 (−10%) | 0.88 ± 0.07 | 0.76 ± 0.13 | 0.87 ± 0.06 | 295 (−53.2%) | 288 (−54.3%) | 334 (−46.9%) | 0.80 ± 0.12 | 0.81 ± 0.19 | 0.84 ± 0.06 |

16 | 494 (−21.6%) | 222 (−64.7%) | 414 (−34.3%) | 0.83 ± 0.10 | 0.70 ± 0.16 | 0.81 ± 0.07 | 166 (−73.7%) | 180 (−71.4%) | 207 (−67.1%) | 0.74 ± 0.17 | 0.77 ± 0.24 | 0.78 ± 0.08 |

**Table 4.**Number of significant correlations (#s.c.) and eventual decreasing as percentage of the total number of combinations (630) for the three indices (INDEX1, INDEX2, INDEX3) and the five subgroups of subjects (32, 28, 24, 20, 16) during SPOT1 (30 s), SPOT2 (30 s), SPOT3 (20 s), and SPOT4 (15 s).

INDEX1 | INDEX2 | INDEX3 | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|

#s.c. (p < 8 × 10^{−5}) | #s.c. (p < 8 × 10^{−5}) | #s.c. (p < 8 × 10^{−5}) | ||||||||||

30 s | 30 s | 20 s | 15 s | 30 s | 30 s | 20 s | 15 s | 30 s | 30 s | 20 s | 15 s | |

32 | 630 * | 630 * | 630 * | 630 * | 630 * | 630 * | 630 * | 623 (−1.1%) | 630 * | 630 * | 630 * | 630 * |

28 | 630 * | 630 * | 630 * | 567 (−10%) | 630 * | 630 * | 607 (−3,6%) | 548 (−13%) | 630 * | 630 * | 630 * | 611 (−3%) |

24 | 628 (−0.3%) | 630 * | 618 (−1.9%) | 416 (−33.9%) | 617 (−2.1%) | 620 (−1.6%) | 515 (−18.2%) | 427 (−32.2%) | 630 * | 598 (−5.1%) | 628 (−0.3%) | 508 (−19.3%) |

20 | 597 (−5.2%) | 622 (−1.2%) | 577 (−8.4%) | 295 (−53.2%) | 538 (−14.6%) | 540 (−14.3%) | 340 (−46%) | 288 (−54.3%) | 630 * | 492 (−21.9%) | 567 (−10%) | 334 (−46.9%) |

16 | 479 (−23.9%) | 581 (−7.8%) | 494 (−21.6%) | 166 (−73.7%) | 393 (−37.6%) | 415 (−34.1%) | 222 (−64.7%) | 180 (−71.4%) | 618 (−1.9%) | 340 (−46%) | 414 (−34.3%) | 207 (−67.1%) |

