Response Times and Tax Compliance

: Inspired by the work of Rubinstein, this study revisits data from a previous lab experiment to explore the relation between response times and tax compliance and understand the potential non-linearity between them by classifying decisions and individuals into compliance types. We ﬁnd that individuals’ decision response time is related to their compliance decisions. Full-non compliant individuals (those who did not declare any earned income) have shorter response times than those who fully or partially complied. Full-compliant individuals also tend to declare income faster than partially compliant subjects. Such results are robust throughout time and when controlling for contextual characteristics of experimental design. We ﬁnd non-linearity via an inverted U -shape function that reaches its maximum declaration time around a compliance rate of 60%, even after controlling for contextual experimental design factors. In addition, we observe a non-linear relation between cognitive skills, response time, and tax compliance. Participants with relatively high cognitive skills and very low or very high tax compliance level have low response times, while subjects with relatively lower cognitive skills tend to report higher decision times for higher compliance levels.


Introduction
Following the pioneering laboratory work by Wilhelm Wundt, the topic of response or reaction times has occupied psychology research for 150 years: since the beginning of modern psychology. Wundt created an experimental lab in Leipzig that became "the place" for researchers and students from around the world to learn how to do experimental research in psychology [1]. Influenced by the psychophysical and physiological methods of scholars, such as Fenchner, Helmholtz, and Donders (who measured the speed of nerve and mental events), Wundt applied the reaction time method in his own laboratory work to measure the speed of mental processes. Benjamin [1] refers to Sokal's compilation of previously unpublished documents that include of James Cattell 1 's journal and letters during the time he was working at Wundt's lab in Germany. Cattell's letters to his parents in January 1884 are indicative of the excitement and challenges that are involved with studying reaction times: "We work in a new field, where others will follow us, who must use or correct our results. We are trying to measure the time that it takes to perform the simplest mental act-as for example to distinguish whether a color is blue or red. As this time seems to be not more than one-hundredth of a second, you can imagine this is no easy task" [1] (p. 42, based on [2]). Such early pioneering endeavors Games 2019, 10, 45 3 of 13 roles of response times via an interpretation of the meaning of choice in single games, predicting the behavior of players in a game, or defining a typology of players 4 .
Inspired by Rubinstein's work, we decided to revisit data from a previous tax compliance experiment [13]. Examining the response times of tax compliance decisions is interesting, as people are faced with the dilemma of whether and how much to cheat. Cheating has received relatively less attention in the response time literature as compared with other dilemmas related to the dictator, ultimatum, or public good games. Jiang [14], for example, studied cheating behaviors in a die-rolling task (participants were able to lie about the side chosen to get higher earnings as the casts were self-reported). The experiments used decision times as a way of measuring the subjects' struggle to be honest between different treatments. Interestingly, as far as we know, tax compliance researchers have not explored response times in detail beyond controlling for it their analysis (see, e.g., [15]). This is surprising given that, 30 years ago, John Carroll [16] already stressed the usefulness of applying process-tracing techniques, such as response times, to obtain insights regarding mental mechanisms when investigating taxpayer compliance.
More recently, it has been criticized that the proposition of distinguishing between intuitive and deliberate choices via an examination of response times has led to backward reasoning [17]. Inferring from fast response times that decisions are intuitive risks ignoring "there is a key distinction between the prediction that an automatic process will occur faster than more deliberative computations, and the classification of a choice as intuitive or automatic because it happens more quickly" [17] (p. 2). Labeling fallacies are common in science, and dual processing research is no exception [18]. Thus, in this paper, we are not going to classify what the response time actually means. We are more interested in understanding the potential of non-linearity between response times and tax compliance. For example, Piovesan and Wengström [19] find, in a modified dictator game, that egoistic decision making via choosing the highest payoff for themselves is faster than social behaviour. Public good games 5 have revealed that fast decision-makers are greater or more generous contributors (for a discussion and criticism of those insights, see [20]). This could indicate that total non-compliance or full compliance are the results of clear-cut heuristics that are faster (low-conflict decisions), while partial compliance might be more likely to reveal higher levels or feelings of conflict [13,21]. As Rubinstein [5] has explored the typology of players, and previous tax compliance research has suggested different types of taxpayers [13,22,23], we will explore the response time differences between different types of taxpayers (classified in our case by difference compliance behaviors).

Decision Process
Details of the experimental design and setting can be found in Dulleck et al. [13]. The Queensland University of Technology Faculty Research Ethics Advisory Board reviewed the experiment, confirming that it met the requirements of the National Statement on Ethical Conduct in Human Research. The experiment was conducted with a design structure that closely followed other laboratory studies on tax compliance. We will only focus on two main aspects of the Dulleck et al. experiment [13], namely the voluntary income tax reporting decision based on the percentage of earned income declared and the response time during the income declaration, to explore response times in this study. As the emphasis here is on the response time, we are not interested in any particular features of the experiment, such as treatments variations or parameters used in the experiment. For example, the experiment included a public good structure, where the total tax paid by individuals in a group was increased by a factor 4 For a detailed discussion on the benefits, challenges, and possibilities of exploring response time in cognitive psychology and experimental or behavioural economics see Spiliopoulos and Ortmann [12]. 5 The public goods game models a collective contribution problem in which individuals can decide how much to contribute towards the group. The collective contribution is multiplied by a certain factor and then equally shared among the individuals of the group. In the experiment all the taxes paid by individuals in a group were multiplied with three different factors (0 = no public good, 1, and 2) being equally redistributed to the group members. before being equally redistributed among the group (see, [24][25][26][27]). We will use a "brute force" approach in the regression analysis using group fixed effects to deal with potential variations in response time due to such heterogeneity (i.e., intercept for each of the 45 experimental groups of four participants). We are also not interested in exploring how the response times are linked to any individual experiences throughout the experiment such as being audited or fined 6 . The dynamics of the experiment are controlled for with time fixed effects (i.e., intercept for each of the 16 experimental rounds). Figure 1 shows income tax reporting. Participants received income, declared their income to an experimental "tax authority", and then paid taxes on the declared income received in each of the 16 rounds. Tax compliance is measured as the proportion of income declared (ratio of declared to actual income). The response time (in seconds) is measured from the moment participants saw the declaration screen until they pressed the okay button. Therefore, we interchangeably use the word response time with declaration time. Participants were not informed that response time was recorded, nor was response time mentioned in the experiment design or instruction. It should be noted that the subjects did not face any time constraints when making the income declaration (endogenously arising response time). This could mean that participants at the extreme level of compliance or non-compliance respond quickly, not because they decide more intuitively, but rather because the decision is an easier one for them to make. Fast responses are also fast mistakes due to not paying attention or lacking motivation [19]. However, Recalde et al. [19] point out that the concern regarding errors is greater in one-shot interactions and, in addition, we will show that we obtain the same results throughout all the rounds in the experiment.

Results
In general, we perform two groupings of tax compliance decisions: (1) We look at each individual single declaration decision (2880 decisions, by the amount of earned income declared); and, (2) we look at individual averages out of all the 16 rounds (180 individuals, by the average amount of earned income declared over the entire experiment). We impose six levels of grouping, i.e., full non-compliant (declaring no earned income), full compliant (declared all earned income), and the remaining four partially compliant levels in equally spaced intervals. Table 1 reports the classification of various types of tax compliance and distributions. Not surprisingly, the share of full  [13], p. 17). 6 Nevertheless, we examined the effect of being audited or fined in the previous round and find that it does reduce respond time (on average by~2 s), after controlling for subject fixed effects).
It should also be noted that we chose to use the data from Dulleck et al. [13] due to the reliable quality of the measurement of the response time. The problem is that computer-based experiments are prone to variation that is caused by network traffic issues that can affect the time recording of events, such as when a screen with the information is visible to the participants. Dulleck et al. [13] isolated the laboratory network from the university and assigned its own subnet to reduce such a problem. The recording discrepancy was reduced to less than six milliseconds.

Results
In general, we perform two groupings of tax compliance decisions: (1) We look at each individual single declaration decision (2880 decisions, by the amount of earned income declared); and, (2) we look at individual averages out of all the 16 rounds (180 individuals, by the average amount of earned income declared over the entire experiment). We impose six levels of grouping, i.e., full non-compliant (declaring no earned income), full compliant (declared all earned income), and the remaining four partially compliant levels in equally spaced intervals. Table 1 reports the classification of various types of tax compliance and distributions. Not surprisingly, the share of full compliance or no compliance is lower when looking at individual averages, but the results show that there are individuals who are always or never compliant. Table 1 reports that the overall median declaration time is non-linearly related to the level of tax compliance, in both individual-decisions and individual-average grouping, with the highest and lowest compliant levels having the shortest declaration time. We also see that the median declaration time is shorter in the second half of the experiment as compared to the first half, for all tax compliance levels. We report median declaration time, as its distribution is (right) skewed (based on individual-decisions, mean = 14.49, median = 10.19, s.d. = 14.1, min = 1.08, max = 152.36, N = 2880).  Figure 2 shows the declaration time over the 16 experimental rounds. The declaration time decreases throughout the 16 rounds, which is likely due to the fact that participants were more familiarized with the task. The spike in period 9 can be explained due to the experimental structure. Participants experienced a rule change after round 8 (and before round 9) related to the public good redistribution structure (see [13]) 7 . This explains why participants took more time in their decision process in round 9. In general, the results report a consistent and robust picture throughout all panels. Full non-compliance actions or individuals who never declare any income have shorter declaration times when compared with all the other tax compliance levels. In Table 2, we report the results of paired t-tests (paired by round) on the average/median decision time (in each round) to assess whether the differences in declaration times between levels of tax compliance are statistically significant. Each t-test compares the sample means of individual between two levels of tax compliance (e.g., Full compliant vs. 7 In the first eight rounds, the redistributed amount for each participant in the public good component of the experiment is based on the total contribution all four members to the group. In the remaining rounds, the calculation of redistribution amount only considered the total contribution of the other three members of the group. We also noted a larger impact on decision time for groups all groups except when full non-compliance was observed (see red line Full non-compliant) over rounds, which means that each test has 32 observations (16 rounds for each comparison pair). The response times are significantly lower for full non-compliance when compared to all other groupings (p < 0.001 for all comparisons). Full compliance actions or being always fully compliant are also connected with a tendency for lower response times, but the differences to other compliance groups are not as strong (statistically significant lower response times as compared with most of the partial compliant cases except, for the very low compliance grouping (0-25% of compliance)). Higher levels of partial non-compliance seemed to trigger faster responses than higher levels of partial compliance. Overall, the results are more robust for individual decisions (see panel a and c) due to the larger number of observations. redistribution structure (see [13]) 7 . This explains why participants took more time in their decision process in round 9. In general, the results report a consistent and robust picture throughout all panels. Full non-compliance actions or individuals who never declare any income have shorter declaration times when compared with all the other tax compliance levels. In Table 2, we report the results of paired t-tests (paired by round) on the average/median decision time (in each round) to assess whether the differences in declaration times between levels of tax compliance are statistically significant. Each t-test compares the sample means of individual between two levels of tax compliance (e.g., Full compliant vs. Full non-compliant) over rounds, which means that each test has 32 observations (16 rounds for each comparison pair). The response times are significantly lower for full non-compliance when compared to all other groupings (p < 0.001 for all comparisons). Full compliance actions or being always fully compliant are also connected with a tendency for lower response times, but the differences to other compliance groups are not as strong (statistically significant lower response times as compared with most of the partial compliant cases except, for the very low compliance grouping (0-25% of compliance)). Higher levels of partial non-compliance seemed to trigger faster responses than higher levels of partial compliance. Overall, the results are more robust for individual decisions (see panel a and c) due to the larger number of observations.  In Figure 3, we further explore the distribution of declaration time by different levels of tax compliance, showing the cumulative percentage frequency. The empirical CDF is shown in base-10 log scale to reduce the skewness. We also differentiate between the first half of the experiment (first 8 rounds, see panel c) and the rounds in the second half (period 9 to 16, see d) due to the spike in round 9. For the declaration time of individual decisions, we use the Two-sample Kolmogorov-Smirnov test to test the equality of distributions between different levels of tax compliance (see Table 3). The significance levels are adjusted for multiple comparisons using the Bonferroni method. Previous results are confirmed, with full non-compliance always returning the fastest declaration times. Full compliance shows shorter response times when compared to partial compliance, except for very low compliance levels (the difference is not statistically significant). round 9. For the declaration time of individual decisions, we use the Two-sample Kolmogorov-Smirnov test to test the equality of distributions between different levels of tax compliance (see Table  3). The significance levels are adjusted for multiple comparisons using the Bonferroni method. Previous results are confirmed, with full non-compliance always returning the fastest declaration times. Full compliance shows shorter response times when compared to partial compliance, except for very low compliance levels (the difference is not statistically significant). We next conduct Ordinary Linear Regression (OLS) regressions to show that declaration time is non-linearly related to tax compliance (inverted U-shape). When using individual declaration time as a dependent variable, we control for group fixed effects to take any design choices, such as treatment variations, within-experiment, and time fixed effects, into account to control for dynamic changes over the process of the experiment (Figure 4a). We also assess within-individual differences by controlling for subject fixed effects to remove individual heterogeneity (Figure 4c). To capture non-linearity, we include tax compliance and its squared term as predictors in the regression models. Figure 4 reports the predictions of the models, which nicely demonstrate the non-linear relationship between response time and level of tax compliance. The declaration time increases as the level of tax compliance increases until around 60% of tax compliance level (turning points of the predicted model are 0.589, 0.611, and 0.659 for panel a, b, and c, respectively), where declaration time starts to decline.   Test of distribution equality (K-S test) is performed on logged values of decision time (based 10). D = Kolmogorov-Smirnov statistic; p unadj = unadjusted p-value (two-sided); p Bon = p-value with Bonferroni correction for multiple comparison. sig. = significance. † p < 0.10; * p < 0.05; ** p < 0.01; *** p < 0.001 based on Bonferroni adjusted p-value.
We next conduct Ordinary Linear Regression (OLS) regressions to show that declaration time is non-linearly related to tax compliance (inverted U-shape). When using individual declaration time as a dependent variable, we control for group fixed effects to take any design choices, such as treatment variations, within-experiment, and time fixed effects, into account to control for dynamic changes over the process of the experiment (Figure 4a). We also assess within-individual differences by controlling for subject fixed effects to remove individual heterogeneity (Figure 4c). To capture non-linearity, we include tax compliance and its squared term as predictors in the regression models. Figure 4 reports the predictions of the models, which nicely demonstrate the non-linear relationship between response time and level of tax compliance. The declaration time increases as the level of tax compliance increases until around 60% of tax compliance level (turning points of the predicted model are 0.589, 0.611, and 0.659 for panel a, b, and c, respectively), where declaration time starts to decline.  Ordinary Linear Regression (OLS) while using declaration time as dependent variable with tax compliance and tax compliance squared as predictors, controlling for experiment group and period fixed effects; (b) Prediction obtained from OLS using individual-average declaration time as dependent variable with mean tax compliance and mean tax compliance squared as predictors, controlling for experiment group fixed effects; and, (c) Prediction obtained from OLS using declaration time as dependent variable with tax compliance and tax compliance squared as predictors, controlling for subject and period fixed effects.
The Dulleck et al. [13] experiment also included a cognitive skills task comprising 50 questions to be solved within 12 minutes related to numerical, verbal, and spatial reasoning (see examples in Table A1). We take advantage of the existence of this data to check whether there is a connection between cognitive skill, response time, and tax compliance. The contour plot analysis ( Figure 5) is based on the predicted response time that was obtained from the OLS regression model with full interaction between cognitive skill and tax compliance (and its squared term). It shows that participants with higher cognitive skill, in general, have shorter decision times when compared to those with lower cognitive skills. In addition, participants with relatively high cognitive skill with very low or very high tax compliance level exhibit low response times. Decision time increases with higher levels of tax compliance for subjects with lower cognitive skill, although also here we observe some non-linearity. The Dulleck et al. [13] experiment also included a cognitive skills task comprising 50 questions to be solved within 12 min related to numerical, verbal, and spatial reasoning (see examples in Table A1). We take advantage of the existence of this data to check whether there is a connection between cognitive skill, response time, and tax compliance. The contour plot analysis ( Figure 5) is based on the predicted response time that was obtained from the OLS regression model with full interaction between cognitive skill and tax compliance (and its squared term). It shows that participants with higher cognitive skill, in general, have shorter decision times when compared to those with lower cognitive skills. In addition, participants with relatively high cognitive skill with very low or very high tax compliance level exhibit low response times. Decision time increases with higher levels of tax compliance for subjects with lower cognitive skill, although also here we observe some non-linearity. Games 2019, 10, x FOR PEER REVIEW 3 of 13

Discussion
We find that different levels of tax compliance are connected to different levels of response times based on data from a previous tax compliance experiment [13] well-suited to exploring response times in this context. Extreme behaviors (full non-compliance or full compliance) seemed to facilitate the decision process, possibly due to lower levels of conflicts when making decisions although full evasion is connected with much faster response times than full compliance. For partial tax compliance, the decision time is highest in the range of 60% compliance, which indicates an inverted U-shape function in the relationship between response time and tax compliance. Looking at the participants' cognitive skill levels also indicates substantial non-linearity. Participants with relatively high cognitive skill who report very low or very high tax compliance levels have low response times, while those with higher levels of tax compliance but lower cognitive skill show the highest declaration times. In general, we therefore believe that we contribute to the response time literature by taking a much closer look at potential non-linearity. Our results also confirm the usefulness of working with a typology of subjects, although, here, we classify them by degrees of compliance. Nevertheless, results indicate that one could also classify them in line with Rubinstein [5], based on response time information.

Conclusions
In general, our goal was to show what we can be learned on response times in a tax compliance setting. Future studies could check the robustness of our results. In particular, it would be beneficial to report data from various tax compliance experiments to provide more robust insights as the

Discussion
We find that different levels of tax compliance are connected to different levels of response times based on data from a previous tax compliance experiment [13] well-suited to exploring response times in this context. Extreme behaviors (full non-compliance or full compliance) seemed to facilitate the decision process, possibly due to lower levels of conflicts when making decisions although full evasion is connected with much faster response times than full compliance. For partial tax compliance, the decision time is highest in the range of 60% compliance, which indicates an inverted U-shape function in the relationship between response time and tax compliance. Looking at the participants' cognitive skill levels also indicates substantial non-linearity. Participants with relatively high cognitive skill who report very low or very high tax compliance levels have low response times, while those with higher levels of tax compliance but lower cognitive skill show the highest declaration times. In general, we therefore believe that we contribute to the response time literature by taking a much closer look at potential non-linearity. Our results also confirm the usefulness of working with a typology of subjects, although, here, we classify them by degrees of compliance. Nevertheless, results indicate that one could also classify them in line with Rubinstein [5], based on response time information.

Conclusions
In general, our goal was to show what we can be learned on response times in a tax compliance setting. Future studies could check the robustness of our results. In particular, it would be beneficial to report data from various tax compliance experiments to provide more robust insights as the response time tends to be a noisy variable [11]. Future studies could also test tax declaration decisions that require more decision time (by reporting, e.g., not only income but also deductions), so as to increase the average response time.
Funding: This research received no external funding.