Vis Inertiae and Statistical Inference: A Review of Difference-in-Differences Methods Employed in Economics and Other Subjects
Abstract
1. Introduction to DiD
2. Fundamentals of the Two-Group and Two-Period Homogenous DiD
- C is the expected value of y for the treated group conditional upon the application of the treatment on that group;
- D is the expected value of y for the untreated group conditional upon the absence of the treatment for that group;
- A is the expected value of y for the treated group conditional upon the absence of the treatment;
- B is the expected value of y for the untreated group conditional upon the absence of the treatment.
- is the value of the response variable for a unit in the population under study. Its value is measured in each group and each t, i.e., before and after the introduction of the treatment. It will correspond either to the i-th or to j-th observation at time t depending on the group (treated or untreated) of the unit;
- β0 is the intercept of the regression model, common to treated and untreated units;
- D1 is the Time Period Dummy, that is, a dummy variable that takes the value 0 or 1 depending on whether the h-th observation of the response variable refers to the pre- (D1 = 0) or post-treatment period (D1 = 1) independently of the group (treated or control) the observation belongs to. It simply indicates whether the treatment existed or not in that period t (independently of which unit was treated);
- D2 is the Treatment Indicator Dummy, that is, a dummy variable that takes the value 0 or 1 depending on whether the h-th observation refers to an individual in the control group (untreated) or in the treatment group, respectively, independently of the specific time period t. Therefore, D2 = 0 when the observation belongs to an untreated unit and D2 = 1 when the observation belongs to a treated unit (independently of when the treatment was introduced). Clearly, in the simplified example of this section with only two periods, D2 = 0 means that the unit is never treated. More complex settings are discussed in the subsequent sections;
- D1 × D2 is the interaction term between the time dummy and the treatment dummy. It is the most important coefficient to estimate as it measures the average effect of treatment on treated units.
- (i)
- corresponds to point B, as above, and must be interpreted as the model baseline average (constant);
- (ii)
- , which corresponds to segment AB, is the constant difference between the two groups before the treatment.
- Parallel trend (the response variable for treated and untreated units follows the same time path);
- No anticipation effects (treated units cannot adjust their behaviour on the basis of information on an incoming treatment);
- No spillover impacts of the treatment (the treatment does not have an impact outside the treated units).
2.1. Violations of the Parallel Trend Assumption
2.2. The Stable Unit Treatment Value Assumption (SUTVA) (Rubin, 1978, 1980, 1990a)
2.3. Exogeneity and Identification DiD and Traditional Econometrics
- is the post-intervention average response of the treated group.
- is the pre-intervention average response of the treated group.
- is the post-intervention average response of the control group.
- is the pre-intervention average response of the control group.
3. The OLS Version of the Two-Way Fixed Effects Regression (TWFE)
- is the response variable;
- is a time effect;
- are unit (not group) fixed effects;
- is the dummy (indicator) for whether or not unit h is affected by the treatment in period t (the term D1 × D2 of the last column of Table A1);
- are idiosyncratic, time-varying unobservable factors.
- and are the two periods of interest that, for simplicity, correspond to two years;
3.1. Testing for the Parallel Trends and Anticipation Effects Assumptions in the TWFE Model
3.2. More on the Parallel Trend Assumption
- Selection bias relates to the fixed characteristics of the units ;
- Time trend is the same for treated and untreated units.
3.3. OLS and the Efficiency of the Estimation of the Treatment
4. Simple Worked Examples
4.1. Example n.1: Equivalence Between the OLS Estimation and the Calculation Based on Mean Differences
- The mean Consumption in the Control group before the treatment is as follows:
- The mean Consumption in the Treated group before treatment is as follows:
- The mean Consumption in the Control group after the treatment is as follows:
- The mean Consumption in the Treated group after the treatment is as follows:
- The estimated Constant = 14.2 (with a p-value smaller than 0.05) is the mean value of the Consumption in the control group in 2010 (i.e., before the treatment). We can compare it with the result obtained from the numerical calculation reported above. The two figures coincide.
- If we sum the coefficient Constant and the d2 coefficient, i.e., if we calculate 14.2 + (– 1.6), we obtain 12.6. This is the expected Consumption of the control group in 2011, i.e., during the year of treatment.
- If we sum up the coefficient Constant and the d1 coefficient, i.e., if we calculate 14.2 + 0.28 = 14.48, we obtain the mean value of the Consumption in the treatment group in 2010, i.e., before the treatment.
- The estimated TRET = 5.32 is the (statistically significant) treatment effect. Treated units increase their average consumption by EUR 5.32 with respect to untreated individuals.
4.2. Example n.2: Illustration of Parallel Trends
5. ATET vs. ATE
6. The Confounding Factors
- Assumption of unconditional independence between response and treatment:
- Assumption of conditional (on covariates X) independence between response and treatment:
- (1)
- The covariate is associated with treatment;
- (2)
- There is a time-varying relationship between the covariate and outcomes;
- (3)
- There is differential time evolution in covariate distributions between the treatment and control populations (the covariate must have an effect on the outcome).
7. More than Two Periods with Homogeneity
8. More Than Two Periods with Heterogeneity
- Irreversibility of the treatment or staggered treatment: This assumption posits that once units receive treatment, they remain treated throughout the observation period.
- Parallel Trends Assumption with respect to Never-Treated Units: When we examine groups and periods where treatment is not applied (C = 1), we assume the average potential outcomes for the group initially treated at time g. The group that never received treatment would have followed similar trends in all post-treatment periods t ≥ g. Then, we have T = (1, …, S) and g = (2, …, S) with t ≥ g. However, this assumption relies on two important conditions:
- There must be a sufficiently large group of units that have never received treatment in our data.
- These never-treated units must be similar enough to the units that eventually receive treatment that we can validly compare their outcomes.
In situations where these conditions are not met, we can use an alternative parallel trends assumption that involves the not-yet-treated units as valid comparison groups. - Parallel Trends Assumption with respect to Not-Yet-Treated Units: When we are studying groups treated first at time g, we assume that we can use the units that are not yet treated by time s (where s ≥ t) as valid comparison groups for the group initially treated at time g.
8.1. The Extended TWFE Method (Wooldridge, 2025)
8.2. The Regression Adjusted Method (Callaway & Sant’Anna, 2021)
8.3. The Inverse Probability Weighting Method, IPW, (Callaway & Sant’Anna, 2021) and the Augmented IPW (Callaway & Sant’Anna, 2021)
9. DiD with Complex Data Structure: Clustering and Spatial-Temporal Dependence
- Data showing a grouping or clustering structure;
- Data exhibiting complex dependence generated by spatial and temporal relationships.
9.1. Clustering
9.2. Serial Correlation
- is the status of the response variable of individual h in group g in time t;
- is the time-invariant group effect;
- is the group-invariant time effect;
- is the interaction dummy representing the treatment state in post-treatment period;
- reflects the idiosyncratic variation in the response variable across individuals, groups and time.
9.3. Spatial Dependence
10. The Most Relevant Issues Discussed in This Review and Some Further Research Directions
- As in many causal inference procedures, DiD relies on strong assumptions that may be difficult to test. The key assumption (parallel trends) is that the outcomes of the treated and comparison groups would have evolved similarly in the absence of treatment (under a vis inertiae as alluded in the title). Yet, even in simple 2 units and 2 periods case statistical tests have low power, and the issue becomes more complicated in the multi-unit and multi-period cases. The search for the existence of parallel trends might became a search for the Arabian Phoenix since it requires elaborated statistical tests. The simple graphical appearance of a commune time path of mean realizations in the pre-treatment period might be a misleading suggestion of the perpetuation of a similar potential parallel trend path in the post treatment periods (when counterfactuals cannot be observed);
- Therefore, without a true randomized experiment, tools like DiD do not broaden the range of “natural experiments” we can use to identify causal effects.
- Even in the case of true randomization, SUTVA problems (i.e., the so-called spill-over effects across treated and untreated unites) might plague estimations and make it difficult to identify a DiD model that consistently estimate ATET (which requires unique potential outcome for each individual under each exposure condition).
- Often the interpretation of the role of covariates in DiD estimates is difficult and, sometimes, even what a covariate is might be controversial. In fact, DiD does not require the treated and comparison groups to be balanced on covariates, unlike in cross-sectional OLS studies. Thus, a covariate that differs by treatment group (i.e. that is group specific) and is associated with the response is not necessarily a confounder in DiD. Only covariates that differ by treatment group and are associated with outcome trends are confounders in DiD as these can be the ones that violate the identification assumptions because they are correlated with both the response and the treatment. Since confounders can bias the estimated treatment effect by violating the parallel trends assumption, practitioners should pay extra care in evaluating whether a factor can be considered as a confounder.
- In this review we have discussed linear basic OLS version of DiD application, without discussing any issue related to the functional form. Yet, there may be cases where it can matter whether the “correct” model is a linear probability model, probit or logit, since they may assume different counterfactuals. Determining that two groups would have experienced parallel trends requires, first of all, a justification of the chosen functional forms for the adopted model.
11. Some Examples of DiD Applications
11.1. The Elasticity of Taxable Income (Feldstein, 1995)
- The structural approach (closer to the “old” theoretical analysis of labour responses to income taxation), which separately accounts for each of the potential responses to taxation (intensive and extensive) and then aggregate.
- TI = taxable income (defined as an aggregate measure of income from various sources);
- τ = proportional income tax rate.
- Post is the dummy variable for the reform period (1 if after-reform and 0 pre-reform);
- Treatment is the dummy identifying the treated income group (Treatment = 1) and the untreated group (Treatment = 0);
- Post × Treatment is the DiD variable given by the interaction between the above two;
- δ is the coefficient of interest which measures ATET;
- and ε is the classical error term.
The use of tax return data rather than of a household survey permits analysing the response of taxable income as a whole and not just of labour force participation and working hours. A panel, in which each individual is observed both before and after the change in tax rates, permits a “differences-in-differences” form of estimator that identifies the tax effect in a way that is not available with a single year’s cross section.
- The income growth rate is the same for all income earners (medium, high and highest tax brackets) in the absence of the treatment (“parallel trend assumption”).
- The taxpayers cannot adjust their income in 1985 (last year before reform) to “choose” their change in tax rate through TRA1986 (“no selection into treatment” and no anticipation effect).
- The comparison of taxpayers that vary in the intensity of treatment (instead of comparing taxed to untaxed taxpayers) is legitimate. Implicitly, he needs to assume that the elasticity of taxable income is constant in income, i.e., the same across all income groups. This last assumption will reappear in other papers.
- Estimates of the elasticities are high estimates, ranging from 1 to 3.
- The so-called Laffer rate, i.e., the rate that maximises the tax revenue, changes with the elasticity and corresponds to 1/(1 + ϵ).
- The USA are on the wrong side of the Laffer curve (excessive levels of income tax rates).
- No proper untreated control group is present in the study. Treatment and control groups differ in the intensity of treatment.
- An equal elasticity of taxable income across income distribution is assumed. The elasticity of taxable income is likely higher for high-income taxpayers (with more adjustment opportunities).
- Small and unstratified sample: very few high-income taxpayers are included.
- The presence of increasing earning inequality in the US determined for non-tax reasons should be considered.
- The results may be affected by a regression-to-the-mean bias due to classification of treatment groups by pre-treatment income: rich people in year t may tend to revert to the mean in year t + 1.
- Panel analysis introduces a downward bias in the estimated elasticity if marginal tax rate for rich people decreases.
- It is unclear whether the common trend assumption really holds. Not even the simplest tests are conducted (parallel trends, anticipation effects, etc.).
- Estimated elasticity overestimates welfare loss if the behavioural response involves transfers between individuals.
- The study provides some unclear indications about the effects of changes in MTR on the aggregate income tax yield, but it is silent about taxpayers’ behavioural reactions to income taxation in spite of the claim that “The Tax Reform Act of 1986 is a particularly useful natural experiment for studying the responsiveness of taxpayers to changes in marginal tax rates” (Feldstein, 1995, p. 552). The potential role that confounders (likely affected by the treatment) may play in this estimation is completely ignored.
11.2. Top Income Taxation and the Migration Decisions of Rich Taxpayers (Kleven et al., 2013)
- = total of domestic players in country n;
- = total of foreign players in country n.
- In the graphical analysis, the elasticities of the average tax rate are not presented for the pre-Bosman period and the Danish case studies because of a lack of individual earnings data before 1996. Similarly, the average tax rate elasticity for Spain is based on the 1996–2003 versus 2004–2008 comparison. It is therefore difficult to conduct a complete comparison study (not even graphical).
- The sample used is limited to a very special category of privileged migrants (the well-paid football players whose behaviour is affected by several treatment-related confounding factors). Out-of-sample projections seem problematic.
- The Bosman ruling could have had differential impacts on low-tax and high-tax countries for non-tax reasons. Tax rates may correlate with country size and thus league quality. Better leagues may have benefited more from Bosman ruling.
- Football player contracts are generally signed in advance with respect to the year of the actual transfer and then anticipation effects of the Borman ruling might be present.
- Other factors could have changed from the pre-Bosman to the post-Bosman era that impacted low-tax and high-tax countries differentially.
11.3. Toxic Emissions and the Environment (Zhou et al., 2019; Dong et al., 2022)
11.4. Regulation, Privatisation, Management (Galiani et al., 2005)
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
Appendix A. An Example with Easy Visualisation of the Dataset
Year | y Consumption Expenditure of an Individual Belonging to a Group Recorded in a Year | D1 = Time Period Treatment Dummy 0 If It Is a Year with No Treatment 1 If It Is a Year When Treatment Existed | D2 = Treatment Application 0 If the Individual Is Never Treated 1 If the Individual Is Treated (Sooner or Later) | TREATMENT = D1 × D2 0 Will Indicate the Individual Is Not Affected by the Tax Policy 1 Will Indicate That in a Certain Year the Individual Is Affected by the Tax Policy |
---|---|---|---|---|
2000 | y1A2000 | 0 | 1 | 0 |
2001 | . | 0 | 1 | 0 |
2002 | . | 0 | 1 | 0 |
2003 | y1A2003 | 1 | 1 | 1 |
2004 | . | 1 | 1 | 1 |
2005 | . | 1 | 1 | 1 |
2006 | y1A2006 | 1 | 1 | 1 |
2000 | y2A2000 | 0 | 1 | 0 |
2001 | . | 0 | 1 | 0 |
2002 | . | 0 | 1 | 0 |
2003 | y2A2003 | 1 | 1 | 1 |
2004 | . | 1 | 1 | 1 |
2005 | . | 1 | 1 | 1 |
2006 | y2A2006 | 1 | 1 | 1 |
……… | ||||
2000 | y1C2000 | 0 | 0 | 0 |
2001 | . | 0 | 0 | 0 |
2002 | . | 0 | 0 | 0 |
2003 | y1C2003 | 1 | 0 | 0 |
2004 | . | 1 | 0 | 0 |
2005 | . | 1 | 0 | 0 |
2006 | y1C2006 | 1 | 0 | 0 |
……… | ||||
2000 | y5C2000 | 0 | 0 | 0 |
2001 | . | 0 | 0 | 0 |
2002 | . | 0 | 0 | 0 |
2003 | y5C2003 | 1 | 0 | 0 |
2004 | . | 1 | 0 | 0 |
2005 | . | 1 | 0 | 0 |
2006 | y5C2006 | 1 | 0 | 0 |
Units ID | TIME | TRET = (D1 × D2) | Response Variable |
---|---|---|---|
1 | 1 | 0 | 0.5 |
1 | 2 | 0 | 0.5 |
1 | 3 | 0 | 0.5 |
1 | 4 | 0 | 0.5 |
1 | 5 | 0 | 0.5 |
1 | 6 | 0 | 0.5 |
1 | 7 | 0 | 0.5 |
1 | 8 | 0 | 0.5 |
1 | 9 | 0 | 0.5 |
1 | 10 | 0 | 0.5 |
2 | 1 | 0 | 1 |
2 | 2 | 0 | 1 |
2 | 3 | 0 | 1 |
2 | 4 | 0 | 1 |
2 | 5 | 1 | 2 |
2 | 6 | 1 | 2 |
2 | 7 | 1 | 2 |
2 | 8 | 1 | 2 |
2 | 9 | 1 | 2 |
2 | 10 | 1 | 2 |
3 | 1 | 0 | 2 |
3 | 2 | 0 | 2 |
3 | 3 | 0 | 2 |
3 | 4 | 0 | 2 |
3 | 5 | 1 | 4 |
3 | 6 | 1 | 4 |
3 | 7 | 1 | 4 |
3 | 8 | 1 | 4 |
3 | 9 | 1 | 4 |
3 | 10 | 1 | 4 |
Consumers’ Id | Time | Consumption EUR | D1 | D2 |
---|---|---|---|---|
1 | 2010 | 12 | 0 | 1 |
2 | 2010 | 9 | 0 | 1 |
3 | 2010 | 13 | 0 | 1 |
4 | 2010 | 14 | 0 | 1 |
5 | 2010 | 15 | 0 | 1 |
6 | 2010 | 13 | 0 | 0 |
7 | 2010 | 14 | 0 | 0 |
8 | 2010 | 13 | 0 | 0 |
9 | 2010 | 16 | 0 | 0 |
10 | 2010 | 15 | 0 | 0 |
1 | 2011 | 15 | 1 | 1 |
2 | 2011 | 17 | 1 | 1 |
3 | 2011 | 19 | 1 | 1 |
4 | 2011 | 18 | 1 | 1 |
5 | 2011 | 22 | 1 | 1 |
6 | 2011 | 13.5 | 1 | 0 |
7 | 2011 | 14 | 1 | 0 |
8 | 2011 | 15 | 1 | 0 |
9 | 2011 | 15.5 | 1 | 0 |
10 | 2011 | 14.4 | 1 | 0 |
Consumers’ Id | Time | Consumption EUR | D1 | D2 |
---|---|---|---|---|
1 | 2009 | 11 | 0 | 1 |
1 | 2010 | 12 | 0 | 1 |
1 | 2011 | 15 | 1 | 1 |
2 | 2009 | 8.6 | 0 | 1 |
2 | 2010 | 9 | 0 | 1 |
2 | 2011 | 17 | 1 | 1 |
3 | 2009 | 12.5 | 0 | 1 |
3 | 2010 | 13 | 0 | 1 |
3 | 2011 | 19 | 1 | 1 |
4 | 2009 | 13 | 0 | 1 |
4 | 2010 | 14 | 0 | 1 |
4 | 2011 | 18 | 1 | 1 |
5 | 2009 | 14 | 0 | 1 |
5 | 2010 | 15 | 0 | 1 |
5 | 2011 | 22 | 1 | 1 |
6 | 2009 | 12 | 0 | 1 |
6 | 2010 | 13 | 0 | 0 |
6 | 2011 | 13.5 | 1 | 0 |
7 | 2009 | 13.7 | 0 | 0 |
7 | 2010 | 14 | 0 | 0 |
7 | 2011 | 14 | 1 | 0 |
8 | 2009 | 12.7 | 0 | 0 |
8 | 2010 | 13 | 0 | 0 |
8 | 2011 | 15 | 1 | 0 |
9 | 2009 | 14.9 | 0 | 0 |
9 | 2010 | 16 | 0 | 0 |
9 | 2011 | 15.5 | 1 | 0 |
10 | 2009 | 14.7 | 0 | 0 |
10 | 2010 | 15 | 0 | 0 |
10 | 2011 | 14.4 | 1 | 0 |
Appendix B. Dataset for Non-Homogeneous DiD Estimation
ID | Year | Consumption | D1 | D2 | TRET | First Year of Treatment |
---|---|---|---|---|---|---|
1 | 2009 | 11 | 1 | 0 | 0 | 2011 |
1 | 2010 | 12 | 1 | 0 | 0 | 2011 |
1 | 2011 | 15 | 1 | 1 | 1 | 2011 |
1 | 2012 | 14.8 | 1 | 1 | 1 | 2011 |
1 | 2013 | 15.8 | 1 | 1 | 1 | 2011 |
1 | 2014 | 17 | 1 | 1 | 1 | 2011 |
2 | 2009 | 8.6 | 1 | 0 | 0 | 2011 |
2 | 2010 | 9 | 1 | 0 | 0 | 2011 |
2 | 2011 | 17 | 1 | 1 | 1 | 2011 |
2 | 2012 | 18 | 1 | 1 | 1 | 2011 |
2 | 2013 | 18.8 | 1 | 1 | 1 | 2011 |
2 | 2014 | 19.1 | 1 | 1 | 1 | 2011 |
3 | 2009 | 12.5 | 1 | 0 | 0 | 2011 |
3 | 2010 | 13 | 1 | 0 | 0 | 2011 |
3 | 2011 | 19 | 1 | 1 | 1 | 2011 |
3 | 2012 | 19.8 | 1 | 1 | 1 | 2011 |
3 | 2013 | 21 | 1 | 1 | 1 | 2011 |
3 | 2014 | 22 | 1 | 1 | 1 | 2011 |
4 | 2009 | 13 | 1 | 0 | 0 | 2011 |
4 | 2010 | 14 | 1 | 0 | 0 | 2011 |
4 | 2011 | 18 | 1 | 1 | 1 | 2011 |
4 | 2012 | 19.1 | 1 | 1 | 1 | 2011 |
4 | 2013 | 22 | 1 | 1 | 1 | 2011 |
4 | 2014 | 21.8 | 1 | 1 | 1 | 2011 |
5 | 2009 | 14 | 1 | 0 | 0 | 2011 |
5 | 2010 | 15 | 1 | 0 | 0 | 2011 |
5 | 2011 | 22 | 1 | 1 | 1 | 2011 |
5 | 2012 | 21.9 | 1 | 1 | 1 | 2011 |
5 | 2013 | 22.2 | 1 | 1 | 1 | 2011 |
5 | 2014 | 22 | 1 | 1 | 1 | 2011 |
6 | 2009 | 12 | 0 | 0 | 0 | Never treated |
6 | 2010 | 13 | 0 | 0 | 0 | Never treated |
6 | 2011 | 13.5 | 0 | 1 | 0 | Never treated |
6 | 2012 | 13.9 | 0 | 1 | 0 | Never treated |
6 | 2013 | 14.2 | 0 | 1 | 0 | Never treated |
6 | 2014 | 15.1 | 0 | 1 | 0 | Never treated |
7 | 2009 | 13.7 | 0 | 0 | 0 | Never treated |
7 | 2010 | 14 | 0 | 0 | 0 | Never treated |
7 | 2011 | 14 | 0 | 1 | 0 | Never treated |
7 | 2012 | 14.9 | 0 | 1 | 0 | Never treated |
7 | 2013 | 15.1 | 0 | 1 | 0 | Never treated |
7 | 2014 | 14.9 | 0 | 1 | 0 | Never treated |
8 | 2009 | 12.7 | 0 | 0 | 0 | Never treated |
8 | 2010 | 13 | 0 | 0 | 0 | Never treated |
8 | 2011 | 15 | 0 | 1 | 0 | Never treated |
8 | 2012 | 15.5 | 0 | 1 | 0 | Never treated |
8 | 2013 | 16.1 | 0 | 1 | 0 | Never treated |
8 | 2014 | 17.2 | 0 | 1 | 0 | Never treated |
9 | 2009 | 14.9 | 0 | 0 | 0 | Never treated |
9 | 2010 | 16 | 0 | 0 | 0 | Never treated |
9 | 2011 | 15.5 | 0 | 1 | 0 | Never treated |
9 | 2012 | 16 | 0 | 1 | 0 | Never treated |
9 | 2013 | 16.7 | 0 | 1 | 0 | Never treated |
9 | 2014 | 17 | 0 | 1 | 0 | Never treated |
10 | 2009 | 14.7 | 0 | 0 | 0 | Never treated |
10 | 2010 | 15 | 0 | 0 | 0 | Never treated |
10 | 2011 | 14.4 | 0 | 1 | 0 | Never treated |
10 | 2012 | 15 | 0 | 1 | 0 | Never treated |
10 | 2013 | 15.7 | 0 | 1 | 0 | Never treated |
10 | 2014 | 16.1 | 0 | 1 | 0 | Never treated |
11 | 2009 | 13.1 | 1 | 0 | 0 | 2012 |
11 | 2010 | 14 | 1 | 0 | 0 | 2012 |
11 | 2011 | 14.8 | 1 | 0 | 0 | 2012 |
11 | 2012 | 16 | 1 | 1 | 1 | 2012 |
11 | 2013 | 16.2 | 1 | 1 | 1 | 2012 |
11 | 2014 | 15.5 | 1 | 1 | 1 | 2012 |
12 | 2009 | 12.9 | 1 | 0 | 0 | 2012 |
12 | 2010 | 13.3 | 1 | 0 | 0 | 2012 |
12 | 2011 | 14.7 | 1 | 0 | 0 | 2012 |
12 | 2012 | 16.1 | 1 | 1 | 1 | 2012 |
12 | 2013 | 16.7 | 1 | 1 | 1 | 2012 |
12 | 2014 | 18 | 1 | 1 | 1 | 2012 |
13 | 2009 | 12 | 1 | 0 | 0 | 2013 |
13 | 2010 | 12.8 | 1 | 0 | 0 | 2013 |
13 | 2011 | 13 | 1 | 0 | 0 | 2013 |
13 | 2012 | 13.9 | 1 | 0 | 0 | 2013 |
13 | 2013 | 15.4 | 1 | 1 | 1 | 2013 |
13 | 2014 | 16 | 1 | 1 | 1 | 2013 |
- DEPENDENT VARIABLE: CONSUMPTION
- COACTOR: INCOME
- HETEROGENOUS TREATMENT: A Consumption Credit (for instance a policy measure that supports consumption (for instance a consumption local credit card with public warrant.
DiD DUMMIES D1 = 0 if the consumer was never treated D1 = 1 if the consumer was treated, sooner or later D2 = 0 if the treatment did not exist in that year for that consumer D2 = 1 if the treatment exists in that year for that consumer | |||
ID: CONSUMERS 1 to 5 are Treated from 2011 6 to 10 are Never Treated 11 to 12 are Treated from 2012 13 is Treated from 2013 to 2014 | |||
TREATMENT TIMING From 2009 to 2010, no treatment existed. From 2011 to 2012, there was a treatment on individuals 1, 2, 3, 4, and 5. In 2012, a treatment was extended to individuals 11 and 12. In 2013, a treatment further extended to individuals of unit 13. | |||
SUMMARY OF UNITS AND COHORTS | |||
Cohorts | Units and Observations | ||
Never Treated Units | 5 units | 30 Observations | |
First Cohort | Units Treated from 2011 | 5 units | 30 Observations |
Second Cohort | Units Treated from 2012 | 2 units | 12 Observations |
Third Cohort | Units Treated from 2013 | 1 unit | 6 Observations |
The units treated since 2011 form the first cohort, and so on. Once the treatment is introduced, each unit in the treated cohort remains treated until the end of the sample period. TWFE, RA and IPW estimations of ATET can be obtained by applying the methods presented in Section 8 and following. |
ATET (SE in Parenthesis) | |||||
---|---|---|---|---|---|
Cohorts | YEARS | TWFE | RA | IPW | AIPW |
2010 | // | 0.18 (0.20) | 0.18 (0.20) | 0.18 (0.2) | |
2011 | 5.20 *** (0.99) | 5.32 *** (0.93) | 5.32 *** (0.93) | 5.32 *** (0.93) | |
2011 | 2012 | 5.22 *** (1.1) | 5.26 *** (1.01) | 5.26 *** (1.01) | 5.26 *** (1.01) |
2013 | 6.0 *** (1.06) | 6 *** (0.97) | 6 *** (0.97) | 6 *** (0.97) | |
2014 | 5.92 *** (1.03) | 5.92 *** (0.96) | 5.92 *** (0.96) | 5.92 *** (0.96) | |
2010 | // | 0.05 (0.24) | 0.05 (0.24) | 0.05 (0.24) | |
2011 | // | 0.82 (0.47) | 0.82 (0.47) | 0.82 (0.47) | |
2012 | 2012 | 1.23 ** (0.31) | 0.72 *** (0.10) | 0.72 *** (0.10) | 0.72 *** (0.10) |
2013 | 1.17 ** (0.44) | 0.62 * (0.23) | 0.62 * (0.23) | 0.62 * (0.23) | |
2014 | 0.97 (1.27) | 0.42 (0.94) | 0.42 (0.94) | 0.42 (0.94) | |
2010 | // | 0.2 (0.16) | 0.2 (0.16) | 0.2 (0.16) | |
2011 | // | −0.08 (0.42) | −0.08 (0.42) | −0.08 (0.42) | |
2013 | 2012 | // | 0.32 *** (0.07) | 0.32 ** (0.08) | 0.32 ** (0.08) |
2013 | 1.5 *** (0.22) | 1 *** (0.09) | 1 *** (0.09) | 1 *** (0.09) | |
2014 | 1.35 ** (0.43) | 1.1 ** (0.24) | 1.1 (0.25) | 1.1 (0.25) | |
Overall ATET | 4.32 ** (1.11) | 4.22 *** (1.003) | 4.22 *** (1.00) | 4.22 *** (1.00) | |
Average ATET by years | |||||
2011 | 5.2 *** (0.99) | 5.32 *** (0.93) | 5.32 *** (0.93) | 5.32 *** (0.93) | |
2012 | 4.08 ** (1.16) | 3.96 ** (1.05) | 3.96 ** (1.05) | 3.96 ** (1.05) | |
2013 | 4.2 ** (1.21) | 4.03 ** (1.08) | 4.03 ** (1.08) | 4.03 ** (1.08) | |
2014 | 4.11 ** (1.28) | 3.94 ** (1.13) | 3.94 ** (1.13) | 3.94 ** (1.13) |
1 | We lastly recall that software packages useful to implement basic and more advanced DiD methods can be found in the following websites (listed in alphabetic order): R®: https://asjadnaqvi.github.io/DiD/docs/02_R/; Stata®: https://asjadnaqvi.github.io/DiD/docs/01_stata/. We did not access the websites during the editing of this paper. |
2 | The present review does not address the domain of healthcare, as readers can access numerous DiD studies that have been utilised to evaluate novel policies and healthcare programs. For instance, in the United States, numerous studies have estimated the effects of expanded Medicaid eligibility through the Affordable Care Act (ACA). In the aftermath of the Supreme Court’s decision regarding the ACA, each state within the United States is empowered to determine its own Medicaid eligibility criteria, with the option to expand its threshold. This methodological framework enabled the establishment of groups comprising treated states and comparison (untreated) states, thereby facilitating the implementation of DiD. These studies have contributed to ongoing policy debates in the US regarding the future of the ACA, and readers are encouraged to consult the relevant literature (see Zeldow & Hatfield, 2021 for an introduction). |
3 | Consequences of non-random assignment are discussed, among others, by Cerulli (2015, p. 17). |
4 | Researchers can then also find the breakdown point—how much of a deviation from the pre-existing difference in trends is needed before we can no longer reject the null of no parallel trend. |
5 | The authors acknowledge that previous researchers have stressed that p-values are not meant to express the strength of evidence in favour of the null. |
6 | According to the authors, this procedure does not simply control for this covariate but rather allows for its use in a 2SLS or GMM estimator. |
7 | Callaway (2022) discusses an ampler set of estimation strategies. According to Callaway (2022, p. 4), all of them explicitly make, in a first step, the same good comparisons that show up in the TWFE regression (i.e., the comparisons that use units that become treated relative to units that are not-yet-treated) while explicitly avoiding the “bad comparisons” that show up in the TWFE regression (i.e., the comparisons that use already-treated units as the comparison group). Then, in a second step, they combine these underlying treatment effect parameters into target parameters of interest such as an overall average treatment effect on the treated. See Section Alternative Approaches in Callaway (2022, p. 20). |
8 | Several different ICC statistics have been proposed, not all of which estimate the same population parameter. There has been considerable debate about which ICC statistics are appropriate for a given use, since they may produce markedly different results for the same data. |
9 | A list of bias correction procedures is provided by Angrist and Pischke (2009, pp. 320–322). |
10 | In a later paper (Feldstein, 1999) he also argues that traditional analyses of the income tax greatly underestimate deadweight losses by ignoring its effect on forms of compensation and patterns of consumption. He calculated the full deadweight loss using the compensated elasticity of taxable income to changes in tax rates because leisure, excludable income, and deductible consumption are assumed (by Feldstein) to be a Hicksian composite good. According to his estimations a deadweight loss of as much as 30% of revenue or more than ten times Harbergers classic 1964 estimate. The relative deadweight loss caused by increasing existing tax rates is substantially greater and, according to Feldstein’s results, may exceed USD 2 per USD 1 of revenue. Some enormous measure, one should say! |
11 | The treatment incorporated in the Feldstein’s analysis was the 1986 US tax reform that lowered marginal tax rates, and simultaneously broadened tax bases. The two elements were designed to net out. Approximately no revenue and distributional effects absent behavioural responses means that approximately there are no income effects. Important as the aim is to estimate the compensated elasticity of taxable income. |
12 | This review does not discuss synthetic controls. One should see Abadie et al. (2015). A synthetic control can be constructed as a weighted average of several units combined to recreate the trajectory that the response variable of a treated unit would have followed in the absence of the treatment. Recent advances in this field can be found in Y. Sun et al. (2025). |
13 | This is in contrast with the view that the appropriate tax rate for decisions on the intensive margin is the marginal tax rate (MTR = tax rate on the last euro earned). In the paper ATR is not exact but approximated (for a subsample of football players). Since these taxpayers earn very high salaries, authors approximate the ATR by the top marginal tax rate (MTR). An alternative, and possible more reliable procedure is followed by Moretti and Wilson (2017). By focusing on the locational outcomes of star scientists, defined as scientists with patent counts in the top 5 percent of the distribution, their paper quantifies how sensitive is migration by these stars to changes in personal and business tax differentials across states in the USA. The study uncovers large, stable, and precisely estimated effects of personal and corporate taxes on star scientists, migration patterns. The long-run elasticity of mobility relative to taxes is 1.8 for personal income taxes, 1.9 for state corporate income tax, and −1.7 for the investment tax credit. |
References
- Abadie, A., Diamond, A., & Hainmueller, J. (2015). Comparative politics and the synthetic control method. American Journal of Political Science, 59(2), 495–510. [Google Scholar] [CrossRef]
- Ahrens, A., Aitken, C., & Schaffer, M. E. (2021). Using machine learning methods to support causal inference in econometrics. In S. Sriboonchitta, V. Kreinovich, & W. Yamaka (Eds.), Behavioral predictive modeling in economics (pp. 23–52). Springer International Publishing. [Google Scholar] [CrossRef]
- Ahrens, A., Chernozhukov, V., Hansen, C., Kozbur, D., Schaffer, M., & Wiemann, T. (2025). An introduction to double/debiased machine learning. arXiv, arXiv:2504.08324. [Google Scholar] [CrossRef]
- Alvarez, L., & Ferman, B. (2020). Inference in difference-in-differences with few treated units and spatial correlation. arXiv, arXiv:2006.16997. [Google Scholar]
- Alvarez, L., Ferman, B., & Wüthrich, K. (2025). Inference with few treated units. arXiv, arXiv:2504.19841. [Google Scholar] [CrossRef]
- Angrist, J. D., GImbens, W., & Rubin, D. B. (1996). Identification of causal effects using instrumental variables. Journal of the American Statistical Association, 91(434), 444–455. [Google Scholar] [CrossRef]
- Angrist, J. D., & Pischke, J.-S. (2009). Mostly harmless econometrics: An empiricist’s companion. Princeton University Press. [Google Scholar]
- Angrist, J. D., & Pischke, J.-S. (2010). The Credibility revolution in empirical economics: How better research design is taking the con out of econometrics. Journal of Economic Perspectives, 24(2), 3–30. [Google Scholar] [CrossRef]
- Ashenfelter, O., & Card, D. (1985). Using the longitudinal structure of earnings to estimate the effect of training programs. The Review of Economics and Statistics, 67(4), 648–660. [Google Scholar] [CrossRef]
- Baker, A., Callaway, B., Cunningham, S., Goodman-Bacon, A., & Sant’Anna, P. (2025). Difference-in-differences designs: A practitioner’s guide. arXiv, arXiv:2503.13323. [Google Scholar] [CrossRef]
- Bell, R. M., & McCaffrey, D. F. (2002). Bias reduction in standard errors for linear regression with multi-stage samples. Survey Methodology, 28(2), 169–181. [Google Scholar]
- Bertrand, M., Duflo, E., & Mullainathan, S. (2004). How much should we trust difference-in-differences estimates? Quarterly Journal of Economics, 119, 249–275. [Google Scholar] [CrossRef]
- Bester, C. A., Conley, T. G., & Hansen, C. B. (2011). Inference with dependent data using cluster covariance estimators. Journal of Econometrics, 165, 137–151. [Google Scholar] [CrossRef]
- Bilinski, A., & Hatfield, L. (2020, August 5). Nothing to see here? Non-inferiority approaches to parallel trends and other model assumptions. JSM 2020 Virtual Conference, in Virtual. Available online: https://ww2.amstat.org/meetings/jsm/2020/onlineprogram/AbstractDetails.cfm?abstractid=312323 (accessed on 10 August 2025).
- Borusyak, K., & Jaravel, X. (2018). Revisiting event study designs. Social science research network. SSRN Scholarly Paper ID 2826228. Available online: https://scholar.harvard.edu/files/borusyak/files/borusyak_jaravel_event_studies.pdf (accessed on 10 August 2025).
- Bosco, B. P., Bosco, C. F., & Maranzano, P. (2025). Labour responsiveness to income tax changes: Empirical evidence from a DID analysis of an income tax treatment in Italy. Empirical Economics, 69(2), 787–828. [Google Scholar] [CrossRef]
- Callaway, B. (2022). Difference-in-differences for policy evaluation. In K. F. Zimmermann (Ed.), Handbook of labor, human resources and population (pp. 1–61). Springer. [Google Scholar] [CrossRef]
- Callaway, B., & Sant’Anna, P. (2021). Difference-in-differences with multiple time periods. Journal of Econometrics, 225, 200–230. [Google Scholar] [CrossRef]
- Cameron, A. C., Gelbach, J. B., & Miller, D. L. (2008). Bootstrap-based improvements for inference with clustered errors. The Review of Economics and Statistics, 90(3), 414–427. [Google Scholar] [CrossRef]
- Cameron, A. C., & Miller, D. L. (2015). A practitioner’s guide to cluster-robust inference. Journal of Human Resources, 50(2), 317–372. [Google Scholar] [CrossRef]
- Cameron, A. C., & Trivedi, P. K. (2005). Microeconometrics: Methods and applications. Cambridge University Press. [Google Scholar]
- Card, D., & Krueger, A. (1994). Minimum wages and employment: A case study of the fast-food industry in new jersey and Pennsylvania. American Economic Review, 84(4), 772–793. [Google Scholar]
- Cerqua, A., Letta, M., & Menchetti, F. (2022). Losing control (group)? The machine learning control method for counterfactual forecasting. SSRN. [Google Scholar]
- Cerqua, A., Letta, M., & Menchetti, F. (2023). The machine learning control method for counterfactual forecasting. arXiv, arXiv:2312.05858. [Google Scholar] [CrossRef]
- Cerqua, A., Letta, M., & Menchetti, F. (2024). Causal inference and policy evaluation without a control group. arXiv, arXiv:2312.05858. [Google Scholar]
- Cerulli, G. (2015). Econometric evaluation of socio-economic programs. In Theory and applications. Springer-Verlag GmbH. [Google Scholar]
- Chernozhukov, V., Chetverikov, D., Demirer, M., Duflo, E., Hansen, C., & Newey, W. (2017). Double/debiased/neyman machine learning of treatment effects. American Economic Review, 107(5), 261–265. [Google Scholar] [CrossRef]
- Chernozhukov, V., Chetverikov, D., Demirer, M., Duflo, E., Hansen, C., Newey, W., & Robins, J. (2016). Double/debiased machine learning for treatment and causal parameters. arXiv, arXiv:1608.00060. [Google Scholar]
- Chernozhukov, V., Chetverikov, D., Demirer, M., Duflo, E., Hansen, C., Newey, W., & Robins, J. (2018). Double/debiased machine learning for treatment and structural parameters. The Econometrics Journal, 21(1), C1–C68. [Google Scholar] [CrossRef]
- Chernozhukov, V., Newey, W. K., & Singh, R. (2022). Debiased machine learning of global and local parameters using regularized Riesz representers. The Econometrics Journal, 25(3), 576–601. [Google Scholar] [CrossRef]
- Chesnaye, N., Stel, S., Tripepi, G., Dekker, F. W., Fu, E. L., Zoccali, G., & Jager, K. J. (2022). An introduction to inverse probability of treatment weighting in observational research. Clinical Kidney Journal, 15(1), 14–20. [Google Scholar] [CrossRef]
- Cole, S. R., & Frangakis, C. E. (2009). The consistency statement in causal inference: A definition or an assumption? Epidemiology, 20(1), 3–5. [Google Scholar] [CrossRef]
- Courtemanche, C. J., & Zapata, D. (2014). Does universal coverage improve health? The Massachusetts experience. Journal of Policy Analysis and Management, 33, 36–69. [Google Scholar] [CrossRef]
- Cox, D. R. (1958). Planning of experiments, Wiley series in probability and statistics—Applied probability and statistics section. John Wiley & Sons Inc. [Google Scholar]
- de Chaisemartin, C., & D’Haultfoeuille, X. (2020). Two-way fixed effects estimators with heterogeneous treatment effects. American Economic Review, 110(9), 2964–2996. [Google Scholar] [CrossRef]
- de Chaisemartin, C., & D’Haultfoeuille, X. (2023). Two-way fixed effects estimators with heterogeneous treatment effects: A survey. Econometrics Journal, 26, C1–C30. [Google Scholar] [CrossRef]
- Delgado, M. S., & Florax, R. J. G. M. (2015). Difference-in-differences techniques for spatial data: Local autocorrelation and spatial interaction. Economics Letters, 137, 123–126. [Google Scholar] [CrossRef]
- Donald, S. G., & Lang, K. (2007). Inference with difference-in-differences and other panel data. The Review of Economics and Statistics, 89(2), 221–233. [Google Scholar] [CrossRef]
- Dong, F., Li, Y., Qin, C., Zhang, X., Chen, Y., Zhao, X., & Wang, C. (2022). Information infrastructure and greenhouse gas emission performance in urban China: A difference-in-differences analysis. Journal of Environmental Management, 316, 115252. [Google Scholar] [CrossRef]
- Elhorst, J. P. (2010). Applied spatial econometrics: Raising the bar. Spatial Economic Analysis, 5(1), 9–28. [Google Scholar] [CrossRef]
- Farbmacher, H., Huber, M., Lafférs, L., Langen, H., & Spindler, M. (2022). Causal mediation analysis with double machine learning. The Econometrics Journal, 25(2), 277–300. [Google Scholar] [CrossRef]
- Feldstein, M. (1995). The effect of marginal tax rates on taxable income: A panel study of the 1986 tax reform act. Journal of Political Economy, 103, 551–572. [Google Scholar] [CrossRef]
- Feldstein, M. (1999). Tax avoidance and the deadweight loss of the income tax. The Review of Economics and Statistics, 81(4), 674–680. [Google Scholar] [CrossRef]
- Fredriksson, A., & de Oliveira, G. M. (2019). Impact evaluation using difference-in-differences. RAUSP Management Journal, 54(4), 519–532. [Google Scholar] [CrossRef]
- Freyaldenhoven, S., Hansen, C., & Shapiro, J. (2019). Pre-event trends in the panel event-study design. American Economic Review, 109, 3307–3338. [Google Scholar] [CrossRef]
- Galiani, S., Gertler, P., & Schargrodsky, E. (2005). Water for life: The impact of the privatization of water services on child mortality. Journal of Political Economy, 113, 83–120. [Google Scholar] [CrossRef]
- Gertler, P. J., Martinez, S., Premand, P., Rawlings, L. B., & Vermeersch, C. M. (2016). Impact evaluation in practice. The World Bank. [Google Scholar]
- Goodman-Bacon, A. (2021). Difference-in-differences with variation in treatment timing. Journal of Econometrics, 225(2), 254–277. [Google Scholar] [CrossRef]
- Huber, M., & Steinmayr, A. (2021). A framework for separating individual-level treatment effects from spillover effects. Journal of Business and Economic Statistics, 39(2), 422–436. [Google Scholar] [CrossRef]
- Imbens, G. W., & Kolesar, M. (2016). Robust standard errors in small samples: Some practical advice. Review of Economics and Statistics, 98(4), 701–712. [Google Scholar] [CrossRef]
- Imbens, G. W., & Rubin, D. B. (2015). Causal inference for statistics, social, and biomedical sciences: An introduction. Cambridge University Press. [Google Scholar]
- Kahn-Lang, A., & Lang, K. (2020). The promise and pitfalls of differences-in-differences: Reflections on 16 and pregnant and other applications. Journal of Business and Economic Statistics, 38, 613–620. [Google Scholar] [CrossRef]
- Kleven, K. J., Landais, C., & Saez, E. (2013). Taxation and international migration of superstars: Evidence from the European football market. American Economic Review, 103(5), 1892–1924. [Google Scholar] [CrossRef]
- Knaus, M. C. (2022). Double machine learning-based programme evaluation under unconfoundedness. The Econometrics Journal, 25(3), 602–627. [Google Scholar] [CrossRef]
- Laffers, L., & Mellace, G. (2020). Identification of the average treatment effect when SUTVA is violated (Vol. 3). Discussion Papers on Business and Economics, University of Southern Denmark. [Google Scholar]
- MacKinnon. (2019). How cluster-robust inference is changing applied econometrics. Canadian Journal of Economics/Revue canadienne d’économique, 52(3), 851–881. [Google Scholar] [CrossRef]
- MacKinnon, J. G., Ørregaard Nielsen, M., & Webb, M. D. (2023). Cluster-robust inference: A guide to empirical practice. Journal of Econometrics, 232(2), 272–299. [Google Scholar] [CrossRef]
- MacKinnon, J. G., & Webb, M. D. (2018). The wild bootstrap for few (treated) clusters. Econometrics Journal, 21, 114–135. [Google Scholar] [CrossRef]
- Manski, F. C., & Pepper, J. V. (2018). How Do Right-to-Carry Laws Affect Crime Rates? Coping with Ambiguity Using Bounded-Variation Assumptions. The Review of Economics and Statistics, 100(2), 232–244. [Google Scholar] [CrossRef]
- Moretti, E., & Wilson, D. J. (2017). The effect of state taxes on the geographical location of top earners: Evidence from star scientists. American Economic Review, 107(7), 1858–1903. [Google Scholar] [CrossRef]
- Myint, L. (2024). Controlling time-varying confounding in difference-in-differences studies using the time-varying treatments framework. Health Services and Outcomes Research Methodology, 24, 95–111. [Google Scholar] [CrossRef]
- Ogburn, E. L., Shpitser, I., & Lee, Y. (2020). Causal inference, social networks and chain graphs. Journal of the Royal Statistical Society Series A: Statistics in Society, 183(4), 1659–1676. [Google Scholar] [CrossRef] [PubMed]
- Ogburn, E. L., Sofrygin, O., Díaz, I., & van der Laan, M. J. (2024). Causal inference for social network data. Journal of the American Statistical Association, 119(545), 597–611. [Google Scholar] [CrossRef]
- Pernagallo, G., & Vitali, G. (2025). Housing market investment in peripheral areas: Evidence from Italy. Papers in Regional Science, 104(5), 100113. [Google Scholar] [CrossRef]
- Qiu, F., & Tong, Q. (2021). A spatial difference-in-differences approach to evaluate the impact of light rail transit on property values. Economic Modelling, 99, 105496. [Google Scholar] [CrossRef]
- Rambachan, A., & Roth, J. (2023). A more credible approach to parallel trends. Review of Economic Studies, 90, 2555–2591. [Google Scholar] [CrossRef]
- Rosenbaum, P. R. (1984). The consequences of adjustment for a concomitant variable that has been affected by the treatment. Journal of the Royal Statistical Society. Series A (General), 147(5), 656–666. [Google Scholar] [CrossRef]
- Roth, J. (2022). Pretest with caution: Event-study estimates after testing for parallel trends. American Economic Review: Insights, 4(3), 305–322. [Google Scholar]
- Roth, J., Sant’Anna, P., Bilinski, A., & Poe, J. (2023). What’s trending in difference-in-differences? A synthesis of the recent econometrics literature. Journal of Econometrics, 235, 2218–2244. [Google Scholar] [CrossRef]
- Rubin, D. B. (1978). Bayesian inference for causal effects: The role of randomization. Annals of Statistics, 6, 34–58. [Google Scholar] [CrossRef]
- Rubin, D. B. (1980). Randomization analysis of experimental data: The fisher randomization test comment. Journal of the American Statistical Association, 75(371), 591–593. [Google Scholar] [CrossRef]
- Rubin, D. B. (1990a). Formal modes of statistical inference for causal effects. Journal of Statistical Planning and Inference, 25, 279–292. [Google Scholar] [CrossRef]
- Rubin, D. B. (1990b). On the application of probability theory to agricultural experiments. Essay on principles. Section 9.] Comment: Neyman (1923) and causal inference in experiments and observational studies. Statistical Science, 5(4), 472–480. [Google Scholar] [CrossRef]
- Schnabl, P. (2012). The international transmission of bank liquidity shocks: Evidence from an emerging market. The Journal of Finance, 67, 897–932. [Google Scholar] [CrossRef]
- Schwartz, S., Gatto, N. M., & Campbell, U. B. (2012). Extending the sufficient component cause model to describe the Stable Unit Treatment Value Assumption (SUTVA). Epidemiologic Perspectives and Innovations, 9(1), 3. [Google Scholar] [CrossRef]
- Shortreed, S. M., & Ertefaie, A. (2017). Outcome-adaptive lasso: Variable selection for causal inference. Biometrics, 73(4), 1111–1122. [Google Scholar] [CrossRef]
- Sobel, M. E. (2006). What do randomized studies of housing mobility demonstrate? Journal of the American Statistical Association, 101(476), 1398–1407. [Google Scholar] [CrossRef]
- Souto, H. G., & Neto, F. L. (2025). Forests for differences: Robust causal inference beyond parametric DiD. arXiv, arXiv:2505.09706. [Google Scholar] [CrossRef]
- Sun, S., & Delgado, M. S. (2024). Local spatial difference-in-differences models: Treatment correlations, response interactions, and expanded local models. Empirical Economics, 67(5), 2077–2107. [Google Scholar] [CrossRef]
- Sun, Y., Xie, H., & Zhang, Y. (2025). Difference-in-differences meets synthetic control: Doubly robust identification and estimation. arXiv, arXiv:2503.11375. [Google Scholar]
- VanderWeele, T. J. (2009). Concerning the consistency assumption in causal inference. Epidemiology, 20(6), 880–883. [Google Scholar] [CrossRef]
- VanderWeele, T. J. (2010). Direct and indirect effects for neighborhood-based clustered and longitudinal data. Sociological Methods and Research, 38(4), 515–544. [Google Scholar] [CrossRef] [PubMed]
- VanderWeele, T. J., Tchetgen, E. J. T., & Halloran, M. E. (2015). Interference and sensitivity analysis. Statistical Science: A Review Journal of the Institute of Mathematical Statistics, 29(4), 687. [Google Scholar] [CrossRef]
- Wang, Y. (2021). Causal inference with panel data under temporal and spatial interference. arXiv, arXiv:2106.15074. [Google Scholar]
- Wang, Y., Samii, C., Chang, H., & Aronow, P. (2024). Design-based inference for spatial experiments under unknown interference. arXiv, arXiv:2010.13599. [Google Scholar] [CrossRef]
- Wing, C., Simon, K., & Bello-Gomez, R. A. (2018). Designing Difference in Difference Studies: Best Practices for Public Health Policy Research. Annual Review of Public Health, 39, 453–469. [Google Scholar] [CrossRef]
- Wooldridge, J. M. (2010). Econometric analysis of cross section and panel data (2nd ed.). The MIT Press. [Google Scholar]
- Wooldridge, J. M. (2023). Simple approaches to nonlinear difference-in-differences with panel data. Econometrics Journal, 26, C31–C66. [Google Scholar] [CrossRef]
- Wooldridge, J. M. (2025). Two-way fixed effects, the two-way mundlak regression, and difference-in-differences estimators. Empir Econ. [Google Scholar] [CrossRef]
- Xu, Y. (2024). Causal inference with time-series cross-sectional data: A reflection. In J. M. Box-Steffensmeier, D. P. Christenson, & V. Sinclair-Chapman (Eds.), Oxford handbook of engaged methodological pluralism in political science. Oxford University Press. [Google Scholar] [CrossRef]
- Yao, L., Chu, Z., Li, S., Li, Y., Gao, J., & Zhang, A. (2021). A survey on causal inference. ACM Transactions on Knowledge Discovery from Data, 15(5), 74. [Google Scholar] [CrossRef]
- Zeldow, B., & Hatfield, L. A. (2021). Confounding and regression adjustment in difference-in-differences studies. Health Services Research, 56, 932–941. [Google Scholar] [CrossRef]
- Zhou, B., Zhang, C., Song, H., & Wang, Q. (2019). How does emission trading reduce China’s carbon intensity? An exploration using a decomposition and difference-indifferences approach. Science of the Total Environment, 676, 514–523. [Google Scholar] [CrossRef]
D1 = 0 | D1 = 1 | |
---|---|---|
D2 = 0 | ||
D2 = 1 |
Control | Treated | |
---|---|---|
Pre-Treatment | 14.2 | 12.6 |
Post-Treatment | 14.48 | 18.2 |
Linear regression | Number of obs | - | 20 | |||
F(3, 16) | - | 4.63 | ||||
Prob > F | - | 0.0163 | ||||
R-squared | - | 0.5951 | ||||
Root MSE | - | 1.8926 | ||||
consumption | coefficient | Roust std. err. | t | p > |t| | [95% conf. interval] | |
d1 | 0.28 | 0.6822023 | 0.41 | 0.687 | −1.166204 | 1.726204 |
d2 | −1.6 | 1.183216 | −1.35 | 0.195 | −4.108306 | 0.9083058 |
TRET | 5.32 | 1.692749 | 3.14 | 0.006 | 1.731532 | 8.908468 |
_cons | 14.2 | 0.583095 | 24.35 | 0.000 | 12.96389 | 15.43611 |
YEARS | 2000 | 2001 | 2002 | 2003 | 2004 | 2005 | 2006 | 2007 |
---|---|---|---|---|---|---|---|---|
UNITS | Period 1 (no treatment Dit = 0) | Period 2 (treatment Dit = 0 ˄ Djt = 1) | ||||||
1 2 | Never treated | |||||||
3 | Not yet treated | Treated since 2003 until 2007 | ||||||
4 | Not yet treated | Treated since 2003 until 2007 | ||||||
5 | Not yet treated | Treated since 2003 until 2007 | ||||||
6 | Not yet treated | Treated since 2003 until 2007 |
YEARS UNITS | 2000 | 2001 | 2002 | 2003 | 2004 | 2005 | 2006 | 2007 |
---|---|---|---|---|---|---|---|---|
1 2 | Never treated | |||||||
3 | Not yet treated | Treated since 2002 | ||||||
4 | Not yet treated | Treated since 2003 | ||||||
5 | Not yet treated | Treated since 2004 | ||||||
6 | Not yet treated | Treated since 2005 |
Variables | Matched (1) | Matched (2) | Non-Matched (3) | Non-Matched (4) |
---|---|---|---|---|
Y | −0.061 *** (−2.807) | −0.029 ** (−2.073) | ||
R | −0.036 *** (−2.869) | −0.036 *** (−2.649) | ||
Y × R | −0.026 *** (−2.986) | −0.026 *** (−3.156) | 0.038 (0.704) | −0.026 (−1.186) |
_cons | −0.062 *** (−2.850) | −0.096 *** (−4.734) | −0.072 *** (−3.088) | −0.111 *** (−5.370) |
Province/Year fix effects | Yes | Yes | Yes | Yes |
A-R² | 0.196 | 0.008 | 0.170 | 0.001 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Bosco, B.P.; Maranzano, P. Vis Inertiae and Statistical Inference: A Review of Difference-in-Differences Methods Employed in Economics and Other Subjects. Econometrics 2025, 13, 38. https://doi.org/10.3390/econometrics13040038
Bosco BP, Maranzano P. Vis Inertiae and Statistical Inference: A Review of Difference-in-Differences Methods Employed in Economics and Other Subjects. Econometrics. 2025; 13(4):38. https://doi.org/10.3390/econometrics13040038
Chicago/Turabian StyleBosco, Bruno Paolo, and Paolo Maranzano. 2025. "Vis Inertiae and Statistical Inference: A Review of Difference-in-Differences Methods Employed in Economics and Other Subjects" Econometrics 13, no. 4: 38. https://doi.org/10.3390/econometrics13040038
APA StyleBosco, B. P., & Maranzano, P. (2025). Vis Inertiae and Statistical Inference: A Review of Difference-in-Differences Methods Employed in Economics and Other Subjects. Econometrics, 13(4), 38. https://doi.org/10.3390/econometrics13040038