Policy Shocks and Public Attention to Digital Tax in Greece: Event-Study and Nowcasting with Google Trends Time Series

Balaskas, Stefanos

doi:10.3390/accountaudit2020006

Open AccessArticle

Policy Shocks and Public Attention to Digital Tax in Greece: Event-Study and Nowcasting with Google Trends Time Series

by

Stefanos Balaskas

eGovernment & eCommerce Lab (Innovation & Entrepreneurship), Department of Business Administration, University of Patras, 26504 Patras, Greece

Account. Audit. 2026, 2(2), 6; https://doi.org/10.3390/accountaudit2020006

Submission received: 1 December 2025 / Revised: 13 March 2026 / Accepted: 26 March 2026 / Published: 2 April 2026

Download

Browse Figures

Versions Notes

Abstract

Digital tax reforms are implemented through staged, publicly announced milestones, yet policymakers rarely have timely indicators of whether these signals mobilize information-seeking and whether such demand can be anticipated for operational planning. We analyze monthly Google Trends series for Greece’s myDATA/e-invoicing rollout (2016–present) using preregistered event study models that separate step changes from post-event trend shifts with HAC-robust inference, and we evaluate 1–3-month predictive performance via rolling-origin cross-validation against a seasonal-naïve benchmark. Search-based attention shifts appeared most clearly in application-related queries: invoicing app terms spike around visible rollout phases (≈+34 to +38 index points over six months) and decline around VAT–myDATA alignment (≈−34 to −43). Ecosystem attention (the “Electronic invoicing” topic) exhibits large, opposite-signed movements (≈−53 around public-sector expansion; ≈+46 around VAT alignment), whereas platform terms show smaller and less regular responses; a back-office milestone produces no detectable change. In out-of-sample tests, event-aware regressions improve short-horizon accuracy for platform terms (≈40–50% MAE reduction at one month; ≈18–32% at two to three months), with series- and horizon-dependent results elsewhere. Overall, the evidence supports using search activity as an intermediate planning signal—informative about when and where guidance demand concentrates but not evidence of compliance.

Keywords:

Google Trends; e-invoicing; myDATA; AADE; digital government; policy communication; event study; nowcasting; public attention; agenda setting; time series forecasting; Greece

1. Introduction

Digitalization of tax administration promises efficiency and transparency, but success depends on widespread user take-up and sustained use [1,2,3]. Governments are therefore expanding e-invoicing and electronic book-keeping; in Greece, the Independent Authority for Public Revenue (AADE) rolled out myDATA as national digital tax-reporting infrastructure. For accounting practice, such systems do not merely digitize communication with the tax authority: they reconfigure routine processes of invoice issuance, transaction recording, and periodic reporting, with downstream implications for reporting timeliness, error rates, and the evidentiary trail that supports audit and enforcement. Implementation is often constrained by information gaps, uneven readiness across firms, and operational strain around deadlines. Evidence from the UK’s Making Tax Digital suggests that early awareness can be limited—surveys found that many small firms initially did not know about new digital record-keeping obligations [4,5,6,7]. When knowledge and training lag behind rollout schedules, agencies face late rush-to-compliance dynamics and spikes in support demand, conditions under which misconfiguration and reporting errors become more likely [4,5,6,7,8].

Public attention is therefore not just a by-product of implementation but a mechanism shaping adoption and compliance outcomes. Behavioral public administration and applied economics show that salience and timely communications can alter citizen responses and uptake, including via low-cost informational interventions [9,10,11,12]. Field evidence suggests that reminders and peer-use cues can substantially increase adoption among initially reluctant users [13,14,15,16,17,18]. However, showing that “announcements matter” is not sufficient: for practice and theory, the key questions are which rollout milestones shift attention, whether changes are abrupt versus gradual, and whether attention differs across parts of a digital compliance ecosystem [18,19,20].

Digital trace data provides a practical way to observe these dynamics at scale. Google Trends has been widely used as a proxy for public awareness and issue salience [20,21], and search activity often responds strongly to pre-announced deadlines and policy shocks [21,22,23,24]. In the myDATA context, the goal is not merely to document spikes but to interpret attention as a measurable intermediate signal in the accounting workflow: increased searching plausibly reflects information acquisition and task initiation (e.g., registration, software choice, configuration, issuance rules, submission procedures) that precede—though do not guarantee—changes in actual reporting behavior and compliance quality. This perspective matters for accounting and auditing because attention peaks can coincide with periods of heightened process change and learning, when the risk of late reporting, corrections, or inconsistent record-keeping may increase.

A second substantive question is what users attend to when interest rises. myDATA is an ecosystem comprising the AADE portal, the “Timologio” application, third-party solutions, and broader e-invoicing compliance concepts. We therefore expect distinct “families” of search terms—platform/authority, application/tool, and ecosystem/compliance—to respond differently to the same milestone, revealing where information needs and friction concentrate [25,26].

This paper targets a gap at the intersection of digital tax infrastructure, accounting process change, and forecasting with digital traces. Prior e-government work largely studies adoption drivers and interventions, while time series research shows that Google Trends can track—and sometimes help predict—collective information demand. These strands are rarely integrated in a staged national rollout where bookkeeping routines, invoicing workflows, and compliance documentation are being reconfigured in real time.

We contribute a preregistered, fully reproducible event study of Greece’s myDATA/e-invoicing rollout that (i) estimates attention shifts around prespecified milestones using an interpretable step/ramp coding with HAC-robust inference and false-discovery control; (ii) compares responses across query families that map to distinct interfaces of the ecosystem (platform, app, and broader e-invoicing); and (iii) evaluates whether policy calendar features improve 1–3-month nowcasts relative to seasonality-only benchmarks, with bounded operational relevance for timing guidance and sizing support capacity. Crucially, search attention is treated as an intermediate signal of information frictions, not a policy success outcome: we do not observe filing behavior, platform usage, or audit findings. Instead, we use attention dynamics to indicate when and where implementation pressures are most likely to surface, motivating linkage to administrative metrics in future work.

2. Literature Review and Related Work

The most critical query and problem is: why strike search attention? When governments introduce technical compliance requirements, affected stakeholders often seek guidance immediately—via accountants, vendors, professional networks, and the web—making search activity a timely proxy for information demand [14,15]. Although searches are not compliance, they can function as an operational leading signal: spikes in queries about deadlines or procedures often precede surges in helpdesk contacts, onboarding activity, and “last-minute” scrambling that appear later in administrative data. Because attention is typically brief and volatile, monitoring searches offers a practical way to assess whether milestone communications are reaching the public during the narrow window when guidance is most likely to be absorbed [23].

This attention perspective aligns with agenda-setting and issue-attention theories. Political communication research emphasizes that media and institutional cues shape what people attend to—even if they do not determine what people believe [27,28,29,30]. Yet salience rarely persists: Downs’ “issue-attention cycle” proposes that public interest rises sharply and then fades as novelty dissipates and costs of sustained engagement emerge [31]. In digital government contexts, announcements, deadlines, and mandate phases can play a similar role to “news events,” briefly redirecting limited attention toward action-proximal questions. Our study adopts this lens by treating prespecified myDATA milestones (e.g., go-live, phased mandates, harmonization steps) as salient cues and testing whether they generate short-run bursts and medium-run shifts in information-seeking [32].

A large cross-disciplinary literature supports the use of search data as an economic and behavioral indicator, particularly for short-horizon monitoring and prediction [33,34,35]. Incorporating Google Trends has improved nowcasts of diverse outcomes, and classic work shows that informative search frequencies can enhance prediction of contemporaneous indicators beyond models using only lagged official data [21]. Search activity can also lead behavior: web queries have been shown to forecast consumer demand in several settings, consistent with “revealed interest” preceding action [24,25,26,27,28]. At the same time, well-known failures such as Google Flu Trends highlight the risks of over-interpretation: media amplification, shifting search behavior, and model instability can generate spurious signals or exaggerated effects [13]. For this reason, our design emphasizes preregistration, prespecified event timing and functional forms, and triangulation across multiple query families to reduce sensitivity to idiosyncratic spikes [36].

Our substantive context overlaps with the growing literature on VAT digitization and mandatory e-invoicing systems (e.g., Italy’s SdI, SAF-T implementations, the UK’s Making Tax Digital), which primarily evaluates compliance, revenue, and productivity effects [18]. This work often documents positive fiscal impacts and improved record-keeping, alongside transition costs. However, it typically provides limited evidence on the pre-compliance stage—public awareness, information gaps, and communication dynamics—despite their importance for successful adoption. We contribute by focusing on this earlier mechanism, i.e., whether and when target populations exhibit measurable information demand around rollout milestones, as a complement to downstream outcome evaluations.

Methodologically, we draw on interrupted time series and event study approaches, where interventions are modeled as level shifts and/or changes in slope [26]. This distinction parallels “pulse” versus “carryover” reasoning in applied settings: some events plausibly induce abrupt jumps in attention (e.g., go-live announcements), whereas others alter trajectories more gradually (e.g., harmonization steps or phased expansion). Our contribution is to apply this logic in a preregistered manner—coding the timing and shape of interventions ex ante rather than searching for breaks post hoc—thereby limiting researcher degrees of freedom and supporting interpretable estimates of medium-run impacts [37].

A practical motivation is whether these event-conditioned signals add value for short-horizon planning. Forecasting research emphasizes that simple seasonal baselines are difficult to beat at 1–3-month horizons and that improvements are often modest unless structure is well specified [7,23]. We therefore prioritize interpretability: we compare a seasonal-naïve benchmark to transparent regression-based models that incorporate prespecified policy timing as step/ramp indicators, yielding directly communicable predictions in Google Trends units [4]. Auxiliary automated methods are treated as robustness checks and are separated from the main analysis.

Overall, this study contributes to research at the intersection of public attention and digital tax infrastructure in four ways. First, it provides a preregistered event study of search attention around a national e-invoicing rollout (Greece’s myDATA) using a prespecified set of milestones. Second, it quantifies effects in interpretable units (SVI points) and contrasts patterns across platform, app, and ecosystem query families. Third, it evaluates whether incorporating policy calendars improves short-horizon nowcasts relative to a seasonal baseline, addressing an operational question relevant to support planning. Fourth, it emphasizes transparency and reproducibility by releasing code, queries, and analysis outputs to facilitate replication and extension.

Background: myDATA and E Invoicing in Greece

Greece’s myDATA (“my Digital Accounting & Tax Application”) is AADE’s digital tax-reporting infrastructure that operationalizes electronic bookkeeping (“e-books”) and supports e-invoicing. The reform aims to standardize and automate reporting, increase transparency, and reduce evasion by providing businesses and intermediaries (accountants, software providers) with a unified reporting interface. Because adoption unfolded through staged mandates and technical harmonization, information demand is expected to arrive in waves—rising around visible deadlines and mandate expansions and receding as workflows routinize.

We therefore preregister six rollout milestones that plausibly shift information demand and/or use:

(1): myDATA production go-live (2021-10) [8];
(2): B2G Phase 1 (2023-09), initiating compulsory e-invoicing for central government bodies [15];
(3): Central administration full coverage (2024-01), modeled as a step-change [26];
(4): VAT–myDATA alignment (2024-01), modeled as a slope-change reflecting gradual workflow convergence [25];
(5): B2G extension to the rest of the public sector (2024-06) [25];
(6): EU authorization for a domestic B2B mandate (2025-03) [16]. These are the public “beats” of implementation (press releases, circulars, deadlines) that should generate detectable shifts in information-seeking if they are salient.

To track attention, we use Google search interest and group queries into three families that map onto distinct points of interaction with the reform: platform terms (A; AADE/myDATA access points, e.g., aade, ααδε), application terms (C; invoicing app queries, e.g., timologio/τιμολόγιο), and ecosystem terms (D; broader e-invoicing topics/standards, e.g., the “Electronic invoicing” topic). This structure allows us to test not only whether attention moves at milestones but where it concentrates—on the official platform, the front-end invoicing tool, or the wider compliance ecosystem. Guided by this context, we ask the following:

RQ1:

Do pre-specified myDATA- and e-invoicing-related events (such as announcements, deadlines, or system updates) cause discernible shifts in public attention as measured by Google search trends?

RQ2:

Can an event-aware structural model (one that includes features for these communications or policy events) improve short-horizon nowcasts of public interest compared to baseline models that capture only regular seasonal patterns?

RQ3:

Which families of search terms show the most significant movements in response to the events, for example, are people searching more about the official platform and its use, the companion app, or the broader ecosystem and compliance requirements?

Figure 1 summarizes the implementation timeline from 2016 to the latest month and marks the six preregistered milestones; the two January 2024 events are shown as distinct step versus slope interventions to avoid conflation and to provide a consistent reference for the results figures.

3. Data and Variable Construction

3.1. Data Source, Scope, Search Terms and Families

We use the monthly Google Trends (GT) search volume index (SVI) for Greece (geo = GR), which reports relative search interest on a 0–100 scale. The main analysis window is 2016-01 through the latest available month, ensuring a long pre-period before the first milestone and a balanced post-period across subsequent events; the full 2004+ history is used only for prespecified robustness checks (e.g., alternative seasonality and placebo tests). Following GT conventions, “<1” values are recoded to 0.5 to retain low-level variation and avoid undefined log transforms in robustness analyses. We retain the native 0–100 scale (rather than standardizing) because effects are directly interpretable in GT points and the bounded scale limits extreme influence [4,9]. For long-horizon downloads, we apply a standard stitching procedure: overlapping monthly windows are rescaled using median overlap ratios and then concatenated to preserve within-country relative levels. All code and query specifications are preregistered and released with the replication package. We include “Electronic invoicing (topic)” (D1) because it captures broad ecosystem-level attention beyond any single spelling/term. Since topic series can behave differently than term series, we report a sensitivity re-estimation using the closest term-based alternatives (e.g., “e-invoicing”) and show whether D-family conclusions are stable.

A (platform): brand/institutional terms linked to AADE/myDATA (incl. aade, ααδε, mydata).
C (app): application/”timologio” terms reflecting invoicing app searches (incl. τιμολόγιο, timologio, ηλεκτρονικό τιμολόγιο).
D (ecosystem): broad/e-invoicing topics and standards (incl. Electronic invoicing (topic), e-invoicing, peppol).

Greek diacritics and script variants are kept constant by term; near-duplicates (e.g., monotonic vs. polytonic variants) are addressed by choosing a single canonical query. Five salience-in-mind priority results for reporting and forecasting are aade (A2), ααδε (A3), τιμολόγιο (C1), timologio (C2), and Electronic invoicing (topic) (D1). The other terms power robustness analyses and a composite index.

3.2. Construction of Outcomes, Design Matrix and Event Indicators

All series are indexed at a monthly frequency (MS) on a shared calendar [33]. GT partial months are truncated to avoid look-ahead until finalized. Missing values are rare; when they occur, we treat them as the reported SVI (with the 0.5 convention for “<1”). The baseline specification does not log-transform outcomes, so effects remain interpretable in GT points; log-scale models (percent effects) are reported as robustness. For secular drift, (ii) month-of-year fixed effects (February–December; January omitted) for deterministic seasonality are used; and (iii) two COVID pulses (2020-03 and 2020-04) to absorb the abrupt pandemic shock without imposing a permanent break are used. We exclude a quarter_end indicator from the baseline because it is redundant with month fixed effects and adds little incremental explanatory power in preliminary diagnostics; it is retained in robustness. This parsimonious specification is preregistered and applied uniformly across series [4,8,22].

Each policy milestone is encoded ex ante as a step and/or slope component. For event e at date

T_{e}

, the level shift is

S_{t}^{(e)} = 1 {t \geq T_{e}}

, and the post-event ramp is

P O S T_{t}^{(e)} = m a x (0, t - T_{e})

in months.

We prespecify six milestones: myDATA go-live (2021-10; step + slope), B2G Phase 1 (2023-09; step + slope; robustness allows lag +1), central administration full (2024-01; step-only), VAT–myDATA alignment (2024-01; slope-only), B2G rest-of-public (2024-06; step + slope), and EU B2B authorization (2025-03; step + slope). The January 2024 split (central administration as step-only; VAT–myDATA alignment as slope-only) reflects distinct implementation logic, i.e., coverage completion versus phased process harmonization.

3.3. Identification and Preregistration

3.3.1. Identification Strategy

We exploit publicly announced myDATA/e-invoicing milestones and encode them ex ante as step and/or post-event slope terms [4,9]. Identification is the within-series, event-timed association around those dates, conditional on a centered trend, deterministic seasonality, and short COVID pulses [4,9]. Milestone timing is largely predetermined by administrative/regulatory sequencing (go-live, phased mandates, harmonization steps), which motivates treating the calendar as fixed from the analyst’s perspective; however, the design cannot exclude time-varying confounds that may co-move with these milestones (e.g., media cycles, vendor campaigns, enforcement signals, or concurrent guidance). We therefore interpret coefficients as shifts in search attention coincident with milestones, not causal effects on compliance or adoption outcomes.

For each outcome series

y_{t}

(monthly SVI), we estimate:

y_{t} = α + γ t_{c} + \sum_{m = 2}^{12} δ_{m} 1 {month = m} + \sum_{e} (β_{S, e} S_{t}^{(e)} + β_{P, e} P O S T_{t}^{(e)}) + \sum_{p \in {2020 - 03, 2020 - 04}} π_{p} 1 {t = p} + u_{t}

Primary estimation uses OLS with Newey–West HAC (6) standard errors. We summarize each event’s medium-run association at horizon

T

months as

Δ_{T} = β_{S, e} + T β_{P, e}

and treat

T = 6

as the preregistered main estimand. To avoid anchoring interpretation on a single horizon, we report

Δ_{3}

,

Δ_{6}

, and

Δ_{12}

side-by-side. p-values are adjusted using Benjamini–Hochberg’s FDR within each series across events; comparisons across series/families are presented as descriptive unless additionally adjusted [4,9].

3.3.2. Robustness and Preregistration

The study was preregistered to document the event calendar, coding rules (including the January 2024 split), estimands, and planned sensitivity checks. Preregistration reduces researcher discretion and supports reproducibility, but it is not a substitute for model evaluation; we therefore treat robustness and diagnostic checks as central to interpretation.

We implement placebo tests by shifting the full event calendar backward by −12, −18, −24, and −30 months and re-estimating

Δ_{6}

under the identical specification. Placebo performance is summarized by (i) the share of placebo

Δ_{6}

estimates significant after BH–FDR within series and (ii) the empirical distribution of placebo

Δ_{6}

relative to the true-timing

Δ_{6}

for each event and series. Non-trivial placebo significance is treated as a threat—consistent with residual structured variation not captured by baseline controls—rather than as confirmatory evidence [32,33]. Where placebo false positives persist, we add a sensitivity specification that strengthens seasonal structure using harmonic (Fourier) terms and report whether the main-timing estimates remain separated from placebo distributions (Appendix A and Table 1).

4. Methods

4.1. RQ1: Event-Study OLS

We estimate event-driven changes in monthly Google Trends attention using a transparent linear specification with calendar controls and preregistered policy dummies. For each outcome series yt (SVI on the native 0–100 scale) indexed by month t, we fit:

\begin{matrix} y_{t} = α + γ t_{c} + \sum_{m = 2}^{12} δ_{m} 1 {m o n t h = m} + \sum_{e} (β_{S, e} S_{t}^{(e)} + β_{P, e} P O S T_{t}^{(e)}) + \\ \sum_{p \in {2020 - 03, 2020 - 04}} π_{p} 1 {t = p} + u_{t} \end{matrix}

The centered linear trend

t_{c}

captures secular drift; month fixed effects (February–December; January omitted) capture seasonality, and two one-month pulses absorb the abrupt COVID shock (4 March 2020). Policy milestones are encoded ex ante as step indicators

S_{t}^{(e)} = 1 {t \geq T_{e}}

and/or post-event ramps

P O S T_{t}^{(e)} = m a x (0, t - T_{e})

in months. The baseline excludes quarter_end to avoid redundancy with month fixed effects.

Primary estimation uses OLS with Newey–West HAC (6) standard errors (preregistered). For interpretation, we summarize each event’s impact at horizon

T \in {3, 6, 12}

months via the linear combination:

Δ_{T} \equiv β_{S, e} + T β_{P, e}

and report HAC-robust 95% confidence intervals computed from the robust covariance of

(β_{S, e}, β_{P, e})

. To address multiplicity within each outcome series, we apply Benjamini–Hochberg’s FDR adjustment to event-level p-values. Our primary estimand is

Δ_{6}

because it provides an operationally meaningful medium-run window while remaining close to the policy period.

Serial dependence is assessed using Ljung–Box and ACF/PACF diagnostics; when residual autocorrelation is substantial, we report an AR (1) error variant (SARIMAX with identical exogenous regressors) as a robustness check in the Appendix A. All top-line inferences are based on the preregistered OLS + HAC specification for transparency and comparability across series. Our preregistered confirmatory claims center on the five priority outcomes × six events. For these “main-claims” tests, we report both (i) BH–FDR within series across events and (ii) a pooled BH–FDR across the full priority family (5 × 6) to align inference with cross-series narrative comparisons. Results outside the priority family are treated as secondary/exploratory and are described without strong inferential language. January 2024 is encoded as two distinct interventions because they represent conceptually different mechanisms: a back-office coverage completion (step-only) versus a workflow harmonization process expected to accumulate gradually (slope-only). This split was preregistered to avoid post hoc tailoring.

4.2. RQ2: Nowcasting Design

We evaluate short-horizon predictive value using blocked rolling-origin cross-validation that mimics real-time deployment. Origins start in 2018-01 and advance in 6-month increments; at each origin, models are trained on up to the previous 48 months and forecast

h \in {1, 2, 3}

months ahead [4,9,16]. All features are deterministic functions of calendar time and prespecified policy dates, ensuring strict no-leakage: design matrices for

t + 1, \dots, t + h

are constructed using only trend continuation, month indicators, COVID pulses, and event step/ramp rules fixed ex ante. We compare three transparent forecasters:

SNAIVE (12): ${\hat{y}}_{t + h} = y_{t + h - 12}$ .
OLS + events: the structural regression with trend, month fixed effects, COVID pulses, and prespecified event indicators.
OLS + events + AR (1): the same exogenous specification estimated with AR (1) errors (SARIMAX) to capture residual autocorrelation.

Forecast accuracy is summarized by MAE and RMSE (primary), with sMAPE/MASE reported for completeness; we also report percentage MAE improvement relative to SNAIVE (12). Statistical comparisons against SNAIVE (12) are conducted using Diebold–Mariano tests with absolute error loss (series × horizon), reported concisely.

Prediction intervals (80% and 95%) are constructed from out-of-sample residual dispersion (Gaussian bands) with a seasonal block bootstrap used as a robustness option to respect monthly dependence. Additional baselines, blends, and extended diagnostic plots are reported in the Appendix A to preserve readability.

Auxiliary machine learning (ML) models are treated as exploratory robustness and are reported in the Appendix A. Specifically, we test low-capacity global residual learners that predict

r_{t + h} = y_{t + h} - y_{t + h - 12}

using deterministic calendar/event features (trend, month/quarter harmonics, event step/ramp indicators, COVID pulses) and then add predicted residuals back to SNAIVE (12). These models are intentionally regularized (shallow trees/small MLP with early stopping) to limit overfitting in short panels; we only highlight ML results when they match or exceed the best structural model consistently across rolling splits for a given (series, horizon). The structural models remain the default due to interpretability and replicability. Forecast value is evaluated series-by-series and horizon-by-horizon; we report instances where the event model underperforms SNAIVE (12) as failures, not exceptions. We quantify uncertainty in MAE differences using a paired bootstrap across forecast origins, and we report empirical PI coverage for nominal 80%/95% intervals.

4.3. RQ3: Which Families Move?

RQ3 summarizes heterogeneous attention responses across three query families—platform (A), app (C), and ecosystem (D)—using the RQ1 event study estimand. Family anchors are the five priority outcomes: platform

= {A 2, A 3}

, app

= {C 1, C 2}

, ecosystem

= {D 1}

. We classify “movement” for each (event, series) using the sign and BH-FDR significance of the primary medium-run estimand

Δ_{6}

:

▲ if $Δ_{6} > 0$ and BH-FDR $p \leq 0.05$ ;
▼ if $Δ_{6} < 0$ and BH-FDR $p \leq 0.05$ ;
○ otherwise.

To distinguish abrupt versus gradual responses, we append “(S)” when either the step component

β_{S, e}

or slope component

β_{P, e}

is significant for BH-FDR even if

Δ_{6}

is marginal, indicating the dominant driver (level vs. ramp). Family-level summaries aggregate these classifications across the relevant anchors (A2–A3, C1–C2, and D1).

As a family-agnostic summary, we construct a composite attention index as the mean of z-scored priority series (A2, A3, C1, C2, D1); PCA-1 is used as a robustness alternative. We re-estimate the same event study model on the composite and apply the same

Δ_{6}

and BH-FDR decision rules. Planned sensitivity checks mirror the preregistered robustness set: HAC (12), event lags (+1/+2), log-scale outcomes, STL-deseasoned outcomes, and placebo events shifted −24 months; where AR dependence is strong, the AR (1) error variant is reported as a stability check in the Appendix A.

5. Data Analysis and Results

5.1. RQ1—Event Impacts

Several preregistered milestones coincide with substantial medium-run shifts in search attention, with multiple effects surviving BH–FDR adjustment (Table 2). We focus on

Δ_{6}

in SVI points (0–100) as the primary effect scale; percent-of-baseline values (Table 3) are provided only to contextualize magnitude across series with different baselines and can be mechanically large when baseline SVI is low.

Two milestones show the clearest and most systematic signatures. First, the B2G rest-of-public expansion is associated with a sharp decline in ecosystem attention and a simultaneous rise in an app query: Electronic invoicing (D1) decreases by

Δ_{6} = - 52.7

SVI (q < 0.001), while τιμολόγιο (C1) increases by

Δ_{6} = + 29.7

(q < 0.001). Second, VAT–myDATA alignment shows a strong reallocation across families: D1 increases by

Δ_{6} = + 46.4

(q < 0.01), while app queries decline (C2

Δ_{6} = - 34.3

, q < 0.01; C1

Δ_{6} = - 43.1

, q < 0.05).

Earlier rollout milestones primarily affect app terms. myDATA go-live is linked to a large increase in timologio (C2) (

Δ_{6} = + 38.4

, q < 0.001), with a smaller increase in

α α δ ϵ

(A3) (

Δ_{6} = + 15.1

, q < 0.05). B2G Phase 1 is associated with increases in both app series (C1

Δ_{6} = + 33.6

, q < 0.01; C2

Δ_{6} = + 23.8

, q < 0.01). In contrast, the back-office “central administration full” milestone shows no detectable shifts across series, consistent with low public salience. EU B2B authorization yields small and mostly non-significant changes, with a modest positive effect for C1 (

Δ_{6} = + 8.2

, q < 0.05) (Table 2).

Taken together, the pattern is heterogeneous by query family and rollout phase. The ecosystem topic (D1) exhibits the largest opposite-signed movements (down after B2G rest-of-public; up at VAT–myDATA alignment), while app queries show the clearest “rollout spike” signature at launch-type milestones (go-live; Phase 1) followed by declines at harmonization (VAT–myDATA alignment). Platform terms (A2/A3) shift more modestly and less consistently, suggesting broader and more diffuse search intent.

Figure 2 and Figure 3 visualize these results with HAC-robust 95% confidence intervals; estimates whose intervals exclude zero correspond to entries significant for BH–FDR in Table 2. Supplementary Tables S1–S10 report the underlying step and slope components and confirm that the headline

Δ_{6}

findings are driven by persistent post-event dynamics (not isolated one-month spikes) for the main ecosystem effects.

Robustness and Sensitivity Checks

The headline RQ1 conclusions are stable across preregistered robustness dimensions. Using HAC (12) instead of HAC (6) leaves signs and key inferences unchanged: D1 remains strongly negative after B2G rest-of-public and strongly positive at VAT–myDATA alignment; app terms remain positive at go-live/Phase 1 and negative at VAT alignment; platform responses remain smaller and less regular (Table 4). Shifting event timing forward by +1/+2 months strengthens fit for some milestones (notably ecosystem responses around VAT alignment and B2G expansion), consistent with short implementation/awareness lags. Log-scale estimates preserve the same directional patterns but can imply very large percentage changes for low-baseline periods; we therefore treat percentages as contextual only (Table 5). STL deseasoning produces the same marquee signs, and placebo events shifted −24 months yield no consistent family-by-event pattern, supporting interpretation as event-timed shifts rather than generic seasonality or drift (Table 6).

5.2. RQ2—Nowcasting Skill

We assess out-of-sample nowcasting with blocked rolling-origin cross-validation (origins every 6 months from 2018; max training window 48 months; horizons

h = 1, 2, 3

). Performance is summarized by MAE (Figure 4) and percentage MAE improvement in OLS + events over a seasonal-naïve benchmark, SNAIVE (12) (Figure 5).

Two patterns emerge. First, platform queries (A2/A3) benefit most from adding trend, seasonality, and preregistered event indicators. OLS + events reduces MAE by about 40–50% at

h = 1

and 18–32% at h = 2–3 relative to SNAIVE (12), and it is the lowest-MAE model for both platform series across horizons (Table 7). Diebold–Mariano tests indicate a clear advantage for A3 at

h = 1

(t = −2.41, p = 0.034) and a marginal advantage for A2 at

h = 1

(t = −2.09, p = 0.061), with remaining contrasts not statistically distinguishable given the small number of origins (Table 8).

Second, gains for app and ecosystem queries are horizon-dependent. For C2 (timologio), OLS + events yields a small improvement at

h = 1

(~1–2%) but larger gains at h = 2–3 (~14–15%), consistent with event structure becoming informative once beyond month-to-month noise. For C1 (τιμολόγιο), OLS + events is slightly better at

h = 1

(~4–5%), while SNAIVE (12) wins at h = 2–3, indicating stronger annual recurrence in that spelling variant. For the ecosystem topic D1, SNAIVE (12) dominates at

h = 1

, whereas OLS + events overtakes at h = 2–3 (~4% and ~13% improvement), consistent with D1’s larger, slower-moving event effects and high short-run volatility. Overall, event-aware structure improves short-horizon forecasts where attention shifts are relatively persistent and policy-timed (especially platform terms), while pure seasonality remains difficult to beat for series with strong annual recurrence (C1) and at the very shortest horizon for a volatile ecosystem series (D1 at

h = 1

). Adding short-memory dynamics (AR terms) and simple blends narrows some

h = 1

gaps in auxiliary comparisons, but the central result remains: preregistered event coding carries forecasting value that is both series- and horizon-specific. Figure 6 provides an illustrative nowcast example (A3) with 95% prediction intervals; interval width increases with horizon, as expected.

5.2.1. Rolling-Origin CV

We evaluate out-of-sample accuracy using the preregistered blocked rolling-origin design (origins from 2018-01 every 6 months; max training window 48 months; horizons

h = 1, 2, 3

). Across series, OLS + events is the top-performing structural model for the platform queries (A2, A3) and for C2, while SNAIVE (12) remains competitive for C1 at longer horizons and for D1 at

h = 1

. SARIMAX + events underperforms across series and horizons (Appendix A Table A5, Table A6 and Table A7).

Platform gains are large and consistent. For A2, OLS + events reduces MAE by ~40% at

h = 1

(3.83 vs. 6.42), ~18% at

h = 2

(6.38 vs. 7.81), and ~17% at

h = 3

(6.36 vs. 7.63). For A3, improvements are larger: ~50% at

h = 1

(6.04 vs. 11.96), ~29% at

h = 2

(8.59 vs. 12.10), and ~32% at

h = 3

(8.72 vs. 12.76).

For app and ecosystem terms, gains depend on horizon. C2 improves modestly at

h = 1

(~1.5%; 21.35 vs. 21.67) but more at

h = 2 - 3

(~14–15%; 16.21 vs. 18.83; 14.35 vs. 16.94). C1 is strongly seasonal: OLS + events is slightly better at

h = 1

(~4.5%; 7.88 vs. 8.25), while SNAIVE (12) wins at

h = 2 - 3

(6.83 and 5.44). For D1, SNAIVE (12) is best at

h = 1

(6.33 vs. 9.30), whereas OLS + events becomes better at

h = 2 - 3

(~4% and ~13%; 7.79 vs. 8.13; 9.08 vs. 10.42). RMSE mirrors MAE (Figure 4 and Figure 5).

Figure 7 plots one-step-ahead backtest paths and shows that errors concentrate around turning points and event windows—precisely where event indicators add most predictive value.

5.2.2. Forecast Comparison

Extending beyond the two-model comparison, simple hybrids that combine seasonal persistence with event structure (and, where helpful, a light AR term) are typically the most robust across series–horizon cells. In practice, blends reduce variance relative to a single model while preserving the main gains identified above: large improvements for platform queries, moderate improvements for C2, and limited scope for improvement where annual recurrence dominates (C1 at

h = 3

) or where short-run volatility is high (D1 at

h = 1

). SARIMAX-style specifications remain dominated in this setting and are therefore not emphasized.

Prediction intervals derived from rolling-origin residual dispersion show reasonable near-term calibration for operational nowcasting (coverage typically moderate in short samples), with wider uncertainty around volatile series and near event windows. Figure 8 and Figure 9 illustrate representative nowcasts and prediction cones for the platform series.

5.3. RQ3—Which Families Move?

BH–FDR-screened

Δ_{6}

classifications show that systematic movement concentrates in the application (“app”) queries (family C). Across 12 app-family event × series cells, 8 are significant (67%). τιμολόγιο (C1) moves at five out of six milestones—up at myDATA go-live and B2G Phase 1, up again at B2G rest of public and EU B2B authorization, and down at VAT–myDATA alignment. timologio (C2) moves at three out of six milestones—up at go-live and Phase 1 and down at VAT alignment. The direction is event-coherent: launch-like milestones are followed by positive app attention, whereas harmonization (VAT alignment) is followed by negative ramps, consistent with declining “how-to app” search once workflows stabilize.

Ecosystem attention (D1) is selective but large. Only two out of six cells (33%) are BH-significant, but both correspond to the largest directional ecosystem shifts: up at VAT–myDATA alignment and down at B2G rest-of-public. Platform terms (family A) are weakest and least systematic: 2/12 cells (17%) are significant (A3 rises at go-live; A2 falls at B2G rest-of-public). No series moves at “central administration full,” consistent with a back-office milestone with low public salience. Mechanistically, rollouts appear more “level/step-like” for app searches, while harmonization/coverage changes manifest as slopes/ramps (e.g., D1 up at alignment; C1/C2 down thereafter). These family patterns mirror RQ1’s

Δ_{6}

ordering and help explain RQ2: event-augmented models add most value where movement is systematic (platform and C2), while season-only baselines remain competitive where dynamics are dominated by recurrence (notably C1 at longer horizons). Table 9 summarizes the BH–FDR movement grid.

5.3.1. Rank by |Δ6| for Each Series

Ranking events by

∣ Δ_{6} ∣

within each series reinforces the family profile. App queries dominate the upper tail: C1’s largest movements occur at VAT alignment (−43.1) and B2G Phase 1 (+33.6), with a further large rise at B2G rest-of-public (+29.7); C2 peaks at go-live (+38.4) and falls sharply at VAT alignment (−34.3). Ecosystem D1 shows the single largest swings overall—−52.7 at B2G rest-of-public and +46.4 at VAT alignment—highlighting a strong but event-selective response. Platform effects are smaller (A3 notable at go-live; A2 notable at B2G rest-of-public), and central administration full remains near-neutral across series. Table 10 reports these ranks.

5.3.2. Composite Index Confirmation (Z-Mean Across A2/A3/C1/C2/D1)

A family-agnostic composite attention index (z-mean of A2/A3/C1/C2/D1) corroborates broad increases at myDATA go-live (

Δ_{6} \approx 0.52

,

p < 0.001

) and B2G Phase 1 (

Δ_{6} \approx 1.49

,

p \approx 0.002

), no effect at central administration full, and a moderate decline at VAT–myDATA alignment (

p \approx 0.035

). This aggregate pattern is consistent with the grid and ranks: rollouts lift app-centric attention, while subsequent harmonization diffuses or reverses attention in the composite. Table 11 reports the composite estimates.

The “top mover” character explains two earlier findings: (a) in RQ1, app terms have highest Δ₆ magnitudes at rollouts and roll-backs at harmonization; and (b) in RQ2, event-based models realize the most clear-cut gains on A/C families at near horizons—exactly where those families record most frequent and largest Δ₆ movements.

Based on the RQ3 grid, platform interest (A3) shifts at myDATA go-live (level + slope); the backtest is OLS + events to capture that step change and early ramp (e.g., 2022-02: actual = 65, OLS ≈ 63, SNAIVE (12) = 40). Subsequently, when attention returns to normal or surges above pre-set policy milestones, OLS + events will occasionally over- or under-shoot (e.g., 2022-08 and 2024-08), while SNAIVE (12) might pick up purely seasonally rebounding. Collectively, the path perspective addresses why event-enhanced models produce meaningful short-horizon returns on platform requests (RQ2) but diminishing edges once the shock of initial rollout has faded—echoing RQ3’s finding that platform movement is most pronounced at go-live with progressively less systematic movement at subsequent checkpoints.

In Figure 10, rolling-origin cross-validation (h = 1 month) between an event-augmented regression (“OLS + events”) and a seasonal-naïve baseline (“SNAIVE (12)”) based on the t–12 value is shown. Origins roll each half-year with a 48-month maximum training window; forecasts are graphed at issuance date. The panel indicates that OLS + events follows the post-go-live increasing trend more closely than SNAIVE (12) and more accurately forecasts the 2021–2022 platform interest ramp-up. Note, evaluation uses Newey–West HAC (6); for ααδε (A3) at h = 1, the average MAE for OLS + events is 6.04, compared to 11.96 for SNAIVE (12), and the Diebold–Mariano test favors OLS + events (t ≈ −2.41, p ≈ 0.034).

Across these 12 representative checkpoints, event-augmented regression is closer to the true SVI in 7/12 months (e.g., 2020-02: 22 vs. 20; 2022-02: 63.3 vs. 65; 2024-02: 68.7 vs. 78), whereas SNAIVE (12) is closer in 5/12 (particularly during GFC peak periods like 2021-08, 2022-08, 2024-08, and early 2025). This is in line with the RQ3 and overall RQ2 findings: near policy rollouts and ramps, OLS + events more closely approximates the level shifts and medium-run trend in platform interest, and distant from milestone shocks, seasonal reversion is useful for SNAIVE (12). In other words, platform searches react highly at go-live and early harmonization (picked up by the event model), but the rest are increasingly seasonal, where a seasonal-naïve base is a contender (Table 12).

6. Discussion

This study examined whether prespecified milestones in Greece’s myDATA/e-invoicing rollout were associated with changes in public search attention (RQ1), whether encoding those milestones improves short-horizon nowcasts of attention (RQ2), and which query families respond most (RQ3). The results are best interpreted as evidence about salience and information-seeking—an intermediate signal of user attention and potential onboarding friction—rather than as evidence of compliance, reporting accuracy, or policy success. Consistent with issue-attention perspectives, “front-stage” milestones tend to coincide with abrupt, short-lived shifts in task-oriented searches, whereas harmonization and coverage changes are more often reflected in gradual ramps or drawdowns.

6.1. Do Events Move Attention?

Across the preregistered milestones, the strongest and most consistent responses appear in the application (“app”) family. In particular, timologio (C2) rises at go-live (Δ₆ ≈ +38.4 SVI) and C1 rises at B2G Phase 1 (Δ₆ ≈ +33.6), while both fall at VAT–myDATA alignment (C1 Δ₆ ≈ −43.1; C2 Δ₆ ≈ −34.3). The ecosystem topic Electronic invoicing (D1) shows large, opposite-signed medium-run shifts—an increase around VAT–myDATA alignment (Δ₆ ≈ +46.4) and a decline after the B2G rest-of-public expansion (Δ₆ ≈ −52.7). Platform terms are more heterogeneous and often weaker: ααδε (A3) increases at go-live (Δ₆ ≈ +15.1) whereas aade (A2) declines after B2G rest-of-public (Δ₆ ≈ −22.2), with other effects small or indistinguishable from zero after FDR adjustment. Central administration full (Jan 2024) shows no measurable shift, consistent with a back-office milestone lacking a user-facing call to action.

Two interpretation boundaries are important. First, we prioritize Δ₆ in SVI points on the native 0–100 scale; this avoids the mechanical inflation that can arise when percentage changes are computed from low baselines. Percentage effects from log-scale robustness checks can appear extreme (e.g., ±300–500%) when pre-event SVI levels are near zero, even if the absolute change remains modest. Accordingly, we treat percentage changes as descriptive robustness and interpret them only alongside absolute SVI point shifts and baseline levels. Second, increased searching plausibly reflects information acquisition and task initiation (e.g., registration, software selection, configuration, issuance rules), but it does not establish uptake, compliance quality, or reporting accuracy.

Within these limits, the pattern of step-like jumps for app queries around launch-type milestones and slower ramps around harmonization is consistent with standard intervention logic (level versus slope responses) and with attention allocation arguments: salient announcements trigger short-lived, action-proximal “how-to” demand, while harmonization reshapes workflows and shifts attention more gradually toward ecosystem-level concerns [9,16,23]. For operations, the practical implication is not that the policy “succeeded” but that certain milestones predictably coincide with concentrated information demand in specific parts of the ecosystem.

6.2. Do Event-Aware Models Forecast Better?

Encoding the prespecified policy calendar improves short-horizon forecasts relative to a seasonality-only benchmark in several series, with the clearest gains for platform terms at

h = 1

and for app/ecosystem series at longer horizons. In rolling-origin validation, OLS + events reduces MAE versus SNAIVE (12) by roughly 40–50% at

h = 1

for platform series (A2/A3), with more variable gains at

h = 2 - 3

and horizon-dependent benefits for app and ecosystem outcomes. This pattern is consistent with a simple point: when variance is partly driven by interpretable, dated shocks (launches, phased mandates, harmonization), deterministic step/ramp indicators capture structure that seasonal repetition alone misses [12,20,23,28].

Horizon heterogeneity is informative rather than speculative. For example, timologio (C2) shows limited incremental value at

h = 1

but clearer gains at

h = 2 - 3

, consistent with the idea that medium-run ramps become predictable once month-to-month idiosyncrasy averages out. For D1, event terms add little at

h = 1

but become more useful at

h = 2 - 3

as slope components accumulate [9,16,30]. Importantly, these are forecasting improvements in attention, not proof of behavioral change; their value is operational: planning the timing of guidance, helpdesk capacity, and vendor coordination around known milestones. The results also align with the cautionary lesson from search-based forecasting critiques: structured, theory-consistent features can help, but idiosyncratic spikes and evolving search behavior limit one-size-fits-all gains.

6.3. Which Families Move?

The family ordering—app (C) strongest, ecosystem (D) second, platform (A) weakest—provides a compact summary of where attention concentrates during staged rollouts. App terms are closest to immediate tasks (issuing invoices, onboarding), so they exhibit sharper step-type responses around salient milestones; ecosystem terms tend to reflect rule changes, standards, and workflow reconfiguration, so they are more often expressed as ramps; platform/brand terms aggregate heterogeneous intents and are less diagnostic except at headline moments. The composite index corroborates that these dynamics are not driven by a single series [9,16,23].

Operationally, the implication is a sequencing logic rather than a success claim. Agencies can anticipate app-focused information demand around launch-type milestones (staffing and onboarding materials), while harmonization windows call for sustained ecosystem-oriented guidance (standards, procedures, vendor alignment). Conversely, purely administrative completions may remain low-salience and should not be expected to shift public attention without complementary communications.

Overall, the contribution is not that search attention equals compliance or performance but that a preregistered, transparent event-study design can (i) characterize heterogeneous attention responses across milestones and query families and (ii) yield modest but actionable improvements in short-horizon nowcasts of attention that support communication and support planning in digital tax rollouts [12,20,23,28]. Table 13 summarizes which conclusions are robust to the prespecified checks and where interpretation remains sensitive.

7. Practical Implications

This study treats Google Trends as an intermediate signal of salience and information-seeking around myDATA/e-invoicing milestones. The implications are therefore operational for communication timing, support capacity, and coordination rather than evidence of compliance or policy success.

7.1. For Policymakers and Tax Administrators

Launch-type milestones (e.g., go-live, early mandate phases) coincide with short-lived spikes in app-focused searches. Communications should therefore be concentrated immediately before and during these dates, using highly practical content such as checklists, step-by-step instructions, and common error fixes. Helpdesk capacity and overflow procedures should also be aligned with these windows. By contrast, harmonization milestones are associated with slower shifts, especially in ecosystem queries and are better supported through staggered guidance over several months, including rule clarification, edge cases, workflow updates, and vendor alignment [12,20,23,28]. Event-aware nowcasts and prediction intervals can support staffing calendars and service-level planning around visible milestones, whereas null effects for back-office milestones suggest limited need for front-line surge capacity unless additional communications are introduced. More broadly, a simple step-versus-slope lens can guide sequencing: pair rollout phases with app-centered calls to action and treat harmonization as a longer support window. Finally, Δ₆ (in SVI points) and the composite attention index can serve as lightweight monitoring signals alongside slower operational indicators, such as tickets or onboarding volumes, to detect when attention is lower than expected and outreach may need adjustment [12,20,23,28].

7.2. For Business Managers and Software Vendors

Vendors should prepare short, time-bounded increases in support and in-product guidance around launch-type milestones, especially for first-use tasks such as invoice issuance, corrections, and submission. This may include temporary increases in chat/support staffing, date-specific prompts, and short workflow-based micro-guides. Because ecosystem attention shifts more gradually during harmonization, vendors can schedule deeper materials such as integration guides, compliance tutorials, and API examples over the following months and align releases accordingly. Shared event-aware forecasts can also help vendors and public authorities coordinate webinars, sandbox windows, and change freezes during high-uncertainty periods, reducing conflicting signals to users. Since platform/brand searches are broader and less event-sensitive, clear signposting and plain-language navigation remain important, particularly for less digitally literate users who may not search using app-specific terms.

8. Conclusions, Limitations, and Future Directions

This study examined whether prespecified milestones in Greece’s myDATA/e-invoicing rollout were associated with shifts in public search attention (RQ1), whether encoding those dates improves short-horizon forecasts of attention (RQ2), and which query families respond most strongly (RQ3). Using preregistered step/ramp indicators on monthly Google Trends data (2016–present) with HAC-robust inference and BH–FDR adjustment, we find a consistent ordering: app queries respond most clearly around launch-type milestones, ecosystem attention shifts more gradually, and platform terms are smaller and less regular; the back-office “central administration full” milestone is near-neutral. Event-aware models also improve out-of-sample nowcasts relative to a seasonal-naïve benchmark for some series and horizons, with the clearest gains in selected short-horizon cases [12,20,23,28]. Overall, policy timing appears to structure information-seeking in measurable ways, while search attention remains an intermediate signal rather than a compliance outcome.

These findings should be interpreted with caution. Google Trends captures attention, not adoption, compliance, or audit outcomes. Large percentage changes can partly reflect low baselines, so we prioritize SVI point effects in interpretation. The design is observational, and time-varying confounds may still coincide with milestones despite controls for trend, seasonality, and COVID pulses. Placebo results are therefore treated as stress tests, and non-trivial placebo significance reinforces cautious, non-causal reading. In addition, GT scaling, stitching, and topic-versus-term differences may affect comparability, although we address these issues through validation and sensitivity checks.

Future research should link attention signals to administrative and behavioral outcomes such as helpdesk tickets, onboarding completion, active user counts, or e-invoice submissions to test whether search attention has measurable operational lead value. Richer designs could incorporate communication intensity and ecosystem activity, including media coverage, vendor releases, and professional association notices, in order to separate policy timing from concurrent narrative shocks. Higher-frequency data could also be used to examine anticipatory spikes and post-event decay, while subgroup analyses by region, industry, or user type could clarify who responds to which milestones [12,22,23]. Finally, applying the same preregistered framework to other digital tax reforms such as Making Tax Digital, SAF-T, or national e-invoicing mandates would test portability and help build comparative evidence across common rollout archetypes.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/accountaudit2020006/s1.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the author on request.

Conflicts of Interest

The author declares no conflicts of interest.

Appendix A

Table A1. Event timing sensitivity: +2-month lag (Δ6, HAC = 6). Δ6 re-estimated after shifting each event forward by +2 months.

Event	aade (A2)	ααδε (A3)	τιμολόγιο (C1)	timologio (C2)	Electronic Invoicing (D1)
b2g_phase1	39.0 ***	58.1 ***	9.2	33.5 **	28.9 **
b2g_rest_public	−6.3	−15.4	−11.6 ***	−3.7	−99.3 ***
central_admin_full	−10.2 ***	−20.6 ***	−11.3	−13.0 **	7.7 **
eu_b2b_authorisation	−46.0 ***	−60.9 ***	12.9	−39.2 *	−8.7
mydata_go_live	2.7	15.3 ***	1.8	39.0 ***	−9.9
vat_mydata_alignment	−37.8 ***	−38.9 ***	10.6	−29.3 *	56.8 ***

Note. Estimates in SVI points with NW-HAC L = 6 SEs; BH-adjusted stars as above. Significance markers are reported as follows: *

p < 0.05

, **

p < 0.01

, ***

p < 0.001

.

Table A2. Log-scale sensitivity: Δ6 as percentage change (HAC = 6). Six-month effects expressed on the log scale (approximate percentage change) for comparability across series.

Series	Event	aade (A2) (log % Δ6)	ααδε (A3) (log % Δ6)	τιμολόγιο (C1) (log % Δ6)	timologio (C2) (log % Δ6)	Electronic Invoicing (D1) (log % Δ6)
0	b2g_phase1	−10.9%	−4.2%	346.4% **	83.2%	306.5% *
1	b2g_rest_public	−38.8%	−10.6%	243.8% **	−41.4%	−90.1% ***
2	central_admin_full	52.2%	34.5%	−30.0%	−28.0%	−29.7%
3	eu_b2b_authorisation	−19.5%	2.2%	87.5%	26.9%	137.9%
4	mydata_go_live	−51.0%	−53.4% *	22.7%	524.4% ***	−34.8%
5	vat_mydata_alignment	12.8%	−18.6%	−84.1% **	−53.7%	111.9%

Note. Entries are percentage changes relative to pre-policy baseline. NW-HAC L = 6 SEs; BH-adjusted stars as above. Interpret as semi-elasticities at six months. Significance markers are reported as follows: *

p < 0.05

, **

p < 0.01

, ***

p < 0.001

.

Table A3. STL-deseasoned outcomes: Δ6 in SVI points (HAC = 6). Δ6 re-estimated after removing seasonal components using STL.

Series	Event	aade (A2) (STL Δ6)	ααδε (A3) (STL Δ6)	τιμολόγιο (C1) (STL Δ6)	timologio (C2) (STL Δ6)	Electronic Invoicing (D1) (STL Δ6)
0	b2g_phase1	8.1 ***	5.8	50.0 ***	4.5	38.5 *
1	b2g_rest_public	−6.2	42.2 ***	25.1 ***	−0.3	−51.6 ***
2	central_admin_full	7.2 ***	16.3 ***	−17.6 ***	7.9	1.8
3	eu_b2b_authorisation	−4.6	−10.2 *	5.7	19.2 ***	−24.0 *
4	mydata_go_live	3.7	15.4 ***	3. 3 ***	38.1 ***	4.6
5	vat_mydata_alignment	−4.7	−25.8 ***	−71.3 ***	−16.0	33.6 *

Note. Outcome is seasonally adjusted SVI; estimates in points. NW-HAC L = 6 SEs; BH-adjusted stars as above. Results mirror baseline signs and significance. Significance markers are reported as follows: *

p < 0.05

, **

p < 0.01

, ***

p < 0.001

.

Table A4. Placebo events (−24 months): Δ6 in SVI points (HAC = 6). Falsification test shifting each event 24 months earlier.

Series	Event	aade (A2)	ααδε (A3)	τιμολόγιο (C1)	timologio (C2)	Electronic Invoicing (D1)
0	b2g_phase1	−35.0 *	−4.0	−1.7	58.6 ***	22.9
1	b2g_rest_public	−42.3 ***	−98.8 ***	−8.8	21.4 **	49.9 ***
2	central_admin_full	15.5	7.5	3.0	2.1	−13.3
3	eu_b2b_authorisation	23.4 ***	10.2	14.8 *	2.3	6.9
4	mydata_go_live	−2.1	4.9	−6.1 *	−13.0 **	10.7 ***
5	vat_mydata_alignment	90.0 ***	71.3 ***	12.9	−54.4 ***	−55.5 **

Note. Estimates in SVI points with NW-HAC L = 6 SEs; BH-adjusted stars as above. Lack of coherent event family patterns and frequent sign reversals support the validity of the main timing results. Significance markers are reported as follows: *

p < 0.05

, **

p < 0.01

, ***

p < 0.001

.

Table A5. Rolling-origin cross-validation estimate (MAE, RMSE) by horizon, model, and series.

Series	Series	Model	h	MAE	RMSE	n_splits
6	Electronic invoicing (D1)	SNAIVE (12)	1	6.333333	6.333333	12
0	Electronic invoicing (D1)	OLS + events	1	9.301091	9.301091	12
3	Electronic invoicing (D1)	SARIMAX + events	1	17.507054	17.507054	12
1	Electronic invoicing (D1)	OLS + events	2	7.788950	8.543542	12
7	Electronic invoicing (D1)	SNAIVE (12)	2	8.125000	9.506404	12
4	Electronic invoicing (D1)	SARIMAX + events	2	18.832143	19.840365	12
2	Electronic invoicing (D1)	OLS + events	3	9.084294	10.258194	12
8	Electronic invoicing (D1)	SNAIVE (12)	3	10.416667	12.540671	12
5	Electronic invoicing (D1)	SARIMAX + events	3	20.041784	21.423044	12
9	aade (A2)	OLS + events	1	3.834758	3.834758	12
15	aade (A2)	SNAIVE (12)	1	6.416667	6.416667	12
12	aade (A2)	SARIMAX + events	1	19.416505	19.416505	12
10	aade (A2)	OLS + events	2	6.379389	7.441202	12
16	aade (A2)	SNAIVE (12)	2	7.812500	8.403695	12
13	aade (A2)	SARIMAX + events	2	19.977884	20.918017	12
11	aade (A2)	OLS + events	3	6.361970	7.528689	12
17	aade (A2)	SNAIVE (12)	3	7.625000	8.806179	12
14	aade (A2)	SARIMAX + events	3	20.225524	21.439155	12
18	timologio (C2)	OLS + events	1	21.352019	21.352019	12
24	timologio (C2)	SNAIVE (12)	1	21.666667	21.666667	12
21	timologio (C2)	SARIMAX + events	1	28.928036	28.928036	12
19	timologio (C2)	OLS + events	2	16.211093	17.569670	12
25	timologio (C2)	SNAIVE (12)	2	18.833333	20.247061	12
22	timologio (C2)	SARIMAX + events	2	30.351804	32.357839	12
20	timologio (C2)	OLS + events	3	14.347414	16.478155	12
26	timologio (C2)	SNAIVE (12)	3	16.944444	18.993470	12
23	timologio (C2)	SARIMAX + events	3	29.067110	31.414501	12
27	ααδε (A3)	OLS + events	1	6.035068	6.035068	12
33	ααδε (A3)	SNAIVE (12)	1	11.958333	11.958333	12
30	ααδε (A3)	SARIMAX + events	1	32.576393	32.576393	12
28	ααδε (A3)	OLS + events	2	8.585137	9.413745	12
34	ααδε (A3)	SNAIVE (12)	2	12.104167	12.662202	12
31	ααδε (A3)	SARIMAX + events	2	33.114341	33.752481	12
29	ααδε (A3)	OLS + events	3	8.719237	9.830565	12
35	ααδε (A3)	SNAIVE (12)	3	12.763889	13.671554	12
32	ααδε (A3)	SARIMAX + events	3	34.515184	35.282245	12
36	τιμολόγιο (C1)	OLS + events	1	7.880146	7.880146	12
42	τιμολόγιο (C1)	SNAIVE (12)	1	8.250000	8.250000	12
39	τιμολόγιο (C1)	SARIMAX + events	1	14.249626	14.249626	12
43	τιμολόγιο (C1)	SNAIVE (12)	2	6.833333	7.656650	12
37	τιμολόγιο (C1)	OLS + events	2	7.134803	7.560648	12
40	τιμολόγιο (C1)	SARIMAX + events	2	14.626164	15.539176	12
44	τιμολόγιο (C1)	SNAIVE (12)	3	5.444444	6.618365	12
38	τιμολόγιο (C1)	OLS + events	3	6.072613	6.723279	12
41	τιμολόγιο (C1)	SARIMAX + events	3	13.129236	14.150966	12

Note. Errors are averages across splits; lower is better. SVI is on Google’s 0–100 scale.

Table A6. Series × horizon accuracy winners with percent MAE improvement over SNAIVE (12).

Series	Events	Model	h	MAE	RMSE	n_splits
0	Electronic invoicing (D1)	SNAIVE (12)	1	6.333333	6.333333	12
1	Electronic invoicing (D1)	OLS + events	2	7.788950	8.543542	12
2	Electronic invoicing (D1)	OLS + events	3	9.084294	10.258194	12
3	aade (A2)	OLS + events	1	3.834758	3.834758	12
4	aade (A2)	OLS + events	2	6.379389	7.441202	12
5	aade (A2)	OLS + events	3	6.361970	7.528689	12
6	timologio (C2)	OLS + events	1	21.352019	21.352019	12
7	timologio (C2)	OLS + events	2	16.211093	17.569670	12
8	timologio (C2)	OLS + events	3	14.347414	16.478155	12
9	ααδε (A3)	OLS + events	1	6.035068	6.035068	12
10	ααδε (A3)	OLS + events	2	8.585137	9.413745	12
11	ααδε (A3)	OLS + events	3	8.719237	9.830565	12
12	τιμολόγιο (C1)	OLS + events	1	7.880146	7.880146	12
13	τιμολόγιο (C1)	SNAIVE (12)	2	6.833333	7.656650	12
14	τιμολόγιο (C1)	SNAIVE (12)	3	5.444444	6.618365	12

Note. Positive percentages indicate lower MAE than SNAIVE (12).

Table A7. Cross-validation summary relative to the SNAIVE (12) baseline (MAE_base and % improvement).

Series	Events	Model	h	MAE	RMSE	n_splits	MAE_base	MAE_improv_%
0	Electronic invoicing (D1)	SNAIVE (12)	1	6.333333	6.333333	12	6.333333	0.000000
1	Electronic invoicing (D1)	OLS + events	1	9.301091	9.301091	12	6.333333	−46.859334
2	Electronic invoicing (D1)	SARIMAX + events	1	17.507054	17.507054	12	6.333333	−176.427175
3	Electronic invoicing (D1)	OLS + events	2	7.788950	8.543542	12	8.125000	4.135998
4	Electronic invoicing (D1)	SNAIVE (12)	2	8.125000	9.506404	12	8.125000	0.000000
5	Electronic invoicing (D1)	SARIMAX + events	2	18.832143	19.840365	12	8.125000	−131.780225
6	Electronic invoicing (D1)	OLS + events	3	9.084294	10.258194	12	10.416667	12.790781
7	Electronic invoicing (D1)	SNAIVE (12)	3	10.416667	12.540671	12	10.416667	0.000000
8	Electronic invoicing (D1)	SARIMAX + events	3	20.041784	21.423044	12	10.416667	−92.401126
9	aade (A2)	OLS + events	1	3.834758	3.834758	12	6.416667	40.237541
10	aade (A2)	SNAIVE (12)	1	6.416667	6.416667	12	6.416667	0.000000
11	aade (A2)	SARIMAX + events	1	19.416505	19.416505	12	6.416667	−202.594890
12	aade (A2)	OLS + events	2	6.379389	7.441202	12	7.812500	18.343818
13	aade (A2)	SNAIVE (12)	2	7.812500	8.403695	12	7.812500	0.000000
14	aade (A2)	SARIMAX + events	2	19.977884	20.918017	12	7.812500	−155.716914
15	aade (A2)	OLS + events	3	6.361970	7.528689	12	7.625000	16.564331
16	aade (A2)	SNAIVE (12)	3	7.625000	8.806179	12	7.625000	0.000000
17	aade (A2)	SARIMAX + events	3	20.225524	21.439155	12	7.625000	−165.252777
18	timologio (C2)	OLS + events	1	21.352019	21.352019	12	21.666667	1.452219
19	timologio (C2)	SNAIVE (12)	1	21.666667	21.666667	12	21.666667	0.000000
20	timologio (C2)	SARIMAX + events	1	28.928036	28.928036	12	21.666667	−33.514011
21	timologio (C2)	OLS + events	2	16.211093	17.569670	12	18.833333	13.923400
22	timologio (C2)	SNAIVE (12)	2	18.833333	20.247061	12	18.833333	0.000000
23	timologio (C2)	SARIMAX + events	2	30.351804	32.357839	12	18.833333	−61.160021
24	timologio (C2)	OLS + events	3	14.347414	16.478155	12	16.944444	15.326736
25	timologio (C2)	SNAIVE (12)	3	16.944444	18.993470	12	16.944444	0.000000
26	timologio (C2)	SARIMAX + events	3	29.067110	31.414501	12	16.944444	−71.543601
27	ααδε (A3)	OLS + events	1	6.035068	6.035068	12	11.958333	49.532535
28	ααδε (A3)	SNAIVE (12)	1	11.958333	11.958333	12	11.958333	0.000000
29	ααδε (A3)	SARIMAX + events	1	32.576393	32.576393	12	11.958333	−172.415826
30	ααδε (A3)	OLS + events	2	8.585137	9.413745	12	12.104167	29.072875
31	ααδε (A3)	SNAIVE (12)	2	12.104167	12.662202	12	12.104167	0.000000
32	ααδε (A3)	SARIMAX + events	2	33.114341	33.752481	12	12.104167	−173.578033
33	ααδε (A3)	OLS + events	3	8.719237	9.830565	12	12.763889	31.688238
34	ααδε (A3)	SNAIVE (12)	3	12.763889	13.671554	12	12.763889	0.000000
35	ααδε (A3)	SARIMAX + events	3	34.515184	35.282245	12	12.763889	−170.412762
36	τιμολόγιο (C1)	OLS + events	1	7.880146	7.880146	12	8.250000	4.483073
37	τιμολόγιο (C1)	SNAIVE (12)	1	8.250000	8.250000	12	8.250000	0.000000
38	τιμολόγιο (C1)	SARIMAX + events	1	14.249626	14.249626	12	8.250000	−72.722739
39	τιμολόγιο (C1)	SNAIVE (12)	2	6.833333	7.656650	12	6.833333	0.000000
40	τιμολόγιο (C1)	OLS + events	2	7.134803	7.560648	12	6.833333	−4.411757
41	τιμολόγιο (C1)	SARIMAX + events	2	14.626164	15.539176	12	6.833333	−114.041425
42	τιμολόγιο (C1)	SNAIVE (12)	3	5.444444	6.618365	12	5.444444	0.000000
43	τιμολόγιο (C1)	OLS + events	3	6.072613	6.723279	12	5.444444	−11.537782
44	τιμολόγιο (C1)	SARIMAX + events	3	13.129236	14.150966	12	5.444444	−141.149230

Note. n_splits = 12 for all cells; metrics computed on held-out folds from the rolling-origin procedure.

Table A8. Rolling-origin cross-validation results.

Series	Events	Model	h	MAE	RMSE	sMAPE	MASE	PI95_cov	n_splits
0	Electronic invoicing (D1)	BLEND (OLS, SNAIVE)	1	7.789796	7.789796	88.699912	1.092261	0.733333	15
1	Electronic invoicing (D1)	BLEND (OLS, SNAIVE)	2	7.951671	8.619382	64.402244	1.118063	0.733333	15
2	Electronic invoicing (D1)	BLEND (OLS, SNAIVE)	3	9.984889	11.625200	70.919535	1.378135	0.688889	15
3	aade (A2)	BLEND (AR1, SNAIVE)	1	4.463388	4.463388	21.301319	0.462340	0.933333	15
4	aade (A2)	BLEND (AR1, SNAIVE)	2	6.371356	7.334841	24.445857	0.661681	0.833333	15
5	aade (A2)	BLEND (AR1, SNAIVE)	3	6.228173	7.393890	24.254338	0.647433	0.822222	15
6	timologio (C2)	BLEND (OLS, SNAIVE)	1	15.662447	15.662447	67.273160	4.132944	0.600000	15
7	timologio (C2)	BLEND (OLS, SNAIVE)	2	12.511016	14.178881	58.088125	3.035360	0.666667	15
8	timologio (C2)	BLEND (OLS, SNAIVE)	3	10.883420	12.965520	55.545402	2.502564	0.666667	15
9	ααδε (A3)	OLS + events + AR1	1	7.157816	7.157816	19.057649	0.502178	0.933333	15
10	ααδε (A3)	BLEND (AR1, SNAIVE)	2	8.715877	9.359305	22.931140	0.625436	0.866667	15
11	ααδε (A3)	BLEND (AR1, SNAIVE)	3	8.835019	9.886419	22.774981	0.648328	0.844444	15
12	τιμολόγιο (C1)	BLEND (OLS, SNAIVE)	1	7.502383	7.502383	52.296183	2.200247	0.600000	15
13	τιμολόγιο (C1)	BLEND (OLS, SNAIVE)	2	6.702038	7.409867	45.758427	1.855730	0.566667	15
14	τιμολόγιο (C1)	SNAIVE (12)	3	5.488889	6.632907	41.888791	1.542399	0.666667	15

Note. For each series and horizon, the table reports the winning specification and its accuracy/uncertainty metrics: MAE, RMSE, sMAPE, MASE, and empirical 95% predictive-interval coverage (PI95_cov). Blends dominate most cells (e.g., BLEND (AR1, SNAIVE) for A2 and A3; BLEND (OLS, SNAIVE) for C2 and C1 at short horizons), while SNAIVE (12) remains the winner only for C1 at h = 3. These outcomes mirror the improvement heatmap and the MAE bar patterns.

References

Alexopoulos, T.A.; Thompson, H. A macroeconomic simulation for Greece in the wake of its government debt crisis. Econ. Change Restruct. 2021, 54, 699–716. [Google Scholar] [CrossRef]
Boikos, S.; Makantasi, E.; Panagiotidis, T. Macroeconomic Uncertainty Indices for European Countries. Notas Econ. 2023, 2023, 7–56. [Google Scholar] [CrossRef] [PubMed]
Box, G.E.P.; Tiao, G.C. Intervention analysis with applications to economic and environmental problems. J. Am. Stat. Assoc. 1975, 70, 70–79. [Google Scholar] [CrossRef]
Choi, H.; Varian, H. Predicting the Present with Google Trends. Econ. Rec. 2012, 88, 2–9. [Google Scholar] [CrossRef]
Cohen, B.C. Press and Foreign Policy; Princeton University Press: Princeton, NJ, USA, 2015. [Google Scholar] [CrossRef]
Da, Z.; Engelberg, J.; Gao, P. In Search of Attention. J. Financ. 2011, 66, 1461–1499. [Google Scholar] [CrossRef]
Dokas, I.; Oikonomou, G.; Panagiotidis, M.; Spyromitros, E. Macroeconomic and Uncertainty Shocks’ Effects on Energy Prices: A Comprehensive Literature Review. Energies 2023, 16, 1491. [Google Scholar] [CrossRef]
E-Invoicing Compliance in Greece|Pagero. Available online: https://www.pagero.com/compliance/regulatory-updates/greece (accessed on 27 August 2025).
Erokhin, D.; Komendantova, N. Analyzing Public Interest in Geohazards Using Google Trends Data. Geosciences 2024, 14, 266. [Google Scholar] [CrossRef]
Ferguson, D.; Meyer, F.G. Probability density estimation for sets of large graphs with respect to spectral information using stochastic block models. arXiv 2022, arXiv:2207.02168v1. [Google Scholar] [CrossRef]
Gelman, A. Scaling regression inputs by dividing by two standard deviations. Stat. Med. 2008, 27, 2865–2873. [Google Scholar] [CrossRef]
Ghosh, A.; E-Roub, F.; Krishnan, N.C.; Choudhury, S.; Basu, A. Can google trends search inform us about the population response and public health impact of abrupt change in alcohol policy?—A case study from India during the COVID-19 pandemic. Int. J. Drug Policy 2021, 87, 102984. [Google Scholar] [CrossRef] [PubMed]
Ginsberg, J.; Mohebbi, M.H.; Patel, R.S.; Brammer, L.; Smolinski, M.S.; Brilliant, L. Detecting influenza epidemics using search engine query data. Nature 2009, 457, 1012–1014. [Google Scholar] [CrossRef] [PubMed]
Goel, S.; Hofman, J.M.; Lahaie, S.; Pennock, D.M.; Watts, D.J. Predicting consumer behavior with web search. Proc. Natl. Acad. Sci. USA 2010, 107, 17486–17490. [Google Scholar] [CrossRef] [PubMed]
Greece: B2B and B2G Electronic Invoicing via MyData|EDICOM Global. Available online: https://edicomgroup.com/blog/greece-mandatory-electronic-invoice?utm_source=chatgpt.com (accessed on 27 August 2025).
Greece: Formal EU Approval for B2B E-Invoicing Mandate Published|Sovos. Available online: https://sovos.com/regulatory-updates/vat/greece-formal-eu-approval-for-b2b-e-invoicing-mandate-published/?utm_source=chatgpt.com (accessed on 27 August 2025).
Harvey, A.C. Forecasting, Structural Time Series Models and the Kalman Filter; Cambridge University Press: Cambridge, UK, 1990. [Google Scholar] [CrossRef]
Heinemann, M.; Stiller, W. Digitalization and cross-border tax fraud: Evidence from e-invoicing in Italy. Int. Tax Public Financ. 2025, 32, 195–237. [Google Scholar] [CrossRef]
Hölzl, J.; Keusch, F.; Sajons, C. The (mis)use of Google Trends data in the social sciences—A systematic review, critique, and recommendations. Soc. Sci. Res. 2025, 126, 103099. [Google Scholar] [CrossRef]
Hyytinen, A.; Tuimala, J.; Hammar, M. Enhancing the adoption of digital public services: Evidence from a large-scale field experiment. Gov. Inf. Q. 2022, 39, 101687. [Google Scholar] [CrossRef]
Implementing Decision—EU—2025/502—EN—EUR-Lex. Available online: https://eur-lex.europa.eu/eli/dec_impl/2025/502/oj/eng?utm_source=chatgpt.com (accessed on 27 August 2025).
Linnell, K.; Fudolig, M.; Schwartz, A.; Ricketts, T.H.; O’Neil-Dunne, J.P.M.; Dodds, P.S.; Danforth, C.M. Spatial changes in park visitation at the onset of the pandemic. PLoS Glob. Public Health 2022, 2, e0000766. [Google Scholar] [CrossRef]
Makridakis, S.; Spiliotis, E.; Assimakopoulos, V. The M4 Competition: Results, findings, conclusion and way forward. Int. J. Forecast. 2018, 34, 802–808. [Google Scholar] [CrossRef]
Mccombs, M.E.; Shaw, D.L. The Agenda-Setting function of mass media. Public Opin. Q. 1972, 36, 176–187. [Google Scholar] [CrossRef]
New Reporting System for Greek VAT—Taxand. Available online: https://www.taxand.com/our-thinking/insights/new-reporting-system-for-greek-vat/?utm_source=chatgpt.com (accessed on 27 August 2025).
Fiscal Solutions. New Rules for Data Transmission to the myDATA Platform in Greece Have Recently Been Published. Available online: https://www.fiscal-requirements.com/news/2702 (accessed on 27 August 2025).
Ozier, D.; Rafiq, T.; de Souza, R.J.; Singh, S.M. Use of Sacubitril/Valsartan Prior to Primary Prevention Implantable Cardioverter Defibrillator Implantation. CJC Open 2023, 5, 93–98. [Google Scholar] [CrossRef]
Papagianni, E.; Evgenidis, A.; Tsagkanos, A.; Megalooikonomou, V. Tourism Demand in the Face of Geopolitical Risk: Insights From a Cross-Country Analysis. J. Travel Res. 2024, 63, 2094–2119. [Google Scholar] [CrossRef]
Reigl, N. Noise shocks and business cycle fluctuations in three major European Economies. Empir. Econ. 2023, 64, 603–657. [Google Scholar] [CrossRef]
Safitri, K. Tax Policy Innovations for Enhancing MSMEs Compliance and Economic Resilience. Int. J. Bus. Appl. Econ. 2025, 4, 769–784. [Google Scholar] [CrossRef]
Shen, L.; Sun, M.; Song, S.; Hu, Q.; Wang, N.; Ou, G.; Guo, Z.; Du, J.; Shao, Z.; Bai, Y.; et al. The impact of anti-COVID-19 nonpharmaceutical interventions on hand, foot, and mouth disease—A spatiotemporal perspective in Xi’an, northwestern China. J. Med. Virol. 2022, 94, 3121–3132. [Google Scholar] [CrossRef]
Simionescu, M.; Schneider, N. Monetary shocks and production network in the G7 countries. J. Econ. Struct. 2023, 12, 20. [Google Scholar] [CrossRef]
Tsamis, G.; Evangelos, G.; Papakostas, A.; Vassiliou, G.; Grafanakis, M.; Garefalakis, A.; Vassalos, M.; Mylona, A.; Papadakis, N. Cost-Effective Design, Content Management System Implementation and Artificial Intelligence Support of Greek Government AADE, myDATA Web Service for Generic Government Infrastructure, a Complete Analysis. Algorithms 2025, 18, 339. [Google Scholar] [CrossRef]
Tsitouras, A.; Papapanagos, H. Factors Influencing Income Inequality and Inclusive Growth in Greece: A Long-Run and Short-Run Analysis. J. Knowl. Econ. 2025, 17, 2889–2919. [Google Scholar] [CrossRef]
Tu, T.; Chhatralia, K.; Maguire, K.; Tipping, S. HM Revenue and Customs Research Report 480 Making Tax Digital for Business: Survey of Small Businesses and Landlords; Research Report for HMRC; HM Revenue & Customs: London, UK, 2017.
Downs, A. Up and Down with Ecology: The “Issue-Attention Cycle”. In Agenda Setting; Routledge: Oxfordshire, UK, 2016; pp. 27–33. Available online: https://www.taylorfrancis.com/chapters/edit/10.4324/9781315538389-4/ecology-issue-attention-cycle-anthony-downs (accessed on 27 August 2025).
Yu, C.; Li, Y. Digitalization of tax collection and enterprises’ social security compliance. Int. Tax Public Financ. 2025, 32, 1213–1252. [Google Scholar] [CrossRef]

Figure 1. myData policy timeline.

Figure 2. Event impacts at six months (Δ6) with HAC-robust 95% confidence intervals. Dumbbell plot of the estimated relative change in Google Trends search volume index (SVI; 0–100 scale) six months following each policy event (vertical line at 0 = no change). Points represent each estimate per query series (color-coded), and horizontal bars represent HAC-robust 95% CIs. Positive values represent greater search interest than the counterfactual; negative values represent decreased interest.

Figure 3. Event impacts at six months (Δ6) by family (95% HAC CIs). Panel plots repeating the Δ6 estimates from Figure 1 but grouped by construct: A (platform queries), C (application queries), and D (ecosystem query). Within each panel, points and 95% HAC CIs are shown for the relevant series only. The vertical line marks zero impact.

Figure 4. Rolling-origin CV MAE by model and horizon (faceted by series). Mean absolute error (MAE; Google Trends 0–100 scale) for SNAIVE (12) and OLS + events at horizons h = 1, 2, 3 months, computed from blocked rolling-origin cross-validation. Lower bars indicate better accuracy.

Figure 5. Heatmap of relative MAE change by series and horizon, computed as

100 \times (1 - {MAE}_{OLS + events} / {MAE}_{SNAIVE (12)})

. Green (positive) values show OLS + events outperforming the seasonal-naïve baseline; red (negative) values show the reverse.

Figure 5. Heatmap of relative MAE change by series and horizon, computed as

100 \times (1 - {MAE}_{OLS + events} / {MAE}_{SNAIVE (12)})

. Green (positive) values show OLS + events outperforming the seasonal-naïve baseline; red (negative) values show the reverse.

Figure 6. ααδε (A3): Example of three-step nowcast with 95% prediction intervals (OLS + events). SVI is on Google’s 0–100 scale. PIs are based on out-of-sample residual variance from the rolling-origin setup used in RQ2.

Figure 7. Rolling-origin backtest paths (h = 1) by series. Out-of-sample one-step-ahead predictions for the blocked rolling-origin design (origins from 2018-01 in 6-month increments; 48-month maximum training window) are plotted for each target series.

Figure 8. Best model vs. SNAIVE (12)—percentage MAE improvement (rolling CV). Heatmap shows the percentage reduction in MAE of the best model at each series–horizon cell relative to SNAIVE (12) under rolling-origin CV (origins from 2018-01, step = 6 months, max train = 48 months; h = 1–3). Green shades indicate improvements; values are labeled as

100 \times (1 - \frac{{MAE}_{best}}{{MAE}_{SNAIVE (12)}})

. “Best” is chosen among OLS + events, AR (1) variants, SARIMAX + events, and equal-weight blends (e.g., BLEND (OLS, SNAIVE), BLEND (AR1, SNAIVE)).

Figure 8. Best model vs. SNAIVE (12)—percentage MAE improvement (rolling CV). Heatmap shows the percentage reduction in MAE of the best model at each series–horizon cell relative to SNAIVE (12) under rolling-origin CV (origins from 2018-01, step = 6 months, max train = 48 months; h = 1–3). Green shades indicate improvements; values are labeled as

100 \times (1 - \frac{{MAE}_{best}}{{MAE}_{SNAIVE (12)}})

. “Best” is chosen among OLS + events, AR (1) variants, SARIMAX + events, and equal-weight blends (e.g., BLEND (OLS, SNAIVE), BLEND (AR1, SNAIVE)).

Figure 9. Example for aade (A2): h = 1–3-month forecast—BLEND (OLS + AR1, SNAIVE) with prediction cones. The orange line shows the 1–3-month point forecast from the equal-weight blend of OLS + events + AR (1) and SNAIVE (12); shaded bands indicate 50%, 80%, and 90% prediction intervals. The dashed vertical line marks the forecast start, and the dotted green line shows the SNAIVE (12) benchmark.

Figure 10. Example of backtest path (h = 1): actual vs. OLS + events vs. SNAIVE (12) for ααδε (A3).

Table 1. Variables and definitions.

Variable	Series/Definition
mydata (A1)	GT SVI for “mydata” (platform term)
aade (A2)	GT SVI for “aade” (platform term; priority outcome)
ααδε (A3)	GT SVI for “ααδε” (Greek script; platform; priority outcome)
ηλεκτρονικα βιβλια (B1)	GT SVI for “ηλεκτρονικα βιβλια” (books/e-books; ancillary)
ηλεκτρονικά βιβλία (B2)	GT SVI for “ηλεκτρονικά βιβλία” (diacritics variant; ancillary)
ηλεκτρονικα βιβλια ααδε (B3)	GT SVI for “ηλεκτρονικα βιβλια ααδε” (ancillary)
τιμολόγιο (C1)	GT SVI for “τιμολόγιο” (invoicing; app family; priority)
timologio (C2)	GT SVI for “timologio” (Latin script variant; app; priority)
ηλεκτρονικό τιμολόγιο (C3)	GT SVI for “ηλεκτρονικό τιμολόγιο” (app/feature; ancillary)
Electronic invoicing (topic) (D1)	GT SVI topic for “Electronic invoicing” (ecosystem; priority)
e-invoicing (D2)	GT SVI for “e-invoicing” (ecosystem; ancillary)
peppol (D3)	GT SVI for “peppol” (standard; ecosystem; ancillary)
Composite attention index	z-score mean of the five priority series (A2, A3, C1, C2, D1); PCA (1) used in robustness
Policy events—step dummies	$S_{t}^{(e)} = 1 {t \geq T_{e}}$ for each preregistered milestone
Policy events—slope ramps	$P O S T_{t}^{(e)} = m a x (0, t - T_{e})$ in months for each milestone (as preregistered)
Calendar dummies (m₂, …, m₁₂)	Month fixed effects with January omitted
quarter_end	Indicator for calendar quarter end; excluded in main (collinearity), retained in robustness

Notes: All GT series are monthly SVI values in 0–100 units. Values of “<1” are recoded to 0.5. The main analysis window is 2016–latest; 2004+ is used for specified robustness checks.

Table 2. Six-month post-event change (Δ6) in search volume index (SVI) points by series and event (BH-adjusted significance).

Series	Event	Electronic Invoicing (D1)	aade (A2)	timologio (C2)	ααδε (A3)	τιμολόγιο (C1)
0	b2g_phase1	28.8	3.2	23.8 **	18.1	33.6 **
1	b2g_rest_public	−52.7 ***	−22.2	8.3	5.2	29.7 ***
2	central_admin_full	1.9	10.4	−3.4	6.4	−10.4
3	eu_b2b_authorisation	14.2	−6.7	−1.0	−4.4	8.2 *
4	mydata_go_live	4.6	3.2	38.4 ***	15.1 *	3.4 *
5	vat_mydata_alignment	46.4 **	0.9	−34.3 **	−34.2	−43.1 *

Note: Δ6 is the average change in SVI over months t = +1, …, +6 relative to the pre-event path. Stars denote BH-adjusted significance: p < 0.05 (*), p < 0.01 (**), p < 0.001 (***). Positive values indicate increases in search interest.

Table 3. Six-month post-event change expressed as percentage of each series’ 2019–2020 mean (context for magnitude).

Series	Event	Electronic Invoicing (D1)	aade (A2)	timologio (C2)	ααδε (A3)	τιμολόγιο (C1)
0	b2g_phase1	198.2%	19.5%	312.5%	66.6%	363.0%
1	b2g_rest_public	−362.3%	−134.3%	108.3%	19.1%	321.3%
2	central_admin_full	13.0%	63.0%	−45.0%	23.5%	−112.7%
3	eu_b2b_authorisation	97.4%	−40.6%	−13.2%	−16.0%	88.6%
4	mydata_go_live	31.6%	19.3%	504.2%	55.7%	36.7%
5	vat_mydata_alignment	318.9%	5.2%	−449.4%	−126.0%	−466.2%

Note. Percentage values scale Δ6 by the series’ mean SVI during 2019–2020 to enable cross-series comparisons. Signs retain the direction of the Δ6 effect (e.g., −466% reflects a large decline relative to baseline).

Table 4. Baseline Δ6 (SVI points) by event and construct (HAC = 6). Six-month post-event changes in search interest across platform queries (A2/A3), application terms (C1/C2), and the ecosystem term (D1).

Event	aade (A2)	ααδε (A3)	τιμολόγιο (C1)	timologio (C2)	Electronic Invoicing (D1)
mydata_go_live	3.2	15.1	3.4	38.4 ***	4.6
b2g_phase1	3.2	18.1	33.6 *	23.8 *	28.8
central_admin_full	10.4	6.4	−10.4	−3.4	1.9
vat_mydata_alignment	0.9	−34.2	−43.1	−34.3 *	46.4 *
b2g_rest_public	−22.2	5.2	29.7 **	8.3	−52.7 ***
eu_b2b_authorisation	−6.7	−4.4	8.2	−1.0	14.2

Note. Entries are Δ6 in SVI points (0–100). Positive = increase; negative = decrease. Newey–West HAC SEs with L = 6 months. Benjamini–Hochberg (BH)-adjusted results within construct: * p < 0.05, ** p < 0.01, *** p < 0.001.

Table 5. HAC bandwidth comparison for Δ6: L = 12 vs. L = 6. Sensitivity of Δ6 estimates to the HAC bandwidth choice.

Event	aade (A2)_HAC12	ααδε (A3)_HAC12	τιμολόγιο (C1)_HAC12	timologio (C2)_HAC12	Electronic Invoicing (D1)_HAC12	aade (A2)_HAC6	ααδε (A3)_HAC6	τιμολόγιο (C1)_HAC6	timologio (C2)_HAC6	Electronic Invoicing (D1)_HAC6
mydata_go_live	3.2	15.1 **	3.4	38.4 ***	4.6	3.2	15.1	3.4	38.4 ***	4.6
b2g_phase1	3.2	18.1	33.6 ***	23.8 *	28.8	3.2	18.1	33.6 *	23.8 *	28.8
central_admin_full	10.4	6.4	−10.4	−3.4	1.9	10.4	6.4	−10.4	−3.4	1.9
vat_mydata_alignment	0.9	−34.2	−43.1 **	−34.3 **	46.4 *	0.9	−34.2	−43.1	−34.3 *	46.4 *
b2g_rest_public	−22.2	5.2	29.7 ***	8.3	−52.7 ***	−22.2	5.2	29.7 **	8.3	−52.7 ***
eu_b2b_authorisation	−6.7	−4.4	8.2	−1.0	14.2	−6.7	−4.4	8.2	−1.0	14.2

Note. Left block uses NW-HAC L = 12; right block shows L = 6 (baseline). BH-adjusted significance markers are reported as follows: *

p < 0.05

, **

p < 0.01

, ***

p < 0.001

. Substantive conclusions are unchanged across bandwidth choices.

Table 6. Event timing sensitivity: +1-month lag (Δ6, HAC = 6). Δ6 re-estimated after shifting each event forward by +1 month to allow for implementation/awareness delays.

Event	aade (A2)	ααδε (A3)	τιμολόγιο (C1)	timologio (C2)	Electronic Invoicing (D1)
b2g_phase1	11.9	24.2 ***	44.8 ***	32.2 ***	36.9 **
b2g_rest_public	8.2	26.8 **	−8.1	16.1	−65.5 ***
central_admin_full	11.3	9.8 *	−29.4 ***	−5.9	−0.9
eu_b2b_authorisation	−28.6 **	−32.0 ***	10.4 *	−10.1	26.7
mydata_go_live	3.1	15.8 ***	2.5	40.1 ***	−3.5
vat_mydata_alignment	−19.5	−37.9 ***	−39.2 ***	−47.9 ***	33.2 *

Note. Estimates in SVI points with NW-HAC L = 6 SEs; BH-adjusted stars as above. Significance markers are reported as follows: *

p < 0.05

, **

p < 0.01

, ***

p < 0.001

.

Table 7. Cross-validated winners by series and horizon with percentage MAE improvement vs. SNAIVE (12).

h	h = 1	h = 2	h = 3
Electronic invoicing (D1)	SNAIVE (12)—MAE 6.33 (+0.0% vs. SNAIVE (12))	OLS + events—MAE 7.79 (+4.1% vs. SNAIVE (12))	OLS + events—MAE 9.08 (+12.8% vs. SNAIVE (12))
aade (A2)	OLS + events—MAE 3.83 (+40.2% vs. SNAIVE (12))	OLS + events—MAE 6.38 (+18.3% vs. SNAIVE (12))	OLS + events—MAE 6.36 (+16.6% vs. SNAIVE (12))
timologio (C2)	OLS + events—MAE 21.35 (+1.5% vs. SNAIVE (12))	OLS + events—MAE 16.21 (+13.9% vs. SNAIVE (12))	OLS + events—MAE 14.35 (+15.3% vs. SNAIVE (12))
ααδε (A3)	OLS + events—MAE 6.04 (+49.5% vs. SNAIVE (12))	OLS + events—MAE 8.59 (+29.1% vs. SNAIVE (12))	OLS + events—MAE 8.72 (+31.7% vs. SNAIVE (12))
τιμολόγιο (C1)	OLS + events—MAE 7.88 (+4.5% vs. SNAIVE (12))	SNAIVE (12)—MAE 6.83 (+0.0% vs. SNAIVE (12))	SNAIVE (12)—MAE 5.44 (+0.0% vs. SNAIVE (12))

Note. For each series and forecast horizon (h = 1, 2, 3 months), the table reports the winner (lowest mean absolute error, MAE) under blocked rolling-origin cross-validation, the winner’s MAE, and the winner’s percentage improvement relative to SNAIVE (12). Improvements are computed as

100 \times (1 - {MAE}_{w i n n e r} / {MAE}_{SNAIVE (12)})

.

Table 8. Diebold–Mariano tests: OLS + events vs. SNAIVE (12) by series and horizon (MAE loss).

Series	Event	h	Loss	n	DM t (p)	Winner
12	Electronic invoicing (D1)	1	MAE	12	1.13 (p = 0.282)	SNAIVE (12)
13	Electronic invoicing (D1)	2	MAE	12	−0.16 (p = 0.878)	OLS + events
14	Electronic invoicing (D1)	3	MAE	12	−0.57 (p = 0.582)	OLS + events
0	aade (A2)	1	MAE	12	−2.09 (p = 0.061) *	OLS + events
1	aade (A2)	2	MAE	12	−1.04 (p = 0.322)	OLS + events
2	aade (A2)	3	MAE	12	−1.12 (p = 0.285)	OLS + events
9	timologio (C2)	1	MAE	12	−0.07 (p = 0.949)	OLS + events
10	timologio (C2)	2	MAE	12	−0.52 (p = 0.611)	OLS + events
11	timologio (C2)	3	MAE	12	−0.58 (p = 0.571)	OLS + events
3	ααδε (A3)	1	MAE	12	−2.41 (p = 0.034) **	OLS + events
4	ααδε (A3)	2	MAE	12	−1.33 (p = 0.212)	OLS + events
5	ααδε (A3)	3	MAE	12	−1.38 (p = 0.196)	OLS + events
6	τιμολόγιο (C1)	1	MAE	12	−0.21 (p = 0.841)	OLS + events
7	τιμολόγιο (C1)	2	MAE	12	0.25 (p = 0.805)	SNAIVE (12)
8	τιμολόγιο (C1)	3	MAE	12	0.68 (p = 0.511)	SNAIVE (12)

Note. Tests use n = 12 forecast origins per horizon. Significance markers: p < 0.10, * p < 0.05, ** p < 0.01 (unadjusted). With this sample size, only A3 at h = 1 shows a clear advantage for OLS + events (p = 0.034), and A2 at h = 1 is marginal (p = 0.061). All other contrasts are not statistically significant.

Table 9. Event-by-series movement grid (Δ6 classification at 6 months).

Series	Event	aade (A2)	ααδε (A3)	τιμολόγιο (C1)	timologio (C2)	Electronic Invoicing (D1)
0	mydata_go_live	○	▲ (L, S)	▲ (L, S)	▲ (L, S)	○ (S)
1	b2g_phase1	○	○ (L, S)	▲ (L)	▲ (S)	○
2	central_admin_full	○	○	○	○	○
3	vat_mydata_alignment	○	○	▼ (S)	▼ (S)	▲ (S)
4	b2g_rest_public	▼ (L)	○	▲ (L, S)	○	▼ (S)
5	eu_b2b_authorisation	○ (L, S)	○ (L, S)	▲ (S)	○	○

Note. Categorization of medium-term effects (Δ6) per event and search series. ▲ = positive Δ6 (BH-FDR p ≤ 0.05); ▼ = negative Δ6 (BH-FDR p ≤ 0.05); ○ = not significant at BH-FDR 5%. Parentheses show significant component(s) despite marginal Δ6: (L) level shift β_S; (S) post-event slope β_P. Estimates on the pre-registered event study: month OLS with month fixed effects, centered trend, COVID pulses, and Newey–West HAC (6) standard errors; Δ6 = β_S + 6β_P. Event dates are deterministic (myDATA go-live, B2G phase 1, central administration full, VAT/myDATA alignment, B2G rest-of-public, EU B2B authorization). Family-wise inferences pool across A2–A3 (platform), C1–C2 (apps), and D1 (ecosystem).

Table 10. Top events by |Δ₆| within each outcome series (BH-FDR on Δ₆).

Series	Rank	Event	Δ₆ (SVI Points)
aade (A2)	1	b2g_rest_public	−22.2 **
	2	central_admin_full	10.4
	3	eu_b2b_authorisation	−6.7
	4	b2g_phase1	3.2
	5	mydata_go_live	3.2
	6	vat_mydata_alignment	0.9
ααδε (A3)	1	vat_mydata_alignment	−34.2
	2	b2g_phase1	18.1
	3	mydata_go_live	15.1 ***
	4	central_admin_full	6.4
	5	b2g_rest_public	5.2
	6	eu_b2b_authorisation	−4.4
τιμολόγιο (C1)	1	vat_mydata_alignment	−43.1 **
	2	b2g_phase1	33.6 ***
	3	b2g_rest_public	29.7 ***
	4	central_admin_full	−10.4
	5	eu_b2b_authorisation	8.2 **
	6	mydata_go_live	3.4 **
timologio (C2)	1	mydata_go_live	38.4 ***
	2	vat_mydata_alignment	−34.3 ***
	3	b2g_phase1	23.8 ***
	4	b2g_rest_public	8.3
	5	central_admin_full	−3.4
	6	eu_b2b_authorisation	−1.0
Electronic invoicing (D1)	1	b2g_rest_public	−52.7 ***
	2	vat_mydata_alignment	46.4 ***
	3	b2g_phase1	28.8
	4	eu_b2b_authorisation	14.2
	5	mydata_go_live	4.6
	6	central_admin_full	1.9

Notes. Δ₆ is the six-month post-event effect (SVI points, 0–100 scale). Ranks are by absolute |Δ₆| within series (1 = largest magnitude). Stars reflect BH-FDR-adjusted significance within series across events: * p < 0.05, ** p < 0.01, *** p < 0.001. Negative values indicate decreases in search interest relative to the counterfactual path.

Table 11. Composite attention index (z-mean across A2/A3/C1/C2/D1): Δ₆ tests.

	Event	delta_6	p_delta_6
0	mydata_go_live	0.520800	3.924432 × 10⁻¹⁰
1	b2g_phase1	1.488648	1.967194 × 10⁻³
2	central_admin_full	−0.131013	5.716091 × 10⁻¹
3	vat_mydata_alignment	−1.318392	3.499174 × 10⁻²
4	b2g_rest_public	0.263230	3.607021 × 10⁻¹
5	eu_b2b_authorisation	0.222121	4.848606 × 10⁻¹

Note. Composite constructed as the mean of z-scored series (each standardized over the analysis window). Entries report Δ₆ and HAC (6) p-values; BH-FDR applied across events. The composite corroborates broad movement at myDATA go-live and B2G phase 1, no effect at central administration full, and a modest negative effect at VAT–myDATA alignment.

Table 12. Selected backtest checkpoints for ααδε (A3), h = 1 month—actual vs. OLS + events vs. SNAIVE (12).

	Month	Actual	OLS + Events	SNAIVE (12)
2	2020-02-01	20.0	22.000000	11.0
3	2020-08-01	29.0	27.564815	15.0
4	2021-02-01	40.0	31.863426	20.0
5	2021-08-01	27.0	33.012640	29.0
6	2022-02-01	65.0	63.339789	40.0
7	2022-08-01	51.0	77.588432	27.0
8	2023-02-01	56.0	61.758016	65.0
9	2023-08-01	42.0	46.107868	51.0
10	2024-02-01	78.0	68.696850	56.0
11	2024-08-01	48.0	80.533516	42.0
12	2025-02-01	63.0	82.554996	78.0
13	2025-08-01	44.0	53.782484	48.0

Table 13. Robustness map for headline conclusions.

Headline Conclusion (What We Claim)	Robust Across Checks?	Sensitivity/How We Phrase It
Family ordering: app (C) most event-reactive; platform (A) muted; ecosystem (D1) selective but large	Yes	Report as qualitative ordering + consistent signs; avoid over-weighting any single cell’s p-value
Back-office milestone (“central administration full”) has no measurable public salience	Yes	Treat as null/near-zero and consistent with low public visibility
Rollout milestones coincide with app-focused surges (go-live, B2G Phase 1)	Mostly	Timing strength can vary with ±1–2 month lag; present as event-timed association, not causal
Harmonization (VAT–myDATA alignment) coincides with app declines and ecosystem rebalancing	Mostly	Magnitudes depend on transformation; emphasize SVI point Δ and direction, not %
Two “anchor” ecosystem shifts dominate D1 (down at B2G rest-of-public; up at VAT alignment)	Yes	Keep as anchors; acknowledge D1 volatility and that other D1 events are weaker/mixed
Nowcasting gains are strongest for platform series (A2/A3)	Yes	Frame as series- and horizon-conditional; strongest at h = 1 and still favorable at h = 2–3
D1 nowcasting at h = 1 does not improve vs. seasonal naïve	Yes	Explicitly report as a failure case; gains (if any) appear mainly at h = 2–3
Very large percentage changes (±300–500%) reflect meaningful behavioral shifts	No (not claimed)	Recast % as context-only; stress low baselines → mechanical inflation, and avoid “policy success” language

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Balaskas, S. Policy Shocks and Public Attention to Digital Tax in Greece: Event-Study and Nowcasting with Google Trends Time Series. Account. Audit. 2026, 2, 6. https://doi.org/10.3390/accountaudit2020006

AMA Style

Balaskas S. Policy Shocks and Public Attention to Digital Tax in Greece: Event-Study and Nowcasting with Google Trends Time Series. Accounting and Auditing. 2026; 2(2):6. https://doi.org/10.3390/accountaudit2020006

Chicago/Turabian Style

Balaskas, Stefanos. 2026. "Policy Shocks and Public Attention to Digital Tax in Greece: Event-Study and Nowcasting with Google Trends Time Series" Accounting and Auditing 2, no. 2: 6. https://doi.org/10.3390/accountaudit2020006

APA Style

Balaskas, S. (2026). Policy Shocks and Public Attention to Digital Tax in Greece: Event-Study and Nowcasting with Google Trends Time Series. Accounting and Auditing, 2(2), 6. https://doi.org/10.3390/accountaudit2020006

Article Menu

Policy Shocks and Public Attention to Digital Tax in Greece: Event-Study and Nowcasting with Google Trends Time Series

Abstract

1. Introduction

2. Literature Review and Related Work

Background: myDATA and E Invoicing in Greece

3. Data and Variable Construction

3.1. Data Source, Scope, Search Terms and Families

3.2. Construction of Outcomes, Design Matrix and Event Indicators

3.3. Identification and Preregistration

3.3.1. Identification Strategy

3.3.2. Robustness and Preregistration

4. Methods

4.1. RQ1: Event-Study OLS

4.2. RQ2: Nowcasting Design

4.3. RQ3: Which Families Move?

5. Data Analysis and Results

5.1. RQ1—Event Impacts

Robustness and Sensitivity Checks

5.2. RQ2—Nowcasting Skill

5.2.1. Rolling-Origin CV

5.2.2. Forecast Comparison

5.3. RQ3—Which Families Move?

5.3.1. Rank by |Δ6| for Each Series

5.3.2. Composite Index Confirmation (Z-Mean Across A2/A3/C1/C2/D1)

6. Discussion

6.1. Do Events Move Attention?

6.2. Do Event-Aware Models Forecast Better?

6.3. Which Families Move?

7. Practical Implications

7.1. For Policymakers and Tax Administrators

7.2. For Business Managers and Software Vendors

8. Conclusions, Limitations, and Future Directions

Supplementary Materials

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI