Next Article in Journal
Power Efficiency Characterization with Various Gate Oxide Thicknesses in Class DE Amplifiers for HIFU Applications
Next Article in Special Issue
Transformer-Based Multimodal Infusion Dialogue Systems
Previous Article in Journal
Adaptive BIST for Concurrent On-Line Testing on Combinational Circuits
Previous Article in Special Issue
SCA-MMA: Spatial and Channel-Aware Multi-Modal Adaptation for Robust RGB-T Object Tracking
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Integrated Framework to Assess the Extent of the Pandemic Impact on the Size and Structure of the E-Commerce Retail Sales Sector and Forecast Retail Trade E-Commerce

International Business and Economics Department, Bucharest University of Economic Studies, 010374 Bucharest, Romania
Electronics 2022, 11(19), 3194; https://doi.org/10.3390/electronics11193194
Submission received: 13 September 2022 / Revised: 29 September 2022 / Accepted: 30 September 2022 / Published: 5 October 2022

Abstract

:
With customers’ increasing reliance on e-commerce and multimedia content after the outbreak of COVID-19, it has become crucial for companies to digitize their business methods and models. Consequently, COVID-19 has highlighted the prominence of e-commerce and new business models while disrupting conventional business activities. Hence, assessing and forecasting e-commerce growth is currently paramount for e-market planners, market players, and policymakers alike. This study sources data for the global e-commerce market leader, the US, and proposes an integrated framework that encompasses automated algorithms able to estimate six statistical and machine-learning univariate methods in order to accomplish two main tasks: (i) to produce accurate forecasts for e-commerce retail sales (e-sale) and the share of e-commerce in total retail sales (e-share); and (ii) to assess in quantitative terms the pandemic impact on the size and structure of the e-commerce retail sales sector. The results confirm that COVID-19 has significantly impacted the trend and structure of the US retail sales sector, producing cumulative excess (or abnormal) retail e-sales of $227.820 billion and a cumulative additional e-share of 10.61 percent. Additionally, estimations indicate a continuation of the increasing trend, with point estimates of $378.691 billion for US e-commerce retail sales that are projected to account for 16.72 percent of total US retail sales by the end of 2025. Nonetheless, the current findings also document that the growth of e-commerce is not a consequence of the COVID-19 crisis, but that the pandemic has accelerated the evolution of the e-commerce sector by at least five years. Overall, the study concludes that the shift towards e-commerce is permanent and, thus, governments (especially in developing countries) should prioritize policies aimed at harnessing e-commerce for sustainable development. Furthermore, in light of the research findings, digital transformation should constitute a top management priority for retail businesses.

1. Introduction

The COVID-19 pandemic has been labeled a black swan event [1] with far-reaching socio-economic consequences [2,3,4]. At the world level, 200 countries and territories have reported COVID-19 cases [5]. Consequently, closing substantial segments of the physical economy has promoted digitization by enhancing the use of digital channels, particularly e-commerce [6,7,8,9]. As such, unlike any other time in history, the e-commerce sector played a key role in society, allowing consumers to safely acquire goods through strict regulations aimed at safeguarding public health [10]. Consequently, in the context of a global economic downturn, e-commerce has surged while digital transformations have accelerated [11,12]. Several nations have experienced shifts towards e-commerce, particularly along the food supply chain. Examples include farmers who began using digital technologies to sell their produce directly to consumers or restaurants that began offering food or grocery delivery services [13]. Moreover, the pandemic has produced an excess consumer interest and subsequent growth in the global web and videoconferencing Software as a Service (SaaS) market that would not have occurred in the absence of the pandemic [14]. Concomitantly, the increase in digitization caused a disruption in traditional business methods such as supply chain management and distribution [15]. Companies that formerly focused on selling through offline channels expedited their digital transformation in order to respond to the fast-evolving commercial reality ([10,16,17]). Hence, the resulting shift from brick-and-mortar retail to e-commerce has spurred total online retail sales and increased the share of e-commerce in total retail sales ([13,18,19]), while total retail sales have decreased. Global e-commerce sales, including both business-to-business (B2B) and business-to-consumer (B2C) sales, increased to $26.7 trillion in 2019, up 4% from 2018, making for 30% of the 2020′s global gross domestic product (GDP) [20]. Additionally, global e-retail sales, comprising B2C sales, equaled $4.2 trillion in 2021 [21] and are expected to reach $5.42 trillion in 2022 and $6.38 trillion by 2024 [22]. The United States (US) leads the global e-commerce market, followed by Japan and China, whereas China, the US, and the United Kingdom are the top three countries in terms of B2C e-commerce sales [23].
Concurrently, accurate sales forecasting is of paramount importance in retailing, as retail businesses rely on sales predictions for various operational decisions [24], including decisions regarding planning production, purchasing, transportation, and workforce [25]. Accurate projections are especially critical for retail and consumer-oriented industries, namely the electronic market and the fashion business [26]. Furthermore, retail sales have long been recognized as a leading or coincident economic indicator [27,28,29,30], whereas e-commerce retail sales have been shown to have an even stronger impact on regional development [31]. Moreover, e-commerce has a documented mitigating effect on pollution, with estimates indicating that offline shopping produces between 1.5 and 2.9 times more greenhouse gas emissions than online shopping [32]. Thus, unsurprisingly, the e-commerce sector is regarded as a key driver for the circular economy [10] and robust estimates of its future evolution are key for effective and efficient policymaking.
Additionally, time series prediction is acknowledged as an important technology to promote industrial intelligence [33]. Consequently, e-market planners and other planning agencies in the information technology sector are increasingly recognizing the importance of assessing and forecasting e-commerce growth [34].
In light of the above considerations, a quantitative assessment of the impact of the pandemic on the e-commerce sector (and in particular its persistence) has emerged as timely research topic, with important economic and policy implications that have in turn motivated the current study. Considering previous findings that confirm the role of retail sales as an economic leading indicator, the univariate modeling approach, where data speak for themselves [35,36], stands as the most appropriate forecasting approach.
Three aspects define this paper’s primary contributions: (1) it develops an integrated framework for forecasting total e-commerce retail sales and the share of e-commerce in total retail sales that embeds an array of relevant statistical and machine-learning forecasting methods; (2) it assesses empirically the magnitude of the COVID-19 pandemic’s impact on the size and structure of the e-commerce retail sales sector; and (3) it investigates the short and long-run effects of the pandemic on this sector and thus explores whether the change in consumer behavior induced by the COVID-19 pandemic towards e-commerce is persistent.
To answer these research goals, long-term forecasts for total US e-commerce retail sales and the share of e-commerce in total US retail sales are generated by the model that outperforms in terms of out-of-sample predicting performance embedded in the framework. Results show that TBATS (exponential smoothing state space model with Box-Cox transformation, ARMA errors, trend, and seasonal components) can better capture the specificities of the e-sales time series, whereas both TBATS and STS (structural time series Model) show a similar superior ability for explaining and predicting e-share trends. Furthermore, new indicators reflecting excess (or abnormal) e-commerce retail sales (i.e., excess e-sales) and excess (or abnormal) share of e-commerce in total retail sales (i.e., excess e-share) are proposed and estimated through the integrated framework by the over-performing models. The findings indicate that the pandemic has accelerated the growth of the e-commerce sector in size and has expanded, above expectations, its weight in total retail sales. Consequently, the pandemic induced a total of $227.820 billion in excess e-sales and has increased the weight of e-commerce in total retail by 10.61 percent, thus accelerating its structural change. In the absence of the pandemic, these evolutions would not have been observed before 2026. Our estimations thus show that the unseen black swan event accelerated the evolution of the e-commerce sector by at least five years. Furthermore, research findings confirm that the COVID-19 pandemic will modify consumer choices in the long run toward e-commerce, as estimations indicate overall US e-commerce sales to increase through the end of 2025, reaching $378.691 billion in absolute terms, and corresponding to a share in the US retail sales sector of 16.72 percent by December 2025.
The remainder of the study is organized as follows. Section 2 provides a short overview of the related literature. Section 3 firstly covers the data and discusses its historical trends, then explains the method and presents the integrated framework proposed in this study, along with its main building blocks. Next, Section 4 presents the empirical results and checks their robustness. Section 5 proceeds with a discussion of findings, while Section 6 concludes the study and extracts some policy implications.

2. Literature Review

Producing accurate forecasting systems for real-world time series modeling is a challenging task [37]. Previous studies that attempt to forecast retail sales are based on forecasting methods that belong to one (or both) of the two main categories (i.e., “cultures”) delineated by [38], each with its strengths and weaknesses [39]: econometrics (i.e., statistical methodologies) and machine learning (self-learning systems that learn from past data to enhance their predictive performance) [40]. They are united by the common goals of providing, on the one hand, information about the data-generating process and, on the other hand, predictability for its future evolution [41]. Machine-learning techniques have the important advantage of being data-driven and needing minimal a priori assertions about the data-generating process [42,43,44] while being capable of capturing non-linearity that characterizes many empirical time series. This important characteristic contributed to making machine-learning techniques such as artificial neural networks (ANNs) very useful in many circumstances where no theoretical direction for an acceptable data generation process is available, and thus the default method of choice for modeling real-world phenomena [45,46].
However, both in practice and in research, forecasting models are usually chosen manually to account for various data-specific elements, such as autocorrelation, trend, seasonality, or exogenous variables [47]. In particular, retail sales series have been shown to typically contain trend and seasonal patterns, and thus pose increased difficulties in the development of reliable forecasting systems [25]. Given the particularities of retail sale data, previous studies tend to disagree in identifying the best-performing forecasting model, which further complicates the task. Some authors have found that statistical models are able to provide accurate forecasts of retail sales (among others, [25,48,49]) whereas others have identified machine-learning methods to out-perform ([50,51,52,53,54,55]). Thus, previous research endeavors have been characterized by mixed results in terms of the effectiveness of statistical methods compared with machine-learning methods to model and predict retail sales.
This study contributes to extending this strand of literature with new and robust findings based on the most recent available data. Additionally, it goes further than previous research by quantitatively assessing the impact of the COVID-19 pandemic on the e-commerce trend and structure and by providing reliable estimates for the future evolution of the sector, thus answering current open questions regarding the magnitude and persistence of this significant impact. As such, this study brings new evidence on the extent to which the pandemic has impacted the US e-commerce retail sales sector and explores whether these disruptions are reversible or persistent. In the process, it develops a forecasting instrument (i.e., the integrated framework), which is capable of improving the forecasting accuracy of e-commerce series.

3. Materials and Methods

3.1. Data

The research conducted in this paper employs quarterly data for the two main variables of interest (i.e., e-commerce retail sales [ECOMSA] and the share of e-commerce in total retail sales [ECOMPCTSA]) sourced from the Federal Reserve Bank of St. Louis’s (FRED) database, which gathers data from the U.S. Census Bureau. The sample period extends from October 1999 (i.e., 1999 Q3) to October 2021 (i.e., 2021 Q3) and therefore includes 88 quarterly observations for each e-commerce time series. The timeframe provides a sufficing sample that includes both tranquil periods, as well as the recent black swan event (i.e., the COVID-19 pandemic), through which the forecasting ability of a pool of predictive models (statistical and machine-learning) encompassed into an integrated forecasting framework can be tested and indicators of excess e-commerce retail sales can be estimated.
For a quick overview of historical evolutions in the retail sales sector, which also includes a sectorial perspective, we also make use of relevant data retrieved from the US Census Bureau database. As such, sector-level time series for Annual U.S. Retail Trade Sales (Total and E-commerce) covering the 1998–2015 period are sourced from Census for providing a deeper insight into historical trends and structure of the retail sector in the United States. This in turn contributes to identifying the most appropriate forecasting method for estimating excess e-sales/e-share that will be discussed further. To our knowledge, our analysis makes use of the most recent and complete available e-commerce retail sector data set for the leading global e-commerce market (i.e., the US market).
It is worth noticing that the US e-commerce retail sector had registered significant growth over the last decade, showing a more accelerated increase than total retail sales in the pre-pandemic era (Figure 1, panels a and b).
The driving engines behind retail sales growth have also been different. Thus, a sectorial perspective on the US market provided in Figure 1 reveals that the growth in total retail sales has been mainly driven by the motor vehicles and parts sector, whereas non-store retailers have been the driving force behind e-commerce retail sales.
However, visualization of the updated e-commerce series confirms that historical evolutions have been significantly disrupted by the COVID-19 pandemic [56]. Figure 2 and Figure 3 show that e-commerce retail sales and the share of e-commerce in total retail sales reached an initial peak in the second quarter of 2020 and, after a mild reversal, accelerated again from January 2021 and continued through the year.
The share of e-commerce in total retail sales followed a similar trend after the pandemic outbreak, registering a significant increasing trend over the first half of 2020. However, the two time-series diverge during 2021, when the share of e-commerce continued its downward trend that started in June 2020, whereas e-commerce retail sales reversed it. Hence, the share of e-commerce in total U.S. retail sales equals 13 percent in the third quarter of 2021, down from its peak level of 15.7 percent in the second quarter of 2020. On the other hand, retail e-commerce sales surpassed $214 billion in October 2021, whereas its highest historical level was registered in the previous quarter, reaching an all-time high value of $221.9 billion in the second quarter of 2021.
Overall, as of October 2021, e-commerce accounts for 13 percent of total retail sales in the United States, up from 0.6 percent in December 1999, whereas physical retail still accounts for 87 percent at the end of the analysis period. From 1999 Q4 to 2021 Q3, e-commerce has increased by $210.11 billion to a total of $214.586 billion, which translates into an impressive 4694% growth. Over the pandemic period, the growth of e-commerce has significantly accelerated. Relative to its 2019 Q4 level of $135.629 billion, e-commerce grew by 58.21%, and its penetration increased by 2%.
These historical trends, and especially the increased volatility induced by the COVID-19 pandemic, indicate that a single model may be unable of capturing efficiently all characteristics, evolutions, or available information hidden in the series and consequently can produce unreliable forecasts [57]. This in turn leads us to use a variety of statistical and machine-learning methods, both linear and nonlinear and analyze their performance from a comparative perspective. The best-performing model will then be used to produce forecasts of the variables of interest that will serve as proxies for the “business as usual” scenario for the e-commerce sector (i.e., without the black swan event that has disturbed its evolution).

3.2. Method

3.2.1. Integrated Framework

The integrated framework proposed in this research is reflected in Figure 4.
As depicted in Figure 4, the algorithm that is run within the integrated framework consecutively follows two distinct paths, which are delineated by different data splitting rules. Its main building blocks are presented in more detail in the following sub-sections.
The implementation of the integrated framework and all estimations are carried out in R software.

3.2.2. Data Splitting Rules and the Forecasting Technique

For forecasting purposes, each e-commerce time series of length N = 88 (from October (or Q3) 1999 to October (Q3) 2021) are separated into two subsets denoting a training period and a test period.
Firstly, to develop the first path (path A) followed by the algorithm on which the integrated framework depicted in Figure 4 is run, the data up to March (i.e., Q1) 2018 (representing 74 quarterly observations) are used in-sample for modeling and validation, and the data from 2018 Q2 to 2019 Q4 (i.e., 7 observations) are used to gauge how well the predictive models perform when forecasting beyond the sample. As the COVID-19 pandemic has significantly altered retail sales series trends and structure, we are especially interested in finding the best forecasting model over the test period or lead-time and subsequently using it for providing h steps ahead forecasts so as to cover the pandemic period ranging from January 2020 to October 2021 (i.e., 2020 Q1 to 2021 Q3).
Thus, in the first investigation (path A), the forecasting horizon h is set to 7 (so as to cover the COVID-19 pandemic forecasting horizon that spreads over seven quarters), and point-forecasts for the seven quarters of the forecasting horizon are produced within the integrated framework by the over-performing model. As a consequence, we issue the most precise forecasts obtainable for the expected e-commerce retail sales and for the share of e-commerce in total retail sales (excess e-share) from the period spanning from January 2020 to October 2021 (over the COVID-19 pandemic period) based on quarterly historical data from January 1999 to December 2019, that is, in the absence of the COVID-19 pandemic. Real e-commerce sector data is subsequently compared to point forecasts, giving estimates of abnormal (excess) e-commerce retail sales and corresponding abnormal share of e-commerce in total retail sales e-sales.
In a subsequent investigation (path B followed by the algorithm), we take a different data splitting approach. Consequently, the training window spreads over the first 81 observations in the sample (i.e., 91 percent of observations, covering the period 1999 Q4–2019 Q4), and a testing period spanning from 2020 Q1- 2021 Q3 (thus a length of 7). For each e-commerce time series, point forecasts are produced after the six predictive models have been fit on the newly specified training window and their respective out-of-sample predictive ability has been analyzed on the new test set data. This approach ensures the robustness of results while also providing estimates for the future evolution of the two e-commerce time series based on the most recent available data. Hence, point forecasts for the e-commerce retail sales and for the share of e-commerce in total retail sales issued through the integrated framework and produced by the over-performing model for each time series until Q4 2025 bring further evidence on whether the impact of the COVID-19 pandemic on the e-commerce sector is long-lasting (otherwise, a revolving trend should be expected).
Hence, to automatically implement the method, the Algorithm 1 performs the following tasks running through Path A:
Algorithm 1. Sequential steps for excess e-sales/e-share estimation
   INPUT: Call Quandl() to source FRED data (ECOMSA, ECOMPCTSA series)
   Step 1. Split each series: 74-7-7 rule (Train/Test/Forecast windows)
   Step 2. Model fiting on ECOMSA and ECOMPCTSA training windows
   2a.call functions HoltWInters(),StructTS(), auto.arima(), ets(),tbats(),nnetar()
   2b.establish corresponding fitness function
   2c. determine best-fit model specifications for ECOMSA and ECOMPCTSA (2 x 6 best-fit models, in-sample setting)
   2d. check the residuals of the 2 x 6 best-fit models from step 2c (ACF plots, PACF Plots, the Box-Ljung test);
   IF (white noise), go to
   Step 3. use step 2c models to produce out-of-sample forecasts over the test window for ECOMSA and ECOMPCTSA
   ELSE,
   repeat steps 2 a–d
   Step 4. Compute performance metrics (MAE, MAPE, MASE, RMSE, Theil’s U, ACF1) for the 2 x 6 best-fit models
   Step 5. Identify best out-of-sample predictive model for ECOMSA and ECOMPCTSA (2 x 1 best forecasting model)
   Step 6. Confirm superior forecasting ability for Step 5 models (call “DM.test”)
   Step 7. Re-estimate the Step 5 models on extended Train + Test window (i.e., pre-COVID-19 data, or 81 quarterly observations); retune model parameters; check residuals
   Step 8. Use updated model specifications from Step 7 to produce forecasts for ECOMSA and ECOMPCTSA up to Q3 2021
   Step 8. Estimate and print excess e-sales/e-share
   FINISH
Specifically, the following script snippet (Box 1) clarifies the extraction of FRED data (i.e., total ecommerce retail sales) via Quandl and the implementation of the data splitting rule for path A:
Box 1. Script for data sourcing and splitting strategy for path A
library(Quandl)
Quandl.api_key("…") #personal Quandl key here
esales <- Quandl("FRED/ECOMSA",
          type = "xts",
          collapse = "quarterly",
          start_date = "1999-10-01",
          end_date = "2022-09-01")
x <- ts(esales)
test_x <- window(x, start=c(75,1),end=c(81,1))#testing window
x <- window(x, end=c(74,1)) #training window
Alternatively, for path B, the algorithm follows a similar, albeit not identical, sequence for each of the two e-commerce series. Thus, in step one it applies a different splitting rule (i.e., 71-17-17) and consequently, during Step 7, it re-estimates the over-performing forecasting model identified in Step 5 on an extended and updated window of data covering the train and test sets and thus comprising the entire time-series data of 88 quarterly observations spanning from 1999 Q4 to 2021 Q3. Finally, it produces predicted values for a forecasting horizon going up to the end of 2025 and hence it does not estimate excess statistics in this path.

3.3. Excess (or Abnormal) E-Commerce Retail Sales (Excess E-Sales) and Excess Share of E-Commerce in Total Retail Sales (Excess E-Share)

In this article, similar to the approach of the Corporate Finance Institute (CFI) [58] and of the European Commission ([59]), excess or abnormal e-sales refers to the reported total value of retail trade e-commerce sales (or e-sales), above what could be anticipated under ‘normal’ conditions. We thus argue that this approach gives a general measure of the impact of the crisis on the e-commerce retail trade sales. The abnormal e-commerce sales statistic not only highlights the impact of the pandemic but also allows for further comparison of additional retail e-commerce sales induced by the pandemic among the world countries.
Excess (abnormal) e-sales statistic is thus expressed as the level of unanticipated e-sales in a month, compared with a “baseline” or benchmark level that represents expected e-sales over the time frame associated with the COVID-19 epidemic, in the absence of the pandemic, as in Equation (1):
y h = E S h E ^ S h
where E S h is the observed level of e-sales at the forecasting horizon h and E ^ S h is the expected or baseline level of e-sales at h under normal conditions. Although an alternative approach would be to use averages of last observations as a baseline level (see for example how the United Kingdom Office for National Statistics estimates excess deaths due to the pandemic), this method is less appropriate for data with trend or seasonality that characterizes retail sales series. Consequently, in this research the predicted level of e-sales at step h issued with the optimal forecasting model is employed as the benchmark. We thus comparatively explore the performance of six commonly employed univariate time series predictive models and generate reliable point-estimates for expected e-sales levels from January 2020 to October 2021 based on quarterly historical data from December 1999 to December 2019, that is, in the absence of the COVID-19 pandemic. As such, these estimates that proxy the evolution of the e-sales series during “normal” conditions are compared to the real evolution of the series over the seven quarters that correspond to the calendar pandemic period, and abnormal (or excess) statistics are computed.
In a similar manner, excess e-share is estimated as the percentage rate of unexpected additional share of e-commerce in total retail sales:
y s h a r e h = E S _ s h a r e h E ^ S _ s h a r e h
where E S _ s h a r e h is the observed share of e-commerce in total retail sales (e-share) at the forecasting horizon h and E ^ S _ s h a r e h is the expected/benchmark level of e-share at h under normal conditions.
The next subsection provides a background on the methods that are encompassed in the integrated framework for forecasting purposes.

3.4. Time Series Forecasting Models

We employ six univariate time series models that are embedded into the integrated framework to automatically model and predict two relevant e-commerce series: e-commerce retail sales and the share of e-commerce in total retail sales.
The HW (Holt-Winters) model developed by [60,61] is known as double exponential smoothing and is especially fit for trended and seasonal time series. As such, HW proved reliable for forecasting business data with seasonality, shifting trends, and seasonal correlation [62] and has thus been found to produce accurate short-term forecasts for sales or demand time-series data [63]. HW incorporates an exponential component ( E t ) as well as a trend component ( T t ), such that:
E t = w Y t + ( 1 w ) ( E t 1 + T t 1 ) , 0 < w < 1
and
T t = v ( E t E t 1 ) + ( 1 v ) T t 1 , 0 < v < 1
After the exponentially smoothed and trend components, E t and T t , are calculated for each observed value, the k-step-ahead forecast is produced by:
F t + k | t = E t + k T t
The forecasting framework developed in this study automatically estimates the HW model by using the function “HoltWinters” in the “stats” package [64] where an additive” (the default) model is specified and nine start periods are used in the autodetection of start values. The function performs HW filtering for each time series that is fed to the framework (i.e., e-sales and e-share); then, by minimizing the squared prediction error, the algorithm automatically discovers the parameters for the best in-sample HW model.
The exponential smoothing state space model (ETS) has received extensive attention in the recent years since [65] proposed the state-space formulation as an extension of the exponential smoothing (ES) classical method. Exponential smoothing considers each time series as a combination of three components, namely, the trend (T), seasonal (S) and error (E) components, whereas the trend component is at its turn a combination of a level term (l) and a growth term (b). None (N), additive (A), additive damped (Ad), multiplicative (M), or multiplicative damped (Md) trend and seasonal components are all possible [66]. The final model is a three-word string (Z,Z,Z), where the first letter indicates the state space model’s error assumption, the second the trend type, and the third the season type [67]. In this study, the integrated framework runs the “ets” function in R’s “forecast” package [68,69] to automatically fit ETS models and thus to automatically select the error, type and season. Then, the corrected Akaike information criterion (AICc) is applied within the framework to find optimal ETS parameters for each time series.
Box and Jenkins [70] pioneered autoregressive integrated moving average (ARIMA) models [71], which are now one of the most widely used statistical methods for linear time series forecasting [72], prominently employed in economic time series forecasting [73]. ARIMA models are capable of capturing both nonseasonal and seasonal patterns of data, which might make them particularly useful for retail sales forecasting purposes [25]. Following the terminology in [74], a seasonal model is written as ARIMA (p,q,d)(P,Q,D)s, where s denotes the seasonal period, while the lowercase and the capital letters represent respectively the number of nonseasonal and seasonal parameters for each of its components. In equation form, an ARIMA (p,d,q)(P,D,Q)s model is written as:
( 1 φ 1 B φ p B p ) ( 1 Φ 1 B s Φ P B s P ) ( 1 B ) d ( 1 B s ) D Y t = ( 1 θ 1 B θ q B q ) ( 1 Θ 1 B s Θ P B s Q ) ε t
where ε t is a random variable with mean zero and the standard deviation σ .
In this study, the integrated forecasting framework automatically trains ARIMA models with the “auto.arima function” (R “forecast” package). The function calculates unit root tests, along with reducing the AICc and MLE to identify parameters through a step-wise process, and determines whether the data used to train the model requires a seasonal differencing. Hence, the algorithm minimizes the corrected Akaike information criterion (AICc) as per Equation (7) to determine the values of p and q, and then identifies the optimal ARIMA model that represents the smallest AICc within the estimated specifications. Thus, it provides a significantly increased efficacy as it chooses to navigate the model space using a step-wise search rather than taking into account every conceivable pairing of p and q [75].
A I C c = A I C + 2 ( p + q + k + 1 ) ( p + q + k + 2 ) T p q k 2
where k = 1 if c ̸= 0 and 0 otherwise, L is the maximum likelihood of the data and the last item in parentheses is the number of parameters in the model (including the variance of the residuals σ2).
TBATS (the exponential smoothing state space model with Box-Cox transformation, ARMA errors, trend, and seasonal components) has been developed by [76]. Its main advantage lies in its capability to deal with complex seasonal behaviors of data [77]. The model representation is expressed as TBATS(ω, p, q, φ, {m1, k1}, {m2, k2}, …,{mT, kT}), where ω denotes the Box-Cox transformation, k is the quantity of harmonics used to represent the seasonal attribute, and φ denotes the dampening parameter. The framework uses the “tbats” function from the “forecast” package to automatically fit TBATS models and choose the best parameters based on AIC.
The structural time series (STS) method [78] asserts a time-series is made up of a range of unobserved components, such as (deterministic and stochastic) trend, seasonal, regression elements, and disturbance terms, all of which can be independently modeled, allowing for the study of their evolution through time and a clear understanding of their contribution to the final predictions [79].
A generalized expression of the decomposition of a time series is given by Equation (8), such that:
y t = μ t + ψ t + γ t + ε t , t = 1 , , T
where μ t is the trend, ψ is the cycle, γ t is the seasonal, and ε t is the irregular white noise component. All of the four components are stochastic and are driven by mutually uncorrelated disturbances [80]. The framework developed in this study to predict e-commerce retail sales embeds the “StructTS” function in R’s “stats” package that uses maximum likelihood to determine the model’s parameters.
Artificial neural networks (ANNs) are inspired by biological systems, particularly by the way biological nervous systems process information [42]). ANNs have been proven capable of modeling complex real-world systems by fully taking nonlinearities into account [81]. The basic elements that describe an ANN are the nodes or the basic processing elements of the network, the network architecture reflecting the connections between nodes; and the training algorithm used to find values of the network parameters for performing a particular task [82]. The type of ANN where each layer of nodes receives inputs from the previous layer is called multi-layered feed-forward network or multilayer perceptron (MLP). The models are referred to as feed-forward since there are no feedback connections so that predictions issued by the model (i.e., outputs) are not fed back into itself [83]. Many different ANN models have been proposed in the literature, depending on the nature of the task assigned to the network, as there are also numerous variations in modeling the nodes [84]. Lagged values from time series data are frequently utilized as inputs in an ANN structure, which is thus defined as neural network autoregression (NNAR) when applied to time series data [85]. NNAR models can be defined as NNAR (p,P,k)m for seasonal data, where m is the seasonal period, p denotes non-seasonally lagged inputs for the linear AR process, P are seasonal lags for the AR process, and k denotes the number of hidden layer nodes.
The neural network autoregression model can be written as:
Y = f ( H ) = f ( W X + B ) , X = [ y ( t 1 ) , y ( t 2 ) , , y ( t p ) ]
where Y is the output vector of predicted values, f is the activation function, H is the vector of n nodes in the hidden layer, W represents the weight matrix between the input and the hidden layers, X is the vector of inputs containing the lagged values of the observed data and B is a bias vector [86]. Hence, the first layer receives the lagged values of the time series representing first e-commerce retail sales and then the share of retail as inputs, then, the weighted inputs are combined linearly and fed-forward to the hidden layer where the activation function is applied. The result is finally sent to the output layer of the network. The integrated framework in this study automatically fits multi-layer feed-forward neural networks with a single hidden layer, k nodes, and p-lagged inputs using the “nnetar” function in the R “forecast” package. To automatically determine the parameters p and P, the method performs 25 iterations while minimizing AIC. Additionally, the hidden layer’s node count is estimated is k = (p + P + 1)/2.
It should be mentioned that several criteria must be met for robust time series forecasting, including employing appropriate test and training sets, performing ex-ante validations, comparing projections to established models, and cross-validation ([74,87,88]). First, this study accounts for cross-validation by analyzing two training samples, in path A and path B, respectively. Second, the training and testing windows have adequate lengths, whereas the use of out-of-sample testing windows meets the ex-ante validation criterion. Finally, the comparative perspective is adopted by evaluating the forecasting ability of six statistical and machine-learning models, including well-established time-series models, among which NNAR and TBATS can handle nonlinear features in time series.

3.5. Forecast Error Measures

A model’s predictive accuracy is determined by estimating its forecast errors by comparing the actual and predicted values. Thus, the forecast error equals:
e T + h = y T + h y ^ T + h | T
with {y1,…,yT} denoting the training set data and {yT + 1,yT + 2,…} the test set data.
For the purpose of analyzing the prediction accuracy of the model, in this study we estimate the following forecasting accuracy metrics: mean absolute error (MAE), root mean squared error (RMSE), mean absolute percentage error (MAPE), mean absolute scaled error (MASE), Theil’s U statistic, and Lag 1 autocorrelation of error (ACF1). Estimation equations for all measures are given in Appendix A.2, whereas further details on these accuracy measures and their interpretation are provided by [74,89].

3.6. Further Robustness Checks: Tests for Superior Forecasting Performance

The Diebold and Mariano test (DM) is estimated to compare model performance at each forecasting horizon employed in the empirical investigations. In order to assess the null hypothesis of equal prediction performance between two competing models, the test developed by [90] and its extended version by [91] has been broadly applied in the literature [92,93]. The DM test statistic is applicable to forecast errors that are contemporaneously correlated, serially correlated, and nonnormal [94]. It is assumed that the loss connected to forecast i is a function of the forecast error, eit, indicated by g(eit), which is typically the square or the absolute value of eit. The loss difference between the two forecasts is further assessed to be:
d t = g ( e 1 t ) g ( e 2 t )
and the following is the null hypothesis to be evaluated:
H O : E ( d t ) = 0 , t
Overall, despite contesters, DM test remains useful for informing on the comparative historical predictive performance [95].
In this research, the integrated framework estimates the DM test via the “dm.test” function included in the “forecast” package of the R environment.

4. Results

The accuracy metrics for the six statistical and machine-learning models’ out-of-sample predicting performance embedded into the integrated framework and employed to predict the evolution of the two e-sales variables are reported in Table 1 and Table 2. Results show that the exponential smoothing state space model with Box-Cox transformation, ARMA errors, trend, and seasonal components (TBATS) model can provide the best forecast for e-sales at a horizon of seven quarters, while both the TBATS and STS models out-perform when forecasting e-share over the April 2017–December 2019 period. Note that all models exhibit sub-unitary Theil’s U statistics. in path A except for NNAR, meaning that all given predictions outperform the naive technique in path A of the algorithm. This is explained by the constantly increasing trend in the two variables over the testing period that is missed by the naïve forecast, which always predicts a flat line. For path B, accuracy measures are significantly higher than for path A, indicating that none of the models has captured the turbulent evolution of the e-commerce series. However, the TBATS and the STS model register similar performances outside the sample for both series in the second scenario (path B).
Of note, the DM test of equal forecast accuracy between two competing models (i.e., the best and second-best performers on the alternate test data) confirms that TBATS is over-performing in predicting e-commerce retail sales in both paths A and B within the array of six predictive models. For the e-share series, the test of superior forecasting ability lacks statistical significance in path A and thus indicates that there is no clear distinction between the predictive ability of the two best performers (i.e., STS and TBATS), whereas in path B the superior forecasting ability of TBATS is again confirmed. Figure 5 and Figure 6 contain scatterplots showing predicted vs. actual values for the two e-commerce series given by the over-performing models (i.e., TBATS) and, for comparative purposes, by the neural network models. All charts indicate that TBATS has been able to forecast the evolution of both series fairly well (adjusted R2 = 97.62% and RMSE = 1317.1 for e-sales and adjusted R2 = 98.34% and RMSE = 0.0634 for e-share), whereas NNAR often underestimates its projections (adjusted R2 = 94.63% and RMSE = 1990.8 for e-sales and adjusted R2 = 96.85% and RMSE = 0.0883 for e-share). In all plots, the regression line between predicted vs. actual values, as well as the perfect-fit line are also drawn, attesting to the superior forecasting ability of TBATS. Moreover, Appendix A.1 also reflects the excellent out-of-sample forecasting ability of TBATS over the pre-pandemic testing period (path A) for the two e-commerce time series.
Results indicate that the difference between all accuracy metrics for the TBATS models and the neural network auto-regression model (NNAR) model is very high for both series and all test-sets, with the TBATS model significantly outperforming the neural network in every case.
TBATS therefore emerges as the overall best option when choosing to rely on a single predictive model for e-sales and e-share at the chosen horizon. However, NNAR is unable to offer competitive predictions for the evolution of the two e-commerce series (i.e., e-sales and e-share) across different test periods.
We next produce the expected e-sales and e-share values for the next 7 quarters corresponding to the pandemic period (from January 2020 to October 2021) in the absence of the COVID-19 pandemic (or during normal conditions). For this task, the best-performing models over the test period are selected. Consequently, point estimates for e-sales and e-share over the seven quarters that correspond to the pandemic period are issued through the TBATS model. Finally, abnormal values for the two variables are estimated by Equations (1) and (2) and reported. Table 3 centralizes all results for the e-sales variable, whereas the corresponding results for the e-share series are reported in Table 4.
Of note, estimations corresponding to the pandemic period indicate a continuation of the increasing trends both for e-commerce retail sales as well as for its penetration rate (i.e., the share of e-commerce in total retail sales). Hence, while the COVID-19 has significantly accelerated the rhythm of growth of the e-commerce sector, it does not necessarily mean that this increase was entirely caused by the pandemic.
We report an excess of e-commerce retail sales of $43.9 billion in 2020 Q2, and an excess of 4.2 percent for the share of e-commerce in total retail sales for the same time frame. As can be seen, in this first wave the pandemic spurred e-commerce and induced high levels of excess e-sales and e-share, while in the second year of the pandemic e-commerce retail sales registered almost similar excess levels, while the share of e-commerce saw much more modest increases as compared to the first wave of the pandemic. This further implies that the structural transformation of the retail sales sector caused by the Black-Swan event came to a halt after the first half of 2020.
In the last stage of this investigation, the best-performing predictive model for each e-commerce series in terms of out-of-sample forecasting ability is fitted to the entire series and further employed to produce point forecasts for the 2021 Q4–2025 Q4 period (i.e., with a forecast horizon h = 17).
Table 5 contains the estimated values for the two series. Results indicate that the current increasing trend will continue through the end of 2025.

5. Discussion

The integrated framework developed in this study is employed for two alternative empirical investigations. As such, the algorithm consecutively runs on two distinct paths. In path A, the algorithm is instructed to train all models on the pre-pandemic window and produce forecasts over the test data corresponding to the seven quarters of the pandemic period, spanning from 2020 Q1 to 2021 Q3. Predicted values are used as proxies for the normal sector evolution in the absence of the pandemic. Then, excess indicators are estimated to offer a quantitative assessment of the COVID-19 impact on the US e-commerce sector. The results indicate abnormal e-commerce retail sales totaling $227.820 billion and a cumulative excess increase of 10.61 percent in the share of e-commerce in retail sales over the pandemic period spanning from January 2020 to September 2021 Q3. Another significant finding is that the growing tendency would have continued even if the black swan event had not occurred, albeit the pandemic has significantly accelerated its evolution.
Subsequently, in path B, the algorithm uses the most recent test data to issue forecasts of the two series for the next five years. Estimations imply that the upward trend will continue both in size and weight of the e-commerce sector. Total e-commerce retail sales are thus predicted to grow by 76.47% over the November 2021-December 2025 period and to thus reach $378.691 billion, accounting for an estimated 16.72 percent of US retail sales by the end of 2025. This in turn corresponds to a further 3.72% increase in the share of e-commerce in total retail sales over the forecasting horizon, which confirms the definitive change in consumer preferences.

5.1. E-Commerce Trends and the COVID-19 Impact

Overall, research findings reinforce previous results in the literature, showing that e-commerce, enabled by computer and internet technologies, has registered an increasing trend over the past decades [96] and has surged following the outbreak of the COVID-19 pandemic ([13,18,19,56]). However, it also confirms that the pandemic did not cause, albeit it has strongly accelerated, a steadily increasing trend for both e-commerce variables [97]. Furthermore, this study brings further evidence that the structure of the e-commerce sector in the United States has been constantly evolving in the past decade, as [98] acknowledges, with non-store retailers driving the growth of e-commerce retail sales and the increase in e-commerce penetration rate.
In quantitative terms, research findings assess the impact of the COVID-19 pandemic as $227.820 billion excess sales over the pandemic period spanning from 2020 Q1 to 2021 Q3 and a cumulative excess increase of 10.61 percent in the share of e-commerce in retail sales as compared to expected values in the absence of the black swan event. While a continuation of the increasing trend was expected, none of the models could accurately capture the strong trends induced by the pandemic and most of the excess gains in the e-commerce variables are concentrated in the second quarter of 2020. These estimates thus support prior findings that the pandemic-induced health and economic crisis has accelerated the usage of digital channels, particularly e-commerce [6,7,8,9].
Results also show that the increasing trend is expected to continue by the end of the forecasting horizon (i.e., 2025 Q4). Thus, we forecast US retail e-commerce sales to grow 76.47% over the November 2021-December 2025 period, reaching $378.691 billion by the end of 2025. E-commerce share in total retail sales is expected to increase by 3.72%, as e-commerce will account for an estimated 16.72 percent of US retail sales by the end of the forecasting horizon.
This brings further proof to confirm an aforementioned finding that the growth of e-commerce is not a result of the COVID-19 crisis (although it is the case that the pandemic has accelerated it), resulting in a shift in consumer behavior and spurring e-commerce retail sales. Official sector-level statistics on the level of the e-sales during the pandemic are not yet available, but we expect previous distinct trends across sectors to have continued, and even to have deepened. Our findings also agree with [13,22], showing that after B2C e-commerce surged during the pandemic, this trend is expected to last in the aftermath of the pandemic owing to the permanent shift of retail sales online in the vast majority of cases. Our projections however surpass Statista’s predictions cited by [99], which expect the North American e-commerce sector to grow by 35% over the next five years.

5.2. Characteristics of E-Commerce Time Series

Moreover, this work adds to the evidence presented by [25], while also demonstrating that e-commerce retail sales time series are characterized by trend and seasonal components that should be accounted for in producing accurate forecasts. The study demonstrates that a TBATS model, when properly calibrated, can accurately forecast the evolution of the two e-commerce series. The ability of the TBATS model to adapt to the historical data results in better forecasting performance on alternative test data windows (i.e., paths A and B of the algorithm that is run within the integrated platform) compared to other competing models (both linear and non-linear) employed in this research (i.e., ETS, HW, ARIMA, STS, and NNAR). Estimations find that TBATS is over-performing in forecasting both e-commerce time-series and across different testing set data, confirming its superior ability to handle typical nonlinear data features, while considering any autocorrelation in the residuals and multiple seasonal components [76]. In general, we thus conclude that TBATS models can forecast the trend movement and seasonal fluctuations fairly well, and back [100]) and [101] by showing that individual nonlinear models perform better than linear models in forecasting aggregate retail sales. This is also in line with the results of [67] that arrives at the same conclusion in estimating the impact of the pandemic on a time series that has registered a similar evolution after January 2020 (i.e., excess hospitalizations caused by the COVID-19 pandemic). However, these findings deviate from [51] that identify the neural network model as best for aggregate retail sales forecasting, whereas trigonometric models are not found to be useful for the task in this prior study. Our divergent results that rank NNAR as one of the worst-performing models for forecasting e-commerce series and thus unable to capture seasonal or trend variations effectively could be explained by the fact that we did not pre-process the data through de-trending or de-seasonalization, which has been shown to dramatically reduce forecasting errors of neural network models [102].

5.3. Policy Implications

In light of these projections and given previous studies that acknowledge e-commerce as a pollution-mitigating factor [32] and driver of the circular economy [10], we argue that policies that encourage digital transformation and innovation, particularly for small and medium-sized businesses, should be prioritized. We thus also support the conclusion of [103] and agree that digital transformation should constitute a top management priority.
Hence, future predictions for the full path of the series of e-commerce retail sales and for the share of e-commerce in retail sales up to the end of 2025 that are issued through the integrated platform are paramount not only in decision-making processes at the company level ([24,25,33,104,105]) but also for policymakers that require forecasts at different horizons to make informed decisions [106].

6. Conclusions

Accurate retail sales forecasting is paramount for retail business management, influencing a range of operational decisions, as businesses must continually adapt to changing circumstances in today’s competitive business environment. Consequently, producing accurate forecasts of retail sales has become equivalent to acquiring or maintaining business success.
The COVID-19 pandemic has significantly impacted the retail sales sector, expanding the scope of e-commerce and accelerating its expansion. Consequently, as retail sales have plummeted following the pandemic outbreak, e-commerce has spurred. The increasing trend of the e-commerce sector is expected to continue, as some of its changing circumstances will most likely be long-term, given the potential for new pandemic waves, the efficiency of the new consumer behavior, learning expenses, and the urge for entrepreneurs to rip the benefits of investing in digital retail channels [13].
Furthermore, it has long been recognized that retail sales are an established leading indicator for important macro-economic variables, whereas e-commerce is also a mitigating factor for polluting emissions and a driver for the circular economy. Consequently, retail sales and in particular e-commerce are significantly influencing policymaking processes.
Hence, owing to the aforementioned factors, interest has widely shifted from forecasting retail sales toward forecasting e-commerce sector indicators.
However, retail sales time series (including e-commerce) often exhibit strong trend and seasonal variations presenting challenges in developing effective forecasting models. Consequently, previous studies disagree on the best model for this task, and accurately modeling and forecasting retail sales series remains an outstanding research topic. Considering the stronger trends and fluctuations encountered in e-commerce series as compared to retail sales, the task of accurately capturing the specificities of e-commerce variables and producing accurate forecasts is that much more challenging.
This paper contributes to e-commerce research by suggesting an integrated approach to model and predict its evolution. The main goal of this study is thus to develop an integrated framework to forecast two e-commerce time series: e-commerce retail sales and the share of e-commerce in total retail sales. The framework embeds an algorithm capable of efficiently estimating six statistical and machine-learning methods (i.e., ETS, HW, ARIMA, STS, TBATS, and NNAR). The algorithm automatically searches for the best in-sample and out-of-sample model parameters. As such, it applies various fitness functions (AIC, AICc, ML) to all model specifications that pass the diagnostic checking on the training window to identify the best in-sample parameters for each of the six models. Then, the optimal six model specifications selected in-sample through the specific fitness function are employed for forecasting on the out-of-sample window. Multiple model accuracy measures, both scaled and scale-free (i.e., MAE, MAPE, MASE, RMSE, Theil’s U, ACF1) are subsequently estimated to assess the predictive ability of the six best (in-sample) model specifications. Finally, the optimal (out-of-sample) forecasting model for each series and each testing window automatically emerges as the one that minimizes forecasts errors. Results indicate that the overall out-of-sample forecasting performance of the TBATS model is best for all circumstances.
As far as we know, this is the first time that this approach is proposed for retail sales forecasting (including e-commerce), whereas a quantitative assessment of the COVID-19 impact on the e-commerce sector through the estimation of its excess (or abnormal) evolution constitutes another novelty. Overall, we argue that the framework proposed in this paper can be a useful managerial tool in decision making, owing to its efficiency, flexibility, and prediction accuracy compared to the state-of-the-art approaches. Results of this research further indicate that on one hand governments should prioritize policies that encourage digital transformation and innovation, and support business adaptation, particularly for small and medium-sized enterprises whereas, on the other hand, digital transformation should constitute a top management priority. The integrated framework capable to produce forecasts of retail sales time series, and assessing abnormal evolutions, or the magnitude of the impact of black swan events, is thus applicable both for retailers and policymakers.

Funding

This research received no external funding.

Data Availability Statement

Data employed in the empirical investigations are publicly available on the Federal Reserve Bank of St. Louis (FRED) website.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Appendix A.1

Figure A1 reflects the forecasted trend produced by the over-performing predictive model (i.e., TBATS) for e-commerce retail sales (panel a) and e-commerce retail share (panel b) over the testing window for path A of the algorithm.
Figure A1. (a) Forecasted trend for e-commerce retail sales over April 2018–January 2020 issued with TBATS. Source: estimation results. (b) Forecasted trend for e-commerce share in total retail sales over April 2018–January 2020 issued with TBATS. Source: estimation results. Source: estimation results.
Figure A1. (a) Forecasted trend for e-commerce retail sales over April 2018–January 2020 issued with TBATS. Source: estimation results. (b) Forecasted trend for e-commerce share in total retail sales over April 2018–January 2020 issued with TBATS. Source: estimation results. Source: estimation results.
Electronics 11 03194 g0a1aElectronics 11 03194 g0a1b

Appendix A.2. Accuracy Measures

Mean absolute error:
M A E = 1 N i = 1 N | y i y ^ l |
Root mean squared error:
R M S E = 1 N i = 1 N ( y i y ^ l ) 2
Mean absolute percentage error:
M A P E = m e a n ( | p t | )
where: p t = 100 e t y t .
Mean absolute scaled error:
MASE = mean ( | q j | )
where qj is independent of the scale of the data and is defined as: q j = e t 1 N 1 i = 2 N | y t y t 1 | for non-seasonal series and as: q j = e t 1 N m i = m + 1 N | y t y t m | for seasonal time series.
Theil’s U statistic:
T h e i l s U = R M S E 1 R M S E 2
where RMSE2 is the RMSE of the naive technique, which always produces a flat line, and RMSE1 is the RMSE metric obtained for the main model; Theil’s U will take values between o and 1, with zero indicating a perfect prediction [107].
Lag 1 Autocorrelation of Error:
A C F 1 = γ ^ ( 1 ) γ ^ ( 0 )
where γ is the sample autocovariance function.

References

  1. Deloitte. Resilient Leadership Responding to COVID-19, Insights. 2020. Available online: https://www2.deloitte.com/global/en/insights/economy/covid-19/heart-of-resilient-leadership-responding-to-covid-19.html (accessed on 26 January 2022).
  2. Maria, N.; Zaid, A.; Catrin, S.; Ahmed, K.; Ahmed, A.J.; Christos, I.; Maliha, A.; Riaz, A. The socio-economic implications of the coronavirus pandemic (COVID-19): A review. Int. J. Surg. 2020, 78, 185–193. [Google Scholar]
  3. Erokhin, V.; Gao, T. Impacts of COVID-19 on trade and economic aspects of food security: Evidence from 45 developing countries. Int. J. Environ. Res. Public Health 2020, 17, 5775. [Google Scholar] [CrossRef] [PubMed]
  4. Tudor, C.; Sova, R. Infodemiological study on the impact of the COVID-19 pandemic on increased headache incidences at the world level. Sci. Rep. 2022, 12, 10253. [Google Scholar] [CrossRef] [PubMed]
  5. Raza, S.; Schwartz, B.; Rosella, L.C. CoQUAD: A COVID-19 question answering dataset system, facilitating research, benchmarking, and practice. BMC Bioinform. 2022, 23, 1–28. [Google Scholar] [CrossRef] [PubMed]
  6. Fletcher, G.; Griffiths, M. Digital transformation during a lockdown. Int. J. Inf. Manag. 2020, 55, 102185. [Google Scholar] [CrossRef] [PubMed]
  7. Nagel, L. The influence of the COVID-19 pandemic on the digital transformation of work. Int. J. Soc. Soc. Pol. 2020, 40, 861–875. [Google Scholar] [CrossRef]
  8. Priyono, A.; Moin, A.; Oktaviani Putri VNA. Identifying digital transformation paths in the business model of SMEs during the COVID-19 pandemic. J. Open Innov. Technol. Mark. Complex. 2020, 6, 104. [Google Scholar] [CrossRef]
  9. Döhring, B.; Hristov, A.; Maier, C.; Roeger, W.; Thum-Thysen, A. COVID-19 acceleration in digitalisation, aggregate productivity growth and the functional income distribution. Int. Econ. Econ. Policy 2021, 18, 571–604. [Google Scholar] [CrossRef]
  10. Ecommerce Europe, 2021, 2021 European E-Commerce Report. Available online: https://ecommerce-europe.eu/wp-content/uploads/2021/09/2021-European-E-commerce-Report-LIGHT-VERSION.pdf (accessed on 26 February 2022).
  11. UNCTAD. How COVID-19 Triggered the Digital and E-Commerce Turning Point. 2021. Available online: https://unctad.org/news/how-covid-19-triggered-digital-and-e-commerce-turning-point (accessed on 26 January 2022).
  12. Insider Intelligence. 2022. Available online: https://www.insiderintelligence.com/insights/ecommerce-industry-statistics/ (accessed on 7 February 2022).
  13. OECD. E-Commerce in the Time of COVID-19. 2020. Available online: https://www.oecd.org/coronavirus/policy-responses/e-commerce-in-the-time-of-covid-19-3a2b78e8/ (accessed on 7 September 2022).
  14. Tudor, C. The Impact of the COVID-19 Pandemic on the Global Web and Video Conferencing SaaS Market. Electronics 2022, 11, 2633. [Google Scholar] [CrossRef]
  15. Rapaccini M, Saccani N, Kowalkowski C, Paiola M, Adrodegari F Navigating disruptive crises through service-led growth: The impact of COVID-19 on Italian manufacturing firms. Ind. Mark. Manag. 2020, 88, 225–237. [CrossRef]
  16. McKinsey. The Next Normal, The Recovery Will Be Digital. 2020. Available online: https://www.mckinsey.com/~/media/mckinsey/business%20functions/mckinsey%20digital/our%20insights/how%20six%20companies%20are%20using%20technology%20and%20data%20to%20transform%20themselves/the-next-normal-the-recovery-will-be-digital.pdf (accessed on 6 September 2022).
  17. Vieira, M.; Malaquias, T.; Ketsmur, V.; Ketsmur, V.; Au-Yong-Oliveira, M. Towards a More Sustainable E-commerce in the Post-COVID-19 Recovery of Portuguese Companies. In International Conference on Advanced Research in Technologies, Information, Innovation and Sustainability; Springer: Cham, Switzerland, 2021; pp. 600–613. [Google Scholar]
  18. Piccinini, D.; Giunchi, C.; Olivieri, M.; Frattini, F.; Di Giovanni, M.; Prodi, G.; Chiarabba, C. COVID-19 lockdown and its latency in Northern Italy: Seismic evidence and socio-economic interpretation. Sci. Rep. 2020, 10, 16487. [Google Scholar] [CrossRef]
  19. European Commission. Online Sales Continue to Grow Among EU Enterprises. 2021. Available online: https://ec.europa.eu/eurostat/web/products-eurostat-news/-/ddn-20211228-1 (accessed on 26 January 2022).
  20. UNCTAD. Global E-Commerce Jumps to $26.7 Trillion, COVID-19 Boosts Online Sales. 2021. Available online: https://unctad.org/news/global-e-commerce-jumps-267-trillion-covid-19-boosts-online-sales (accessed on 26 January 2022).
  21. Forbes. Global E-Commerce Sales To Hit $4.2 Trillion As Online Surge Continues, Adobe Reports. 2021. Available online: https://www.forbes.com/sites/joanverdon/2021/04/27/global-ecommerce-sales-to-hit-42-trillion-as-online-surge-continues-adobe-reports/?sh=268030f350fd (accessed on 26 January 2022).
  22. Statista. Retail E-Commerce Sales Worldwide from 2014 to 2024. 2022. Available online: https://www.statista.com/statistics/379046/worldwide-retail-e-commerce-sales/ (accessed on 26 January 2022).
  23. British International Freight Association (BIFA). Global E-Commerce Jumps to $26.7 Trillion. 2021. Available online: https://www.bifa.org/news/articles/2021/may/global-e-commerce-jumps-to-267-trillion (accessed on 26 January 2022).
  24. Huber, J.; Stuckenschmidt, H. Daily retail demand forecasting using machine learning with emphasis on calendric special days. Int. J. Forecast. 2020, 36, 1420–1438. [Google Scholar] [CrossRef]
  25. Ramos, P.; Santos, N.; Rebelo, R. Performance of state space and ARIMA models for consumer retail sales forecasting. Robot. Comput.-Integr. Manuf. 2015, 34, 151–163. [Google Scholar] [CrossRef] [Green Version]
  26. Beheshti-Kashi, S.; Karimi, H.R.; Thoben, K.D.; Lütjen, M.; Teucke, M. A survey on retail sales forecasting and prediction in fashion markets. Syst. Sci. Control. Eng. 2015, 3, 154–161. [Google Scholar] [CrossRef]
  27. Diron, M. Short-term forecasts of euro area real GDP growth: An assessment of real-time performance based on vintage data. J. Forecast. 2008, 27, 371–390. [Google Scholar] [CrossRef] [Green Version]
  28. Turner, L.W.; Witt, S.F. Factors influencing demand for international tourism: Tourism demand analysis using structural equation modelling, revisited. Tour. Econ. 2001, 7, 21–38. [Google Scholar] [CrossRef]
  29. Joseph, A.; Larrain, M. Housing Starts Forecast of Retail Sales through the 2007-2009 Recession. Procedia Comput. Sci. 2012, 12, 271–275. [Google Scholar] [CrossRef] [Green Version]
  30. Deloitte. Retail Sales: Trends in Revenue and Employment. 2018. Available online: https://www2.deloitte.com/us/en/insights/economy/behind-the-numbers/changes-in-retail-sales-economic-impact.html (accessed on 27 January 2022).
  31. Markhaichuk, M. Impact of online retail on economic development in Russian regions. In Innovation Management, Entrepreneurship and Sustainability (IMES 2018); Vysoká škola ekonomická v Praze: Prague, Czech Republic, 2018; pp. 635–644. [Google Scholar]
  32. Wyman, O. Is E-Commerce Good For Europe? 2021. Available online: https://www.oliverwyman.com/content/dam/oliver-wyman/v2/publications/2021/apr/is-ecommerce-good-for-europe.pdf (accessed on 7 February 2022).
  33. Yang, Y.; Fan, C.; Xiong, H. A novel general-purpose hybrid model for time series forecasting. Appl. Intell. 2021, 52, 2212–2223. [Google Scholar] [CrossRef] [PubMed]
  34. Mukhopadhyay, S.; Samaddar, S.; Nargundkar, S. Predicting Electronic Commerce Growth: An Integration Of Diffusion and Neural Network Models. J. Electron. Commer. Res. 2008, 9. [Google Scholar]
  35. Gujarati, D.N. Basic Econometrics, 4th ed.; McGraw-Hill: Singapore, 2003. [Google Scholar]
  36. Gujarati, D.N.; Porter, D.C.; Gunasekar, S. Basic Econometrics; Tata McGraw-Hill Education: Noida, India, 2012. [Google Scholar]
  37. Silva, E.G.; Neto, P.S.D.M.; Cavalcanti, G.D. A dynamic predictor selection method based on recent temporal windows for time series forecasting. IEEE Access 2021, 9, 108466–108479. [Google Scholar] [CrossRef]
  38. Breiman, L. Statistical modeling: The two cultures (with comments and a rejoinder by the author). Stat. Sci. 2001, 16, 199–231. [Google Scholar] [CrossRef]
  39. Miller, A.C.; Foti, N.J.; Fox, E.B. Breiman’s Two Cultures: You Don’t Have to Choose Sides. Obs. Stud. 2021, 7, 161–169. [Google Scholar] [CrossRef]
  40. Charpentier, A.; Flachaire, E.; Ly, A. Econometrics and machine learning. Econ. Et Stat. 2018, 505, 147–169. [Google Scholar] [CrossRef]
  41. Koehrsen, W. Thoughts on the Two Cultures of Statistical Modeling. 2019. Available online: https://towardsdatascience.com/thoughts-on-the-two-cultures-of-statistical-modeling-72d75a9e06c2 (accessed on 1 February 2022).
  42. Zhang, G.; Patuwo, B.E.; Hu, M.Y. Forecasting with artificial neural networks: The state of the art. Int. J. Forecast. 1998, 14, 35–62. [Google Scholar] [CrossRef]
  43. Kamruzzaman, J.; Sarker, R.A. Artificial neural networks: Applications in finance and manufacturing. In Artificial Neural Networks in Finance and Manufacturing; IGI Global: Hershey, PA, USA, 2006; pp. 1–27. [Google Scholar]
  44. IJ, H. Statistics versus machine learning. Nat. Methods 2018, 15, 233. [Google Scholar]
  45. Zao, Q. Statistical Modeling: Returning to Its Roots. 2022. Available online: http://www.statslab.cam.ac.uk/~qz280/post/breiman-reprint-2021/ (accessed on 1 February 2022).
  46. Adhikari, R.; Agrawal, R.K. An introductory study on time series modeling and forecasting. arXiv 2013, arXiv:1302.6613. [Google Scholar]
  47. Salinas, D.; Flunkert, V.; Gasthaus, J.; Januschowski, T. DeepAR: Probabilistic forecasting with autoregressive recurrent networks. Int. J. Forecast. 2020, 36, 1181–1191. [Google Scholar] [CrossRef]
  48. Kuvulmaz, J.; Usanmaz, S.; Engin, S.N. Time-series forecasting by means of linear and nonlinear models. In Mexican International Conference on Artificial Intelligence; Springer: Berlin/Heidelberg, German, 2005; pp. 504–513. [Google Scholar]
  49. Frank, C.; Garg, A.; Sztandera, L.; Raheja, A. Forecasting women’s apparel sales using mathematical modeling. Int. J. Cloth. Sci. Technol. 2013, 15, 107–125. [Google Scholar] [CrossRef] [Green Version]
  50. Alon, I.; Qi, M.; Sadowski, R.J. Forecasting aggregate retail sales: A comparison of artificial neural networks and traditional methods. J. Retail. Consum. Serv. 2001, 8, 147–156. [Google Scholar]
  51. Chu, C.W.; Zhang, G.P. A comparative study of linear and nonlinear models for aggregate retail sales forecasting. Int. J. Prod. Econ. 2003, 86, 217–231. [Google Scholar] [CrossRef]
  52. Au, K.F.; Choi, T.M.; Yu, Y. Fashion retail forecasting by evolutionary neural networks. Int. J. Prod. Econ. 2008, 114, 615–630. [Google Scholar] [CrossRef]
  53. Sun, Z.-L.; Choi, T.M.; Au, K.-F.; Yu, Y. Sales forecasting using extreme learning machine with applications in fashion retailing. Decis. Support Syst. 2008, 46, 411–419. [Google Scholar] [CrossRef]
  54. Thomassey, S. Sales forecasts in clothing industry: The key success factor of the supply chain management. Int. J. Prod. Econ. 2010, 128, 470–483. [Google Scholar] [CrossRef]
  55. Wong, W.K.; Guo, Z.X. A hybrid intelligent model for medium-term sales forecasting in fashion retail supply chains using extreme learning machine and harmony search algorithm. Int. J. Prod. Econ. 2010, 128, 614–624. [Google Scholar] [CrossRef]
  56. Office for National Statistics. Impact of the Coronavirus (COVID-19) Pandemic on Retail Sales in 2020. 2021. Available online: https://www.ons.gov.uk/economy/grossdomesticproductgdp/articles/impactofthecoronaviruscovid19pandemiconretailsalesin2020/2021-01-28 (accessed on 27 January 2022).
  57. Sardar, T.; Nadim, S.S.; Rana, S.; Chattopadhyay, J. Assessment of lockdown effect in some states and overall India: A predictive mathematical study on COVID-19 outbreak. Chaos Solitons Fractals 2020, 139, 110078. [Google Scholar] [CrossRef] [PubMed]
  58. Corporate Finance Institute (CFI), Abnormal Return. Available online: https://corporatefinanceinstitute.com/resources/knowledge/trading-investing/abnormal-return/ (accessed on 10 January 2022).
  59. European Commission. Excess Mortality—Statistics. 2022. Available online: https://ec.europa.eu/eurostat/statistics-explained/index.php?title=Excess_mortality_-_statistics&oldid=550516 (accessed on 4 February 2022).
  60. Holt, C.C. Forecasting seasonals and trends by exponentially weighted moving averages. ONR Res. Memo. 1959, 52, 5–10. [Google Scholar]
  61. Winters, P.R. Forecasting sales by exponentially weighted moving averages. Manag. Sci. 1960, 6, 324–342. [Google Scholar] [CrossRef]
  62. Gelper, S.; Fried, R.; Croux, C. Robust forecasting with exponential and Holt–Winters smoothing. J. Forecast. 2010, 29, 285–300. [Google Scholar]
  63. Chatfield, C.; Yar, M. Holt-Winters forecasting: Some practical issues. J. R. Stat. Soc. Ser. D 1988, 37, 129–140. [Google Scholar] [CrossRef]
  64. R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2012; ISBN 3-900051-07-0. Available online: http://www.R.-project.org/ (accessed on 20 August 2022).
  65. Hyndman, R.J.; Koehler, A.B.; Snyder, R.D.; Grose, S. A state space framework for automatic forecasting using exponential smoothing methods. Int. J. Forecast. 2002, 18, 439–454. [Google Scholar] [CrossRef] [Green Version]
  66. Yang, D.; Sharma, V.; Ye, Z.; Lim, L.I.; Zhao, L.; Aryaputera, A.W. Forecasting of global horizontal irradiance by exponential smoothing, using decompositions. Energy 2015, 81, 111–119. [Google Scholar] [CrossRef] [Green Version]
  67. Perone, G. Comparison of ARIMA, ETS, NNAR, TBATS and hybrid models to forecast the second wave of COVID-19 hospitalizations in Italy. Eur. J. Health Econ. 2021, 23, 917–940. [Google Scholar] [CrossRef] [PubMed]
  68. Hyndman, R.J.; Khandakar, Y. Automatic time series forecasting: The forecast package for R. J. Stat. Softw. 2008, 26, 1–22. [Google Scholar] [CrossRef] [Green Version]
  69. Hyndman, R.; Athanasopoulos, G.; Bergmeir, C.; Caceres, G.; Chhay, L.; O’Hara-Wild, M.; Petropoulos, F.; Razbash, S.; Wang, E.; Yasmeen, F. Forecast: Forecasting Functions for Time Series and Linear Models. R package version 8.16. 2022. Available online: https://pkg.robjhyndman.com/forecast/ (accessed on 10 August 2022).
  70. Box, G.; Jenkins, G. Time Series Analysis: Forecasting and Control; Holden-Day: San Francisco, CA, USA, 1970. [Google Scholar]
  71. Box, G.E.P.; Jenkins, G.M.; Reinsel, G.C.; Ljung, G.M. Time Series Analysis: Forecasting and Control, 5th ed.; John Wiley & Sons: New York, NY, USA, 2015. [Google Scholar]
  72. Zhang, G.P. Time series forecasting using a hybrid ARIMA and neural network model. Neurocomputing 2003, 50, 159–175. [Google Scholar] [CrossRef]
  73. Wang, L.; Zou, H.; Su, J.; Li, L.; Chaudhry, S. An ARIMA-ANN hybrid model for time series forecasting. Syst. Res. Behav. Sci. 2013, 30, 244–259. [Google Scholar] [CrossRef]
  74. Hyndman, R.J.; Athanasopoulos, G. Forecasting: Principles and Practice, OTexts, Melbourne. 2018. Available online: OTexts.com/fpp2 (accessed on 10 August 2022).
  75. Silva, E.S.; Hassani, H.; Heravi, S.; Huang, X. Forecasting tourism demand with denoised neural networks. Ann. Tour. Res. 2019, 74, 134–154. [Google Scholar] [CrossRef]
  76. De Livera, A.M.; Hyndman, R.J.; Snyder, R.D. Forecasting time series with complex seasonal patterns using exponential smoothing. J. Am. Stat. Assoc. 2011, 106, 1513–1527. [Google Scholar] [CrossRef]
  77. Yu, C.; Xu, C.; Li, Y.; Yao, S.; Bai, Y.; Li, J.; Wang, L.; Wu, W.; Wang, Y. Time series analysis and forecasting of the hand-foot-mouth disease morbidity in China using an advanced exponential smoothing state space TBATS model. Infect. Drug Resist. 2021, 14, 2809. [Google Scholar] [CrossRef]
  78. Harvey, A.C. Forecasting Analysis of Time Series Models and the Kalman Filter; Cambridge University Press: Cambridge, UK, 1989. [Google Scholar]
  79. Lawson, A.R.; Ghosh, B.; Broderick, B. Prediction of traffic-related nitrogen oxides concentrations using Structural Time-Series models. Atmos. Environ. 2011, 45, 4719–4727. [Google Scholar] [CrossRef]
  80. Kapombe, C.M.; Colyer, D. A structural time series analysis of US broiler exports. Agric. Econ. 1999, 21, 295–307. [Google Scholar] [CrossRef]
  81. Pasini, A. Artificial neural networks for small dataset analysis. J. Thorac. Dis. 2015, 7, 953. [Google Scholar]
  82. Allende, H.; Moraga, C.; Salas, R. Artificial neural networks in time series forecasting: A comparative analysis. Kybernetika 2002, 38, 685–707. [Google Scholar]
  83. Gupta, T. Deep Learning: Feedforward Neural Network. 2017. Available online: https://towardsdatascience.com/deep-learning-feedforward-neural-network-26a6705dbdc7 (accessed on 1 February 2022).
  84. Hill, T.; Marquez, L.; O’Connor, M.; Remus, W. Artificial neural network models for forecasting and decision making. Int. J. Forecast. 1994, 10, 5–15. [Google Scholar] [CrossRef] [Green Version]
  85. Munim, Z.H.; Shakil, M.H.; Alon, I. Next-day bitcoin price forecast. J. Risk Financ. Manag. 2019, 12, 103. [Google Scholar] [CrossRef] [Green Version]
  86. Tudor, C.; Sova, R. Benchmarking GHG Emissions Forecasting Models for Global Climate Policy. Electronics 2021, 10, 3149. [Google Scholar] [CrossRef]
  87. Adya, M.; Collopy, F. How effective are neural networks at forecasting and prediction? A review and evaluation. J. Forecast. 1998, 17, 481–495. [Google Scholar] [CrossRef]
  88. Munim, Z.H. State-space TBATS model for container freight rate forecasting with improved accuracy. Marit. Transp. Res. 2022, 3, 100057. [Google Scholar] [CrossRef]
  89. Hyndman, R.J.; Koehler, A.B. Another look at measures of forecast accuracy. Int. J. Forecast. 2006, 22, 679–688. [Google Scholar] [CrossRef] [Green Version]
  90. Diebold, F.X.; Mariano, R.S. Comparing Predictive Accuracy. J. Bus. Econ. Stat. 1995, 13, 253–263. [Google Scholar]
  91. Harvey, D.; Leybourne, S.; Newbold, P. Testing the equality of prediction mean squared errors. Int. J. Forecast. 1997, 13, 281–291. [Google Scholar] [CrossRef]
  92. Hering, A.S.; Genton, M.G. Comparing spatial predictions. Technometrics 2011, 53, 414–425. [Google Scholar] [CrossRef]
  93. Gilleland, E.; Roux, G. A new approach to testing forecast predictive accuracy. Meteorol. Appl. 2015, 22, 534–543. [Google Scholar] [CrossRef]
  94. Coroneo, L.; Iacone, F. Comparing predictive accuracy in small samples using fixed-smoothing asymptotics. J. Appl. Econom. 2020, 35, 391–409. [Google Scholar] [CrossRef]
  95. Diebold, F.X. Comparing predictive accuracy, twenty years later: A personal perspective on the use and abuse of Diebold–Mariano tests. J. Bus. Econ. Stat. 2015, 33, 1. [Google Scholar] [CrossRef] [Green Version]
  96. Huang, Y.; Chai, Y.; Liu, Y.; Shen, J. Architecture of next-generation e-commerce platform. Tsinghua Sci. Technol. 2018, 24, 18–29. [Google Scholar] [CrossRef]
  97. COFACE. Has the Global Retail Sector Returned to Normal? 2021. Available online: https://www.coface.com/News-Publications/News/Has-the-global-retail-sector-returned-to-normal (accessed on 27 January 2022).
  98. Statista. E-Commerce in the United States—Statistics & Facts. 2022. Available online: https://www.statista.com/topics/2443/us-ecommerce/ (accessed on 4 February 2022).
  99. World Economic Forum. These Are the World’s Biggest E-Commerce Markets. 2021. Available online: https://www.weforum.org/agenda/2021/09/e-commerce-fast-growth-online-sales-china-europe-america/ (accessed on 4 February 2022).
  100. Aye, G.C.; Balcilar, M.; Gupta, R.; Majumdar, A. Forecasting aggregate retail sales: The case of South Africa. Int. J. Prod. Econ. 2015, 160, 66–79. [Google Scholar] [CrossRef] [Green Version]
  101. Di, G.; Latorre, V.; Lucidi, S.; Procacci, E. An application of support vector machines to sales forecasting under promotions. 4OR 2016, 14, 309–325. [Google Scholar]
  102. Zhang, G.P.; Qi, M. Neural network forecasting for seasonal and trend time series. Eur. J. Oper. Res. 2005, 160, 501–514. [Google Scholar] [CrossRef]
  103. Saarikko, T.; Westergren, U.H.; Blomquist, T. Digital transformation: Five recommendations for the digitally conscious firm. Bus. Horiz. 2020, 63, 825–839. [Google Scholar] [CrossRef]
  104. Aras, S.; Deveci Kocakoç, İ.; Polat, C. Comparative study on retail sales forecasting between single and combination methods. J. Bus. Econ. Manag. 2017, 18, 803–832. [Google Scholar] [CrossRef] [Green Version]
  105. Loureiro, A.L.; Miguéis, V.L.; da Silva, L.F. Exploring the use of deep neural networks for sales forecasting in fashion retail. Decis. Support Syst. 2018, 114, 81–93. [Google Scholar] [CrossRef]
  106. Quaedvlieg, R. Multi-horizon forecast comparison. J. Bus. Econ. Stat. 2021, 39, 40–53. [Google Scholar] [CrossRef] [Green Version]
  107. Leuthold, R.M. On the use of Theil’s inequality coefficients. Am. J. Agric. Econ. 1975, 57, 344–346. [Google Scholar] [CrossRef]
Figure 1. Historical trends and structure of US e-commerce retail sales per sector (panel a); historical trends and structure of US total retail sales per sector (panel b).
Figure 1. Historical trends and structure of US e-commerce retail sales per sector (panel a); historical trends and structure of US total retail sales per sector (panel b).
Electronics 11 03194 g001aElectronics 11 03194 g001b
Figure 2. Quarterly evolution of E-Commerce Retail Sales [ECOMSA] over 1999–2021; Author’s representation with data retrieved from the Federal Reserve Bank of St. Louis (FRED), which sources data from U.S. Census Bureau; https://fred.stlouisfed.org/series/ECOMSA; accessed on 8 September 2022.
Figure 2. Quarterly evolution of E-Commerce Retail Sales [ECOMSA] over 1999–2021; Author’s representation with data retrieved from the Federal Reserve Bank of St. Louis (FRED), which sources data from U.S. Census Bureau; https://fred.stlouisfed.org/series/ECOMSA; accessed on 8 September 2022.
Electronics 11 03194 g002
Figure 3. Quarterly evolution of the share of E-Commerce in Total Retail Sales [ECOMPCTSA] over 1999–2021, Author’s representation with data retrieved from the Federal Reserve Bank of St. Louis (FRED), which sources data from U.S. Census Bureau; https://fred.stlouisfed.org/series/ECOMPCTSA, accessed on 8 September 2022.
Figure 3. Quarterly evolution of the share of E-Commerce in Total Retail Sales [ECOMPCTSA] over 1999–2021, Author’s representation with data retrieved from the Federal Reserve Bank of St. Louis (FRED), which sources data from U.S. Census Bureau; https://fred.stlouisfed.org/series/ECOMPCTSA, accessed on 8 September 2022.
Electronics 11 03194 g003
Figure 4. Integrated framework for forecasting e-commerce sector variables (i.e., e-commerce retail sales ECOMSA) and share of e-commerce in retail sales (ECOMPCTSA) where the algorithm follows subsequently path A (red color) and path B (blue color).
Figure 4. Integrated framework for forecasting e-commerce sector variables (i.e., e-commerce retail sales ECOMSA) and share of e-commerce in retail sales (ECOMPCTSA) where the algorithm follows subsequently path A (red color) and path B (blue color).
Electronics 11 03194 g004
Figure 5. Model performance over testing window (e-commerce retail sales, path A): TBATS (panel a), and NNAR (panel b). Source: estimation results.
Figure 5. Model performance over testing window (e-commerce retail sales, path A): TBATS (panel a), and NNAR (panel b). Source: estimation results.
Electronics 11 03194 g005
Figure 6. Model performance over testing window (e-commerce retail share, path A): TBATS (panel a), and NNAR (panel b). Source: estimation results.
Figure 6. Model performance over testing window (e-commerce retail share, path A): TBATS (panel a), and NNAR (panel b). Source: estimation results.
Electronics 11 03194 g006
Table 1. Performance metrics for the six models on ECOMSA (path A and path B).
Table 1. Performance metrics for the six models on ECOMSA (path A and path B).
Path A: Training Set (October 1999 to March 2017) and Test Set (April 2017 to December 2019)
Predictive ModelMAEMAPEMASERMSETheil’s UACF1
NNAR4662.793.192.746592.651.480.64
ETS2131.341.591.252548.140.630.36
ARIMA1923.241.441.132353.490.580.41
STS1978.931.481.162421.460.600.39
HW2102.571.571.242526.420.630.37
TBATS1885.011.411.112301.640.570.40
DM (TBATS/ARIMA) (p-value)0.0357 **
Path B: Training Set (October 1999 to June 2017) and Test Set (July 2017 to September 2021)
Predictive ModelMAEMAPEMASERMSETheil’s UACF1
NNAR32,950.7026.6924.6833,666.451.740.50
ETS15,516.7312.6011.9915,846.251.470.41
ARIMA15,530.9412.1111.5615,860.501.520.41
STS15,540.4012.5911.5815,870.031.470.41
HW15,530.9513.6711.6715,860.511.480.41
TBATS14,702.2911.5911.0115,000.501.470.39
DM (TBATS/STS) (p-value)0.0729 **
** significant at 5%.
Table 2. Performance metrics for the six models on ECOMPCTSA (path A and path B).
Table 2. Performance metrics for the six models on ECOMPCTSA (path A and path B).
Path A: Training Set (October 1999 to March 2017) and Test Set (April 2017 to December 2019)
Predictive ModelMAEMAPEMASERMSETheil’s UACF1
NNAR0.282.602.310.381.450.61
ETS0.111.050.890.120.460.29
ARIMA0.111.080.910.120.470.31
STS0.101.010.850.120.450.28
HW0.121.150.960.130.490.34
TBATS0.101.020.860.120.450.26
DM (STS/TBATS) (p-value)0.653
Path B: Training Set (October 1999 to December 2019) and Test Set (January 2020 to October 2021)
Predictive ModelMAEMAPEMASERMSETheil’s UACF1
NNAR1.7213.6514.812.261.210.72
ETS0.916.737.801.531.080.56
ARIMA0.916.617.081.521.090.55
STS0.876.347.611.520.870.55
HW0.906.567.681.501.080.55
TBATS0.765.516.601.380.870.52
DM (STS/TBATS) (p-value)0.0217 **
** significant at 5%.
Table 3. Excess e-commerce retail sales over the pandemic window (in millions of dollars).
Table 3. Excess e-commerce retail sales over the pandemic window (in millions of dollars).
Real Value (mil. USD)Estimated Value (mil. USD)Excess E-Sales (mil. USD)
2020 Q1154,575155,453.7−878.7
2020 Q2203,847159,944.543,902.5
2020 Q3201,385164,464.136,920.9
2020 Q4199,665169,012.330,652.7
2021 Q1215,290173,588.441,701.6
2021 Q2221,951178,192.243,758.8
2021 Q3214,586182,823.231,762.8
Sum 227,820.6
Table 4. Excess e-commerce retail sales as a percent of total sales over the pandemic window (%).
Table 4. Excess e-commerce retail sales as a percent of total sales over the pandemic window (%).
Real Value(%)Estimated Value (%)Excess E-Share (%)
2020 Q111.411.250.15
2020 Q215.711.494.21
2020 Q313.811.732.07
2020 Q413.611.971.63
2021 Q113.612.211.39
2021 Q213.312.450.85
2021 Q313.012.690.31
Sum 10.61
Table 5. Trend estimation from 2021Q4 to 2025Q4 for e-sales (ECOMSA) and e-share (ECOMPTCSA).
Table 5. Trend estimation from 2021Q4 to 2025Q4 for e-sales (ECOMSA) and e-share (ECOMPTCSA).
Point Forecast E-Sales (mil. USD)Point Forecast E-Share (%)
2021 Q4225,330.113.47
2022 Q1233,242.813.67
2022 Q2241,361.613.87
2022 Q3249,690.014.08
2022 Q4258,231.614.28
2023 Q1266,989.914.49
2023 Q2275,968.514.65
2023 Q3285,171.214.89
2023 Q4294,601.515.09
2024 Q1304,263.215.30
2024 Q2314,160.015.56
2024 Q3324,295.715.71
2024 Q4334,674.115.93
2025 Q1345,298.916.11
2025 Q2356,174.216.32
2025 Q3367,303.616.54
2025 Q4378,691.216.72
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Tudor, C. Integrated Framework to Assess the Extent of the Pandemic Impact on the Size and Structure of the E-Commerce Retail Sales Sector and Forecast Retail Trade E-Commerce. Electronics 2022, 11, 3194. https://doi.org/10.3390/electronics11193194

AMA Style

Tudor C. Integrated Framework to Assess the Extent of the Pandemic Impact on the Size and Structure of the E-Commerce Retail Sales Sector and Forecast Retail Trade E-Commerce. Electronics. 2022; 11(19):3194. https://doi.org/10.3390/electronics11193194

Chicago/Turabian Style

Tudor, Cristiana. 2022. "Integrated Framework to Assess the Extent of the Pandemic Impact on the Size and Structure of the E-Commerce Retail Sales Sector and Forecast Retail Trade E-Commerce" Electronics 11, no. 19: 3194. https://doi.org/10.3390/electronics11193194

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop