1. Introduction
Previous studies have explored the performance of clean energy assets compared to conventional energy assets and the significance of environmental, social, and governance (ESG) investments during financial crises. In this context,
Kuang (
2021) discovered that green bonds and clean energy stocks effectively mitigate the downside risks associated with conventional energy stocks, albeit with varying risk-reduction capabilities.
Miralles-Quirós and Miralles-Quirós (
2019) utilized a Vector Auto-Regressive–Asymmetric Dynamic Conditional Correlation (VAR-ADCC) approach to demonstrate that alternative energy Exchange-Traded Funds (ETFs) can outperform energy ETFs. Similarly,
Pavlova and Boyrie (
2022) employed a five-factor model and reported that ESG ETFs can outperform conventional ones, aligning with the findings of
Folger-Laronde et al. (
2020), who utilized analysis of variance (ANOVA) and multivariate regression models and concluded that higher levels of sustainability performance in ETFs do not safeguard investments during market downturns. However, during the Coronavirus Disease (COVID-19) period, they performed no better than conventional ETFs. These latter findings contradict the conclusions of
Broadstock et al. (
2021), who investigated the short-term cumulative returns of the China Securities Index 300 (CSI300) and concluded that the ESG role had incremental importance during times of crisis.
When examining past financial crises,
Alexopoulos (
2018) employed return and risk measures and found that the 2008 financial crisis affected clean energy ETFs more significantly than the 2014 oil crisis affected conventional energy ETFs, as the former were more sensitive to exogenous factors than the latter.
Dawar et al. (
2021) provided evidence for the decreasing dependence of clean energy stock returns on crude oil returns, reinforcing the theory that these two assets react differently to market turmoil.
While many studies have assessed the performance of energy funds, they mostly focus on indicators based on risk and return factors such as Jensen’s alpha (
Jensen 1968), value at risk (VaR), or the Sharpe ratio (
Sharpe 1966). Other models, such as the Carhart model (
Carhart 1997) and the Fama–French five-factor model (
Fama and French 2015), incorporate additional factors like momentum, investment, size, and value. In terms of evaluating the efficiency of a specific fund or portfolio, numerous authors have employed various Data Envelopment Analysis (DEA) approaches to reach their findings. For example,
Murthi et al. (
1997) used the original DEA model proposed by
Charnes et al. (
1978) to assess the efficiency of mutual funds, while
Basso and Funari (
2016) applied a DEA model with variable returns to scale (VRS) to compare the results obtained from this model with traditional financial indicators.
In this given context, the DEA approach emerges as particularly advantageous. DEA, a mathematical programming methodology, proves adept at gauging the efficiency of a group of entities termed Decision-Making Units (DMUs). In our specific case, these DMUs are the ETFs undergoing evaluation, with their performance intricately defined by multiple inputs (indicators being minimized) and outputs (indicators being maximized). Employing this methodology results in the classification of DMUs as either efficient or inefficient based on a singular efficiency score.
Moreover, DEA facilitates the identification of benchmarks for inefficient DMUs, supplying managers with invaluable insights into best practices. Additionally, the DEA methodology has gained widespread acceptance and application in measuring ETF performance due to its ability to surmount limitations associated with conventional performance measures, a point underscored by
Murthi et al. (
1997) and
Choi and Min (
2017).
Notably,
Murthi et al. (
1997) underscored the advantages of this tool in portfolio performance assessment. First and foremost, the methodology dispenses with the need for a theoretical reference model (e.g., the Capital Asset Pricing Model or the Arbitrage Pricing Theory). Instead, the efficiency of each ETF (DMU) is measured against a set of efficient ETFs within the same category. Secondly, it enables the simultaneous consideration of both risk (inputs) and profit measures (outputs), culminating in a comprehensive performance assessment score.
The outputs most used in DEA models are related to return metrics, such as average return, Sharpe ratio, and portfolio return. On the other hand, the inputs in DEA models can vary more widely and include factors such as expense ratio, value at risk (VaR), turnover, standard deviation, and several others that will be discussed in further detail. In this paper, we employ a Slack-Based Measure (SBM) DEA model combined with cluster analysis, considering two inputs (expense ratio and beta) and three outputs (annualized average return; environmental social responsibility and corporate governance; and net asset value), to evaluate the efficiency of 14 alternative energy equity ETFs and 55 conventional energy equity ETFs. This model will allow us to evaluate and compare the efficiency of both types of ETFs across two distinct time periods: a one-year period (2020) and a three-year period (2018, 2019, and 2020). Additionally, it will help us identify the inputs and outputs that have a greater impact on their efficiency.
From an investor’s standpoint, this study holds significant value, as it identifies the most relevant indicators for each category of ETFs across various time frames, encompassing both normal and crisis periods, including the economic downturn triggered by COVID-19. Furthermore, it evaluates the importance of environmental factors over time and compares them with other established financial factors. Hence, instead of assessing hypotheses derived from an underlying asset-pricing model or other theoretical finance framework, the methodology and metric employed in this exercise has the ability to clarify empirical patterns within the sample data.
Finally, unlike the DEA models employed in the literature (see
Section 2) that rely on the assumption of homogeneous DMUs—the ETFs in our case–which can limit their accuracy in analyzing heterogeneous datasets, we employ clustering benchmarking, a technique that breaks down a group of ETFs into clusters based on their distinct characteristics (conventional vs. alternative energy). This approach allows for the identification of similarities within clusters and differences between them, thereby enhancing the accuracy of DEA analysis. Furthermore, comparing DEA models without clustering benchmarking to those that employ it enables the calculation of the technology gap ratio (TGR), a metric that assesses the relative efficiency of different ETF clusters against the whole set of ETFs (i.e., the meta-frontier). Additionally, a robustness analysis has been conducted, which enables the evaluation of potential impacts on efficiency stemming from variations in factors, which may transpire during periods of market turbulence.
The remainder of the paper is organized as follows:
Section 2 provides a review of previous works that have applied DEA to evaluate efficiency or have utilized other models to analyze ESG or energy funds.
Section 3 outlines the DEA methodology employed in this study.
Section 4 presents details about the data used and the rationale behind selecting specific inputs and outputs.
Section 5 presents and examines the results obtained. Finally,
Section 6 presents the main conclusions and offers potential insights for investors based on the findings.
2. Literature Review
Table 1 highlights the literature that exists on the performance evaluation of mutual funds, ETFs, and portfolios using DEA models, which incorporate various inputs and outputs. However, it is evident that a limited number of studies have specifically applied the DEA approach to assess the efficiency of energy ETFs or incorporated environmental factors in the assessment.
For instance,
Allevi et al. (
2019) utilized a DEA model that incorporated environmental consumption and saving indicators to evaluate the performance of European green mutual funds. Their findings indicated a positive relationship between environmental factors and the funds’ efficiency. Similarly,
Tsolas and Charles (
2015) employed SBM models (
Tone 2001) to evaluate the performance of green ETFs, although their study did not consider environmental-based factors as indicators.
Given the rapid growth of investment in clean energy and the increasing popularity of the ETF industry due to its accessibility to a wide range of investors, it becomes crucial to incorporate environmental and socially responsible factors when evaluating any type of open-end fund, especially those focusing on energy equities.
In subsequent studies,
Choi and Min (
2017) applied the Range Direction Model (RDM) to assess 312 mutual funds, and
Tsolas (
2022) used the same methodology for utility ETFs. Although the RDM is non-radial (i.e., it allows to account for non-proportional changes in the inputs and outputs), it is not units-invariant, limiting its applicability for comparing DMUs with different units of measurement for inputs and outputs.
Like
Tsolas and Charles (
2015), we employ the SBM DEA model, which is non-radial and units-invariant. This makes the SBM DEA model more flexible and robust than the Charnes, Cooper, and Rhodes (CCR) model (
Charnes et al. 1978) used by
Murthi et al. (
1997) and the Banker, Charnes, and Cooper (BCC) models (
Banker et al. 1984) applied by
Premachandra et al. (
2012),
Basso and Funari (
2014), and
Basso and Funari (
2016). The CCR DEA model assumes Constant Returns to Scale (CRS), which may not always hold. In contrast, despite assuming VRS, the BCC model, like the CCR model, assumes a proportional change in either inputs or outputs (depending on the model’s orientation) for efficiency, a somewhat unrealistic assumption.
Zhang and Chen (
2018) utilized window DEA analysis and a DEA directional distance to assess energy portfolios based on daily fossil-fuel futures prices. However, akin to the BCC and CCR models, the DEA directional distance model is radial. Finally,
Tsolas (
2019) employed Grey Relational Analysis, deemed suitable for problems considering multiple factors and their inter-relationships, while
Henriques et al. (
2022) used DEA as a filtering procedure to compute ETF portfolios in the energy sector.
One common limitation of all these studies, thus far, is the treatment of ETFs as homogeneous DMUs. Therefore, we contribute to the literature by proposing cluster analysis to group ETFs based on their characteristics and performance. This allows us to compare the efficiency of ETFs within each cluster, rather than just comparing the efficiency of all ETFs together. This is a more granular approach that provides a more nuanced understanding of the efficiency of ETFs.
Finally, the paper analyzes the performance of ETFs across two distinct time periods (2020 and 2018–2020) to capture the impact of market conditions and the COVID-19 pandemic. This is important because the relative performance of ETFs can vary depending on market conditions.
3. Methodology
DEA, initially introduced by
Charnes et al. (
1978), is a model used to estimate the efficiency of DMUs with CRS. Later,
Banker et al. (
1984) extended the CCR model and introduced the BCC model, which allows for VRS.
In our study, we used the SBM model proposed by
Tone (
2001). This model offers a more suitable approach for assessing efficiency as it is non-radial and units-invariant, unlike the CCR and BCC models. Furthermore, the SBM model can be input-, output-, or non-oriented, providing flexibility in evaluating efficiency. This latter model not only measures efficiency but also provides valuable information on the specific increase or decrease required for each output or input of the inefficient DMUs, respectively.
Additionally, we incorporated cluster analysis into the SBM model to evaluate the efficiency of DMUs based on the cluster frontier. The combination of the SBM model and cluster analysis enabled us to account for the unique characteristics and performance of each DMU (
Tone and Tsutsui 2015).
3.1. The SBM Model
Considering a set of
DMUs
, where
X = [
xij,
i = 1, 2, …,
m,
j = 1, 2, …,
n] is the (
m ×
n) matrix of inputs,
Y = [
yrj, r = 1, 2, …,
s,
j = 1, 2, …,
n] is the vector of
outputs (
s ×
n) and the rows of these matrices corresponding to the inputs and outputs of DMU
k are, respectively, given by
and
, with
T indicating the transpose of a vector. The SBM model is given as (
Tone 2001):
where
ρ is the efficiency score;
for all
i = 1, …,
m, and
, for all
r = 1, …,
s, represent the input excesses and output shortfalls for each input and output, respectively; and
represents an intensity factor that refers to the importance of the DMU
j (
j = 1, …,
n) that is viewed as the benchmark.
From problem (1), we can perceive that an increase in
or
,
ceteris paribus, will decrease its objective function value. Hence, it can be stated that 0 <
ρ < 1.
ρ in (1) can also be formulated as:
The ratio evaluates the rate of reduction in input i and is the average reduction in inputs, in percentage. Likewise, the ratio measures the rate of increase in output r and is the average increase in outputs, in percentage. Hence, ρ is considered the ratio of average inefficiencies of inputs and outputs.
Model (1) is converted into model (3), using a positive scalar variable
t:
If
=
,
=
and
Λ =
, problem (3) becomes:
The optimal solution is formulated as:
ρ* = τ*, * = Λ*/t*, = /t*, = /t*.
Definition 1. A DMUk is SBM-efficient if . The previous condition can be equally defined as = 0 and = 0.
Definition 2. It is possible to obtain a set of efficient reference units for the SBM-inefficient DMUk by considering the indices of the DMUs related with .
Consider the reference set of the SBM-inefficient DMU
k as: E
k = {
j:
,
j = 1, …,
n}. The point of the efficient frontier that can be considered as a reference DMU for the SBM-inefficient DMU
k is:
Model (1) uses the CRS assumption. To consider VRS, it is required to add the constraint to model (1).
Definition 3. A DMUk is SBM-output-efficient if , which is equivalent to = 0. Nevertheless, it is possible that ≠ 0.
3.2. SBM Model with Cluster Benchmarking
Traditional DEA models incorporate data assumptions that can impact the accuracy of the results.
Corton and Berg (
2009) assert that one crucial factor influencing DEA’s accuracy is the assumption of homogeneity among the DMUs. To address this concern, clustering benchmarking proves valuable when dealing with heterogeneous data (
Jiang et al. 2020). Clusters can be identified by applying a statistical clustering method tailored to the specific issue at hand or externally sourced, drawing upon the insights of experts in the field (
Tone and Tsutsui 2015).
This technique involves breaking down a group of DMUs into clusters based on distinct characteristics, enabling the identification of similarities within clusters and differences between them. By leveraging the homogeneity within clusters and the heterogeneity across different clusters, clustering benchmarking offers advantages. A comparison between DEA models without clustering benchmarking (i.e., the meta-frontier with an overall set of DMUs) and those that employ this feature allows the calculation of the technology gap ratio (TGR) (
Battese et al. 2004). The TGR
k of DMU
k is computed as follows:
where
is the SBM-output-efficiency value of DMU
k that corresponds to the meta-frontier and
is the SBM-output-efficiency value of DMU
k that corresponds to the cluster-frontier. The lower the TGR value, the higher the grouping requirement, and vice versa. The value of TGR
k varies between 0 and 1, and a TGR
k closer to 1 suggests that there is a small gap between the meta-frontier and the cluster-frontier. Expression (8) reveals that the efficiency of DMU
k measured for the meta-frontier can be obtained by the product of the efficiency of DMU
k measured for the cluster-frontier and the TGR:
3.3. Robustness Analysis
In practical applications, precise measurements of input and output factors can be challenging, and this becomes crucial when employing DEA since the efficiency of DMUs is sensitive to potential data errors (
Zerafat Angiz et al. 2012). To address this, various approaches have been proposed to incorporate fuzziness into DEA models by introducing tolerance levels. One prominent approach is the α-level approach, which transforms the fuzzy DEA model into a set of parametric programs to determine the lower and upper bounds of efficiency scores (
Hatami-Marbini et al. 2011). The MaxDEA software enables the inclusion of fuzzy inputs and outputs in the model, computing the lower and upper bounds of efficiency scores. This facilitates the evaluation of the impact that changes in inputs and outputs can have on efficiency scores. The lower and upper bounds of inputs and outputs are defined as follows:
where [
,
] and [
,
] are the α-level form of the fuzzy inputs and fuzzy outputs, and
L and
U designate the lower and upper bounds, respectively. By applying a tolerance
(i.e., a percentage perturbation, representing a deviation in terms of percentage change) to these inputs and outputs it is possible to obtain the new following intervals:
5. Results
This section presents the results obtained from the application of the proposed DEA model, utilizing the MaxDEA 8 Ultra software for computation.
In our analysis, we used statistical tests to evaluate the differences in the TGR between the two clusters. Specifically, we applied the
t test for differences in means (
Table 5) and the Kolmogorov–Smirnov test (
Table 6). Upon thorough examination, we identified significant differences between the clusters, allowing us to affirm the validity of the chosen approach.
Detailed insights are provided through
Table 7 and
Table 8, offering comprehensive descriptive statistics for the efficiency scores (cluster- and meta-frontier) and the corresponding TGR values. The analysis encompasses EEs and AEEs across multiple time horizons, providing a holistic understanding of their performance.
The key findings of the analysis are succinctly summarized in the preceding tables, revealing a higher potential for improvement among EEs in both the 1-year and 3-year time periods.
Specifically, the EEs exhibit room for improvement of 72% in the 1-year period and 81% in the 3-year period, whereas the AEEs demonstrate room for improvement of only 50% in the 1-year period and 45% in the 3-year period.
Figure 1 and
Figure 2 visually depict the efficiency scores for the meta-frontier and cluster-frontier. The left side showcases the performance of AEEs, while the right side displays the performance of EEs.
The analysis of
Figure 3 and
Figure 4 reveals a noticeable disparity between the meta-frontier and cluster-frontier for AEEs compared to EEs in both time periods. AEEs exhibit a more significant difference between the two frontiers. This observation is supported by the considerably lower average TGRs for AEEs, with a minimum of 0.26 in the 1-year period, compared to EEs, which have a minimum of 0.96 in the 3-year period. These values indicate a larger gap between the frontiers for AEEs.
Additionally, it is noteworthy that the average TGR values for AEEs slightly decreased over the 3-year period, indicating a growing disparity between the two frontiers, while EEs experienced a slight decrease. This trend suggests that the gap between the meta-frontier and cluster-frontier slightly increased for AEEs, while it slightly narrowed for EEs over the extended time duration.
During the 1-year period, there was a decrease in the number of efficient alternative energy ETFs (AEEs) from the cluster-frontier (4 ETFs) to the meta-frontier (1 ETF), whereas the number of efficient conventional energy ETFs (EEs) remained consistent for both frontiers (12 ETFs)—as depicted in
Figure 3.
In the 3-year period, there was a decline in the number of efficient AEEs and EEs from the cluster-frontier to the meta-frontier, with a more pronounced decrease observed in the former. Specifically, the number of efficient AEEs decreased from seven to three ETFs. Furthermore, upon conducting a combined analysis of the two periods, it becomes evident that the number of efficient AEEs is higher in the 3-year period, while the number of efficient EEs is higher in the 1-year period—as depicted in
Figure 4.
These findings align with the research of
Pavlova and Boyrie (
2022), who observed that ESG ETFs outperformed the market prior to the crisis but did not exhibit superior performance during the COVID-19 crash. Similarly,
Demers et al. (
2021) reported that companies with higher ESG scores did not experience superior returns during the COVID-19 crisis. However, contrasting results were reported by
Nofsinger and Varma (
2014) for the 2000–2011 period, where socially responsible investment (SRI) outperformed conventional funds during the crisis period of 2007–2009.
Table 9 presents the percentage of efficient ETFs categorized by type. It is evident that, for the cluster-frontier, the percentage of efficient AEEs surpasses that of efficient EEs in both periods. This indicates that a larger proportion of AEEs are capable of maximizing their outputs.
Figure 5 and
Figure 6 illustrate that the average values of outputs (NAV, ESG, and return) are significantly higher for efficient ETFs compared to inefficient ones, particularly in the 3-year period. Specifically, efficient funds exhibited higher NAV, ESG, and return values by 341%, 78%, and 66%, respectively. On the other hand, the average values of inputs (beta and ER) are lower for efficient ETFs compared to inefficient ones in both periods, with this difference being more pronounced in the 3-year period. In the 3-year period, efficient funds had lower beta and ER values by 58% and 22%, respectively.
5.1. Potential Improvements
Figure 7 and
Figure 8 showcase the potential improvement for each input and output that inefficient ETFs can achieve to become efficient, based on the analysis of the cluster-frontier results. The data presented pertain to a 1-year period and are organized by indicator and ETF category.
Among the indicators, the ER demonstrates the highest potential for improvement, with a remarkable −137%. This implies a reduction in the average ER from 0.0069 to 0.0029. Notably, the potential for improvement in ER is more pronounced for EEs at −182% compared to AEEs at −32%. The NAV also displays a significant improvement potential, averaging 97%. EEs exhibit a higher improvement potential of 97% compared to AEEs at 81%. ESG demonstrates a reasonable average improvement potential of 36%, with a more substantial relevance for EEs at 43% compared to AEEs at 13%. This highlights the importance of environmental, social, and governance factors for EEs compared to AEEs. Return and beta indicate the lowest potential for improvement, with averages of 14% and −5%, respectively. It is noteworthy that the improvement potential for return in AEEs is minimal at 0.4%, while beta shows a more significant value of 10%. Conversely, EEs have a 15% improvement potential for return and a lower value of −4% for beta. Overall, to enhance the efficiency of both EEs and AEEs over the 1-year period, a reduction in the ER and an increase in the NAV are key areas for improvement.
The analysis conducted in the previous section is also applied to a 3-year period, as illustrated in
Figure 9 and
Figure 10, to examine the impact of time on the results. Similarly to the 1-year period, the ER exhibits the highest potential for improvement, with a remarkable −212%. This indicates a reduction in the average ER from 0.0068 to 0.0022. Consistent with the findings from the 1-year period, the potential for improvement in ER is more pronounced for EEs at −267% compared to AEEs at −52%. The NAV demonstrates the same level of improvement potential as observed in the 1-year period, with nearly identical values for both EEs (97%) and AEEs (96%). ESG values show a similar improvement potential to the 1-year period, with an average of 33%, but with a smaller gap between EEs (37%) and AEEs (17%). Return and beta indicate significantly more room for improvement compared to the 1-year period, with average values of 23% and −22%, respectively. When comparing the 3-year period to the 1-year period, it is noteworthy that AEEs show a higher improvement potential for return (35% versus 20%) while EEs display a 20% improvement potential for return. Therefore, it can be concluded that the average absolute values for improvement potential increase over a longer period (except for ESG). However, the most critical areas for achieving higher efficiency remain the same for both ETF categories—the ER and NAV.
5.2. Correlation Analysis
Table 10 presents the correlation values between the efficiency scores and the indicators for both ETF categories across the two periods. Upon analyzing this table, it becomes evident that there exists a positive correlation between the outputs (return, ESG, and NAV) and the efficiency scores, while a negative (or null) correlation is observed between the inputs (ER and beta) and the efficiency scores. It is also noteworthy that the absolute values of the correlations tend to be stronger over a longer period, with the exception of ESG, which exhibits more relevance over the shorter period, particularly for AEEs. In the 1-year period, the indicator with the strongest correlation to efficiency scores for AEEs is beta (−0.572), while for EEs it is ESG (0.330). Conversely, in the 3-year period, the indicator with the strongest correlation to efficiency scores for AEEs is return (0.628), and for EEs, it is NAV (0.403).
5.3. Robustness Analysis
For a 1-year period, when incorporating a tolerance (i.e., a percentage perturbation, representing a deviation in terms of percentage change) of δ = 5% to the current inputs and outputs based on the cluster-frontier, a total of seven ETFs (two AEEs and five EEs) are identified as robustly efficient, while thirty-seven remain robustly inefficient, and twenty-five are potentially efficient. When expanding the tolerance to δ = 10%, the number of robustly efficient ETFs remains the same at seven (two EEs and five AEEs), while twenty-four remain robustly inefficient, and thirty-eight become potentially efficient (see
Figure 11). Considering a 5% to 10% increase in the factors, AEEs exhibit a potential improvement ranging from 58% to 77%, whereas EEs show a greater potential improvement ranging from 64% to 122% (see
Figure 12). Conversely, if all factors experience a 5% to 10% decrease, AEEs are expected to face a potential worsening of −32% to −37%, while EEs show a higher potential worsening ranging from −47% to −50% (see
Figure 13). Overall, the efficiency of AEEs demonstrates greater resilience to potential variations in inputs and outputs compared to EEs, which display a more pronounced sensitivity in terms of efficiency.
Over a 3-year period, when considering a tolerance of δ = 5% applied to the inputs and outputs based on the cluster-frontier, eight ETFs (three AEEs and five EEs) are identified as robustly efficient, while fifty-two remain robustly inefficient, and nine are potentially efficient. Expanding the tolerance to δ = 10%, the number of robustly efficient ETFs reduces to six (two EEs and four AEEs), with fifty remaining robustly inefficient and thirteen becoming potentially efficient (refer to
Figure 12). Upon increasing the factors by 5% and 10%, AEEs exhibit a potential improvement of 23% to 47%, respectively, whereas EEs show a lower potential improvement ranging from 7% to 14% (see
Figure 13). Conversely, when decreasing the factors by 5% and 10%, AEEs experience a potential worsening of −23% to −41%, respectively, while EEs demonstrate a slightly lower potential worsening ranging from −15% to −32% (see
Figure 14). In contrast to the observations from the one-year period, EEs demonstrate greater immunity to efficiency variations over the longer term when applying tolerances to the inputs and outputs, compared to AEEs.
Thus, when perturbing inputs and outputs, AEEs exhibit a greater improvement potential than EEs over the longer term, while EEs demonstrate significantly higher room for improvement than AEEs in the short term. Furthermore, it can be observed that the absolute values associated with potential worsening are smaller than those of potential improvement for AEEs in both periods. However, for EEs, the potential worsening becomes more pronounced over the longer term.
6. Conclusions
The rapid growth of the clean energy sector presents an opportunity for investors and fund managers to diversify their portfolios by including firms focused on sustainable energy sources. This study investigates the efficiency of alternative energy ETFs in comparison to conventional energy ETFs, considering both financial and environmental factors. The performance of 14 alternative energy ETFs and 55 conventional energy ETFs was evaluated using an SBM-DEA model over a one-year period (2020) and a three-year period (2018–2020). This non-radial modeling approach provides insights into the necessary adjustments that inefficient ETFs should make to achieve efficiency. By combining the SBM model with cluster analysis, ETFs can be grouped according to their energy type, distinguishing between AEEs and EEs. Furthermore, robustness analysis allows for an assessment of the potential influence on efficiency resulting from changes in factors, which can occur during periods of market turbulence.
When the data are divided into groups based on ETF type, the percentage of efficient AEEs increases significantly, while EEs do not exhibit a strong reaction. TGR values for AEEs are substantially lower, indicating that these ETFs, on average, only achieve 26% and 38% of their potential output over the 1-year and 3-year periods, respectively, when compared to the meta-frontier. Hence, there is a stronger connection between the inputs and outputs used in the assessment for conventional energy ETFs. This outcome could be attributed to the absence of environmental and socially responsible factors, which would likely exhibit a stronger association with alternative energy funds, rather than performance indicators that are more applicable to established and highly valued ETFs.
Introducing 5% and 10% perturbations in the evaluation factors has minimal impact on efficiency, particularly over the longer term, as more than 50 ETFs remain robustly inefficient during this period. However, in the short term, variations in indicators could lead to expanded potential efficiency for more ETFs when increasing the tolerance level. Our results further highlight that the factors that most significantly influence the efficiency of both ETF categories are the ER and NAV. Therefore, reducing operational expenses and improving the assets/liabilities ratio are crucial actions for fund managers to enhance ETF efficiency. Additionally, it is noteworthy that an increase in ESG factors would positively impact conventional energy ETFs to a greater extent than alternative energy ETFs, with even more pronounced effects compared to return or beta.
From an investor’s perspective, it is important to consider how alternative and conventional energy ETFs react to changes in the same indicators, especially during periods of market crises, as investigated in our study. The rapid growth of the clean energy industry and the diversity offered by AEEs present compelling reasons for introducing these securities into investor portfolios. In this study, we have endeavored to provide a set of inputs and outputs that better assess the efficiency of ETF categories from both financial and environmental perspectives. However, we acknowledge that the reported data lack a significant sample of alternative energy ETFs, which could lead to more relevant and comprehensive results.
Future research should incorporate additional indicators to evaluate the efficiency of AEEs and EEs, particularly focusing on more specific indicators based on environmental and socially responsible attributes, alongside substantial data for alternative energy ETFs.
Additional research avenues may explore alternative methodologies, such as investigating the influence of ESG factors on the efficiency of AEEs and EEs through a multi-objective metaheuristic algorithm (e.g.,
Tian et al. 2023). Alternatively, researchers could delve into the development of an energy-focused stochastic disassembly line balancing problem for AEEs and EEs (e.g.,
Zhang et al. 2023).