1. Introduction
Policymakers and other decision-makers in agri-food supply chains (AFSCs) play a key role in advancing sustainable strategies, particularly in regions facing significant social, economic, and environmental constraints [
1]. Sustainability assessment must shift from traditional economic indicators to a multidimensional perspective that integrates environmental, social, and governance dimensions, as climate change, resource depletion, and socioeconomic volatility increase AFSC vulnerability [
2]. Developing empirically based frameworks to assess the actual sustainability performance of policy interventions across diverse capitals, while accounting for institutional capacity and data limitations, remains challenging [
3]. Despite numerous studies on TFP at macro and farm levels, few have explicitly addressed sustainability assessment at the institutional level in Tunisia, considering data limitations and recent policy reforms.
In this regard, total factor productivity (TFP), which calculates the ratio of total outputs to a bundle of inputs, is widely recognized as an index of resource-use efficiency [
4,
5]. Although TFP has traditionally been employed to assess technical or economic performance, recent studies are increasingly exploring it as a composite sustainability metric that takes into account input substitution, technological advancement, and structural changes influenced by public policies [
6,
7]. TFP provides a comprehensive depiction of system efficiency, which is suitable for evaluating long-term effects across environmental, social, and economic dimensions, in contrast to single-factor productivity measures.
A thorough methodological analysis is necessary to evaluate AFSC policymakers’ sustainability performance. Although TFP is commonly used in agricultural economics, its potential for assessing sustainability, especially at the level of institutions, remains largely unexplored. Several approaches exist to measure TFP, each with its own strengths and limitations, and they are generally grouped into growth accounting, frontier-based, and index-based methods. Most research to date has focused on macro levels, such as national, regional, or farm-level analyses. For example, Afzal et al. [
8] applied a growth accounting framework to study long-term productivity at the national level in Pakistan, while [
9] examined technical change and efficiency across regional crop systems. Liu et al. [
10] and Wang et al. [
11] used frontier-based techniques, including data envelopment analysis (DEA), slack-based measures (SBMs), and Malmquist decomposition, to capture scale efficiency and innovation in Chinese agricultural regions. While these methods provide valuable insights for broad benchmarking, they are less suited for assessing the specific contributions of institutional actors responsible for designing and implementing sustainability-oriented policies. While previous research has largely focused on Asian contexts, the applicability of these methods to Tunisia’s agricultural policies and institutional frameworks remains largely unexplored.
Frontier methods such as DEA, SBM, and Malmquist indices are widely employed to evaluate green or sustainable productivity [
11,
12]. They are robust tools for measuring relative efficiency and managing multiple inputs and outputs [
13], particularly when price data are scarce or unreliable. However, their effectiveness diminishes for strategic policy evaluations that require dynamic decomposition of productivity changes over time, especially when actor-specific or price-based metrics are needed [
14], and they face limitations in capturing institutional-level impacts and data-scarce environments. Although global studies [
15,
16] track productivity growth using market-based price weights, these approaches remain rarely applied in sustainability assessments at the institutional level. Some sectoral studies [
17] employed Törnqvist–Theil indices to examine input–output dynamics, and a few regional studies [
18] incorporated undesirable outputs to account for environmental concerns. These methods, while effective for benchmarking and cross-sectional comparisons, often lack direct linkage with the institutional constraints, data limitations, and policy priorities that shape real-world agricultural decision-making, particularly in developing regions. Despite these efforts, comprehensive evaluations integrating economic, environmental, and social dimensions with price-based productivity indices are still lacking.
This study seeks to answer the following research question: How can TFP methods be adapted and applied to assess the sustainability performance of institutional policymaking in Tunisia’s agri-food sector, considering data limitations and policy priorities?
The Tunisian context is characterized by ongoing agricultural policy reforms, limited availability of high-quality data, and institutional constraints, highlighting the need for regionally grounded sustainability assessment frameworks.
To fill this gap, this paper proposes an adapted framework for institutional policymaking in Tunisia, which applies carefully selected TFP methods to evaluate the sustainability performance of a policymaking institution within AFSC. Rather than introducing a completely new methodology, the study contextualizes and operationalizes existing approaches to enhance their relevance and applicability for decision-makers in data-constrained agricultural institutions. Given the diversity of TFP approaches and the complexity of sustainability issues, decision-support methods such as multi-criteria decision-making (MCDM), with particular attention to the Analytic Hierarchy Process (AHP) [
19,
20], provide a structured means of ranking alternatives by combining expert insights with quantitative evaluation [
21].
Unlike previous studies that have mainly focused on farm-level or macroeconomic TFP estimation, this research extends the use of productivity indices to an institutional policymaking context, where decisions directly influence resource allocation and sustainability outcomes. The contribution of this study lies not in proposing a fully new method but in demonstrating how an integrated AHP–TFP framework can be tailored to institutional and policy needs, providing a practical and regionally grounded application of existing productivity theories. The integration of the AHP method allows for the systematic selection of the most suitable TFP approach based on expert criteria, while the incorporation of multi-capital key performance indicators (economic, environmental, and social) into a unified, monetized productivity framework strengthens its operational value for policy design. This hybrid AHP–TFP structure not only bridges methodological and contextual gaps in the existing literature but also enhances the transparency and reproducibility of sustainability assessment for decision-makers in agri-food governance.
This paper aims to design a decision-support framework for selecting and applying appropriate TFP methods to assess the sustainability-related performance of a policymaking institution in Tunisia’s agri-food sector. The study pursues two main objectives:
- (i)
Identify and prioritize the most suitable TFP methods using AHP;
- (ii)
Empirically apply the top-ranked TFP methods to a real-world policymaking case in Tunisia, providing insights into performance gaps and areas for improvement.
The paper is structured as follows:
Section 2 outlines the TFP methods, selection criteria, and the implementation of AHP.
Section 3 describes the case study and the data employed.
Section 4 presents the empirical results, including the AHP ranking, the sustainability assessment, and the sensitivity analysis.
Section 5 discusses the main findings, emphasizing their implications, validity, and limitations. Finally,
Section 6 concludes the study and highlights avenues for future research.
4. Results
This section presents and interprets the findings from the AHP and TFP analyses, implemented using Python. The relative importance of the criteria and sub-criteria was derived from policymakers’ judgments, while the performance scores of the TFP alternatives were calculated accordingly. Based on these results, the TFP methods were subsequently ranked. The detailed findings are reported in
Section 4.1 and
Section 4.2.
4.1. AHP Results and Elasticity Analysis
The weights of the criteria and sub-criteria derived from the AHP evaluation are presented in
Figure 5. The “Data Availability and Quality (C2)” criterion emerges as the top priority, with a weight of 71.3%, highlighting the central role of data-related aspects in the evaluation process. In contrast, the “Specificities and Objectives (C1)” criterion accounts for 28.7% of the total weight. At the sub-criterion level, within C2, the “Number of Units” sub-criterion is the most influential, followed by the “Nature of the Data” and “Available Variables.” For C1, the “Communication” sub-criterion carries more weight than “Multi-output Considerations.” Overall, these results emphasize the importance of robust and well-structured data, as well as effective communication, in guiding the selection of an appropriate TFP method.
Figure 6 presents the aggregate scores of the three TFP methods, along with their individual performances for each sub-criterion. The Solow Residual shows a marked advantage in the “number of units” sub-criterion, indicating its suitability for small datasets, although its performance is weaker in terms of data diversity and communication. The Divisia Index exhibits a balanced profile, excelling in the “nature of data” and “type of available variables” sub-criteria, which reflects its adaptability to diverse datasets when reliable time-series data are available. The Törnqvist–Theil Index achieves the highest score in “multi-output considerations” and performs strongly in “communication,” highlighting its suitability for aggregating multiple outputs while maintaining interpretability.
Overall, the results reveal complementary strengths: the Solow Residual for limited datasets, the Divisia Index for versatile data handling, and the Törnqvist–Theil Index for robust multi-output analysis with clear communication.
Table 5 summarizes these findings, presenting the aggregate scores and rankings of the three methods. The Divisia Index ranks first, confirming its theoretical robustness and flexibility. The Törnqvist–Theil Index ranks second, emphasizing its strength in multi-output contexts. The Solow Residual ranks third, reflecting its more limited applicability in complex productivity assessments. Based on these results, the Divisia Index and Törnqvist–Theil Index are retained for subsequent analyses, as they provide the best balance between methodological rigor and practical relevance according to the AHP evaluation.
The consistency of expert judgments was assessed for all sub-criterion matrices using
and
, as previously defined.
Table 6 summarizes the results for each set of sub-criteria.
All
values are below the recommended threshold of 0.10, indicating that the expert judgments were coherent and reliable. These results confirm the robustness of the weighting process applied in the AHP evaluation. To further support the reliability of the analysis, the consistency ratios for all pairwise comparison matrices were examined, and all values remained below 0.10 (see
Appendix A,
Table A2). Additionally, a sample pairwise comparison matrix illustrating the structure applied to all criteria and sub-criteria is provided in
Appendix A,
Table A1. Together, these elements demonstrate the internal coherence of the AHP evaluation and the soundness of the derived weights.
In summary, the AHP analysis results confirm that each TFP method offers specific strengths, yet the Divisia Index ranks highest overall for its balanced combination of adaptability, robust data processing, and interpretability. The Törnqvist–Theil Index follows, particularly suited to multi-output contexts, while the Solow Residual remains more appropriate for small or simple datasets. These results provide a clear, evidence-based foundation for selecting the most suitable TFP calculation methods in subsequent analyses.
To assess the robustness of the AHP-based ranking of TFP methods, an elasticity analysis was conducted by perturbing the weights of the main criteria. Variations of ±10% were selected following standard practice in AHP-based studies to capture moderate uncertainty in expert judgments, while ±20% perturbations were also applied to explore more extreme scenarios. Previous studies have shown that weight perturbations can significantly affect ranking outcomes in AHP and highlighted the importance of analyzing their impact on uncertainty and dispersion in results. The exact numerical variations are presented in
Table 7, while
Figure 7 visualizes the relative impact of each criterion, allowing for a quick and intuitive comparison of sensitivity across the TFP methods.
Applying these perturbations, the Divisia Index remained highly stable, the Törnqvist–Theil Index exhibited moderate sensitivity, and the Solow Residual was the most sensitive to changes in criteria weights, confirming the robustness of the ranking obtained from the base-case AHP scores.
4.2. TFP Computation Results and Sensitivity Analysis
The results are summarized in
Table 8 and
Table 9, which present the monetized input indicators (
) in TND and their associated weights (
) for 2015 and 2020, as well as the monetized output indicators (
) with their respective weights (
). The TFP calculation is based on data for 2015 and 2020 only, as no intermediate annual data were available. The year 2015 was used as the reference year for computing productivity changes, and the analysis evaluates the single-period variation between 2015 and 2020 using ratio-based, Divisia, and Törnqvist–Theil indices. These values are expressed in TND, as detailed in the previously described shadow price-based valuation process, ensuring consistency and comparability across indicators. Together, these tables constitute the final dataset for TFP calculation, with all components harmonized for integration into the defined formulas.
As a preliminary step, before applying the selected TFP index methods, a traditional ratio-based TFP was computed using aggregated, monetized inputs and outputs for 2015 and 2020. The resulting TFP declined sharply from 0.232 in 2015 to 0.094 in 2020, corresponding to a −59.54% change. This significant drop indicates a decrease in productivity, reflecting inefficiencies in converting inputs into outputs over the study period. Although some physical productivity indicators improved, rising input costs, particularly for energy and labor, were the primary drivers of this decline. This ratio-based TFP thus serves as a baseline, emphasizing the need for more flexible, index-based methods that can better capture structural changes in the input–output relationship.
To address this, TFP was further calculated using the Törnqvist–Theil and Divisia indexes, which were identified as the most suitable methods through the AHP analysis. Both approaches incorporate the economic value and relative weight of each KPI, allowing for a more nuanced and accurate assessment of productivity changes.
Using the Divisia Index, the logarithmic variation in TFP between 2015 and 2020 is:
This corresponds to a productivity of −54.26%, closely matching the ratio-based result and thus confirming the observed efficiency decline.
In contrast, the Törnqvist–Theil Index yields a less pronounced reduction:
This represents a relative decrease of −23.70%, suggesting that although productivity declined, the magnitude of the reduction varies depending on the index method used. Nevertheless, all three approaches consistently indicate a downward trend in TFP over the five-year period.
To further assess the impact of data uncertainty on TFP outcomes, a sensitivity analysis was performed by varying shadow prices by ±10% and ±20% for all input and output KPIs. The recalculated TFP scores for each method are illustrated in
Figure 8, which presents the results using a radar chart. This visualization highlights the variations in absolute scores and confirms the consistency of the relative rankings across different shadow price assumptions.
The results indicate that the relative ranking of the TFP computation methods remained stable under both ±10% and ±20% variations in shadow prices. This stability demonstrates the robustness of the findings despite moderate uncertainties in data valuation. The analysis provides a critical examination of how shadow pricing uncertainty influences TFP outcomes, thereby enhancing the transparency and credibility of the methodological results.
6. Conclusions and Perspectives
This paper proposed an integrated methodology based on the TFP index to evaluate the sustainability of AFSC, with a particular focus on the Tunisian context. The approach followed a sequential process, including the identification and understanding of strategic KPIs, the formulation of criteria and sub-criteria for selecting the most suitable TFP method, and the application of AHP to structure expert judgments and derive relative weights. This decision-making framework enabled the prioritization of TFP methods based on multiple dimensions, including interpretability, data availability, and analytical objectives. The empirical results show that the Divisia Index emerged as the most appropriate method for TFP assessment in this context. This choice is primarily justified by its theoretical consistency and its capacity to decompose productivity growth over time using continuous data. The application of the selected TFP method provided valuable insights into the sustainability performance of the regional agricultural system between 2015 and 2020, highlighting productivity trends as well as the influence of policy decisions. Beyond these empirical findings, the study contributes methodologically by offering a replicable framework that integrates expert-based judgments and multi-criteria decision-making tools. This approach is adaptable to other regions or sectors where sustainability evaluation requires balancing data constraints, interpretability, and policy relevance.
Future research can take several directions. First, policymakers can leverage the Divisia and Törnqvist–Theil indices to guide budget allocation, prioritize interventions, and monitor efficiency improvements. Second, targeted measures, such as investments in energy-efficient irrigation, labor productivity enhancement programs, and strategic subsidy reallocation, can mitigate structural inefficiencies. Third, applying multi-capital KPIs allows comprehensive sustainability monitoring, while regular TFP assessments provide evidence-based guidance for adaptive policy design. Finally, the framework can be extended to other regions or sectors, facilitating benchmarking, evaluation of alternative interventions, and support for data-driven decision-making.
Overall, this study contributes to a structured and evidence-based assessment of sustainability within agricultural systems, emphasizing the significance of context-sensitive, data-driven, and integrative methodological approaches. Implementing targeted policy measures, such as promoting energy-efficient irrigation technologies, strengthening labor capacity through training programs, and optimizing subsidy allocation, can mitigate existing structural inefficiencies in Tunisia’s agricultural sector. The results highlight the relevance of expert-informed and context-aware decision-making frameworks to effectively support sustainable agricultural development.