Digital Twins in Agriculture: From Technological Promise to Epistemological Tension in Complex Agroecosystems

El Jarroudi, Moussa; Tychon, Bernard; Lahlali, Rachid

doi:10.3390/agriculture16121286

Open AccessReview

Digital Twins in Agriculture: From Technological Promise to Epistemological Tension in Complex Agroecosystems

by

Moussa El Jarroudi

^1,*

,

Bernard Tychon

¹

and

Rachid Lahlali

²

¹

SPHERES Research Unit, Department of Environmental Sciences and Management, University of Liège, B-6700 Arlon, Belgium

²

Phytopathology Unit, Department of Plant Protection, École National d’Agriculture de Meknès, Km 10, Rte Haj Kaddour, BP S/40, Meknes 50001, Morocco

^*

Author to whom correspondence should be addressed.

Agriculture 2026, 16(12), 1286; https://doi.org/10.3390/agriculture16121286

Submission received: 6 May 2026 / Revised: 4 June 2026 / Accepted: 8 June 2026 / Published: 10 June 2026

(This article belongs to the Special Issue Integrated Precision Technologies for Sustainable Agro-Environmental Management in Climate-Smart Agriculture)

Download

Browse Figures

Review Reports Versions Notes

Abstract

Digital twin (DT) technology is attracting increasing interest as a potentially valuable tool for the future of agriculture. By offering a dynamic virtual representation of real agricultural systems, it opens up new possibilities for real-time monitoring, simulation, and decision support. In principle, such approaches could improve predictive capacity, optimize resource use, and support more responsive management strategies. However, agriculture cannot be treated as an engineered system, and this is where important challenges emerge. Agroecosystems are living, context-dependent, and inherently variable, shaped by diverse processes that remain only partly observable and often difficult to model. This makes their representation and prediction considerably more complex than in many industrial applications. In this review, we critically examine the conceptual foundations, architectural frameworks, and current applications of agricultural digital twins (ADTs), while also identifying key scientific and practical constraints that continue to limit their development. Particular attention is given to two recurring issues: the assumption that increasing data availability necessarily improves prediction, and the persistent gap between observable variables and the underlying biological and ecological processes that govern system behaviour. Drawing on conceptual figures and comparative analyses, we highlight important research gaps and argue for a shift in perspective. Rather than pursuing increasingly precise predictions, there is a need to develop digital twins that explicitly account for uncertainty and support more resilient forms of decision-making. In this context, the value of ADTs may lie less in predictive accuracy alone, and more in their ability to help decision-makers navigate complexity, variability, and change.

Keywords:

agricultural digital twins; agroecosystems; complex adaptive systems; predictive uncertainty; partial observability; data–model paradox; decision support systems; artificial intelligence in agriculture; systems modeling; adaptive management; climate change adaptation; resilient agriculture

1. Introduction

In recent years, digital technologies have started to reshape agriculture in ways that are both gradual and far-reaching. Across the sector, decision-making is increasingly informed by data, supported by interconnected tools, and guided by models that attempt to anticipate change and enhance management. This transition is complex, and in many contexts it is still unfolding, but its influence is already noticeable in a wide range of agricultural practices, irrespective of agrifood value chain levels [1,2].

Within this evolving landscape, digital twins have emerged as one of the more ambitious developments. Their principal objective is to create evolving virtual counterparts of real agricultural systems, continuously updated through incoming data, and capable of supporting both predictive analysis and day-to-day decision-making. According to [3], the concept of digital twins originates from industrial engineering, where they were initially developed to replicate and optimize complex mechanical systems. Its adaptation to agriculture, inter alia, reflects growing expectations that similar approaches could help enhance productivity and resource-use efficiency, as well as strengthen the resilience of agrifood systems, particularly under increasing climate variability.

However, the transfer of this concept to agriculture is far from straightforward. Unlike engineered systems, agroecosystems are shaped by complex interactions of biotic and abiotic factors living, and strong environmental variability. These interactions are often non-linear, context-dependent, and influenced by processes that remain only partially understood or difficult to represent within modelling frameworks [4,5,6,7,8,9,10,11]. As a result, applying digital twins to agriculture introduces a fundamental tension between the capacity to predict and control system dynamics and the reality of biological systems that are only partly observable and inherently uncertain.

Despite the rapid expansion of the interest in agricultural digital twins in recent years [2,12,13,14,15,16,17,18], much of the discussion continues to be driven by technological enthusiasm. The dominant narrative tends to emphasise innovation, data integration, and computational advances, while giving comparatively less attention to the conceptual and operational limits that shape real-world agricultural systems. A commonly held assumption is that increasing the volume of data and the sophistication of models will necessarily lead to better understanding and improved predictions. Yet, this assumption is not as straightforward as it may seem. Research on complex systems has repeatedly shown that more data do not automatically translate into greater insight, more reliable forecasts, or better-informed decisions [2,12,13,14,15,16,17,18].

Against this backdrop, the present review adopts a deliberately critical and integrative perspective. Rather than simply listing existing applications, it seeks to examine the assumptions that underpin agricultural digital twins, to assess how they perform across different contexts in agrifood system, and to highlight the structural, epistemological, and practical constraints that continue to shape their development. By taking this step back, the aim is to reposition digital twins within a broader understanding of agriculture as a complex adaptive system, and to proffer directions for future research that are both more realistic and more firmly grounded in scientific reasoning.

2. Bibliometric Overview of the Field

2.1. Scope and Procedure of the Bibliometric Screening

The bibliometric overview presented in this section was designed as a targeted screening of the international peer-reviewed literature in order to identify the main temporal and thematic trends associated with digital twins in agriculture. It was not intended as an exhaustive bibliometric census of all global publications on the topic, but rather as a structured overview supporting the critical analysis developed in this review. The search was conducted in Web of Science Core Collection and Scopus, using combinations of the term “digital twin” with agriculture-related descriptors such as “agriculture”, “farming”, “crop”, “livestock”, “greenhouse”, and “agroecosystem”, applied to titles, abstracts, and keywords. The screening focused on the period 2016–2025, chosen to capture both the emergence of agricultural digital twins as a distinct topic and its subsequent consolidation.

The bibliometric search was conducted in March 2025. The initial screening identified 412 records across Web of Science and Scopus. After duplicate removal (n = 87), 325 records remained for title and abstract screening. Following eligibility assessment and exclusion of non-relevant or non-agricultural studies, a final corpus of 178 publications was retained for qualitative synthesis and bibliometric interpretation.

The retained corpus was used to support the bibliometric screening and thematic analysis presented in this review. Not all publications included in the screening corpus were necessarily cited individually in the manuscript, as references were selected according to their relevance to the specific conceptual, methodological, and critical discussions developed throughout the review. A PRISMA-inspired screening workflow summarizing this procedure has been added as Figure 1.

Only English-language peer-reviewed journal articles and review papers were considered. Conference papers, editorials, notes, book chapters, and documents not directly addressing agricultural applications, architectures, conceptualisations, or limitations of digital twins were excluded.

However, we acknowledge that this selection strategy introduces limitations. Because agricultural digital twins remain an emerging and highly application-oriented field, relevant advances may also appear in conference proceedings, technical reports, industrial project documents, and non-English literature. The present review should therefore be interpreted as a critically curated and structured synthesis rather than a fully exhaustive inventory of all existing ADT-related publications.

The resulting corpus was used to identify the publication trajectory shown in Figure 2 and to support the thematic structuring presented in Figure 3. In this sense, the bibliometric analysis should be interpreted as a structured overview of the reviewed corpus rather than as a claim to full exhaustiveness.

2.2. Bibliometric Evolution of Research on Digital Twins in Agriculture

A bibliometric analysis of the literature reveals that research on Agricultural Digital Twins (ADTs) has undergone a profound transformation over the past decade. What initially emerged as a relatively niche and largely conceptual topic has progressively evolved into a rapidly expanding interdisciplinary research domain situated at the intersection of precision agriculture, artificial intelligence, cyber–physical systems, remote sensing, systems modelling, and decision-support sciences. As illustrated in Figure 2, the cumulative number of publications increased exponentially between 2016 and 2025, reflecting the growing scientific, technological, and societal interest in the application of digital twin concepts to agricultural systems.

However, Figure 2 suggests that publication growth alone provides only a partial representation of the field’s evolution. While scientific production has accelerated markedly, the development of conceptual maturity appears to have progressed at a substantially slower pace. The figure conceptualizes this divergence through an Innovation–Maturity Gap, highlighting the possibility that technological innovation is advancing faster than the conceptual, epistemological, and methodological foundations required to support its long-term development. In other words, the rapid expansion of the field has not necessarily been accompanied by an equivalent consolidation of theoretical frameworks, validation procedures, uncertainty quantification approaches, governance structures, or explanatory models.

This observation is particularly significant because it suggests that the evolution of ADTs is not simply a story of technological progress. Rather, it reflects the convergence of multiple scientific traditions that have historically evolved in relative isolation, including precision agriculture, smart farming, crop and soil modelling, environmental monitoring, artificial intelligence, cyber–physical systems, and digital infrastructure research. As these domains increasingly converge, new challenges emerge regarding interoperability, transferability, explainability, governance, biological realism, and the representation of complex agroecosystem processes.

Figure 2 further indicates that the field may be entering a new developmental phase. Whereas the period between 2016 and 2020 was primarily characterized by technological emergence and acceleration, recent years have witnessed growing attention to issues extending beyond implementation and operational performance. Increasing numbers of studies now address uncertainty quantification, model validation, explainability, governance, interoperability, and the limitations of digital representations when applied to living, adaptive, and context-dependent agroecosystems. These dimensions collectively contribute to what is represented in the figure as conceptual maturity.

Importantly, the figure should not be interpreted as a formal measurement of scientific maturity, nor as a statistically calibrated indicator. The conceptual maturity index represents a heuristic synthesis derived from recurring themes identified throughout the reviewed literature, including validation, uncertainty management, interoperability, governance, explainability, transferability, and biological realism. Its purpose is therefore not to quantify maturity precisely, but to illustrate a broader pattern: although ADTs are experiencing rapid technological expansion, significant challenges remain in establishing the conceptual and epistemological foundations required for their reliable deployment in complex agroecosystems.

From this perspective, the future development of ADTs may depend less on further increases in data volume, computational power, or sensing capacity than on narrowing the Innovation–Maturity Gap. Achieving this objective will require stronger integration between technological innovation and conceptual advancement, ensuring that future ADTs are not only more powerful computationally, but also more interpretable, transferable, biologically realistic, uncertainty-aware, and scientifically robust.

General note on conceptual figures. All conceptual figures presented in this review should be interpreted as heuristic syntheses intended to clarify the main conceptual relationships, trade-offs, paradoxes and systemic tensions emerging from the reviewed literature. Curves, thresholds, normalized scales, relative proportions, directional arrows and graphical relationships are illustrative devices rather than empirical measurements, statistically calibrated outputs or universally validated relationships. They are used to formalize recurring patterns, support comparative interpretation and stimulate critical discussion. Accordingly, these figures should not be read as quantitative models, but as conceptual frameworks designed to make explicit the assumptions, limitations and future research directions associated with Agricultural Digital Twins in complex agroecosystems.

2.3. Thematic Structuring and Conceptual Reorientation of the Field

Alongside this quantitative expansion, the Agricultural Digital Twin (ADT) literature has also undergone a process of conceptual maturation and thematic diversification. As illustrated in Figure 3, the field can be organized into five interconnected domains: (i) modelling and artificial intelligence for prediction and decision support, (ii) technologies and data infrastructures enabling observation and data integration, (iii) applications across agri-food systems, (iv) limits, governance, and epistemological considerations, and (v) foundational and conceptual frameworks that define how digital twins are understood and implemented in agricultural contexts. Rather than representing isolated research streams, these domains form an increasingly interconnected system-of-systems in which technological innovation, modelling capacity, practical applications, and critical reflection co-evolve.

This thematic structure reveals an important shift in the trajectory of the field. Early research was largely driven by technological opportunities associated with sensors, remote sensing, IoT networks, simulation platforms, and artificial intelligence [1,2]. More recent studies, however, increasingly recognize that the effectiveness of ADTs cannot be explained solely by advances in data acquisition or computational sophistication. As a result, growing attention has been directed toward conceptual foundations, interoperability, uncertainty modelling, explainability, governance, and the broader epistemological assumptions underlying digital representations of agroecosystems.

Figure 3 further suggests that the future development of ADTs depends on the integration of these dimensions rather than on progress within any single domain. Advances in sensing technologies may expand observability, while modelling and artificial intelligence improve predictive capabilities; however, their practical value ultimately depends on how these technologies are embedded within coherent conceptual frameworks, interpreted through transparent governance mechanisms, and translated into meaningful agricultural applications. In this sense, the figure highlights that the scientific value of ADTs emerges from the interaction among technological, methodological, institutional, and conceptual dimensions rather than from technological innovation alone.

A growing body of literature is therefore moving beyond questions of implementation and operational performance toward more fundamental questions concerning representation, realism, interpretability, transferability, and uncertainty [1,2]. Researchers are increasingly asking what constitutes a meaningful digital representation of a living agroecosystem, how model assumptions influence decision-making, and whether digital twins can adequately capture the biological heterogeneity, context dependency, and adaptive dynamics that characterize agricultural systems. This evolution signals a transition from a predominantly technology-driven phase toward a more reflexive and theoretically informed stage of development.

Viewed collectively, the five domains represented in Figure 3 illustrate the emergence of a new research agenda in which ADTs are no longer framed merely as predictive technologies, but as integrative socio-technical systems situated at the intersection of data, models, ecological processes, governance structures, and human decision-making. The long-term success of ADTs may therefore depend less on technological sophistication alone and more on the capacity to connect these dimensions within coherent, transparent, and scientifically robust frameworks capable of supporting resilient agri-food systems.

Figure 4 synthesizes one of the central arguments emerging from this review: the evolution of Agricultural Digital Twins (ADTs) is characterized not only by increasing technological capability, but also by a growing divergence between technical performance and system understanding. The figure conceptualizes this trajectory through four successive phases—data scarcity, knowledge gain, data saturation, and epistemic overload—highlighting how the relationship between data accumulation and knowledge generation may become increasingly non-linear in complex agroecosystems.

During the early stages of digitalization, additional data, improved sensing infrastructures, and stronger data–model coupling generally contribute to substantial gains in both predictive performance and system understanding. This phase corresponds to a period of rapid knowledge acquisition in which previously unobserved processes become measurable and model representations improve accordingly. Such expectations have strongly influenced the development of ADTs, which were initially promoted as tools for optimization, automation, predictive control, and real-time synchronization, largely inspired by engineering and cyber–physical systems paradigms [2,3,19,20]. However, Figure 4 suggests that this positive relationship may not persist indefinitely. Beyond a critical data saturation threshold, the marginal contribution of additional data to system understanding progressively decreases. While technical performance may continue to improve through enhanced computational capacity, more sophisticated algorithms, and larger data streams, explanatory power and mechanistic understanding may increase at a much slower rate. This divergence reflects the fact that many agroecosystem processes are governed by biological variability, context dependency, nonlinear interactions, emergent properties, and partially observable mechanisms that cannot be fully resolved through data accumulation alone [7,21,22,23]. The figure further introduces the concept of epistemic overload, a stage in which increasing digitalization may generate expanding layers of hidden uncertainty despite continued improvements in technical performance. In highly connected systems, larger volumes of data may reveal new correlations, interactions, and sources of variability faster than they reduce uncertainty. Consequently, the growth of information can be accompanied by the emergence of new blind spots associated with model misspecification, latent variables, structural uncertainty, and unknown unknowns. Under such conditions, additional data do not necessarily translate into proportional gains in understanding and may even increase the complexity of interpretation.

Importantly, Figure 4 should not be interpreted as an empirical quantification or universally calibrated relationship. The saturation threshold, curve trajectories, and performance levels shown are conceptual representations intended to synthesize recurring patterns identified across the reviewed literature. Rather than describing statistically validated universal responses, the figure illustrates a broader epistemological argument: technological progress and data accumulation may continue to improve operational performance, while the deeper challenge of understanding complex agroecosystem behaviour remains only partially resolved.

In this perspective, the future development of ADTs should not be evaluated solely according to predictive accuracy, computational efficiency, or data volume. Equally important is their capacity to improve system understanding, make uncertainty explicit, identify epistemic blind spots, and support robust decision-making under conditions of incomplete knowledge. Figure 4 therefore highlights a fundamental shift in perspective: the challenge is no longer simply to collect more data, but to determine how much meaningful understanding can be generated from increasingly complex information environments.

More recent contributions increasingly acknowledge that these limitations are not marginal. They arise from the intrinsic properties of agroecosystems themselves. Many key processes remain only partially observable, biological responses vary across contexts, and models are often difficult to transfer across environments and scales. In addition, relationships identified in data may become unstable under changing conditions, while structural uncertainty persists even in situations where large volumes of data are available.

From this perspective, the bibliometric trajectory of the field offers more than a simple indication of publication growth, as it also reflects a deeper conceptual shift. The literature is gradually moving away from an exclusive emphasis on technological capability toward a more critical understanding of digital twins as partial, context-dependent, and model-based representations of agricultural reality. Taken together, Figure 1, Figure 2, Figure 3 and Figure 4 suggest that the future development of agricultural digital twins will depend not only on increasing data availability or computational power, but also on the capacity to design frameworks that explicitly account for uncertainty, remain sensitive to context, and are grounded in a more realistic understanding of biological systems. In this perspective, the value of agricultural digital twins may lie less in producing fixed and fully reliable predictions than in supporting informed and adaptive decision-making under conditions of uncertainty, variability, and partial observability.

3. Conceptual Foundations: Engineering Paradigms vs. Agroecological Reality

The concept of the digital twin has its roots in an engineering perspective, where systems are generally treated as observable, decomposable, and, to a certain extent, controllable [19,24,25,26,27,28]. Within such contexts, digital replicas can be highly effective for simulating system behaviour and supporting optimisation. However, once this perspective is brought into agriculture, its underlying assumptions become less straightforward. Agroecosystems do not generally behave like engineered machines [2,29]. They are better described as complex adaptive systems, shaped by feedback loops, emergent properties, and interactions that unfold across multiple spatial and temporal scales [1,2]. These characteristics make agricultural systems inherently difficult to represent through simplified or reductionist modelling frameworks.

The rapid expansion of digital agriculture has also generated increasing conceptual overlap between Agricultural Digital Twins (ADTs), precision agriculture, smart farming systems, cyber–physical platforms, and data-driven decision-support systems. Although these concepts share several technological foundations, they are not fully equivalent and should not be used interchangeably.

In this review, an Agricultural Digital Twin (ADT) is defined as a dynamically evolving virtual representation of an agricultural system that is continuously updated through bidirectional interactions between physical and digital components, integrates heterogeneous data streams with process-based or hybrid modelling frameworks, and supports adaptive decision-making under changing environmental conditions. This definition differs from conventional precision agriculture approaches, which primarily focus on spatially targeted management, and from smart farming systems that often emphasize automation, connectivity, and sensor integration without maintaining continuously synchronized virtual representations of agroecosystem states.

Similarly, ADTs differ from traditional decision-support systems because they aim to establish continuous synchronization between observed system states, predictive simulations, and adaptive updating mechanisms. However, as discussed throughout this review, such synchronization remains inherently constrained by partial observability, structural uncertainty, and context dependency in agroecosystems. This distinction is particularly important because not all data-intensive agricultural systems can legitimately be considered digital twins. Systems lacking continuous updating, bidirectional interaction, or adaptive assimilation mechanisms may function as advanced monitoring or predictive platforms without fully meeting the operational criteria of ADTs.

This tension becomes particularly apparent in what can be referred to as the data–model paradox. Digital twins are often developed with the implicit expectation that increasing the amount of available data will improve system understanding and enhance predictive performance [19,24,25,26,27,28]. In practice, however, the relationship is not that simple, as more data do not necessarily make a model more robust.

Importantly, this limitation should not be interpreted as implying that increasing data availability is inherently detrimental. High-quality, process-relevant, and biologically meaningful datasets can substantially improve predictive robustness when integrated into appropriate model structures. The paradox emerges primarily when additional data compensate for incomplete or weakly specified models rather than improving mechanistic understanding. In such cases, noisy, redundant, poorly harmonized, or weakly related datasets may amplify uncertainty, increase overfitting, and reduce interpretability.

When the underlying model is structurally deficient, that is, poorly specified, weakly calibrated, or insufficiently aligned with the actual functioning of the system, adding more data does not necessarily improve understanding or prediction; instead, it may amplify noise, complicate interpretation, and in some cases increase uncertainty rather than reduce it [5,21,27,28,30,31,32,33,34,35]. In this sense, the link between data volume and predictive reliability should be understood as conditional rather than cumulative.

Examples of this phenomenon have been reported in crop yield forecasting systems integrating heterogeneous satellite datasets, disease detection frameworks relying on unstable spectral signatures, and soil moisture assimilation systems where increasing sensor density introduced inconsistencies between observational scales and model structure. These examples suggest that predictive instability may emerge not only from data scarcity, but also from mismatches between data complexity and the capacity of models to represent underlying biological processes.

As illustrated in Figure 5, the fundamental limitation of Agricultural Digital Twins (ADTs) may not arise from insufficient data availability alone, but from the existence of an inherent observability frontier that constrains how much of an agroecosystem can be directly measured, represented, and understood. Agroecosystems are complex socio-ecological systems shaped by interacting atmospheric, biological, hydrological, agronomic, and socio-economic processes operating across multiple spatial and temporal scales. While advances in remote sensing, sensor networks, UAVs, artificial intelligence, and data integration continue to expand the observable domain, a substantial fraction of system behaviour remains only partially observable or entirely hidden. Ecological interactions, microbiome dynamics, stress memory, adaptive human behaviour, emergent ecosystem properties, and cross-scale feedback mechanisms often remain beyond direct observation and are therefore only indirectly represented within digital twin architectures.

Figure 5 conceptualizes this challenge through the notion of an observability frontier, representing the epistemic boundary separating observable signals from latent system drivers. In this perspective, a digital twin should not be interpreted as a complete digital replica of reality, but rather as a dynamic approximation constructed from partial observations, model assumptions, and inferred relationships. Consequently, increasing data acquisition does not necessarily eliminate uncertainty, because many sources of uncertainty originate not from data scarcity alone but from structural limitations in system representation, incomplete process understanding, context dependency, and emergent dynamics.

Importantly, Figure 5 should not be interpreted as an empirical quantification or a universally calibrated representation of digital twin performance. Instead, it provides a heuristic conceptual synthesis of recurring patterns identified throughout the reviewed literature. The observability frontier, hidden drivers, and modelled representations shown in the figure are not intended to represent measurable thresholds or statistically validated relationships. Rather, they illustrate a fundamental epistemological constraint: regardless of technological progress, a portion of agroecosystem complexity is likely to remain only partially observable and therefore only partially representable within any modelling framework.

This interpretation carries important implications for the future development of ADTs. Rather than pursuing the unattainable objective of perfect prediction through ever-increasing data accumulation, future systems may benefit more from explicitly acknowledging uncertainty, identifying blind spots, and exploring alternative futures. In this sense, the figure supports a broader conceptual transition from prediction-centred modelling toward resilient decision intelligence, where the primary objective is not to forecast reality perfectly but to support robust, transparent, and adaptive decisions under conditions of incomplete knowledge. The value of ADTs may therefore depend less on their ability to eliminate uncertainty than on their capacity to make uncertainty visible, understandable, and actionable for decision-makers.

At the same time, agricultural digital twins operate within a fundamentally constrained observational space. What can be measured captures only a fraction of the processes that actually drive agroecosystem dynamics. Unlike engineered systems, where internal states can often be monitored directly or inferred with reasonable confidence, agroecosystems are influenced by many latent processes that remain difficult, and sometimes impossible, to observe directly. These include root–soil interactions, microbial dynamics, plant physiological responses to abiotic and abiotic stresses, and a range of ecological interactions that occur across different trophic levels.

Figure 5 highlights the consequences of this limitation by illustrating the existence of an epistemological gap between the system as it is observed and the system as it actually functions. Observable variables—such as canopy reflectance, soil moisture, or temperature—provide useful information, but they remain indirect proxies for deeper biological and ecological processes. From this perspective, digital twins cannot be considered complete representations of agroecosystem reality. Rather, they should be understood as partial approximations, built from a necessarily reduced and filtered view of the system.

This partial observability introduces a form of structural uncertainty that cannot simply be eliminated through increasing data availability because the missing information relates to processes that are inherently difficult to observe at operational scales [23,36,37,39,41,42]. Consequently, predictive outputs may appear quantitatively precise while remaining biologically incomplete. This creates what can be described as an epistemological asymmetry between observable system states and the hidden causal processes driving agroecosystem behaviour [23,36,37,39,41,42].

This limitation has several important implications. First, it weakens confidence in predictive outputs, particularly when models rely heavily on correlations derived from observable variables without adequately capturing the causal mechanisms governing system behaviour. Second, it challenges the assumption that digital twins can provide fully reliable representations of biological systems. More broadly, it reveals a fundamental epistemological tension at the core of agricultural digital twins: these systems are designed to simulate realities that can only ever be partially observed.

As illustrated in Figure 6, the gap between measurable variables and hidden agroecosystem processes should therefore not be considered a temporary technological limitation that will disappear automatically with increasing sensor density or computational power. Instead, it reflects deeper scientific constraints linked to the intrinsic complexity, heterogeneity, and multi-scale organization of living systems.

While digital twins can undoubtedly provide valuable insights and improve operational management under certain conditions, their outputs must be interpreted cautiously, particularly in situations where important processes remain hidden or only indirectly inferred. In this sense, the challenge is not purely technical but fundamentally conceptual. It concerns the limits of what can realistically be known, measured, represented, and predicted in complex agroecosystems.

Taken together, Figure 5 and Figure 6 show that the central challenge facing Agricultural Digital Twins (ADTs) is not merely the acquisition of larger datasets or the deployment of more computationally sophisticated models. The deeper challenge is epistemological: how to construct useful digital representations of systems whose functioning is only partially observable, context-dependent, and shaped by hidden biological, ecological and socio-economic processes. These figures therefore challenge the engineering assumption that digital replication can progressively converge toward a complete representation of reality. In agroecosystems, uncertainty is not only a temporary technical limitation caused by insufficient data; it is also an intrinsic property of living systems characterized by emergent behaviour, cross-scale interactions, biological variability and adaptive responses. Consequently, future ADTs should not be designed as attempts to mirror agroecosystems exhaustively, but as adaptive approximation systems capable of identifying what is observable, clarifying what remains uncertain, and supporting robust decisions under incomplete knowledge.

4. Digital Twin Architectures: Integration Challenges and Fragmentation

Agricultural digital twins are often described as layered systems that integrate data acquisition, modelling components, data harmonisation, and decision-support modules. On paper, this type of architecture appears both coherent and compelling. It suggests a seamless integration of scales, data sources, and analytical processes, with the potential to support real-time monitoring and predictive management of agroecosystems [17,19]. However, this apparent coherence becomes more difficult to sustain when confronted with the realities of agricultural systems. In practice, most digital twin implementations fall short of functioning as fully integrated systems. Instead, they tend to remain fragmented, both technically and institutionally, reflecting the complexity of agricultural environments and the difficulty of coordinating heterogeneous data, models, actors, and infrastructures.

A central source of this fragmentation lies in the inherently heterogeneous nature of agricultural data. Observations of agroecosystems are generated from a wide range of sources, such as remote sensing platforms, in situ sensors, farm machinery, and farmer-reported information. These data streams differ not only in format and resolution, but also in spatial and temporal scale, as well as in their associated uncertainties and error structures [18,21]. This diversity is not merely a technical issue that can be resolved through improved data management. It reflects the underlying nature of agricultural systems themselves, where processes operating at the microbial or root level can interact with, and influence, dynamics at the field or landscape scale in highly non-linear ways [7]. Under such conditions, integrating these heterogeneous data streams into a single digital twin architecture often requires substantial simplifications, which may obscure important dimensions of system behaviour.

These integration challenges are further compounded by the lack of widely adopted data standards, shared ontologies, and effective interoperability frameworks. Initiatives such as the FAIR data principles [43] and the development of agricultural data platforms have contributed to progress in this field. However, their implementation remains uneven and, in many cases, insufficient to support fully operational and scalable digital twins. In practice, data pipelines are often assembled in a project-specific manner, which limits their reproducibility and makes it difficult to transfer solutions across regions or production systems [2,44]. As a result, many digital twin architectures remain context-dependent, fragile, and difficult to generalise. This lack of interoperability is not only a technical limitation; it also raises broader concerns about reproducibility and scientific robustness.

Beyond technical considerations, digital twin architectures are increasingly embedded within proprietary digital ecosystems, often controlled by a relatively small number of private actors. These platforms frequently operate as closed systems, where access to data, algorithms, and even model structures is restricted [18,45]. In such contexts, asymmetries in data ownership and control can emerge. Farmers and local stakeholders contribute large volumes of data, yet they do not always retain control over how these data are used, shared, or translated into valuable outputs. Nor do they necessarily benefit proportionally from the insights generated. This concentration of data and computational capacity raises important questions related to data sovereignty, transparency, and power dynamics within digital agriculture [46,47].

These issues cannot be reduced to a simple lack of technological maturity. They point instead to deeper structural tensions at the intersection of biological complexity, data science, and socio-economic organisation. Digital twins are often built on the assumption that systems can be standardised, observed, and, to some extent, controlled. Yet agroecosystems do not easily conform to this logic. They are variable, context-specific, and only partially observable, which creates a fundamental mismatch between the architectural ambitions of digital twins and the realities they aim to represent. The result is not only technical fragmentation, but also a more profound epistemological constraint, as these systems attempt to impose coherence on inherently heterogeneous and dynamic realities.

This fragmentation has direct implications for decision-making. When models are constructed on the basis of incomplete, weakly integrated, or biased data streams, their outputs may fail to reduce uncertainty and, in some cases, may even reinforce it. The apparent precision of digital twin outputs can therefore be misleading, creating a sense of reliability that is not always supported by the underlying data or model structures [48]. In agricultural contexts, this is particularly critical, as decisions based on such outputs can have lasting consequences for ecosystems, resource use, and farm livelihoods.

Taken together, these observations suggest that the current development of digital twins in agriculture is characterised less by full integration than by multiple layers of fragmentation, technical, institutional, and epistemological. Addressing these challenges will certainly require advances in interoperability, data infrastructures, and modelling approaches. However, technical improvements alone are unlikely to be sufficient. There is also a need for a more critical reassessment of the assumptions underlying digital twin architectures, particularly with regard to data governance, system representation, and the limits of predictive control in complex living systems.

5. Applications of Digital Twins: Performance, Context-Dependency, and Structural Limitations

Digital twins have rapidly gained attention as one of the most ambitious and potentially integrative concepts in agricultural research, with applications ranging from yield prediction and disease surveillance to resource optimization and farm management. Their growing popularity stems from the expectation that continuous integration of observations, modelling frameworks, and decision-support systems can improve agricultural management under increasingly variable environmental conditions.

Concrete examples of Agricultural Digital Twin (ADT) implementation now include greenhouse tomato production systems integrating climate-control digital twins, maize yield forecasting frameworks combining Sentinel-2 assimilation with crop growth models, wheat disease early-warning systems integrating UAV imagery with epidemiological simulations, livestock behavioural monitoring systems based on wearable sensors, and irrigation scheduling platforms combining soil-moisture sensing with weather-driven decision-support systems. These examples illustrate the diversity of ADT applications currently emerging across crop and livestock production systems.

In the area of yield prediction, several studies report improvements in predictive accuracy under controlled conditions [44,49]. At the same time, some recent studies demonstrate that hybrid modelling approaches combining mechanistic crop models with adaptive machine-learning frameworks can improve robustness, particularly when dynamic updating and multi-season calibration are incorporated. Several studies have also reported encouraging results regarding regional transferability when models are deployed within relatively homogeneous agroecological regions and supported by continuous calibration procedures. However, these systems typically remain strongly dependent on site-specific calibration, cultivar-dependent parameters, and local climatic conditions [22].

Moreover, the apparent success of data assimilation approaches can sometimes mask deeper structural limitations. Repeatedly correcting model outputs with observational data may reduce errors in the short term, but this does not necessarily indicate that the underlying representation of system processes has improved. In this sense, a form of “performance illusion” may emerge, where models appear accurate while remaining structurally incomplete [21]. Under such conditions, digital twins may function less as mechanistic representations of agroecosystem dynamics and more as adaptive interpolation tools.

A similar tension can be observed in plant disease monitoring. In this domain, digital twins are increasingly combined with proximal and remote sensing technologies designed to detect early signals of plant stress. Indicators such as spectral signatures, thermal anomalies, and fluorescence measurements are used to infer plant health and anticipate disease development [23,50]. While these approaches are promising, their interpretation remains complex. Spectral signals are rarely specific to a single cause and can be influenced by a range of abiotic and biotic factors, including water stress, nutrient deficiencies, canopy structure, and variations in illumination [51]. As a result, distinguishing disease-related signals from background variability is not always straightforward, increasing the risk of false positives or misclassification, particularly in operational settings where decisions depend on reliable and interpretable outputs.

The temporal dynamics of plant diseases add another layer of complexity. Many pathogens follow non-linear infection pathways, involve latency periods, and respond to threshold effects that cannot easily be captured through observable proxies alone [52]. Consequently, the early-warning potential often attributed to digital twins may be constrained by the very nature of the signals on which they rely. This reinforces the gap between observable indicators and the biological processes that actually drive disease dynamics.

Digital twins have also been widely promoted as tools for optimising resource use, particularly in irrigation and nutrient management. By combining soil moisture data, weather information, and crop models, these systems are expected to support more precise and adaptive management decisions [2,44]. Reported benefits include reduced water use, improved nutrient efficiency, and, in some cases, higher economic returns. Several pilot studies have reported water savings exceeding 15–30% while maintaining comparable yield levels, suggesting substantial potential for resource-efficient management when digital twins are properly integrated into farm decision processes.

However, many of these results are derived from experimental or pilot-scale studies and may not fully reflect the variability and constraints of real farming conditions.

Importantly, the long-term stability of these reported benefits remains insufficiently documented. Most published studies evaluate ADT performance over relatively short periods, often covering only one or a few growing seasons. Consequently, evidence regarding their robustness under prolonged climatic variability, extreme weather events, and changing management conditions remains limited. Longitudinal validation studies are therefore urgently needed to assess whether reported gains persist through time.

Future evaluation frameworks for agricultural digital twins should therefore move beyond predictive accuracy alone and incorporate rigorous multi-dimensional validation strategies. These should include independent validation across years, sites, cultivars, management systems, and climatic conditions; explicit separation between calibration and validation datasets; systematic reporting of uncertainty intervals; comparison with non-digital-twin baseline models; and assessment of real operational outcomes such as water-use efficiency, pesticide reduction, nutrient-use efficiency, yield stability, labour requirements, economic benefit, and farmer decision quality. Without such validation procedures, the apparent precision of ADT outputs may overstate their actual robustness, transferability, and practical value under real farming conditions.

More broadly, the effectiveness of these systems cannot be assessed solely in terms of technical performance. Their practical value also depends on user behaviour, institutional contexts, and economic conditions. Farmers may face barriers related to cost, access to data, technological literacy, and trust, all of which influence the adoption and impact of digital twin technologies [46]. In addition, successful implementation increasingly depends on co-design approaches involving farmers, advisers, agronomists, and technology developers. Emerging evidence suggests that user participation during system development improves transparency, trust, interpretability, and operational relevance, thereby increasing the likelihood of long-term adoption. This highlights that the effectiveness of ADTs is not purely a function of algorithmic performance but also of their integration within broader socio-technical systems.

The growing incorporation of explainable artificial intelligence (XAI) techniques may further contribute to addressing this challenge by improving model transparency and helping users understand the rationale behind digital twin recommendations. Such developments could help bridge the gap between predictive capability and user trust, which remains a critical limitation in many current implementations.

Taken together, these observations suggest that while digital twins hold considerable promise across a range of agricultural applications, their current performance remains shaped by a persistent tension between local success and limited generalisability. Rather than representing a universal solution, they are better understood as context-dependent tools whose effectiveness is constrained by data quality, model structure, system complexity, and socio-economic conditions.

The evidence reviewed here therefore suggests that future progress will depend not only on improving sensing technologies, computational capacity, or model sophistication, but also on strengthening long-term validation, improving interoperability, enhancing explainability, and developing participatory implementation frameworks capable of integrating technical innovation with the realities of agricultural decision-making. Table 1 provides a synthesis of these applications, highlighting both reported benefits and their underlying limitations.

As agricultural digital twins increasingly incorporate machine-learning and AI-driven predictive frameworks, an important tension emerges between predictive performance and interpretability. While black-box models may improve short-term forecasting capacity, they can simultaneously obscure the causal mechanisms underlying agroecosystem dynamics. This prediction–interpretability trade-off becomes particularly critical in biological systems characterized by non-linearity, context dependency, and partial observability, where understanding causal relationships may be as important as improving predictive accuracy itself.

6. Critical Analysis: Structural, Epistemological, and Operational Limits of Agricultural Digital Twins

One of the main limitations of digital twins in agriculture stems from the need to simplify inherently complex agroecosystems into a set of variables that can be measured and modelled. This simplification is, in many ways, unavoidable. Yet it also comes at a cost. In making these systems computationally manageable, certain dimensions that are central to their functioning tend to be reduced, or sometimes left aside altogether. This is particularly evident for belowground processes, microbial dynamics, trophic interactions, and biodiversity-driven regulation, which operate across multiple scales and are often only partially observable [7,55,56,57,58]. As a result, digital twins tend to emphasise what can be measured relatively easily, rather than what necessarily matters most, introducing a form of structural bias in how agroecosystems are represented.

This limitation becomes particularly important when considering the increasing interest in nature-based and biodiversity-driven agricultural systems, where ecosystem services often emerge from interactions that remain difficult to monitor directly. Consequently, the capacity of digital twins to represent ecological resilience mechanisms remains intrinsically constrained by the observability of the processes on which these mechanisms depend.

This tendency reflects a broader epistemological challenge that extends beyond agriculture itself. Digital twins implicitly assume that system behaviour can be adequately captured through measurable variables and computational representations. However, agroecosystems are not merely collections of measurable components. They are living systems characterized by emergence, self-organization, adaptation, evolutionary dynamics, and context-dependent interactions. Many of these properties arise from interactions among system components rather than from the components themselves, making them inherently difficult to quantify and model explicitly.

Particularly problematic is the representation of biological interactions occurring below the sensor horizon. Root–microbiome associations, rhizosphere signalling processes, microbial functional redundancy, ecological resilience mechanisms, and biodiversity-mediated buffering effects remain largely invisible to current monitoring infrastructures. Yet these processes often play a central role in determining crop productivity, nutrient cycling, disease suppression, and system resilience. Consequently, even highly sophisticated digital twins may omit processes that fundamentally shape agroecosystem functioning.

This bias has important implications when it comes to capturing emergent behaviour. Agroecosystems rarely respond in simple or predictable ways. Their dynamics are shaped by feedback loops, threshold effects, and, at times, abrupt shifts, where relatively small changes can trigger disproportionately large consequences [59,60,61,62]. When key processes are missing or overly simplified, digital twins may struggle to anticipate such transitions, particularly when system behaviour depends on interactions that are not explicitly represented. In that sense, the simplifications required for modelling are not just technical compromises, they also act as structural constraints that limit the ability of digital twins to reflect the deeper dynamics of living systems.

Recent advances in hybrid modelling approaches that combine mechanistic knowledge with machine-learning algorithms have attempted to address some of these limitations. While these approaches offer promising opportunities to better integrate biological understanding into digital twin architectures, their ability to adequately capture emergent system properties and cross-scale ecological interactions remains largely untested under diverse agroecological conditions.

From an epistemological standpoint, digital twins also raise a more fundamental question: what kind of knowledge is actually being produced in data-intensive agriculture? The rapid growth of data availability and computational capacity has encouraged the idea that more data will naturally lead to better understanding and improved prediction. However, this assumption is not always borne out in practice. As emphasised by Saltelli et al. (2020), models should not be evaluated solely on their predictive performance [63]. Their value also depends on transparency, robustness, and their capacity to capture the causal mechanisms underlying system behaviour. This issue becomes particularly relevant when digital twins incorporate machine learning approaches. While such models can enhance predictive performance, they often do so at the expense of interpretability, making it more difficult to understand what is actually being captured [21,48].

This challenge has stimulated growing interest in explainable artificial intelligence (XAI) approaches, which seek to improve transparency and provide greater insight into the relationships identified by data-driven models. Although these approaches represent an important step toward improving model interpretability, they do not fully resolve the broader challenge of linking predictive outputs to causal biological mechanisms.

In this context, there is a risk of what might be described as an “illusion of precision.” Highly detailed simulations can give the impression of accuracy and control, even when the underlying model structure remains uncertain or incomplete. In many cases, digital twins rely on correlations derived from observable variables without fully accounting for the causal processes that drive system dynamics. In agriculture, this distinction is particularly important. Decisions based on unstable or poorly understood relationships can lead to unintended consequences, especially when models are applied outside the contexts in which they were originally developed [2,21,47,64,65]. These limitations become even more critical under conditions of global change, where systems are evolving and historical data may no longer provide a reliable guide.

Importantly, this potential “illusion of precision” should not be interpreted as an inherent property of all Agricultural Digital Twins. Rather, it may emerge under specific circumstances, particularly when predictive performance is emphasized without equivalent attention to model assumptions, uncertainty characterization, biological realism, or independent validation. Several studies have demonstrated substantial operational benefits of data-driven approaches within clearly defined experimental contexts. Nevertheless, the extent to which such performance can be generalized across contrasting agroecological conditions remains insufficiently documented.

Moreover, the transferability of digital twin models across regions, production systems, and climatic conditions remains insufficiently documented. Models calibrated under specific environmental conditions may exhibit substantially reduced performance when confronted with novel combinations of climatic, biological, or management factors. This raises important questions regarding the long-term robustness and generalizability of current digital twin implementations.

This qualification is particularly important because many of the performance gains reported in the literature—including improvements in prediction accuracy, disease detection, irrigation scheduling, and resource-use efficiency, have frequently been obtained under relatively controlled conditions, pilot-scale deployments, experimental farms, greenhouse environments, or limited temporal observation windows. While these studies provide valuable proof-of-concept evidence, they do not necessarily demonstrate equivalent levels of performance under the full diversity of real-world farming conditions.

From a more practical perspective, digital twins are also demanding in terms of implementation. Their deployment requires not only access to data, but also the infrastructure, computational resources, and expertise needed to manage and update these systems over time. In many regions, such conditions are not easily met [17,18]. This creates a clear barrier to adoption, particularly for smallholder farmers or for agricultural systems operating with limited digital resources. In such contexts, digital twins may inadvertently reinforce existing inequalities in access to innovation, rather than helping to reduce them [46,66].

This issue is particularly relevant given that smallholder farming systems contribute substantially to global food production. If digital twin technologies remain dependent on costly infrastructures and specialized expertise, their benefits may remain concentrated within highly capitalized agricultural systems, thereby exacerbating existing technological and economic disparities.

Beyond these technical constraints, the effective use of digital twins also depends on human and institutional factors that are sometimes overlooked. Their usefulness is closely tied to user capacity, trust, and the availability of appropriate support systems. Farmers need to be able to interpret model outputs and translate them into practical decisions, even when those outputs are uncertain or difficult to interpret. This raises important questions about usability, transparency, and the extent to which these systems are designed with end-users in mind [47].

In addition, questions related to data governance, ownership, privacy, and algorithmic transparency are becoming increasingly important as agricultural digital twins rely on expanding networks of sensors, cloud infrastructures, and proprietary analytical platforms. The future success of ADTs may therefore depend as much on the development of transparent governance frameworks as on further technological innovation.

Without a stronger integration of social and institutional dimensions, digital twins risk remaining technologically sophisticated tools that are only partially aligned with the realities of agricultural practice.

Furthermore, the literature continues to suffer from a lack of harmonized validation procedures. Many studies rely on short-term experiments, single-site evaluations, or narrowly defined performance metrics.

Although several studies report encouraging improvements in predictive accuracy, water-use efficiency, disease detection performance, and operational decision support, these outcomes should be interpreted within the scope of the experimental conditions under which they were obtained. The current evidence base remains dominated by relatively short-term and context-specific evaluations, making it difficult to determine whether similar levels of performance can be consistently maintained across different crops, climatic regions, management systems, and socio-economic contexts.

Future research should therefore prioritize multi-year, multi-site, and multi-crop validation frameworks, together with explicit uncertainty quantification and comparisons against alternative modelling approaches.

Such validation efforts should move beyond conventional technical metrics alone and also assess robustness, transferability, economic viability, user adoption, environmental trade-offs, and performance under unexpected or extreme conditions. This broader evaluation perspective would provide a more realistic assessment of the capacity of Agricultural Digital Twins to support resilient decision-making in complex agroecosystems.

Such efforts are essential for establishing the robustness, credibility, and operational relevance of digital twins under real-world agricultural conditions.

Taken together, these structural, epistemological, and operational limitations suggest that the current promise of agricultural digital twins remains constrained by more than just technical challenges. They point to deeper issues related to how agroecosystems are represented, how knowledge is produced, and how these tools are implemented in practice. Table 2 summarises these constraints, highlighting how they interact across different dimensions and contribute to shaping both the potential and the limits of digital twins in agriculture.

Collectively, these limitations suggest that future progress will depend not only on improvements in sensing technologies, computational power, and modelling sophistication, but also on the ability to develop transparent, biologically realistic, uncertainty-aware, and socially legitimate digital twin frameworks capable of supporting decision-making under the inherent complexity of agroecosystems.

7. Research Gaps, Scientific Controversies, and Future Directions

The literature on digital twins in agriculture is evolving rapidly, but it also reveals a number of unresolved tensions, both conceptual and methodological. Digital twins are often presented as integrative and predictive tools with the potential to transform agricultural systems. However, a closer examination suggests that there is still a noticeable gap between these ambitions and what is currently achieved in practice. The main scientific controversies underlying this gap are summarised in Table 3, while the key research gaps and future priorities are outlined in Table 4.

One of the most persistent assumptions in this field is that increasing data availability will automatically improve predictive performance. While this idea is intuitively appealing, the evidence suggests a more complex reality. When models are poorly constrained, structurally limited, or only loosely connected to the actual dynamics of agroecosystems, adding more data can introduce noise, increase redundancy, and even destabilise predictions rather than improve them [21,44]. In complex systems, where many processes remain only partially observable, the relationship between data volume and model performance is therefore not linear. As reflected in Table 3, this issue highlights a broader tension between data abundance and model adequacy, suggesting that beyond a certain point, additional data may reduce rather than enhance predictive reliability. Importantly, this observation should not be interpreted as evidence that increasing data availability is inherently detrimental. Rather, it suggests that predictive improvements depend critically on the capacity of model structures to transform data into biologically meaningful representations of agroecosystem processes. High-quality and process-relevant observations can substantially improve predictive robustness when integrated within appropriate modelling frameworks. The controversy therefore lies less in the quantity of available data than in the relationship between data complexity and model adequacy [21,44].

A second major limitation concerns the way models are validated. Many studies reporting strong performance still rely on relatively narrow datasets, short observation periods, or controlled experimental conditions that do not fully capture the variability of real agricultural systems [22]. In some cases, models are even evaluated using the same data that were used for calibration, raising concerns about overfitting and inflated performance estimates. More importantly, validation across different environments, cropping systems, and time periods remains limited. This makes it difficult to assess how robust and transferable these models actually are. As highlighted in Table 3 and Table 4, improving validation practices remains a key priority for strengthening the scientific credibility of digital twins in agriculture.

Beyond conventional calibration procedures, future validation strategies should incorporate independent evaluations across years, sites, climatic conditions, management systems, and crop types. Particular attention should also be given to uncertainty quantification, robustness under extreme events, and comparisons with non-digital-twin baseline approaches. Such validation frameworks are essential for determining whether reported improvements genuinely reflect model generalization or merely local optimisation effects [22].

The growing integration of artificial intelligence further complicates this picture. Machine learning and deep learning approaches can enhance predictive performance by identifying complex and non-linear patterns that are difficult to capture with traditional models [21,48]. However, these gains often come with a trade-off. As models become more powerful, they may also become less transparent. In many cases, they function as black-box systems, where predictive accuracy increases while interpretability declines.

This prediction–interpretability trade-off has emerged as one of the most important scientific controversies associated with agricultural digital twins. While highly flexible machine-learning algorithms can identify complex non-linear relationships, they often provide limited insight into the causal mechanisms underlying system behaviour. Recent developments in explainable artificial intelligence (XAI), hybrid mechanistic–AI approaches, and causal modelling frameworks represent promising avenues for addressing this limitation. However, their implementation within operational agricultural digital twins remains relatively limited and requires further evaluation under diverse agroecological conditions [21,48]. The challenge is particularly important in biological systems, where understanding why a prediction is generated may be as important as the prediction itself. In agroecosystems characterized by partial observability, emergent behaviour, and strong context dependency, predictive performance alone may be insufficient to ensure scientific credibility or operational reliability. Consequently, future ADTs will likely need to balance predictive power with explainability and biological realism [21,48].

These controversies are summarized in Table 3, while key research gaps and priorities are outlined in Table 4. Together, Table 3 and Table 4 illustrate that many of the current limitations of ADTs originate not only from technological constraints but also from unresolved conceptual questions concerning uncertainty, causality, system representation, and model validity.

A major future direction for Agricultural Digital Twins (ADTs) lies in the transition from deterministic prediction engines to adaptive decision navigators, as conceptualized in Figure 7. This transition reflects a profound shift in the underlying philosophy of digital twin development. Conventional ADTs typically aim to identify a single most likely future based on historical observations, model calibration, and best-estimate predictions. While such approaches can provide valuable insights under relatively stable conditions, their effectiveness may decline when agroecosystems are exposed to multiple interacting sources of uncertainty, including climatic variability, biological dynamics, socio-economic disruptions, policy changes, and adaptive human responses.

Figure 7 illustrates an alternative paradigm in which uncertainty is no longer treated as a residual error to be minimized, but as an intrinsic and unavoidable feature of agroecosystem functioning. In this framework, adaptive ADTs explicitly integrate uncertainty quantification, data assimilation, machine learning, feedback mechanisms, and process-based modelling to generate ensembles of plausible futures rather than a single deterministic forecast. The objective therefore shifts from predicting one future accurately to exploring multiple possible trajectories and identifying management strategies that remain effective across a range of uncertain conditions.

This evolution aligns ADTs with key principles of resilience theory, adaptive management, robust decision-making, and scenario-based planning. Rather than optimizing decisions for a single anticipated future, adaptive digital twins support the identification of robust strategies capable of maintaining system performance under diverse climatic, biological, economic, and institutional scenarios. In this sense, the future value of ADTs may depend less on their ability to forecast perfectly and more on their capacity to help decision-makers navigate uncertainty, anticipate alternative futures, and enhance the adaptive capacity of agroecosystems.

More fundamentally, Figure 7 suggests that the next generation of ADTs should be evaluated not solely according to predictive accuracy, but also according to their ability to support resilient, flexible, and context-aware decision-making. This represents a transition from a paradigm of prediction and control toward a paradigm of learning, adaptation, and uncertainty-informed governance. The ultimate ambition is therefore not the production of a single optimal recommendation, but the construction of decision-support systems capable of remaining informative, transparent, and operational across multiple plausible futures.

At the same time, the increasing complexity of agroecosystems introduces a fundamental challenge for Agricultural Digital Twins (ADTs), conceptualized in Figure 8 as the Digital Twin Relevance Paradox. As agroecosystem complexity increases, the need for digital twins becomes progressively greater because decision-making must account for nonlinear interactions, environmental variability, biological heterogeneity, and multiple sources of uncertainty operating across scales. However, the reliability of current ADTs may decline beyond a critical complexity threshold because many of the processes governing system behaviour remain only partially observable, weakly represented, or structurally inaccessible to existing modelling frameworks. These include emergent ecological properties, microbiome dynamics, cross-scale feedback mechanisms, socio-economic responses, governance influences, and adaptive human decision-making. Consequently, ADTs tend to perform most reliably in relatively simple and controlled systems, whereas their greatest potential value lies in highly complex systems characterized by strong uncertainty and limited predictability. This paradox highlights a central limitation of current digital twin architectures and suggests that future progress will depend not only on increasing data availability or computational power, but also on improving the representation of hidden processes, uncertainty propagation, adaptive learning mechanisms, and cross-scale interactions that shape agroecosystem dynamics [60].

Finally, Figure 8 further exposes a critical mismatch between where Agricultural Digital Twins (ADTs) currently perform most reliably and where they are most urgently needed. In relatively simple or controlled systems, model structures are easier to parameterize, uncertainty is more constrained, and observed variables are more closely linked to system behaviour. Under these conditions, ADTs can reach comparatively high reliability, but their added value for decision support may remain limited because system dynamics are already more predictable. Conversely, as agroecosystem complexity increases, the need for ADTs becomes greater because farmers and decision-makers must operate under stronger uncertainty, nonlinear dynamics, interacting stressors, and socio-ecological feedback. Yet these are precisely the conditions under which current ADTs become less reliable, because hidden processes, emergent behaviour, microbiome dynamics, climate extremes, governance constraints, and human decision-making are difficult to observe, formalize, and validate. This creates a relevance paradox: ADTs may be most reliable where they are least needed, and most needed where they are least reliable. The implication is not that ADTs are unsuitable for complex agroecosystems, but that their future development must move beyond technological maturity alone. Progress will require uncertainty-aware modelling, cross-scale validation, adaptive learning, transparent governance, and stronger representation of latent biological and socio-economic processes. Figure 8 therefore calls for more realistic expectations: the scientific value of ADTs should not be judged only by predictive performance under controlled conditions, but by their ability to remain informative, robust, and decision-relevant when agroecosystem complexity increases. Beyond purely scientific and technical challenges, future developments will also need to address broader governance and implementation issues. Questions related to data ownership, algorithmic transparency, digital sovereignty, accessibility, farmer participation, and equitable access to technological innovation are likely to become increasingly important as digital twin deployment expands. Without appropriate governance frameworks and participatory implementation strategies, the benefits of ADTs may remain concentrated within highly capitalized agricultural systems, potentially reinforcing existing technological and socio-economic inequalities.

Taken together, these research gaps and ongoing controversies point to a broader conclusion: the future of digital twins in agriculture will depend on more than technological progress alone. It will require improvements in validation practices, greater attention to model interpretability, and a more explicit integration of uncertainty into both modelling and decision-making processes. It will also require stronger integration of biological realism, multi-scale system representation, transparent governance frameworks, participatory design approaches, and uncertainty-aware decision-support architectures capable of operating under the intrinsic complexity of agroecosystems. The issues summarized in Table 3, the priorities outlined in Table 4, and the conceptual shifts illustrated in Figure 7, Figure 8 and Figure 9 all converge toward the same direction. For digital twins to mature as a scientific and operational approach, their development will need to become more critical, more transparent, more biologically grounded, and more firmly anchored in the realities of complex agricultural systems.

8. Conceptual Framework Development and Figure Construction

The conceptual figures presented in this review were not designed simply as visual illustrations of the text. They form an integral part of the analytical framework underlying the manuscript. Their development is based on a careful synthesis of the literature, combined with a systems-oriented perspective and an explicit effort to reflect on the epistemological dimensions of digital twin research. Rather than reproducing empirical outputs, these figures are intended as conceptual tools that help make visible the recurring patterns, inconsistencies, and tensions identified across studies on digital twins, agroecosystem complexity, and data-driven modelling [17,21,44].

The construction of these figures followed a structured, yet iterative process. It began with a close reading of the literature in order to identify the main conceptual tensions shaping the field. Among the most prominent were the non-linear relationship between data availability and model performance (Figure 5), the gap between observable variables and underlying biological processes (Figure 6), and the mismatch between technological maturity and system complexity (Figure 9). Rather than treating these issues in isolation, they were considered as interconnected expressions of deeper structural constraints that emerge when digital twins are applied to complex living systems [7,63].

Particular attention was given to identifying concepts that appeared consistently across multiple studies despite differences in agricultural systems, modelling approaches, and technological infrastructures. Only recurring themes supported by convergent evidence from the reviewed literature were incorporated into the conceptual framework. This approach was adopted to minimise arbitrary interpretation and to ensure that the figures reflected broad scientific patterns rather than isolated observations.

The figures were then developed through a process of abstraction aimed at translating these recurring patterns into simplified visual representations. The objective was not to quantify relationships precisely, but rather to provide a coherent framework for understanding how different limitations and opportunities interact within agricultural digital twin systems.

Importantly, the figures were constructed using a systems-thinking perspective in which agroecosystems are viewed as complex adaptive systems characterised by feedback loops, non-linearity, partial observability, emergent behaviour, and context dependency [63,67]. Consequently, the graphical structures were intentionally designed to emphasize interactions, constraints, trade-offs, and sources of uncertainty rather than purely technological functionalities.

In this respect, Figure 5, Figure 6, Figure 7, Figure 8 and Figure 9 should be interpreted as analytical syntheses rather than predictive models. Their purpose is to facilitate discussion about relationships that are frequently acknowledged in the literature but rarely formalised explicitly [21,68]. Similarly, Figure 5 was developed to illustrate the gap between observable variables and latent processes, a challenge that is widely recognised in fields such as plant phenotyping and ecological modelling [23].

This distinction is particularly important because several of the mechanisms illustrated in the figures remain difficult to evaluate through conventional experimental approaches. By translating fragmented evidence into structured conceptual representations, the figures provide a means of integrating knowledge originating from different disciplines, including agronomy, systems ecology, artificial intelligence, remote sensing, modelling science, and digital agriculture [7,22].

This approach is, of course, open to discussion. Conceptual figures inevitably involve a degree of interpretation, which means they may be perceived as subjective or insufficiently grounded in empirical validation. However, in a field where empirical evidence remains fragmented and highly context-dependent, developing such conceptual frameworks is not necessarily a weakness. On the contrary, it can help structure scientific debate, clarify what remains uncertain, and identify priorities for future research [22,65].

Furthermore, conceptual synthesis has historically played a pivotal role in the advancement of scientific paradigms, particularly in disciplines characterized by high system complexity, strong context dependency, and limited experimental controllability. In such domains, conceptual frameworks frequently act as intellectual bridges linking empirical observations, theoretical developments, methodological innovation, and the formulation of new research hypotheses. Rather than merely summarizing existing knowledge, they often contribute to shaping emerging scientific agendas by identifying hidden assumptions, unresolved tensions, and overlooked research frontiers.

Within this perspective, the conceptual figures proposed throughout this review should be interpreted as hypothesis-generating and theory-building frameworks rather than definitive representations of agroecosystem reality. Their primary objective is not to provide empirical quantification, but to formalize recurring patterns, conceptual contradictions, and systemic challenges repeatedly identified across the reviewed literature. As such, they are intended to stimulate future empirical validation, comparative analyses, and methodological refinement.

Importantly, these figures should not be regarded as simple visual complements to the text. Instead, they constitute an integral component of the critical argument developed throughout this review. Collectively, they challenge several techno-optimistic assumptions that continue to dominate discussions surrounding Agricultural Digital Twins (ADTs), particularly the implicit belief that increasing data volumes, sensing density, computational power, and model complexity will inevitably lead to improved understanding, predictability, and control of agroecosystems.

By making structural constraints more explicit, the figures reveal a series of interconnected scientific challenges that remain only partially acknowledged in the current literature. These include the non-linear relationship between data accumulation and knowledge generation, illustrated through the data saturation and epistemic overload dynamics represented in Figure 4; the fundamental observability constraints imposed by partially observable biological systems and hidden ecological processes highlighted in Figure 5; the transition from deterministic prediction toward adaptive, uncertainty-aware decision support conceptualized in Figure 7; the cross-scale interactions linking molecular, physiological, agronomic, landscape, and socio-economic processes illustrated in Figure 8; the maturity–complexity paradox showing that digital twins tend to be least mature precisely where system complexity is highest (Figure 9); and the governance challenges associated with transparency, accountability, explainability, interoperability, and stakeholder legitimacy synthesized in Figure 10.

Taken together, these conceptual representations suggest that the future development of ADTs may be constrained less by technological limitations than by deeper epistemological boundaries associated with understanding, representing, and governing complex living systems. In this sense, the figures collectively support a broader scientific argument: the central challenge facing agricultural digital twins is not merely technological implementation, but the reconciliation of digital representation with the intrinsic uncertainty, heterogeneity, emergence, adaptability, and context dependency that characterize agroecosystems.

Consequently, the figures serve not only as explanatory devices but also as critical analytical instruments designed to stimulate scientific reflection and encourage a more balanced assessment of both the opportunities and limitations of digital twin technologies. Their purpose is to confront technological expectations with biological, ecological, epistemological, and socio-technical realities that cannot be fully eliminated through data acquisition or computational advances alone.

For this reason, the figures should not be interpreted as neutral summaries of the literature. Rather, they function as conceptual lenses through which prevailing assumptions can be critically examined and alternative research trajectories explored. They are not predictive models in themselves, but heuristic frameworks intended to organize thinking around the limits, trade-offs, paradoxes, and future directions of ADT development. By explicitly linking data, models, observability, uncertainty, governance, and system complexity, they contribute to a more transparent, reflexive, and scientifically grounded understanding of digital twin implementation in agriculture.

Ultimately, the conceptual framework developed in this review seeks to move beyond a purely technological interpretation of Agricultural Digital Twins. By explicitly incorporating uncertainty, partial observability, epistemic limits, governance mechanisms, adaptive decision-making, and multi-scale system complexity, it proposes a broader scientific perspective through which future ADT research can be evaluated, refined, and critically assessed. From this viewpoint, the long-term success of agricultural digital twins will likely depend not only on advances in sensing technologies, artificial intelligence, or computational performance, but also on their capacity to integrate ecological realism, scientific transparency, institutional legitimacy, and responsible governance within coherent socio-ecological decision-support systems capable of operating under conditions of irreducible uncertainty.

9. Limitations of the Conceptual Framework

Although the conceptual framework and associated figures developed in this review provide a structured synthesis of recurring patterns emerging from a rapidly evolving and highly interdisciplinary body of literature, their interpretive power remains subject to several important limitations that warrant explicit consideration.

First, the proposed framework is grounded in a qualitative, comparative, and integrative interpretation of the literature rather than in a formal quantitative meta-analysis. Consequently, the relationships represented in Figure 4, Figure 5, Figure 6, Figure 7, Figure 8, Figure 9 and Figure 10 should not be interpreted as empirically calibrated laws, predictive equations, or universally transferable system properties. Rather, they represent conceptual abstractions derived from convergent observations, theoretical reasoning, and recurring themes identified across diverse studies. This distinction is particularly important because many of the dynamics illustrated in the framework, including the data–knowledge decoupling process, the emergence of epistemic blind spots, and the maturity–complexity paradox—are likely to vary substantially across agroecological conditions, climatic contexts, production systems, institutional environments, and levels of technological development.

More fundamentally, the framework reflects an attempt to identify general tendencies within a heterogeneous scientific landscape rather than to establish universal principles governing Agricultural Digital Twins (ADTs). As a result, some conceptual relationships may reflect dominant narratives and recurrent patterns within the current literature rather than intrinsic properties of agroecosystems themselves. The framework should therefore be interpreted as an analytical synthesis of contemporary scientific thinking rather than as a definitive representation of agricultural reality.

Second, the level of abstraction required to construct an integrative conceptual framework inevitably simplifies the complexity of living agroecosystems. Biological systems are characterized by emergence, adaptation, path dependency, non-linearity, and cross-scale interactions that cannot be fully represented through static conceptual structures. Processes such as plant–microbiome co-evolution, genotype-by-environment interactions, ecological memory, adaptive farmer behaviour, landscape connectivity, and ecosystem feedback mechanisms remain only partially represented within the present framework.

This limitation is particularly evident in Figure 6 and Figure 8, which respectively conceptualize partial observability and multi-scale organization. While these representations help clarify major structural tensions, they cannot fully capture the diversity, contingency, and dynamic nature of interactions occurring across biological, ecological, and socio-economic scales. Consequently, the framework should not be interpreted as an exhaustive description of agroecosystem functioning, but rather as a heuristic lens designed to reveal underlying tensions that frequently remain hidden in predominantly technology-oriented discussions.

Third, several influential drivers of digital twin performance remain only indirectly incorporated into the framework. These include behavioural adaptation by farmers, institutional learning processes, governance structures, market dynamics, policy interventions, economic incentives, geopolitical instability, data ownership regimes, and emerging technological disruptions. Increasing evidence suggests that these factors can influence the success or failure of digital twin implementation as strongly as technological performance itself. However, incorporating all of these dimensions explicitly would have transformed the framework into a substantially more complex socio-technical model, extending beyond the analytical objectives of the present review.

A related limitation concerns the representation of uncertainty. Several relationships illustrated throughout the framework, particularly those involving thresholds, transitions, saturation effects, and declining predictive reliability, are primarily derived from theoretical reasoning, systems thinking, and comparative interpretation rather than from harmonized empirical datasets. The curves and transitions represented in Figure 4, Figure 7 and Figure 9 should therefore be interpreted as heuristic and context-dependent trajectories. They identify plausible system behaviours and potential structural tendencies, but they do not constitute empirically validated universal responses.

Future research should therefore focus on testing these conceptual propositions through coordinated empirical investigations conducted across contrasting agroecological regions, climatic gradients, production systems, and technological environments. Such efforts would help determine whether the relationships proposed here reflect general characteristics of ADTs or remain contingent upon specific contexts and implementation conditions.

Fourth, the framework only partially captures the profound socio-technical heterogeneity that characterizes global agriculture. The pathways leading toward uncertainty-aware and adaptive digital twins are unlikely to be uniform across regions. Differences in infrastructure quality, digital connectivity, institutional capacity, governance systems, access to expertise, economic resources, and data availability may fundamentally shape both the trajectory and the maturity of ADT development.

This limitation is particularly relevant for low- and middle-income countries, where agricultural systems often operate under conditions of limited connectivity, fragmented datasets, institutional constraints, and resource scarcity. Under such circumstances, the assumptions underlying highly integrated digital twin architectures may not hold, and alternative development pathways may emerge. Consequently, the maturity–complexity relationships proposed in this review should not be assumed to follow identical trajectories across all geographical and socio-economic contexts.

Fifth, the framework remains constrained by the current state of the scientific literature itself. Publication bias may systematically favour studies reporting technological successes, predictive improvements, and positive implementation outcomes, while failures, negative results, uncertainty analyses, and unsuccessful deployments remain comparatively underreported. This asymmetry may contribute to an overly optimistic representation of ADT capabilities and may partially obscure the frequency and significance of implementation challenges. As a result, some of the limitations discussed throughout this review may in fact be more pervasive than currently documented in the published literature.

Finally, an inherent limitation of all conceptual frameworks lies in their interpretive nature. Conceptual synthesis is not a neutral process of aggregation but an analytical exercise involving theoretical choices, disciplinary perspectives, and epistemological assumptions. Alternative conceptualisations of the same literature are therefore entirely possible. Researchers approaching ADTs from engineering, ecological, agronomic, social-science, or governance perspectives may legitimately emphasize different mechanisms, relationships, and priorities.

Rather than representing a weakness, this plurality of interpretations reflects the interdisciplinary nature of the field itself. Indeed, one of the central conclusions emerging from this review is that Agricultural Digital Twins cannot be fully understood through a single disciplinary lens. Their development sits at the intersection of engineering, ecology, data science, systems theory, governance, and decision sciences. Consequently, future progress will likely depend not only on technological innovation but also on the capacity to integrate these diverse perspectives into coherent scientific frameworks capable of addressing the fundamental challenge underlying ADTs: how to represent, understand, and support decision-making within complex living systems that remain only partially observable, inherently adaptive, and irreducibly uncertain.

For this reason, the framework should not be viewed as a definitive representation of agricultural digital twin development, but rather as a working scientific hypothesis designed to stimulate debate, guide future empirical investigations, and support the progressive refinement of theoretical understanding. Its principal value lies not in providing final answers, but in making explicit the key assumptions, uncertainties, and structural tensions that continue to shape the evolution of agricultural digital twins [22,65].

Ultimately, the framework should be regarded as an evolving conceptual architecture that will require continuous revision as new empirical evidence, modelling approaches, validation studies, and real-world implementations become available. In this sense, its scientific contribution is less to establish fixed conclusions than to provide a transparent foundation upon which future research can build, challenge, refine, and empirically test the proposed relationships.

10. Beyond Technological Optimism: A Critical Reappraisal of Digital Twins in Agriculture

Although the literature on digital twins in agriculture is expanding rapidly and often presents them as transformative tools, the discussion remains, to a large extent, shaped by a strong techno-optimistic narrative that deserves closer examination. Digital twins are frequently portrayed as integrative, data-driven solutions capable of improving productivity, enhancing sustainability, and strengthening resilience. While these expectations are not without foundation, they can sometimes overlook important scientific uncertainties, as well as the epistemological and socio-technical constraints that become apparent when these systems are applied to real-world agricultural contexts.

A first point of concern relates to the widespread assumption that agricultural systems can be made fully observable, measurable, and ultimately controllable through digital infrastructures. This assumption, which is partly inherited from engineering paradigms [19,69], does not align easily with the nature of agroecosystems. These systems are not fully ordered or predictable environments; they are complex adaptive systems, shaped by non-linearity, emergent dynamics, and multiple sources of uncertainty [7,70,71,72,73].

Furthermore, the assumption that increasing digitalisation will necessarily reduce uncertainty deserves careful reconsideration. In biological systems, uncertainty is not merely a consequence of insufficient information but often reflects intrinsic properties of ecological processes themselves. Consequently, even highly sophisticated digital twins may remain unable to fully anticipate emergent system behaviours, regime shifts, or unexpected responses arising from interactions across scales.

A second issue concerns the growing tendency to equate technological sophistication with scientific understanding. The increasing integration of sensors, remote sensing platforms, machine learning algorithms, and high-performance computing infrastructures undoubtedly expands the capacity to collect and process information. However, the accumulation of data should not be confused with an equivalent increase in explanatory power. As discussed throughout this review, the relationship between information availability and system understanding remains conditional upon the ability of models to represent underlying causal mechanisms [21,48,74,75].

This distinction becomes particularly important in the context of artificial intelligence and deep learning approaches. While highly flexible predictive models may achieve impressive forecasting performance, they may simultaneously obscure causal relationships and reduce interpretability. As a result, predictive success does not necessarily imply improved understanding of agroecosystem functioning. In some cases, increasingly complex algorithms may generate highly accurate outputs while providing limited insight into the ecological processes responsible for observed patterns.

Another important consideration concerns the socio-technical dimension of agricultural digital twins. Digital technologies are often presented as neutral innovations whose benefits emerge automatically from their deployment [18,66]. In reality, their performance and impact are strongly influenced by institutional arrangements, governance structures, economic conditions, and user capacities [46,47].

These socio-technical constraints indicate that governance should not be treated as an external regulatory add-on or a secondary implementation concern, but as a constitutive layer of Agricultural Digital Twins. As illustrated in Figure 10, the transformation of heterogeneous agricultural data into actionable, trustworthy and socially legitimate decisions depends on governance mechanisms that make data flows traceable, model assumptions interpretable, responsibilities explicit, and uncertainties communicable. Transparency, explainability, interoperability, data ownership, accountability, fairness, trust, adaptive learning and stakeholder participation are therefore not peripheral conditions for adoption; they are core requirements for ensuring that ADTs remain credible, usable and decision-relevant under the uncertainty and complexity of real agroecosystems. This issue is especially relevant in smallholder farming systems, rainfed tropical agriculture, biodiversity-rich agroecological systems, and resource-constrained regions, where digital infrastructures, connectivity, technical support, and access to high-quality datasets may remain limited. Ironically, many of the agricultural systems that could potentially benefit most from improved decision-support tools are also those in which the implementation of sophisticated digital twins remains most challenging.

At the same time, a growing body of evidence suggests the existence of a fundamental mismatch between the technological aspirations associated with Agricultural Digital Twins (ADTs) and the ecological complexity of the systems they seek to represent. As conceptualized in Figure 8, the performance of digital twin architectures tends to be highest in environments characterized by relatively stable conditions, high observability, standardized management practices, and well-defined system boundaries. Under such circumstances, data streams can be efficiently integrated, model parameters can be calibrated with greater confidence, and system dynamics remain sufficiently constrained to enable reliable prediction and optimization.

The situation is markedly different in complex agroecosystems. Systems characterized by high biodiversity, strong environmental variability, heterogeneous management practices, and multiple interacting biological processes remain substantially more difficult to represent through digital replicas. This observation raises a critical scientific question: are digital twins currently being developed primarily for the agricultural systems where they are most needed to enhance resilience and sustainability, or for those where implementation is technically easier and economically more attractive?

This question becomes particularly relevant when considering diversified agroforestry systems, mixed crop–livestock systems, tropical smallholder mosaics, regenerative farming systems, and low-input agroecological landscapes. Such systems are governed by dense ecological networks involving soil microbiomes, plant–plant interactions, trophic relationships, landscape connectivity, ecosystem services, and adaptive management strategies that evolve through time. Their behaviour frequently emerges from interactions occurring across multiple organizational scales rather than from a limited set of measurable variables. Consequently, many of the assumptions underlying conventional digital twin architectures—including standardization, repeatability, parameter stability, and transferability—become increasingly difficult to sustain.

From this perspective, the challenge facing future generations of ADTs is not merely one of computational power or data acquisition. Rather, it concerns the capacity to develop representations capable of embracing ecological complexity without reducing it excessively. A digital twin that simplifies agroecosystem functioning to the point where essential ecological processes disappear may achieve technical efficiency while simultaneously losing ecological realism. This creates a fundamental trade-off between model tractability and biological fidelity that remains largely unresolved within the current literature.

More broadly, the complexity mismatch highlighted in Figure 8 reflects a deeper epistemological issue. Agricultural systems are not simply physical systems waiting to be digitized; they are living adaptive systems characterized by emergence, contingency, path dependency, and irreducible uncertainty. Unlike industrial assets, whose behaviour can often be described through relatively stable physical laws, agroecosystems continuously reorganize in response to environmental change, biological adaptation, and human intervention. As a consequence, complete representation may remain an unattainable objective regardless of future advances in sensing technologies or artificial intelligence.

It is equally important to recognize that digital twins are not neutral technological artefacts. Every digital twin embodies a particular way of observing, measuring, interpreting, and ultimately governing agricultural systems. The selection of variables, the design of indicators, the structure of models, and the definition of optimization objectives all reflect underlying epistemological assumptions regarding what is considered relevant, measurable, and actionable knowledge.

By privileging variables that can be quantified, monitored, and computationally processed, ADTs may unintentionally favour certain forms of knowledge while marginalizing others. Experiential knowledge accumulated by farmers, tacit management practices, local ecological understanding, indigenous knowledge systems, and context-specific observations often remain difficult to formalize within conventional modelling frameworks. Yet these forms of knowledge frequently play a critical role in managing uncertainty, responding to unexpected disturbances, and maintaining long-term agroecosystem resilience.

This observation has important implications for the future governance of digital agriculture. If digital twins become increasingly influential in guiding agricultural decisions, questions of knowledge representation, transparency, inclusiveness, and epistemic equity will become as important as questions of predictive accuracy. The challenge is therefore not simply to build more sophisticated digital twins, but to ensure that their development remains compatible with plural forms of knowledge and diverse agricultural realities.

Ultimately, the long-term scientific value of ADTs may depend less on their ability to create perfect virtual replicas of agricultural systems than on their capacity to function as transparent, uncertainty-aware, and participatory decision-support environments. In this perspective, the next generation of agricultural digital twins should not aim to replace ecological understanding with computation, but rather to create new forms of interaction between data, models, ecological processes, and human expertise. Such a transition would move the field beyond a purely technological paradigm toward a more mature vision of ADTs as socio-ecological learning systems capable of supporting resilient agriculture under conditions of complexity, uncertainty, and global change.

Figure 10 ultimately reframes Agricultural Digital Twins (ADTs) not as purely computational infrastructures, but as governed socio-ecological knowledge systems operating at the interface between observation, modelling, and irreducible uncertainty. Rather than depicting governance as an external regulatory layer applied after model development, the figure positions governance as an integral component of the digital twin itself, shaping how data are collected, interpreted, validated, communicated, and translated into decisions. In this perspective, the long-term value of ADTs depends not only on sensing technologies, artificial intelligence, and predictive capabilities, but also on the institutional and societal mechanisms through which uncertainty is acknowledged, managed, and incorporated into decision-making processes.

The figure further highlights a critical distinction between what can be measured, what can be modelled, and what remains fundamentally uncertain. Although advances in remote sensing, UAVs, IoT networks, and artificial intelligence continue to expand the observable and modelled domains of agroecosystems, important drivers of system behaviour remain only partially observable or structurally inaccessible. These include microbiome dynamics, emergent ecosystem properties, adaptive human behaviour, socio-economic disruptions, policy shifts, and cross-scale ecological feedbacks. As a result, uncertainty should not be viewed as a temporary deficiency that will disappear with additional data, but as an intrinsic characteristic of complex agroecosystems that must be explicitly incorporated into digital twin architectures.

Consequently, future ADT development should move beyond data-centric paradigms toward uncertainty-aware and governance-sensitive frameworks. Participatory modelling approaches become particularly important in this context, as farmers, advisors, agronomists, ecologists, data scientists, and policy actors possess complementary forms of knowledge that can help identify blind spots, contextualize model outputs, and improve system legitimacy. Such collaborative approaches may strengthen transparency, operational relevance, stakeholder trust, and social acceptance while reducing the risk of developing technically sophisticated systems that remain disconnected from real-world agricultural decision-making.

Beyond methodological innovation, the figure suggests that a broader governance architecture is required to ensure the responsible deployment of ADTs. This architecture should include transparent data-governance policies, explainable artificial intelligence, interoperable infrastructures, open validation protocols, uncertainty quantification procedures, adaptive learning mechanisms, and cross-scale verification frameworks capable of evaluating model performance across diverse agroecological and socio-economic contexts. These governance dimensions are not secondary complements to technological development; they are likely to become primary determinants of the credibility, legitimacy, and long-term societal value of digital twins.

Taken together, these considerations suggest a fundamental conceptual shift. The objective of future ADTs should not be the pursuit of complete digital replication or perfect prediction of agroecosystems, an ambition that may remain unattainable in highly complex systems. Instead, ADTs should evolve toward adaptive, uncertainty-aware, and governance-enabled systems capable of supporting robust decisions under imperfect knowledge. In this perspective, success will depend less on maximizing predictive accuracy alone and more on building systems that remain transparent, trustworthy, resilient, and decision-relevant when confronted with uncertainty, complexity, and change. The future of Agricultural Digital Twins may therefore depend not on eliminating uncertainty, but on governing it effectively.

11. Conclusions

Digital twins are increasingly presented as emblematic of the broader transition toward data-driven agriculture. They carry a compelling promise: to integrate data, models, and decision-support tools into dynamic representations of agroecosystems capable of informing action in a context of accelerating environmental and climatic change. This promise is not without substance. It reflects both genuine technological progress and a growing need for tools that can help navigate the complexity of modern agricultural systems. Yet, as this review has shown, a clear gap remains between what digital twins are expected to achieve and what they can currently deliver.

At the core of this gap lies a fundamental characteristic of agroecosystems: they are living, context-dependent systems shaped by interactions that are multi-scale, dynamic, and only partially observable. In such systems, uncertainty is not simply a temporary limitation that can be resolved with more data. It is a structural feature. This has important implications. Although increasing data availability and model sophistication can improve the apparent representation of agricultural systems, they do not automatically generate deeper understanding or more reliable prediction. In many cases, model sophistication depends on measurements, state variables, or process-level information that are only partially available, inconsistently observed, or too costly to acquire under real farming conditions. As a result, a methodological deadlock may emerge: the more refined the model becomes, the more it relies on data that agricultural systems cannot reliably provide. In such situations, the resulting outputs may create an illusion of precision that exceeds both the observational basis of the model and the actual predictability of the system.

Importantly, this observation should not be interpreted as a universal limitation of Agricultural Digital Twins. Under appropriate conditions, numerous studies have demonstrated meaningful improvements in prediction, monitoring, resource-use efficiency, and decision support. However, the magnitude, persistence, and transferability of these benefits remain highly dependent on system characteristics, data quality, model assumptions, environmental conditions, and validation procedures. Consequently, caution is warranted when extrapolating reported performance gains beyond the contexts in which they were originally demonstrated.

In this light, the role of digital twins in agriculture may need to be reconsidered. Rather than being viewed primarily as tools for predictive control, they may be better understood as evolving interfaces that bring together data, models, and decision-making under conditions of uncertainty. Their value lies not only in their capacity to simulate system behaviour, but also in their potential to support more informed, adaptive, and context-aware decisions. This is particularly important in agricultural systems, where management choices must often be made under conditions of uncertainty, incomplete observability, and strong environmental variability. In such contexts, the role of digital twins is not simply to provide a single optimal prediction, but to help decision-makers explore possible system responses, compare alternative scenarios, and better understand the limits and implications of different management options.

At the same time, the future relevance of agricultural digital twins will depend on their ability to move beyond highly controlled production environments and demonstrate their value across a broader diversity of agroecosystems. This challenge is particularly important for smallholder farming systems, rainfed tropical agriculture, mixed crop–livestock systems, and biodiversity-rich agroecological landscapes, where system complexity, environmental variability, and limited digital infrastructure continue to constrain implementation. The true test of ADTs will therefore not be their performance in simplified environments alone, but their capacity to remain robust, interpretable, and operational under real-world complexity.

The evidence reviewed throughout this study further suggests that future progress will depend not only on advances in sensing technologies, artificial intelligence, computational power, or model integration. Equally important will be the development of rigorous long-term validation strategies, cross-site transferability assessments, uncertainty-aware modelling frameworks, and explainable AI approaches capable of improving transparency and scientific credibility. In this perspective, predictive performance should no longer be considered the sole criterion of success. Robustness, interpretability, adaptability, and ecological realism are likely to become equally important benchmarks for evaluating next-generation agricultural digital twins.

Future evaluation efforts should therefore place greater emphasis on long-term, multi-site, and cross-context validation studies capable of assessing whether reported performance improvements remain consistent across contrasting agroecological regions, production systems, climatic conditions, and socio-economic settings. Such efforts are essential for distinguishing context-specific successes from more generalizable properties of Agricultural Digital Twins.

Beyond scientific and technical considerations, the development of responsible agricultural digital twins will require new governance frameworks capable of addressing questions of data ownership, transparency, interoperability, accountability, and equitable access to digital innovation. Open data governance principles, explainable artificial intelligence frameworks, participatory modelling approaches involving farmers and local stakeholders, and cross-scale validation mechanisms should become central components of future ADT development. These dimensions are not peripheral to technological innovation; they are fundamental conditions for ensuring legitimacy, trust, and long-term societal value.

Particular attention should also be given to avoiding the emergence of new forms of digital inequality. Without appropriate governance mechanisms, the benefits of digital twins may remain concentrated in highly digitised agricultural systems, while resource-constrained regions continue to face barriers related to infrastructure, expertise, and data accessibility. Future ADTs should therefore be designed not only to optimise agricultural performance, but also to enhance inclusiveness, accessibility, and resilience across diverse socio-economic contexts.

Rather than questioning the value of ADTs, these considerations highlight the importance of adopting realistic expectations regarding their current capabilities and future development trajectories. The central challenge is not whether digital twins can generate useful insights, but under which conditions, at which scales, and with what degree of confidence those insights remain reliable, transferable, and operationally relevant.

Ultimately, this review argues that the future of agricultural digital twins does not lie in the pursuit of perfect prediction or complete digital replication of agroecosystems. Rather, it lies in the development of adaptive, uncertainty-aware, and scientifically transparent systems capable of supporting decision-making in environments characterised by complexity, variability, and change. The most successful agricultural digital twins will likely be those that combine technological innovation with ecological understanding, scientific humility, participatory governance, and operational realism. In this emerging paradigm, digital twins should be viewed not as instruments of control, but as tools for navigating uncertainty and supporting resilient agricultural transitions in an increasingly unpredictable world.

Author Contributions

Conceptualization, M.E.J.; methodology, M.E.J.; software, M.E.J.; validation, M.E.J. and R.L.; formal analysis, M.E.J.; investigation, M.E.J.; resources, M.E.J.; data curation, M.E.J. and R.L.; writing—original draft preparation, M.E.J.; writing—review and editing, R.L. and B.T.; visualization, M.E.J.; supervision, R.L.; project administration, B.T.; funding acquisition, R.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by FINCOME—Séjours de longue durée, Centre National pour la Recherche Scientifique et Technique (CNRST), Service Soutien au Transfert de Technologie.

Data Availability Statement

No new data were created or analyzed in this study. Data sharing is not applicable to this article.

Acknowledgments

This work was supported by the CNRST (Rabat) through a six-month postdoctoral fellowship awarded to the first author under the FINCOME program. The authors would like to express their sincere gratitude to the National School of Agriculture of Meknès (Morocco) for hosting a research stay from 1 November 2025 to 30 April 2026, which significantly contributed to the development of this work. The authors particularly acknowledge the Director of the institution for his support, as well as the research team of Rachid Lahlali and Ghizlane Echchgadda for their scientific exchanges, collaboration, and valuable insights.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Zhang, R.; Zhu, H.; Chang, Q.; Mao, Q. A comprehensive review of digital twins technology in agriculture. Agriculture 2025, 15, 903. [Google Scholar] [CrossRef]
Pylianidis, C.; Osinga, S.; Athanasiadis, I.N. Introducing digital twins to agriculture. Comput. Electron. Agric. 2021, 184, 105942. [Google Scholar] [CrossRef]
Grieves, M.; Vickers, J. Digital Twin: Mitigating Unpredictable, Undesirable Emergent Behavior in Complex Systems. In Transdisciplinary Perspectives on Complex Systems: New Findings and Approaches; Kahlen, F.-J., Flumerfelt, S., Alves, A., Eds.; Springer International Publishing: Cham, Switzerland, 2017; pp. 85–113. [Google Scholar] [CrossRef]
Antle, J.M.; Stöckle, C.O. Climate Impacts on Agriculture: Insights from Agronomic-Economic Analysis. Rev. Environ. Econ. Policy 2017, 11, 299–318. [Google Scholar] [CrossRef]
Basso, B.; Dumont, B.; Maestrini, B.; Shcherbak, I.; Robertson, G.P.; Porter, J.R.; Smith, P.; Paustian, K.; Grace, P.R.; Asseng, S.; et al. Soil Organic Carbon and Nitrogen Feedbacks on Crop Yields under Climate Change. Agric. Environ. Lett. 2018, 3, 180026. [Google Scholar] [CrossRef]
Frieler, K.; Schauberger, B.; Arneth, A.; Balkovič, J.; Chryssanthacopoulos, J.; Deryng, D.; Elliott, J.; Folberth, C.; Khabarov, N.; Müller, C.; et al. Understanding the weather signal in national crop-yield variability. Earth’s Future 2017, 5, 605–616. [Google Scholar] [CrossRef]
Levin, S.A. Ecosystems and the Biosphere as Complex Adaptive Systems. Ecosystems 1998, 1, 431–436. [Google Scholar] [CrossRef]
Lobell, D.B.; Asseng, S. Comparing estimates of climate change impacts from process-based and statistical crop models. Environ. Res. Lett. 2017, 12, 015001. [Google Scholar] [CrossRef]
Rosenzweig, C.; Elliott, J.; Deryng, D.; Ruane, A.C.; Müller, C.; Arneth, A.; Boote, K.J.; Folberth, C.; Glotter, M.; Khabarov, N.; et al. Assessing agricultural risks of climate change in the 21st century in a global gridded crop model intercomparison. Proc. Natl. Acad. Sci. USA 2014, 111, 3268–3273. [Google Scholar] [CrossRef] [PubMed]
Ruane, A.C.; Rosenzweig, C.; Asseng, S.; Boote, K.J.; Elliott, J.; Ewert, F.; Jones, J.W.; Martre, P.; McDermid, S.P.; Müller, C.; et al. An AgMIP framework for improved agricultural representation in integrated assessment models. Environ. Res. Lett. 2017, 12, 125003. [Google Scholar] [CrossRef]
Ruane, A.; Rosenzweig, C. Climate Change Impacts on Agriculture: Global Trends, Challenges and Opportunities; World Scientific: Singapore, 2019; pp. 161–191. [Google Scholar] [CrossRef]
Ariesen-Verschuur, N.; Verdouw, C.; Tekinerdogan, B. Digital Twins in greenhouse horticulture: A review. Comput. Electron. Agric. 2022, 199, 107183. [Google Scholar] [CrossRef]
Barroso-Barroso, C.; Vega-Muñoz, A.; Maradiaga-López, J.; Salazar-Sepúlveda, G.; Carabantes-Silva, R. Smart Farming and the SDGs: Emerging Research Patterns and Sustainability Implications. Agriculture 2026, 16, 81. [Google Scholar] [CrossRef]
Das, P.; Woods, O.; Kong, L. Modelling nature? The digital twinning and untwinning of urban farms. Geoforum 2026, 170, 104539. [Google Scholar] [CrossRef]
Escribà-Gelonch, M.; Liang, S.; van Schalkwyk, P.; Fisk, I.; Long, N.V.D.; Hessel, V. Digital Twins in Agriculture: Orchestration and Applications. J. Agric. Food Chem. 2024, 72, 10737–10752. [Google Scholar] [CrossRef]
Litvinov, Y.; Mezhenkov, A.; Sudip, T.; Sushkova, S.; Samoylov, A.; Melnik, E.; Kozlovskiy, A.; Kuchmenko, E.; Minkina, T.; Shuvaev, E.; et al. Theoretical foundations for developing a digital soil twin for Southern Russia. Eurasian J. Soil Sci. 2026, 15, 141–148. [Google Scholar] [CrossRef]
Verdouw, C.; Tekinerdogan, B.; Beulens, A.; Wolfert, S. Digital twins in smart farming. Agric. Syst. 2021, 189, 103046. [Google Scholar] [CrossRef]
Wolfert, S.; Ge, L.; Verdouw, C.; Bogaardt, M.-J. Big Data in Smart Farming—A review. Agric. Syst. 2017, 153, 69–80. [Google Scholar] [CrossRef]
Qi, Q.; Tao, F. Digital Twin and Big Data Towards Smart Manufacturing and Industry 4.0: 360 Degree Comparison. IEEE Access 2018, 6, 3585–3593. [Google Scholar] [CrossRef]
Finger, R. Sustainable crop protection and the role of digital agriculture. Agric. Syst. 2026, 231, 104516. [Google Scholar] [CrossRef]
Reichstein, M.; Camps-Valls, G.; Stevens, B.; Jung, M.; Denzler, J.; Carvalhais, N.; Prabhat. Deep learning and process understanding for data-driven Earth system science. Nature 2019, 566, 195–204. [Google Scholar] [CrossRef] [PubMed]
Wallach, D.; Martre, P.; Liu, B.; Asseng, S.; Ewert, F.; Thorburn, P.J.; van Ittersum, M.; Aggarwal, P.K.; Ahmed, M.; Basso, B.; et al. Multimodel ensembles improve predictions of crop–environment–management interactions. Glob. Change Biol. 2018, 24, 5072–5083. [Google Scholar] [CrossRef]
Mahlein, A.-K. Plant Disease Detection by Imaging Sensors—Parallels and Specific Demands for Precision Agriculture and Plant Phenotyping. Plant Dis. 2015, 100, 241–251. [Google Scholar] [CrossRef]
Ding, K.; Chan, F.T.S.; Zhang, X.; Zhou, G.; Zhang, F. Defining a Digital Twin-based Cyber-Physical Production System for autonomous manufacturing in smart shop floors. Int. J. Prod. Res. 2019, 57, 6315–6334. [Google Scholar] [CrossRef]
Wang, J.; Zhang, L.; Lin, K.-Y.; Feng, L.; Zhang, K. A digital twin modeling approach for smart manufacturing combined with the UNISON framework. Comput. Ind. Eng. 2022, 169, 108262. [Google Scholar] [CrossRef]
Spiryagin, M.; Edelmann, J.; Klinger, F.; Cole, C. Vehicle system dynamics in digital twin studies in rail and road domains. Veh. Syst. Dyn. 2023, 61, 1737–1786. [Google Scholar] [CrossRef]
Fett, M.; Wilking, F.; Goetz, S.; Kirchner, E.; Wartzack, S. A Literature Review on the Development and Creation of Digital Twins, Cyber-Physical Systems, and Product-Service Systems. Sensors 2023, 23, 9786. [Google Scholar] [CrossRef]
Miao, R.; Liu, S.; Sun, Y.; Du, M.; Bao, J. Streamlining digital twin development and operation with DTOps. J. Intell. Manuf. 2026, 37, 1531–1550. [Google Scholar] [CrossRef]
Cesco, S.; Sambo, P.; Borin, M.; Basso, B.; Orzes, G.; Mazzetto, F. Smart agriculture and digital twins: Applications and challenges in a vision of sustainability. Eur. J. Agron. 2023, 146, 126809. [Google Scholar] [CrossRef]
Silva, J.V.; Giller, K.E. Grand challenges for the 21st century: What crop models can and can’t (yet) do. J. Agric. Sci. 2020, 158, 794–805. [Google Scholar] [CrossRef]
Keating, B.A. Crop, soil and farm systems models—Science, engineering or snake oil revisited. Agric. Syst. 2020, 184, 102903. [Google Scholar] [CrossRef]
Liu, L.; Basso, B. Linking field survey with crop modeling to forecast maize yield in smallholder farmers’ fields in Tanzania. Food Secur. 2020, 12, 537–548. [Google Scholar] [CrossRef]
Maestrini, B.; Basso, B. Subfield crop yields and temporal stability in thousands of US Midwest fields. Precis. Agric. 2021, 22, 1749–1767. [Google Scholar] [CrossRef]
Wen, J.; Gabrys, B.; Musial, K. Toward Digital Twin Oriented Modeling of Complex Networked Systems and Their Dynamics: A Comprehensive Survey. IEEE Access 2022, 10, 66886–66923. [Google Scholar] [CrossRef]
He, X.; Ai, Q.; Pan, B.; Tang, L.; Qiu, R. Spatial-temporal data analysis of digital twin. Digit. Twin 2025, 2, 2482378. [Google Scholar] [CrossRef]
Mitchell, J.; Johnston, I.G.; Bassel, G.W. Variability in seeds: Biological, ecological, and agricultural implications. J. Exp. Bot. 2017, 68, 809–817. [Google Scholar] [CrossRef]
Balduzzi, M.; Binder, B.M.; Bucksch, A.; Chang, C.; Hong, L.; Iyer-Pascuzzi, A.S.; Pradal, C.; Sparks, E.E. Reshaping Plant Biology: Qualitative and Quantitative Descriptors for Plant Morphology. Front. Plant Sci. 2017, 8, 00117. [Google Scholar] [CrossRef]
Lasky, J.R.; Hooten, M.B.; Adler, P.B. What processes must we understand to forecast regional-scale population dynamics? Proc. R. Soc. B Biol. Sci. 2020, 287, 20202219. [Google Scholar] [CrossRef]
Cohen, A.R.; Chen, G.; Berger, E.M.; Warrier, S.; Lan, G.; Grubert, E.; Dellaert, F.; Chen, Y. Dynamically Controlled Environment Agriculture: Integrating Machine Learning and Mechanistic and Physiological Models for Sustainable Food Cultivation. ACS EST Eng. 2022, 2, 3–19. [Google Scholar] [CrossRef]
Applestein, C.; Caughlin, T.T.; Germino, M.J. Post-fire seed dispersal of a wind-dispersed shrub declined with distance to seed source, yet had high levels of unexplained variation. AoB Plants 2022, 14, plac045. [Google Scholar] [CrossRef]
Garrett, K.A.; Bebber, D.P.; Etherton, B.A.; Gold, K.M.; Plex Sulá, A.I.; Selvaraj, M.G. Climate Change Effects on Pathogen Emergence: Artificial Intelligence to Translate Big Data for Mitigation. Annu. Rev. Phytopathol. 2022, 60, 357–378. [Google Scholar] [CrossRef] [PubMed]
Pereponova, A.; Grahmann, K.; Lischeid, G.; Bellingrath-Kimura, S.D.; Ewert, F.A. Sustainable transformation of agriculture requires landscape experiments. Heliyon 2023, 9, e21215. [Google Scholar] [CrossRef]
Wilkinson, M.D.; Dumontier, M.; Aalbersberg, I.J.; Appleton, G.; Axton, M.; Baak, A.; Blomberg, N.; Boiten, J.-W.; da Silva Santos, L.B.; Bourne, P.E.; et al. The FAIR Guiding Principles for scientific data management and stewardship. Sci. Data 2016, 3, 160018. [Google Scholar] [CrossRef]
Basso, B.; Antle, J. Digital agriculture to design sustainable agricultural systems. Nat. Sustain. 2020, 3, 254–256. [Google Scholar] [CrossRef]
Bronson, K.; Knezevic, I. Big Data in food and agriculture. Big Data Soc. 2016, 3, 2053951716648174. [Google Scholar] [CrossRef]
Birner, R.; Daum, T.; Pray, C. Who drives the digital revolution in agriculture? A review of supply-side trends, players and challenges. Appl. Econ. Perspect. Policy 2021, 43, 1260–1285. [Google Scholar] [CrossRef]
Klerkx, L.; Jakku, E.; Labarthe, P. A review of social science on digital agriculture, smart farming and agriculture 4.0: New contributions and a future research agenda. NJAS Wagening. J. Life Sci. 2019, 90–91, 1–16. [Google Scholar] [CrossRef]
Rudin, C. Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead. Nat. Mach. Intell. 2019, 1, 206–215. [Google Scholar] [CrossRef]
Jin, X.; Kumar, L.; Li, Z.; Feng, H.; Xu, X.; Yang, G.; Wang, J. A review of data assimilation of remote sensing and crop models. Eur. J. Agron. 2018, 92, 141–152. [Google Scholar] [CrossRef]
Mahlein, A.-K.; Alisaac, E.; Al Masri, A.; Behmann, J.; Dehne, H.-W.; Oerke, E.-C. Comparison and Combination of Thermal, Fluorescence, and Hyperspectral Imaging for Monitoring Fusarium Head Blight of Wheat on Spikelet Scale. Sensors 2019, 19, 281. [Google Scholar] [CrossRef] [PubMed]
Zarco-Tejada, P.J.; Camino, C.; Beck, P.S.A.; Calderon, R.; Hornero, A.; Hernández-Clemente, R.; Kattenborn, T.; Montes-Borrego, M.; Susca, L.; Morelli, M.; et al. Previsual symptoms of Xylella fastidiosa infection revealed in spectral plant-trait alterations. Nat. Plants 2018, 4, 432–439. [Google Scholar] [CrossRef]
Savary, S.; Willocquet, L.; Pethybridge, S.J.; Esker, P.; McRoberts, N.; Nelson, A. The global burden of pathogens and pests on major food crops. Nat. Ecol. Evol. 2019, 3, 430–439. [Google Scholar] [CrossRef] [PubMed]
Bouri, M.; Arslan, K.S.; Şahin, F. Climate-smart pest management in sustainable agriculture: Promises and challenges. Sustainability 2023, 15, 4592. [Google Scholar] [CrossRef]
Delfani, P.; Thuraga, V.; Banerjee, B.; Chawade, A. Integrative approaches in modern agriculture: IoT, ML and AI for disease forecasting amidst climate change. Precis. Agric. 2024, 25, 2589–2613. [Google Scholar] [CrossRef]
Bardgett, R.D.; van der Putten, W.H. Belowground biodiversity and ecosystem functioning. Nature 2014, 515, 505–511. [Google Scholar] [CrossRef]
Delgado-Baquerizo, M.; Oliverio, A.M.; Brewer, T.E.; Benavent-González, A.; Eldridge, D.J.; Bardgett, R.D.; Maestre, F.T.; Singh, B.K.; Fierer, N. A global atlas of the dominant bacteria found in soil. Science 2018, 359, 320–325. [Google Scholar] [CrossRef]
Giffard, B.; Winter, S.; Guidoni, S.; Nicolai, A.; Castaldini, M.; Cluzeau, D.; Coll, P.; Cortet, J.; Le Cadre, E.; d’Errico, G.; et al. Vineyard Management and Its Impacts on Soil Biodiversity, Functions, and Ecosystem Services. Front. Ecol. Evol. 2022, 1, 850272. [Google Scholar] [CrossRef]
Medina, N.; Vandermeer, J. Developing systems theory in soil agroecology: Incorporating heterogeneity and dynamic instability. Front. Environ. Sci. 2023, 1, 1171194. [Google Scholar] [CrossRef]
Scheffer, M.; Bascompte, J.; Brock, W.A.; Brovkin, V.; Carpenter, S.R.; Dakos, V.; Held, H.; van Nes, E.H.; Rietkerk, M.; Sugihara, G. Early-warning signals for critical transitions. Nature 2009, 461, 53–59. [Google Scholar] [CrossRef]
Dakos, V.; Carpenter, S.R.; Brock, W.A.; Ellison, A.M.; Guttal, V.; Ives, A.R.; Kéfi, S.; Livina, V.; Seekell, D.A.; van Nes, E.H.; et al. Methods for Detecting Early Warnings of Critical Transitions in Time Series Illustrated Using Simulated Ecological Data. PLoS ONE 2012, 7, e41010. [Google Scholar] [CrossRef]
Schlüter, M.; Müller, B.; Frank, K. The potential of models and modeling for social-ecological systems research the reference frame ModSES. Ecol. Soc. 2019, 24, 31. [Google Scholar] [CrossRef]
Ong, T.W.Y.; Liao, W. Agroecological Transitions: A Mathematical Perspective on a Transdisciplinary Problem. Front. Sustain. Food Syst. 2020, 4, 00091. [Google Scholar] [CrossRef]
Saltelli, A.; Bammer, G.; Bruno, I.; Charters, E.; Di Fiore, M.; Didier, E.; Nelson Espeland, W.; Kay, J.; Lo Piano, S.; Mayo, D.; et al. Five ways to ensure that models serve society: A manifesto. Nature 2020, 582, 482–484. [Google Scholar] [CrossRef]
Pearl, J. Causality, 2nd ed.; Cambridge University Press: Cambridge, UK, 2009. [Google Scholar] [CrossRef]
Peters, J.; Janzing, D.; Schölkopf, B. Elements of Causal Inference: Foundations and Learning Algorithms; MIT Press: Cambridge, MA, USA, 2017; ISBN 978-0-262-03731-0. [Google Scholar]
Rotz, S.; Gravely, E.; Mosby, I.; Duncan, E.; Finnis, E.; Horgan, M.; LeBlanc, J.; Martin, R.; Neufeld, H.T.; Nixon, A.; et al. Automated pastures and the digital divide: How agricultural technologies are shaping labour and rural communities. J. Rural Stud. 2019, 68, 112–122. [Google Scholar] [CrossRef]
Levins, R. The strategy of model building in population biology. Am. Sci. 1966, 54, 421–431. [Google Scholar]
Rolnick, D.; Donti, P.L.; Kaack, L.H.; Kochanski, K.; Lacoste, A.; Sankaran, K.; Ross, A.S.; Milojevic-Dupont, N.; Jaques, N.; Waldman-Brown, A.; et al. Tackling Climate Change with Machine Learning. ACM Comput. Surv. 2022, 55, 42. [Google Scholar] [CrossRef]
Tao, F.; Qi, Q.; Wang, L.; Nee, A.Y.C. Digital Twins and Cyber–Physical Systems toward Smart Manufacturing and Industry 4.0: Correlation and Comparison. Engineering 2019, 5, 653–661. [Google Scholar] [CrossRef]
Folke, C.; Carpenter, S.; Walker, B.; Scheffer, M.; Rockström, J. Resilience Thinking: Integrating Resilience, Adaptability and Transformability. Ecol. Soc. 2010, 15, 20. [Google Scholar] [CrossRef]
Duru, M.; Therond, O.; Fares, M.h. Designing agroecological transitions; A review. Agron. Sustain. Dev. 2015, 35, 1237–1257. [Google Scholar] [CrossRef]
Tittonell, P. Ecological intensification of agriculture—Sustainable by nature. Curr. Opin. Environ. Sustain. 2014, 8, 53–61. [Google Scholar] [CrossRef]
Rillig, M.C.; Ryo, M.; Lehmann, A.; Aguilar-Trigueros, C.A.; Buchert, S.; Wulf, A.; Iwasaki, A.; Roy, J.; Yang, G. The role of multiple global change factors in driving soil functions and microbial biodiversity. Science 2019, 366, 886–890. [Google Scholar] [CrossRef]
Bzdok, D.; Altman, N.; Krzywinski, M. Statistics versus machine learning. Nat. Methods 2018, 15, 233–234. [Google Scholar] [CrossRef]
Geirhos, R.; Jacobsen, J.-H.; Michaelis, C.; Zemel, R.; Brendel, W.; Bethge, M.; Wichmann, F.A. Shortcut learning in deep neural networks. Nat. Mach. Intell. 2020, 2, 665–673. [Google Scholar] [CrossRef]

Figure 1. Building the evidence base of Agricultural Digital Twins (ADTs) through systematic literature screening and bibliometric overlay analysis. Panel (A) presents the PRISMA-inspired workflow used to identify, screen, assess, and select publications included in the review. A total of 412 records were initially retrieved from the Web of Science Core Collection and Scopus databases in March 2025. After duplicate removal, title and abstract screening, and full-text eligibility assessment, 178 publications were retained for qualitative synthesis and bibliometric interpretation. Panel (B) shows the VOSviewer version 1.6.20 (Centre for Science and Technology Studies (CWTS), Leiden University, Leiden, The Netherlands; https://www.vosviewer.com), keyword co-occurrence network generated from the retained corpus. Node size reflects keyword occurrence frequency, link thickness indicates the strength of co-occurrence relationships, and node colour represents the average publication year associated with each keyword. Purple and blue nodes indicate earlier research themes, whereas green and yellow nodes indicate more recent or emerging topics within the field. Together, the two panels illustrate both the methodological transparency of the literature selection process and the temporal–intellectual structure of the contemporary Agricultural Digital Twin research landscape. The bibliometric map should be interpreted as a structured representation of dominant research themes, their interconnections, and their temporal evolution within the reviewed corpus, rather than as an exhaustive or statistically calibrated representation of the entire scientific domain.

Figure 2. Diverging trajectories of technological expansion and conceptual maturity in Agricultural Digital Twin (ADT) research. The blue curve illustrates the rapid growth of publications identified in the reviewed corpus between 2016 and 2025, reflecting the increasing adoption of ADTs across agricultural domains. The orange curve represents the conceptual maturation of the field, including advances in validation, uncertainty quantification, interoperability, governance, explainability, transferability, and biological realism. The widening gap between the two trajectories highlights an emerging innovation–maturity imbalance, whereby technological development progresses faster than the resolution of fundamental scientific and epistemological challenges. Rather than indicating a lack of progress, this divergence reflects the transition of ADTs from a predominantly technology-driven phase toward a more reflexive stage in which questions of uncertainty, system representation, governance, and adaptive decision-making become increasingly central. The figure should be interpreted as a conceptual synthesis derived from patterns identified across the reviewed literature rather than as a quantitative bibliometric model.

Figure 3. Beyond Technology: A Conceptual Framework for the Next Generation of Agricultural Digital Twins. This figure conceptualizes Agricultural Digital Twins (ADTs) as integrated system-of-systems architectures operating at the interface of data acquisition, artificial intelligence, agroecosystem applications, governance, and systems thinking. The framework identifies five interdependent thematic pillars emerging from the reviewed literature and demonstrates that the future evolution of ADTs is increasingly shaped by cross-cutting challenges related to uncertainty, transparency, interoperability, explainability, biological realism, and responsible innovation. Rather than representing a purely technological evolution, the figure highlights the ongoing transformation of ADTs into adaptive socio-ecological decision-support systems capable of operating under complexity and uncertainty. The framework should be interpreted as a conceptual synthesis derived from the literature and designed to illustrate the multidimensional and interdisciplinary nature of Agricultural Digital Twin research.

Figure 4. Data–model coupling and epistemic limits in Agricultural Digital Twins. This conceptual figure illustrates how increasing digitalization intensity may improve technical performance without necessarily producing proportional gains in system understanding. The blue trajectory represents technical performance, the green trajectory represents system understanding, and the red dashed trajectory represents potential epistemic blind spots linked to hidden uncertainty, model misspecification and unobserved constraints. Beyond a data-saturation zone, additional data may, under certain conditions, create diminishing explanatory returns or a perception of increased precision without equivalent gains in predictive reliability, mechanistic understanding or decision robustness. Curves, thresholds and normalized values are heuristic rather than empirical and should not be interpreted as universal or statistically calibrated relationships.

Figure 5. The observability frontier in agricultural digital twins: why more data do not necessarily resolve hidden agroecosystem complexity. The figure conceptualizes the transition from low to high data integration in Agricultural Digital Twins (ADTs) and its consequences for model representation. From left to right, increasing data integration improves the coverage of observable system states, but does not eliminate the hidden biological, ecological and structural constraints that remain only partially measurable or indirectly inferred. The central panel distinguishes directly observed variables from latent processes, while the model-system panel illustrates how measured data are transformed into model inputs and outputs despite the persistence of unrepresented or weakly represented constraints. The figure highlights two critical limitations. First, additional data do not necessarily generate proportional gains in predictive reliability when model structure remains incomplete, poorly coupled to system functioning or weakly grounded in causal processes [5,21]. Second, ADTs are necessarily constructed from observable proxies, whereas key agroecosystem processes, such as below-ground interactions, microbial dynamics, physiological regulation and biotic feedbacks, remain only partly accessible to measurement or inference [7,36,37,38,39,40]. The figure should therefore be interpreted as a heuristic conceptual synthesis rather than an empirical quantification: it shows that the performance of ADTs is shaped not only by the amount of integrated data, but by the persistent epistemic gap between measurable variables and hidden system dynamics.

Figure 6. Partial observability and epistemological gap in agricultural digital twins. This conceptual diagram illustrates how observable variables are used to infer underlying agroecosystem processes that remain only partially accessible to measurement. The figure is intended as a heuristic representation of the epistemological challenges associated with modelling complex biological systems. The relative positioning of components and relationships should therefore be interpreted conceptually rather than quantitatively, highlighting the persistent gap between observed system states and hidden ecological, physiological, and biological processes.

Figure 7. From prediction engines to decision navigators: multi-future reasoning in Agricultural Digital Twins. The figure conceptualizes a transition from conventional Agricultural Digital Twins (ADTs) that generate a single best-estimate prediction toward adaptive ADTs designed to explore multiple plausible futures. Conventional ADTs typically rely on observed data, static model structures and historical calibration to produce one dominant forecast and one recommended management action. Such approaches may perform under stable conditions but remain vulnerable when climate, biological, market, management or socio-environmental conditions deviate from expectations. Adaptive ADTs, by contrast, integrate data assimilation, machine learning, process-based models, feedbacks and uncertainty quantification to generate scenario ensembles across interacting sources of uncertainty. These scenario ensembles do not aim to identify a single optimal future, but to define a robust decision space in which irrigation, disease management, fertilization, economic choices, risk mitigation and governance options can be evaluated across alternative trajectories. The figure highlights a conceptual shift from prediction-centred modelling toward resilience-oriented decision navigation, where the value of ADTs lies less in forecasting one future accurately than in supporting flexible decisions that remain effective across many possible futures.

Figure 8. The Digital Twin Relevance Paradox in Complex Agroecosystems. This conceptual figure illustrates the non-linear relationship between agroecosystem complexity, the need for Agricultural Digital Twins (ADTs), and their current operational reliability. As agroecosystem complexity increases, the demand for digital twins rises because decision-making becomes increasingly challenged by nonlinear interactions, uncertainty, environmental variability, and multi-scale feedbacks. In contrast, the reliability of current ADTs tends to decline beyond a critical complexity threshold due to hidden processes, emergent behaviours, microbiome dynamics, governance influences, human decision-making, and climate extremes that remain only partially represented within existing modelling frameworks. The intersection between the two trajectories defines an “optimal digital twin domain,” where scientific value is maximized through a balance between system complexity and model reliability. Beyond this threshold, a “paradox zone” emerges in which digital twins are most needed for decision support but simultaneously become least reliable. The figure highlights a central challenge for next-generation ADTs: future progress will depend not only on increasing data availability or computational capacity, but also on improving the representation of hidden processes, uncertainty propagation, adaptive learning, and socio-ecological feedbacks within complex agroecosystems. The figure should be interpreted as a conceptual synthesis derived from recurring patterns identified across the reviewed literature rather than as an empirical or quantitatively calibrated relationship.

Figure 9. The expanding epistemic shadow of Agricultural Digital Twins under increasing agroecosystem observability. This conceptual figure illustrates how the progressive expansion of sensing technologies, remote and proximal observations, artificial intelligence, data fusion and system-of-systems integration has increased the observable fraction of agroecosystems. However, this expansion does not necessarily eliminate uncertainty. Instead, it may reveal deeper layers of hidden biological, ecological and socio-economic complexity that remain only partially measurable or structurally inaccessible to current modelling frameworks. The upper agroecosystem layer represents variables that digital twins can increasingly capture, including canopy structure, vegetation indices, biomass, soil moisture, yield, management operations and field-level environmental conditions. The lower layer represents the epistemic shadow: root architecture, microbiome dynamics, rhizosphere processes, nutrient transformations, genotype-by-environment interactions, epigenetic regulation, biotic interactions, ecosystem feedbacks, farmer behaviour and socio-economic responses. The figure therefore reframes the maturity–complexity relationship by showing that the challenge is not only that digital twins perform less reliably in complex systems, but that greater observability can expose additional hidden processes and new sources of structural uncertainty. Based on a conceptual synthesis of the literature [21,22,44], the figure should be interpreted as a heuristic framework rather than an empirical quantification. It highlights that future progress will depend not only on additional data or computational capacity, but on the ability to represent latent processes, uncertainty, emergent dynamics and socio-ecological feedbacks within Agricultural Digital Twin architectures.

Figure 10. Governing uncertainty in Agricultural Digital Twins. The figure conceptualizes Agricultural Digital Twins (ADTs) as systems operating between three domains: what can be measured, what can be modelled, and what remains uncertain. While data streams from satellites, UAVs, sensors, weather observations and farm records expand the observable domain, and models represent processes such as crop growth, disease dynamics, water balance and yield forecasting, many drivers of agroecosystem behaviour remain only partially observable, including microbiome dynamics, emergent properties, farmer behaviour, climate surprises, socio-economic shocks and cross-scale feedbacks. The outer governance layer highlights the need for transparency, explainability, accountability, stakeholder participation, adaptive learning, and data ownership to manage these residual uncertainties. The figure therefore shifts the focus from data governance to uncertainty governance, emphasizing that the future effectiveness of ADTs depends not only on better data and models, but on their ability to support robust decisions under irreducible uncertainty.

Table 1. Critical assessment of digital twin applications in agriculture: reported performance versus structural limitations.

Application Domain	Reported Benefits	Underlying Mechanisms	Key Limitations	Critical Interpretation	References
Yield Prediction	Improved accuracy through data assimilation	Integration of remote sensing with crop models; dynamic state updating	Strong dependence on site-specific calibration; limited transferability	Performance gains may reflect local tuning rather than true predictive generalization	[49]
Disease Monitoring	Early detection via spectral and thermal signals	Use of proxy indicators of plant stress; integration with epidemiological models	Non-specific signals; high sensitivity to environmental variability; risk of misclassification	Early-warning systems are constrained by proxy ambiguity and lack of causal specificity	[23]
Resource Optimization (Irrigation, Fertilization)	Increased efficiency in water and nutrient use	Coupling of sensor data with decision support models	Context-dependent results; influenced by management improvements rather than DT itself	Gains may reflect improved practices rather than intrinsic digital twin capability	[47]
Farm Management Systems	Enhanced decision-making and real-time monitoring	Integration of multi-source data into platforms	Data fragmentation; interoperability issues; user adoption barriers	Effectiveness depends as much on socio-economic context as on technical performance	[46]
Early Warning Systems	Detection of pre-symptomatic signals	Spectral and temporal anomaly detection	Weak link to underlying biological processes; epistemological gap	Signals may anticipate symptoms but not necessarily explain or predict system transitions	[53,54]

Table 2. Structural, epistemological, and operational constraints of agricultural digital twins.

Dimension	Core Limitation	Underlying Mechanism	Scientific Implication	Practical Consequence	References
Structural	Reduction in complex systems to measurable variables	Exclusion of latent processes (microbiome, roots, biodiversity	Inability to capture emergent dynamics and non-linear feedback	Limited reliability under dynamic or extreme conditions	[55]
Structural	Partial observability of agroecosystems	Dependence on proxy variables (e.g., spectral indices)	Epistemological gap between observed and real system	Model outputs reflect incomplete system representation	[53,54]
Epistemological	Illusion of precision	High-resolution simulations masking uncertainty	Overconfidence in model outputs	Risk of misinformed decision-making	[63]
	Lack of causal interpretability	Reliance on correlation-based models	Weak explanatory power	Limited transferability and robustness	[64]
	Poor generalization under novel conditions	Models trained on historical datasets	Failure under climate variability or new contexts	Unreliable predictions in future scenarios	[21]
Operational	High infrastructure and data requirements	Need for continuous data streams and computing capacity	Unequal accessibility of technology	Digital divide between farming systems	[17]
	Limited user adoption and trust	Complexity of outputs and lack of transparency	Misalignment between tool design and user needs	Underutilization or misuse of digital twins	[47]
	Data ownership and governance issues	Concentration of data in proprietary platforms	Power asymmetries in digital agriculture	Dependence on external actors	[18]

Table 3. Scientific controversies in agricultural digital twins.

Controversy	Dominant Assumption	Emerging Evidence	Scientific Implication
Data–model relationship	More data improves prediction	Noise amplification and instability [21]	Non-linear performance response
Model validation	Calibration = validation	Limited generalization across contexts [22]	Overestimated model reliability
AI integration	Higher accuracy = better models	Loss of interpretability [48]	Epistemological opacity
Early detection	Spectral signals reflect biological processes	Proxy ambiguity [23]	Weak causal inference
System representation	Models capture system dynamics	Partial observability [7]	Epistemological gap

Table 4. Key research gaps and future priorities.

Research Gap	Current Limitation	Priority Direction	Expected Impact
Model generalization	Site-specific calibration	Cross-scale and cross-site validation	Improved robustness
Data integration	Fragmented data streams	Standardized interoperability frameworks	Scalable digital twins
Interpretability	Black-box AI models	Hybrid mechanistic–AI approaches	Better decision support
Uncertainty handling	Deterministic predictions	Probabilistic and uncertainty-aware models	Reliable forecasts
Multi-scale modeling	Single-scale models	Integrated multi-level systems	Realistic system representation
Decision frameworks	Prediction-focused	Resilience and adaptation-oriented models	Sustainable mana

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

El Jarroudi, M.; Tychon, B.; Lahlali, R. Digital Twins in Agriculture: From Technological Promise to Epistemological Tension in Complex Agroecosystems. Agriculture 2026, 16, 1286. https://doi.org/10.3390/agriculture16121286

AMA Style

El Jarroudi M, Tychon B, Lahlali R. Digital Twins in Agriculture: From Technological Promise to Epistemological Tension in Complex Agroecosystems. Agriculture. 2026; 16(12):1286. https://doi.org/10.3390/agriculture16121286

Chicago/Turabian Style

El Jarroudi, Moussa, Bernard Tychon, and Rachid Lahlali. 2026. "Digital Twins in Agriculture: From Technological Promise to Epistemological Tension in Complex Agroecosystems" Agriculture 16, no. 12: 1286. https://doi.org/10.3390/agriculture16121286

APA Style

El Jarroudi, M., Tychon, B., & Lahlali, R. (2026). Digital Twins in Agriculture: From Technological Promise to Epistemological Tension in Complex Agroecosystems. Agriculture, 16(12), 1286. https://doi.org/10.3390/agriculture16121286

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Digital Twins in Agriculture: From Technological Promise to Epistemological Tension in Complex Agroecosystems

Abstract

1. Introduction

2. Bibliometric Overview of the Field

2.1. Scope and Procedure of the Bibliometric Screening

2.2. Bibliometric Evolution of Research on Digital Twins in Agriculture

2.3. Thematic Structuring and Conceptual Reorientation of the Field

3. Conceptual Foundations: Engineering Paradigms vs. Agroecological Reality

4. Digital Twin Architectures: Integration Challenges and Fragmentation

5. Applications of Digital Twins: Performance, Context-Dependency, and Structural Limitations

6. Critical Analysis: Structural, Epistemological, and Operational Limits of Agricultural Digital Twins

7. Research Gaps, Scientific Controversies, and Future Directions

8. Conceptual Framework Development and Figure Construction

9. Limitations of the Conceptual Framework

10. Beyond Technological Optimism: A Critical Reappraisal of Digital Twins in Agriculture

11. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI