Application and Evaluation of a Simple Crop Modelling Framework: A Case Study for Spring Barley, Winter Wheat and Winter Oilseed Rape over Ireland

: Globally, croplands represent a signiﬁcant contributor to climate change, through both greenhouse gas emissions and land use changes associated with cropland expansion. They also represent locations with signiﬁcant potential to contribute to mitigating climate change through alternative land use management practices that lead to increased soil carbon sequestration. In spite of their global importance, there is a relative paucity of tools available to support ﬁeld-or farm-level crop


Introduction
In Ireland, the dominant agricultural land use is grasslands (91%) with croplands representing less than 10% of the agriculture land [1].The predominance of grassland is largely due to the near year-round conditions favorable for grass growth, associated with the mild maritime climate experienced here, which results in lower input costs for dairy production over other agricultural systems.Consequently, crop production is typically confined to the lighter soils and warmer and drier conditions associated with the east and south-east of Ireland.The main tillage area is dominated by cereal crops Spring Barley (Hordeum vulgare L.; hereafter referred to as SB) and Winter wheat (Triticum aestivum L.; WW).SB provides a valuable source for animal feed and the malting industries with a land area of 128,720 ha and an average yield of 7.2 t•ha −1 [2] and WW is the most economically viable crop grown here with an area of 60,400 ha and an average yield of 9.5 t•ha −1 [3].Winter oilseed rape (Brassica napus L.; WOSR) is a more marginal crop, covering approximately 8500 hectares, with an average yield of 4.0 t•ha −1 , and is typically grown as a break crop for WW.While the dominant contribution from agriculture to Ireland's natural greenhouse gas emissions (GHG) comes from the dairy sector, reflecting its dominance in terms of area and economic activity, croplands represent a relatively small GHG source, mainly due to activities associated with cultivation and fertilizer use [4,5].In spite of the relatively small impact to the overall national emissions, this sector has potential to make an important contribution to reducing agricultural emissions.
While process-based crop models are potentially useful tools for quantifying both production and carbon budgets [6][7][8][9], and how these relate to weather, climate and environmental conditions, there are challenges related to the application of these models at the regional level.These challenges are further compounded by a lack of readily available in situ data in order calibrate and evaluate these models.A number of crop models have been developed, which vary in terms of their driving mechanisms and complexity [10].For example, GECROS [11] and SUCROS [12] are illustrative of carbon-focused models; radiation-based models include SAFY [13], SAFY-WB [14], STICS [15], CERES [16], and water focused models include AqYield [17] and Aquacrop [18].This list is not exhaustive but represents some of the main crop models currently available for use, depending on application.In addition, these models are generally employed as research tools and require a level of expertise in order to employ them.
Models can be further classified depending on whether they simulate a crop using a generic approach or those that are specific to certain crops.For example, Aquacrop is a generic crop growth model that utilizes crop specific parameters.In contrast, SuMoToRI [19] and LINTUL-BRASNAP [20] are specific to a single crop type, namely WOSR.While the more general crop models, such as STICS and Aquacrop, have found widespread use they are relatively complex models and require a number of parameters to be calibrated prior to application; therefore their suitability for use at the regional level or in potentially data sparse regions is challenging [21,22].As an alternative, simple semi-empirical models, such as the Simple Algorithm for Yield Estimate (SAFY) model [14], which require a reduced number of inputs and parameters, offer potential for more widespread use and application, can be readily applied, and are potentially suitable for scaling to the regional level.However, similar to all models, they require appropriate testing and evaluation.While a number of empirical and semi-empirical based models have been developed for specific crop types [23][24][25][26], the development of a 'common' model framework which is readily adaptable has been missing, therefore limiting their widespread use in new locations and/or for different crop types.
The SAFY crop model was originally developed by [13] and the model was initially evaluated on WW fields in dry land areas in Morocco.The basic version of the model simulated crop growth as green area index (GAI), dry above-ground matter (biomass) and dry grain matter (yield).Due to the simple model architecture, which is modular in design, the SAFY model has been widely adapted by the research community and applied in different environments and crops [24,[27][28][29][30][31].Refs.[14,24] added a soil-water balance module to simulate the effect of the water stress on the crop growth and production.Refs.[25,32] added a module to simulate the carbon budgets in addition to a soil-water balance module.Additional modifications have focussed on GAI, an important biophysical variable that can describe the crop growth and a state variable in the SAFY model.GAI can be measured on the ground by different sensors [33], collected by destructive sampling methods [33] or, can be retrieved from remote sensing data using empirical or processbased/hybrid methods [21,34,35].For example, Refs.[29,31] integrated remote sensing data into the SAFY model to update the crop-state variable GAI during the assimilation process, leading to improved model estimates.
For the research presented here, we sought to evaluate the use of the SAFY model [13], with modified soil-water balance module [24,31] and carbon fluxes module [25,32] to simulate crop growth, biomass, grain yield and components of the water and carbon fluxes for the main crop types grown in Ireland and for which we had access to published or available in situ data.The decision to employ the specific model was based around the model architecture, which can be readily modified and adapted, and the limited number of parameters required to run the model.Given the model's simplistic modelling approach, it has been successfully applied for estimation of crop growth, above ground biomass and grain yield using limited computational resources which is usually a challenge at spatial level application of complex crop models and makes it even more attractive for practical applications.In addition, the model offers the potential to use and deploy in an operational capacity, with limited user intervention.We also sought to contribute to the model development, through the addition of a new water stress function and modifications of components in the energy and carbon flux modules.The soil-water balance module, specifically the water stress function, is detailed in the methods section and latent heat is derived from the actual evapotranspiration in the context of the Irish soils that have been comparatively less studied.The modified model was then evaluated on three maincrop types in Ireland, namely WW, SB and WOSR.Thus, the objective of the work was to adapt a previously developed model, enhancing its capacity to be deployed in a broader range of applications, and evaluate the model for use in a maritime climate regime.

Materials and Methods
The input datasets, management data and model are described in the following sections.

Context
The island of Ireland experiences a mild maritime temperate climate associated with the westerly moisture-laden airflow off the North Atlantic [36].Based on the daily mean climatology (1981-2010), the maximum temperature across the region is between 18 • C and 20 • C in summer, and winters are generally mild, around 8 • C; only occasionally do temperatures drop below 0 • C. Mean annual rainfall is approximately 1200 mm, distributed throughout the year.The higher rainfall amounts of 1000-1400 mm mainly occur in the west, associated with proximity to the ocean, and may exceed 2000 mm in the upland areas due to influence of topography.In contrast, the east experiences lower rainfall amounts of between 750-1000 mm.More details on the background climate of Ireland are provided in [37].In the context of general soil information, the main tillage area in Ireland, located in the south-east, is characterized as mostly freely draining sandy soils, while the midlands and south are dominated by limestone rich soils, with peat soils dominate on the hills, mountains and the western fringes of the country [38].

Study Locations
In situ data, collated from previously published, and unpublished, studies, was available for five locations, representing SB, WW and WOSR.With the exception of the northern-most site located in Crossnacreevy, Co. Down, all sites are located in the main tillage growing areas along the east and south-east of Ireland (Figure 1).Field data for the SB site located in Duncormick County Wexford, included GAI and biomass for the period 2011-2013.For WW, yield data for the period 2013-2015 were available for three sites, located in Oakpark, County Carlow, Killeagh County Cork and Crossnacreevy, County Down (Northern Ireland), respectively.The WOSR experimental site, located in Gowran Co. Kilkenny, had measurements of GAI, biomass, and respiration data for the period 2015-2016.Table 1 outlines the main crop events for the selected study sites.
distributed throughout the year.The higher rainfall amounts of 1000-1400 mm mainly occur in the west, associated with proximity to the ocean, and may exceed 2000 mm in the upland areas due to influence of topography.In contrast, the east experiences lower rainfall amounts of between 750-1000 mm.More details on the background climate of Ireland are provided in [37].In the context of general soil information, the main tillage area in Ireland, located in the south-east, is characterized as mostly freely draining sandy soils, while the midlands and south are dominated by limestone rich soils, with peat soils dominate on the hills, mountains and the western fringes of the country [38].

Study Locations
In situ data, collated from previously published, and unpublished, studies, was available for five locations, representing SB, WW and WOSR.With the exception of the northern-most site located in Crossnacreevy, Co. Down, all sites are located in the main tillage growing areas along the east and south-east of Ireland (Figure 1).Field data for the SB site located in Duncormick County Wexford, included GAI and biomass for the period 2011-2013.For WW, yield data for the period 2013-2015 were available for three sites, located in Oakpark, County Carlow, Killeagh County Cork and Crossnacreevy, County Down (Northern Ireland), respectively.The WOSR experimental site, located in Gowran Co. Kilkenny, had measurements of GAI, biomass, and respiration data for the period 2015-2016.Table 1 outlines the main crop events for the selected study sites.

Evaluation Data: In Situ
The available in situ data, used to evaluate the model, were obtained from previous studies.For SB, measured GAI and biomass data for three years 2011-2013 was obtained for Duncormick, Co. Wexford, with an average yield of 8.5 t•ha −1 (850 g•m −2 ) [39].Observed WW yield data and sowing dates were taken from [23].These were used to evaluate the simulated biomass and yield data for WW for three sites (Table 2; Figure 1), WW growth stage GS61, which refers to stage of the start of flowering (first anthers visible) [3] in presented in Table 2. Evaluation data for WOSR at Gowran site were obtained from a previously published study [40].

Site Soil Information
Soils in Ireland are comparatively young and very heterogeneous, are less studied and lack detailed information on soil properties, including water content, and soil profile information.Consequently, model-derived soil information was obtained from the global SoilGrids (https://soilgrids.org)(accessed on 17 November 2022) database [41].SoilGrids produces global maps of soil properties using different machine learning approaches, trained on soil observation from 240,000 locations worldwide [42].It contains estimates of soil textural properties (sand, silt, clay and bulk density) for six different depth intervals (to 100 cm) in addition to information on soil-water content at field capacity (FC), saturation (SAT) and permanent wilting point (PWP) [43][44][45], derived using pedo-transfer functions and the data are available at 1 km and 250 m.We obtained the soil texture information and soil-water parameters at 250 m for each available depth.For the purposes of the subsequent modelling and in the absence of site-specific soil information, we only consider the bulk soil properties.Therefore, we aggregated the available layers, using a weighted average, to derive a mean depth profile value for FC, SAT and PWP.

Meteorlogical Data
Daily meteorological data was obtained from Met Éireann, the Irish national meteorological service, for Ireland and from the UK Met Office Integrated Data Archive System (MIDAS) for Aldergrove, located in Northern Ireland.Daily maximum and minimum temperature, global radiation and precipitation were obtained for the years 2011-2013 for JC, 2013-2016 OP and 2013-2015 for AG, RP (Table 1).Reference evapotranspiration was derived based on the FAO-Penman Monteith equation.

Model Overview
A schematic overview of the model is presented in Figure 2. All the simulated components and related equations of the flowchart are described in the following sections.The model framework used in this work was originally developed by [13] and was initially tested for WW.The original model is based on the Monteith light use efficiency (LUE) theory [46] and its relationship to biomass.Within the model, biomass is separated into leaf and non-leaf components using a partitioning function and specific leaf area is used to simulate crop growth.Grain yield is simulated using a rate of grain-filling parameter.The model simulates GAI, biomass, and yield on a daily time step, consistent with the driving meteorological inputs.The basic equations for simulating crop biomass and yield are shown in Equations ( 1)- (5).Equation (4a,c) shows the temperature stress function.
where, DAM is the daily accumulated biomass, APAR is the Absorbed photosynthetically active radiation computed as a product of the global radiation, climatic efficiency parameter and the fractional absorbed photosynthetic active radiation (FAPAR), calculated using Beer's law.Equations ( 2) and (3) show the calculations for APAR and the use of Beer's law.F T (T a ) is a temperature stress function (Equation (4a-4c), T a is the mean air temperature, and WST is the water stress coefficient (Equation ( 11)) computed from the soil-water balance module.WST is not in the basic version of the SAFY model, it is added here to take water stress into account when estimating crop biomass.
where, R g is the global radiation in Mj.m −2 and c is a climatic efficiency parameter.
where, k ext is the light extinction coefficient.
where, T opt is the optimal temperature for the crop, T min and T max are the minimum and maximum temperature thresholds beyond which crop growth ceases.T a is the mean air temperature and β is the degree of the polynomial.
where, ∆Yield is the daily change in yield and P y is a grain filling factor.The soil-water balance module (SAFY-SW), adapted from [24,29], utilizes a simple 'leaky bucket' model to simulate soil-water balance and includes inputs in the form of precipitation, and losses, in the form of runoff, evapotranspiration (ETR) and deep percolation or drainage.The water balance model assumes no unsaturated flow, and capillary or upward flow only occurs as a result of soil evaporation.A root-growth sub-module was adapted for SAFY to aid the simulation of root water uptake, which is used to compute the plant transpiration [24,31].Because of the availability of field specific soil information, Refs.[24,29,31] implemented the soil-water balance module for a 5-layer soil profile.We adapted the same architecture for estimating water movement in the soil and water stress during the growing season but utilizing the bulk soil information and a mean soil profile, due to the absence of site specific soil information.Equations ( 6)- (11) shows the equations in the soil-water balance module.
Potential evapotranspiration (PET) is computed as 1.1 times the equilibrium evaporation rate (EET), following [47] and EET is calculated using the Priestly-Taylor equilibrium equation, using albedo (α), global radiation (R g ) and daily mean temperature (T avg ).The Priestly-Taylor equation only requires a limited number of meteorological inputs.
PE is potential evaporation, PT is potential transpiration, and is computed based on the daily estimated GAI. Potential root water uptake (PRWU) is derived using Equation (10).
SW is the soil-water content, WP is water availability at wilting point and RLV is the root length density.The actual evapotranspiration (AET) is equal to PT, if PRWU is greater than or equal to PT, i.e., no water stress condition.Where PRWU is less then PT crop transpiration demand is not satisfied and the crop is considered to be experiencing water stress (WST), computed as the ratio of AET to PT.
An equation is also added to translate AET (in mm.day −1 ) to a latent heat (LH) flux in Watts/m 2 Equation (12).This should enable the model to be compared to equivalent measurements obtained from a flux tower or similar instrument, if available at a site and is an important component of the energy balance.LH = 28.94× AET ( A carbon flux module was previously implemented within the SAFY model.The approach estimates the Gross Primary Productivity (GPP) similar to Equation (1) with a modification to the ELUE parameter and inclusion of sR 10, to account for the fraction of the green tissues in the senescence stage.Here, we replaced ELUE by fELUE, a product of ELUE and radiation use efficiency (RUE).RUE takes into account the diffuse fraction of radiation that is not usually considered in crop models, but studies have shown that RUE can be significant in the estimation of productivity; ignoring the parameter may lead to an inaccurate estimation of the fluxes.Equations ( 13)-( 16) outline the equations used to derive GPP [25,32].
R df /R g is highly correlated with the R g /R a where R df is the diffuse fraction of the radiation, R a is the extra-terrestrial radiation and R g is the global radiation downwards.R df /R g is computed using the equations as suggested by [48].
Net ecosystem exchange (NEE) is computed as the difference between NPP and heterotrophic respiration (R h ), calculated as follows T s is the soil temperature.Reco = Ra + Rh (23) Ecosystem respiration (R eco ) is calculated as the sum of autotrophic (Ra) and heterotrophic (Rh) respiration.
The list of all parameters employed in the model are outlined in the Supplementary Material Tables S1 and S2.

Model Evaluation
In order to determine if the model was capable of reproducing the available observations at the study locations, we employed a range of statistical measures.We used the R 2 , root mean square error (RMSE) and relative RMSE (RRMSE%) metrics to quantify the accuracy of the simulated variables with respect to the observed variables.The statistical metrics were computed in Python 3.6.5 environment.Observed/measured quantities are typically plotted on the x-axis, except where time (DOY) appears on the x-axis; simulated/estimated variables are plotted on the y-axis for figures provided in the results.

Winter Wheat
For WW, the model generates reasonable estimates of biomass accumulation (R 2 = 0.71, RMSE = 1.81 t•ha −1 and RRMSE = 8.0%).In contrast, the model did not predict grain yield with any degree of accuracy (R 2 = 0.09, RMSE = 2.33 t•ha −1 and RRMSE = 18.6%), with estimated values showing wide variation between years (Figure 5a,b biomass and grain yield).If two anomalous observations from Crossnacreevy and Carlow are removed, the model fit for yield becomes statistically significant (R 2 = 0.85, p < 0.004).However, as we do not have access to the raw data it is not possible to exclude these values just to improve the model fit.The model estimated date for growth stage GS61, along with the observed date, is presented in Table 2.The model estimated date for growth stage GS61 ranges from between 0.5% to 12% difference, relative to the observed dates.

Winter Oilseed Rape
For WOSR, there was close agreement between the observed and simulated biomass (R 2 = 0.99 and RMSE = 0.52 t•ha −1 ) but the model performed less well for predicting GAI (Figure 6a) (R 2 = 0.30 and RMSE = 0.98 m 2 •m −2 ) (Figure 6b).As there were limited available measurements of respiration for this crop and site, we were able to undertake a preliminary evaluation of the model for these variables.The comparisons between observed and simulated ecosystem respiration (Reco) and soil respiration (Rh) are shown in Figure 7a,b, respectively.The model was found to systematically overestimate Reco and resulted in a spread in model estimated values, relative to the observations, which is reflected in the poor evaluation metric scores (R 2 = 0.18 and RMSE = 1.01 g•C•m −2 •day −1 ).However, good agreement was found between the measured and estimated Rh (R 2 = 0.52 and RMSE = 0.28 g•C•m −2 •day −1 ) over the six-month period in 2016 for which observations were available.The simulated Reco and observed Rh showed similar ranges from January to March, however, simulated Reco values exceeded Rh values by up to 3-fold from April to June.

Winter Wheat
For WW, the model generates reasonable estimates of biomass accumulation (R 2 = 0.71, RMSE = 1.81 t•ha −1 and RRMSE = 8.0%).In contrast, the model did not predict grain yield with any degree of accuracy (R 2 = 0.09, RMSE = 2.33 t•ha −1 and RRMSE = 18.6%), with estimated values showing wide variation between years (Figure 5a,b biomass and grain yield).If two anomalous observations from Crossnacreevy and Carlow are removed, the model fit for yield becomes statistically significant (R 2 = 0.85, p < 0.004).However, as we do not have access to the raw data it is not possible to exclude these values just to improve the model fit.The model estimated date for growth stage GS61, along with the observed date, is presented in Table 2.The model estimated date for growth stage GS61 ranges from between 0.5% to 12% difference, relative to the observed dates.

Winter Wheat
For WW, the model generates reasonable estimates of biomass accumulation (R 2 = 0.71, RMSE = 1.81 t•ha −1 and RRMSE = 8.0%).In contrast, the model did not predict grain yield with any degree of accuracy (R 2 = 0.09, RMSE = 2.33 t•ha −1 and RRMSE = 18.6%), with estimated values showing wide variation between years (Figure 5a,b biomass and grain yield).If two anomalous observations from Crossnacreevy and Carlow are removed, the model fit for yield becomes statistically significant (R 2 = 0.85, p < 0.004).However, as we do not have access to the raw data it is not possible to exclude these values just to improve the model fit.The model estimated date for growth stage GS61, along with the observed date, is presented in Table 2.The model estimated date for growth stage GS61 ranges from between 0.5% to 12% difference, relative to the observed dates.

Winter Oilseed Rape
For WOSR, there was close agreement between the observed and simulated biomass (R 2 = 0.99 and RMSE = 0.52 t•ha −1 ) but the model performed less well for predicting GAI (Figure 6a) (R 2 = 0.30 and RMSE = 0.98 m 2 •m −2 ) (Figure 6b).As there were limited available measurements of respiration for this crop and site, we were able to undertake a preliminary evaluation of the model for these variables.The comparisons between observed and simulated ecosystem respiration (Reco) and soil respiration (Rh) are shown in Figure 7a,b, respectively.The model was found to systematically overestimate Reco and resulted in a spread in model estimated values, relative to the observations, which is reflected in the poor evaluation metric scores (R 2 = 0.18 and RMSE = 1.01 g•C•m −2 •day −1 ).However, good agreement was found between the measured and estimated Rh (R 2 = 0.52 and RMSE = 0.28 g•C•m −2 •day −1 ) over the six-month period in 2016 for which observations were available.The simulated Reco and observed Rh showed similar ranges from January to March, however, simulated Reco values exceeded Rh values by up to 3-fold from April to June.

Discussion
Here, the original SAFY model framework was adapted to include a modified soilwater balance and carbon fluxes module to simulate crop growth, biomass and yield, and components of the water, energy and carbon budgets.The soil-water balance module adapted in this work was modified to a single layer soil profile to represent mean soil profile in the absence of the site specific information-while this represents a simplification, it is unlikely that many potential study locations would have access to detailed soil profile information and thus represents a pragmatic approach.The modified model was applied to selected locations in Ireland for which there was previously published crop data available, specifically for SB, WW and WOSR.The crop model was subsequently evaluated against the available measurements obtained from these sources and studies [23,39,40].

Spring Barley (SB)
The model simulates the crop growth, biomass, and yield from the day of emergence to the day of complete senescence (Figure 3a-3f).The performances for simulating the above ground biomass and GAI are within the range of those of other studies.[50] implemented the SAFY model to estimate barley yield in a semi-arid region in Central Tunisia and obtain a good correlation between the measured and simulated yield with R 2 = 0.77.The model fit was attributed to the integration of the in-season SPOT/HRV remote sensing data and the local calibration of the most sensitive model parameters.In the study [51] barley biomass was estimated with an R 2 ranging from 0.39 to 0.68 and RMSE ranging from 0.42 kg•m −2 -0.83 kg•m −2 (420 g•m −2 -830 g•m −2 ) using the crop surface models-generated by (a) mosaicking of collected UAV images (b) point cloud generation (c) digital surface model (DSM) export, the main limiting factors were attributed to the four lodging cultivars 10,11,12,14.Ref. [52] employed the Sirius wheat model to estimate SB biomass and yield; their model simulated biomass with an RMSE = 1.4 t•ha −1 (140 g•m −2 ) and yield with an RMSE = 1.3 t•ha −1 (130 g•m −2 ).The model was also able to simulate the differences in barley growth due to its mechanistic treatment of canopy development.
Ref. [53] assessed the performance of nine different process-based crop models during 44 growing seasons of barley in northern and central Europe.Large uncertainties were found in estimating barley yield across sites and years, while the mean predictions from the nine models agreed well with the observations, the uncertainties were attributed to limited calibration of the models.In terms of simulated total above-ground biomass, models followed a slightly different order compared to the simulation of grain yields.Ref. [54] estimated biomass with median R 2 = 0.62; RMSE = 1.63 t•ha −1 (163 g•m −2 ); nRMSE = 14.9% and LAI with median R 2 = 0.92; RMSE = 0.3 m 2 •m −2 ; nRMSE = 7.1% for barley crop at three sites in Germany using the remote sensing and random forest modeling approach, LAI could be predicted with high accuracy because of the strong correlation of reflectance in the visible spectrum and leaf pigments per area.
Our modeling approach was found to be statistically significant (p < 0.01) for simulating biomass and GAI (Figure 4a,b); there are a few parameters that are variable and need to be manually fixed before its application, the date of sowing is one of them.Date of sowing SB in 2011 was 25 March and 3 April in 2012 and 2013.As illustrated in Figure 3a-f the simulated date of emergence is DOY 100, which represents 10 April.The simulated date of emergence for 2012 and 2013 is DOY 125 or 25 April, calibrating this parameter is important as the model starts simulating fluxes from the day of emergence.Where the actual date of sowing can be specified within the model, the model is clearly capable of simulating GAI and biomass (e.g., Figure 3) for SB.Biomass was simulated with high accuracy for all three years of available data; however the accuracy was somewhat compromised in estimating GAI.GAI is estimated using the leaf partitioning coefficients (P la and P lb ) on biomass.This indicates that the sensitive parameters need to be calibrated locally or inferred from remote sensing data.Error may also have been introduced in obtaining the measurements of GAI on the ground, specifically for the year 2012, where the unusual measurement is clearly visible (Figure 3c).In comparison to studies [51][52][53][54][55] the model adapted in this work was found to perform reasonably in simulating biomass during all the crop growth stages (Figure 3a-f).Simulated grain yield was estimated to between 6-8 t•ha −1 (600-800 g•m −2 ); the average reported yield from the field experiments for the period for which data was available is 8.5 t•ha −1 (850 g•m −2 ).The model simulated grain yield is close to the observed grain yield.In our study we used a constant value of 0.6 to estimate the grain yield from the above ground biomass.Calibrated values of this parameter would likely result in improved model estimates.

Winter Wheat (WW)
The model was subsequently applied to WW crop for three sites over the period 2013-2015-including Crossnacreevy, Carlow and Kileagh.Grain yield is derived by multiplying the daily biomass with the grain filling factor which is assumed to be a constant 0.6 for this study.In our validation for WW crop there are two observations of the grain yield that cause high errors, and if these two observations are removed, then our model shows high correlation with the observed values and becomes statistically significant (R 2 = 0.85, p < 0.01), As we do not have access to the ground collected raw data, we could not assess the quality of these data points and so cannot simply remove the observation to improve the model accuracy.There are two other possibilities for the low accuracy achieved in estimating grain yield through this approach, first the grain filing factor constant value may not be representative and requires further investigation; the other possible reason may be the lack of available data to more robustly evaluate the model.[23] developed a winter wheat yield potential model (WWYPM) for different sites in Ireland and evaluated against WW crop yield.They obtained an R 2 = 0.37 for biomass and R 2 = 0.19 for yield.The model adapted in this work represents an improvement for biomass, but with a lower R 2 for yield, which was also poorly estimated in [23].It is clear that the model estimate for WW yield specifically needs to be investigated further and evaluated, subject to alternative in situ data becoming available.In spite of the poorer model performance for yield, the estimated timing of GS61 closely matched the available observations (Table 2).The % differential ranges from 0.5-12% with respect to the GS61 stage of WW (data not shown).[21] calibrated the Aquacrop model parameters for wheat using the canopy cover derived from the Venµs satellite images for the crop season 2017-2018 and found R 2 = 0.51; RMSE = 683 g•m −2 ; RRMSE = 68.45%for biomass and R 2 = 0.11; RMSE = 151 g•m −2 RRMSE = 26.9% for grain yield on validating Aquacrop model for 2018-2019 year on wheat fields in Maccarse, Italy.Ref. [31] estimated wheat yield with R 2 = 0.66 by integrating crop model with the Sentinel-2 remote sensing data.Ref. [13] applied the basic SAFY model to the WW crop in Haouz plain, Marrakech city and found that the model underestimated yield by almost 0.5-0.9t•ha −1 when compared to field measurements, which they attributed to differences in growing conditions between the plots used for calibration and evaluation.Ref. [30] integrated in-season remote sensing data into the SAFY model to improve the GAI, biomass and yield estimates.The model was applied to three different years 2013-2015 for WW crop in China, the study found 20%, 17%, and 15% RRMSE on validation with measured yield estimates, averaging to 18% over the three years studied, which is similar to what we found in our work (18.6%).

Winter Oil Seed Rape (WOSR)
In situ data were also available for WOSR at Gowran, Co. Kilkenny.The crop was sown in September 2015 and harvested the following year in August 2016 [40].A benefit of this site was the availability of both ecosystem and heterotrophic respiration.Ref. [56] assimilated optical and SAR data into the SAFY model for the estimation of GAI and biomass for rapeseed over three years in France, the model making use of the in-season remote sensing data derived GAI from Sentinel-1, Sentinel-2 and Landsat-8 simulated GAI with R 2 ranging from 0.84-0.92and RRMSE from 28-41%.Our modeling approach represents an improvement in simulating biomass but not GAI, the reason maybe as described above, local calibration of partitioning coefficients (Supplementary Materials, Table S2) that partitions biomass into leaf and non-leaf biomass.Daily GAI growth and senescence is derived by multiplying daily biomass with the leaf partitioning coefficients (PL a and PL b ) and specific leaf area parameter.In the assimilation approach these parameters and the other variable parameters (Supplementary Material) can be optimized with in-season remote sensing data.Ref. [57] estimated rapeseed LAI using the empirical equations on UAV multispectral images at different growth stages, the study concludes that LAI estimation at seedling and flowering stages have high errors in comparison to the elongation stage and it may also be a probable reason for the underestimation of GAI single point observation in our work (Figure 6a).The carbon fluxes module of the model simulated the components of the carbon budgets.Due to lack of observed data, we have compared the modelled simulated ecosystem respiration that includes aboveground autotrophic and soil respiration with the observed ecosystem soil respiration.The soil chambers used in the study were inserted into the interrow spacing between the plants but did not include plants, therefore CO 2 flux consisted predominantly of heterotrophic respiration with a minor contribution of autotrophic root respiration.The large difference in CO 2 emissions between the modelled respiration and measured soil respiration values thus can be attributed to the aboveground autotrophic respiration not captured by the soil chambers (Figure 7a,c).Soil respiration was also measured in bare soil without vegetation to quantify CO 2 emissions only of heterotrophic origin.In contrast, the model simulated heterotrophic respiration was compared with the observed heterotrophic respiration and a low error with improved correlation was found compared to the ecosystem respiration model (Figure 7b,d).Ref. [25] simulated the components of the carbon budget for sunflower in France, the study shows ecosystem respiration was simulated with R 2 = 0.83 and RMSE = 0.96 g•C•m 2 •day −1 , higher than that found in this study.Additional evaluation data are required to confirm if the model is capable of producing more reliable estimates of ecosystem respiration.In a previous study [58], the CERES model was evaluated at several European flux sites, the model estimated components of carbon budget with RMSE = 30 g•C•m 2 •day −1 .In the present study, heterotrophic respiration is somewhat underestimated in comparison to the observed heterotrophic respiration as it will be limited by temperature, moisture and C substrate (Figure 7b).
Overall, the model was found to estimate biomass across all crops and sites.Biomass is a function of APAR, ELUE and is constrained by both temperature (F T T a ) and water (WST) stress.However, there are some uncertainties in simulating GAI and yield.For the simulation of GAI, the best model estimates were associated with SB, likely due to the availability of an optimal suite of model parameters for this crop.However, for WW and WOSR the parameter information was insufficient or not available and consequently lead to some uncertainties in simulating GAI.For yield, the harvest index or grain filling factor needs to be calibrated based on available data.
In this work, we found that the model effectively simulates the crop growth, biomass, yield and components of the carbon budgets.However, there are some challenges related to the calibration of the model sensitive parameters.More validation data is required to test the model for different crops and years.In the case of the non-availability of the calibration data and upscaling the model, multi-source remote sensing data can be included into the model, such as [24,[29][30][31].There are available a few generic crop models that allows simulating growth, biomass and yield for multiple crops, example Aquacrop [18], STICS [15] etc., but are computationally expensive and requires calibration for a large number of parameters, which is usually not possible.In comparison, the model in our work has a limited number of required parameters, is not computationally expensive to run and can be readily adapted as a common tool and deployed in a larger application for simulating crop components, water and carbon fluxes for multiple crops and geographical locations.The model is modular in nature and existing modules can be modified, new modules can be added further enhancing the capabilities of the model, For example, the adapted model can be integrated with the biogeochemical DNDC model to estimate the carbon stocks or the model can be applied to grasslands and forestry applications with some modifications to simulate the canopy gas exchange in the forest ecosystems.

Conclusions
In this work, we applied the simple crop model SAFY to both spring and winter crops in Ireland.We modified the soil-water balance and CO 2 fluxes modules to estimate the water, energy, and carbon budgets along with the crop growth, biomass, and yield at selected study sites.The modified model was evaluated against available in situ data for spring barley, winter wheat and winter oilseed rape.Overall, we found that the model was capable of simulating crop growth (GAI) and biomass.While the model also simulates components of the water, energy, and carbon budgets, we could not validate these at the study sites due to lack of observations.We do intend to evaluate the energy, water, and CO 2 components at available flux tower sites as part of future work.
The model was found to perform poorly in simulating grain yield; there may be two main reasons for this, the lack of validation data and the constant value employed for the grain-filling factor-ideally this should be calibrated from the ground values.There are a limited number of model parameters that require calibration prior to application: date of emergence and effective light use efficiency are two of them.In the absence of measured values, the use of remote sensing data obtained during the crop season can be employed, which removes the need to manually set the parameters, as these can be optimized using data assimilation techniques.As part of the continued development of the model, we aim to integrate remote sensing data-derived information into the model for the spatial application of the model.

Figure 1 .
Figure 1.Location of meteorological stations (AWS) and crop sites (WOSR, WW, SB) for which in situ data was available.Soil map (FAO-UNESCO) and annual precipitation isohyetal are presented in the background).

Figure 1 .
Figure 1.Location of meteorological stations (AWS) and crop sites (WOSR, WW, SB) for which in situ data was available.Soil map (FAO-UNESCO) and annual precipitation isohyetal are presented in the background).

Figure 2 .
Figure 2. Model Overview-Simulated fluxes are highlighted in bold.

Figure 3 .
Figure 3. Measured (black dots) and simulated (colored lines) for GAI (a,c,e), biomass & yield (b,d,f) for SB for 2011 (a,b), 2012 (c,d) and 2013 (e,f).DOY indicates day of year.Grain yield is plotted for illustrative purposes, no in situ data was available for this variable.

Figure 3 .
Figure 3. Measured (black dots) and simulated (colored lines) for GAI (a,c,e), biomass & yield (b,d,f) for SB for 2011 (a,b), 2012 (c,d) and 2013 (e,f).DOY indicates day of year.Grain yield is plotted for illustrative purposes, no in situ data was available for this variable.

Figure 4 .
Figure 4. Simulated and observed (a) GAI and (b) biomass for SB from selected sites for all years.

Figure 5 .
Figure 5. (a) Simulated and measured biomass; and (b) simulated and observed yield for WW crop.Different sizes indicate different years smallest size indicate 2013 and the largest point represents 2015 year.Different counties are represented with different colors as outlined in the legend.

Figure 4 .
Figure 4. Simulated and observed (a) GAI and (b) biomass for SB from selected sites for all years.

Figure 4 .
Figure 4. Simulated and observed (a) GAI and (b) biomass for SB from selected sites for all years.

Figure 5 .
Figure 5. (a) Simulated and measured biomass; and (b) simulated and observed yield for WW crop.Different sizes indicate different years smallest size indicate 2013 and the largest point represents 2015 year.Different counties are represented with different colors as outlined in the legend.

Figure 5 .
Figure 5. (a) Simulated and measured biomass; and (b) simulated and observed yield for WW crop.Different sizes indicate different years smallest size indicate 2013 and the largest point represents 2015 year.Different counties are represented with different colors as outlined in the legend.

Figure 7 .
Figure 7. (a) Simulated and observed ecosystem respiration (Reco); and (b) simulated and observed heterotrophic respiration (Rh).In (c), the black line represents the dynamics of the ecosystem respiration (Reco) and red points observed values; in (d) the black line shows the simulated heterotrophic respiration (Rh) values and blue points represents observed heterotrophic respiration.

Figure 7 .
Figure 7. (a) Simulated and observed ecosystem respiration (Reco); and (b) simulated and observed heterotrophic respiration (Rh).In (c), the black line represents the dynamics of the ecosystem respiration (Reco) and red points observed values; in (d) the black line shows the simulated heterotrophic respiration (Rh) values and blue points represents observed heterotrophic respiration.

Table 1 .
Crop, year, experimental sites, dates on sowing, emergence, harvest and senescence, and weather stations.

Table 2 .
Observed yield (Y), biomass (B), sowing dates and date of growth stage 61 (GS61) for the selected WW study sites and years and model estimated yield, biomass and GS61.