2.2. Socioeconomic Assessment with an Emphasis on Uses of Regional Groundwater for Agriculture
A socioeconomic assessment with an emphasis on the uses of regional groundwater for agriculture in the Palomas-Guadalupe Victoria aquifer (0812) was based on data from the federal Mexican resource in the Public Register of Water Rights (REPDA) provided by CONAGUA up to the year 2021, where land tenure and equity in the allocation of water rights adjudication volumes to different groundwater users were used to calculate the Gini Index [
25]. To obtain the population of the study area, data from the Population and Housing Censuses [
23] were used. These datasets are comprehensive, periodic studies that collect demographic, socioeconomic, and housing information on Mexico’s population at a specific point in time. Groundwater well identifications and their allocated water volumes were obtained from the Public Register of Water Rights (REPDA) provided by Conagua [
26] for the Palomas-Guadalupe Victoria aquifer (0812). This data, updated as of 31 December 2021, provided details on the geographic locations of groundwater wells, the purposes of water use, the allocated pumping volumes per well, and the groundwater adjudication permits [
27]. Additionally, a Geographic Information System (GIS) running under ArcMap v.10.3 by ESRI
® was used as the primary tool for processing and analyzing the collected data. In the initial stage, the data were converted from XLS to SHP format to determine the distribution and concentration of groundwater wells and population, as well as their relationships with other geospatial elements [
27].
Using these data, the second stage of the process involved determining per capita groundwater availability in the Palomas-Guadalupe Victoria aquifer. Several key elements were considered for this calculation. First, the total amount of available groundwater equals the sum of all allocated groundwater volumes. To support our work, it is essential to have an accurate estimate of the amount of groundwater available based on hydrologic mass balance. Second, the total population represents the number of people living in the region comprising the study area, based on current, reliable demographic data. Third, the units for water and population were kept consistent so that the calculation was valid. Water volumes are typically expressed in cubic meters (m
3) or liters, and population refers to the total number of people living in the region under consideration. Finally, the basic calculation was based on the total amount of available groundwater and the total population, usage, and distribution factors. This included the amount of groundwater allocated to different uses, such as agriculture, industry, and domestic consumption. Once these data were obtained, the calculation was carried out following the formula in Equation (1) [
27].
This calculation yielded an average value indicating the amount of water available to each person within the aquifer area. However, it is essential to recognize that groundwater availability is not the only factor in assessing water security; other factors, such as water quality, the sustainability of water sources, and the effective management of water resources, must also be considered.
2.3. Commitment to Our Work
Within the study area, most withdrawals from aquifer 0812 are for irrigated agriculture, whereas the remainder is for domestic use [
22]. The absence of a Smart Farming Technologies (SFT) and Climate-Smart Agriculture (CSA) approach to improve water efficiency in irrigation practices is common in this transboundary watershed. Furthermore, many pecan orchards in this region rely on inefficient flood irrigation. At the same time, other farmers have lost their production potential due to drought and water scarcity, with groundwater depletion threatening their investments. This is relevant to a holistic, common strategy for groundwater sustainability in this binational region, as similar challenges are occurring on the US side at the Los Muertos Basin, where the Mimbres aquifer system is located. In this aquifer, the water table has declined by nearly 27 m over the past 40 years, resulting in changes to the transboundary water table surface [
12]. Furthermore, extensive research has documented the need for binational groundwater information, with substantial evidence supporting the transboundary characteristics of these aquifer systems, particularly with respect to groundwater flow connections, which have been recognized by the International Shared Aquifer Resources Management Agency (ISARM) [
28]. For example, the research region receives groundwater from the Las Palmas and Palomas-Guadalupe Victoria aquifers, highlighting the importance of ensuring sustainable groundwater use on both sides of the US-Mexico border [
22].
The need to implement water-saving irrigation technologies while managing real-time digital data (e.g., climatic temperatures and soil moisture) and applying a Smart Farming Technologies/Climate-Smart Agriculture (SFT/CSA) approach at the farm is critical to the future of agriculture in this binational region. SFT/CSA is a new approach to agricultural production that uses real-time monitoring, gathering digital data to address the need for in-farm information to support decision-making for small-scale farmers [
20]. Furthermore, the use of digital data in agriculture can play an important role when real-time monitoring practices are applied to evaluate soil moisture, soil temperature, and soil salinity, as well as changes in climatic temperatures to monitor climatic conditions for in situ decision-making at the farm, thereby estimating evapotranspiration (ETP) and other crop parameters, such as crop coefficients (Kc). These technical practices for collecting digital data can improve water efficiency and provide cloud-based Internet support for real-time information transfer to farmers and for research and academic purposes. These datasets can be used to model irrigation schedules, fill gaps in real-time information, complete the loop of necessary inputs for real-time irrigation scheduling, and support water conservation and decision-making at the farm. Furthermore, the online availability of digital data can be used to compare and evaluate binational regional climate changes and agricultural water demands, and to assess the implementation of such SFT/CSA approaches on both sides of the border.
This research addressed the lack of scientific research and strengthened different research lines aligned with these interests to promote water conservation, binational data sharing, and the use of real-time digital data in agriculture. To delineate the research site of this paper, we selected a 4-ha parcel, planted in January 2020 in a 10 m by 10 m tree distribution, with a micro-sprinkler irrigation system at a flow rate of 36 lph. This is a multiyear project (10 years) managed annually, starting its sixth year in 2026, during which planned tasks will be carried out once all computers and sensors are installed and connected to the cloud. We also expect multiannual results across the different stages of the project, including data collection and processing, nut production, and product quality. Installing digital piezometric dataloggers for groundwater monitoring, continuing to install soil moisture sensors to assess water availability at sites with changing soil textures, and expanding the use of automated micro-irrigation (drip) systems to improve water-use efficiency are part of the project’s ongoing agenda. Therefore, digital data can play an important role in real-time monitoring of soil degradation, transboundary groundwater depletion, and climatic conditions, thereby supporting in situ decision-making at the farm.
2.4. Data Collection and Processing Procedures
Pecan trees are one of the most important crops in northern Mexico, with annual acreage increments above other types of crops. To help reduce the challenges of producing more food with fewer resources, especially water, in situ monitoring is a key tool for evaluating potential solutions in these rural communities, which face significant criticism for their high-water consumption in pecan production. Therefore, IoT capacity at the farm is essential for implementing and enhancing technological solutions. In México, the approach to humanitarian technology (innovation for all) is considered an important factor in supporting the implementation of such an SFT/CSA approach. Therefore, addressing the technological burden of adopting real-time digital data in agricultural practices and training small farmers to adopt technology holistically are essential to the success of Mexico’s agriculture. Practical applications of capturing digital data at the research site of this study will significantly contribute to a cloud-based “Data Lake” for agricultural production, benefiting the surrounding farms. Such a vision requires computing capacity and real-time solutions to enhance the regional economy and, most importantly, to improve family income among local farmers, thereby reducing the gap between farm investments and productivity while implementing the required technological infrastructure logistics (
Figure 2).
Demand for irrigation water for pecan orchards has increased significantly over the past 20 years, whereas irrigation practices remain deficient due to limited or nonexistent irrigation technologies. Irrigation scheduling is a key tool for water-efficient agricultural practices; therefore, it is important to estimate real-time ETP using meteorological data and to install tensiometers to measure soil moisture and map soil physical parameters. In this project, these data were available via a cloud-based protocol for accessing modeling results after real-time regional ETP and Kc were estimated, and the preliminary results were shared with local farmers. Computer-based tools and cloud computing were used to develop the model for real-time ETP and Kc estimates, which was improved while integrating computer programming protocols, enhancing the data architecture while “cleaning data” through an ETL (Extraction-Transformation-Loading) a process that was applied to better manage our integrated data base from the “Data Lake” reservoir, using an SQL protocol running descriptive and predictive analysis to our process. Despite the advantages of implementing IoT-based monitoring systems for irrigation management, several sources of uncertainty may influence the reliability of the collected data and derived irrigation recommendations. Soil moisture measurements obtained from tensiometers and other in situ sensors are affected by soil heterogeneity, installation conditions, and calibration procedures. Tensiometers measure soil matric potential rather than volumetric water content; therefore, the conversion to plant-available water depends on soil water retention curves that vary with soil texture, structure, and organic matter content (
Figure 3).
Spatial variability within pecan orchards may also lead to localized differences in soil moisture, meaning that a limited number of sensors may not fully capture field-scale variability. Additional uncertainty arises in estimating reference evapotranspiration (ETo or ETP). Meteorological measurements used in evapotranspiration models, such as air temperature, relative humidity, solar radiation, and wind speed, contain inherent sensor errors that may propagate into evapotranspiration calculations [
29]. These measurement uncertainties may result in deviations of approximately 5–15% in daily evapotranspiration estimates. Furthermore, crop coefficients (Kc) used to estimate crop evapotranspiration may vary depending on orchard management practices, tree age, canopy density, and local microclimatic conditions, which may not be fully captured by standardized coefficients reported in the literature [
30]. The use of IoT communication networks and cloud-based data systems introduces additional operational limitations. Wireless data transmission may experience intermittent connectivity or packet loss, leading to temporary gaps in the monitoring dataset. Similarly, long-term sensor performance may be influenced by battery degradation, environmental exposure, and maintenance conditions. Finally, uncertainties may also arise during data processing, particularly during extraction, transformation, and loading (ETL) procedures, where missing data, time-synchronization issues, or sensor outliers may affect modeling results. Consequently, the irrigation scheduling framework developed in this study should be interpreted as a decision-support tool rather than an exact prediction system. Continuous calibration, periodic sensor validation, and the future integration of multiple monitoring points across the orchard are recommended to reduce uncertainties and improve the reliability of real-time irrigation management strategies [
29].
Additionally, agroclimatology enabled the prediction of the behavior of climatological variables and their effects on agricultural production, thereby facilitating decision-making. These practices, such as managing irrigation schedules and estimating crop water stress, help improve water-use practices while pursuing SFT/CSA. To estimate reference evapotranspiration (ETo), Hargreaves and Samani [
5] developed an equation (Equation (2)) using minimal climatological data, accounting for variations in temperature and solar radiation (R0). For this purpose, climatological “data lakes” generated at the Rancho El Regalo (RER) climatological station (
Figure 3) were used, along with a network of humidity and temperature sensors installed in the pecan orchard at the research site.
Subsequently, Equation (2) was used, with a coefficient of transpiration (KT) of 0.162 for inland regions, following the modification proposed by Allen [
31] and accepted by Samani [
4]. Values for solar radiation (R0) were taken from the extraterrestrial solar radiation table, using the average range value for latitudes between 30° and 32°, in the Northern Hemisphere [
4], obtaining the results of the ETo for the years 2021, 2022, and 2023 from Equation (2). In addition, other climatological variables, such as precipitation and relative humidity, were examined to identify trends in their relationships with ETo.
Reference evapotranspiration (ETo) is a key parameter for irrigation scheduling and water balance studies in arid and semi-arid regions, where among the most widely used methods for estimating ETo are the FAO Penman–Monteith and Hargreaves–Samani methods. The FAO Penman–Monteith equation [
31] is considered the standard approach because it incorporates multiple meteorological variables, including solar radiation, air temperature, humidity, and wind speed. In contrast, the Hargreaves–Samani method [
5] is a simplified empirical model that primarily relies on maximum and minimum air temperature and extraterrestrial radiation. The Penman–Monteith method generally provides more accurate estimates of reference evapotranspiration because it accounts for both aerodynamic and energy-balance processes that control evapotranspiration. However, its application requires a complete set of meteorological measurements, which were unavailable at the research site. Therefore, the Hargreaves–Samani method was applied as an alternative, since only temperature data were available, producing ETo estimates that were generally consistent with those obtained using the Penman–Monteith approach. However, because the region experiences periods of high wind speed or elevated vapor pressure deficit, the Hargreaves–Samani method tends to underestimate evapotranspiration because it does not explicitly account for wind speed and humidity. However, the simplified Hargreaves–Samani mode provided reliable estimates using the limited meteorological data available for this study. In arid regions like the study area, previous studies have reported mean absolute errors ranging from approximately 0.5 to 1.5 mm day
−1 when Hargreaves–Samani estimates are compared with Penman–Monteith results [
2,
3,
4]. In the present case, the error magnitude remained within this range, suggesting that the temperature-based method provided an acceptable approximation for operational irrigation management.
The RER meteorological station collected data at 5 min intervals, and maximum (Tmax) and minimum (Tmin) temperatures were recorded daily from January 2021 to December 2025, in contrast to the official temperature estimates for the region [
24]. Climatological data were obtained from a Davis Vantage Pro meteorological station (
Figure 3), which provided precipitation and temperature data to estimate potential evapotranspiration (ETo) using Equation (2). During this period, the temperature at the RER station where the research site is located averaged −2.7 °C in winter, with the highest temperatures recorded in July, peaking at 37 °C (
Figure 4). The meteorological dataset collected at the study site includes continuous monitoring of air temperature from 2021 to 2025, a primary variable for estimating reference evapotranspiration (ETo). Temperature data are essential in widely used evapotranspiration models where air temperature directly influences vapor pressure deficit and the atmospheric demand for water. The temperature monitoring graph for the 2021–2025 period (
Figure 4) illustrates the seasonal variability of maximum, minimum, and average monthly temperatures, showing a consistent climatic pattern typical of arid and semi-arid regions of northern Mexico. The dataset shows a clear seasonal cycle with maximum temperatures reaching approximately 37–40 °C during June–August, which corresponds to the peak evaporative demand period, minimum temperatures frequently ranging between 3 and 8 °C during December–January, reflecting winter dormancy conditions for pecan orchards and average temperatures during the growing season (April–September) generally ranging between 24 and 30 °C. This seasonal behavior strongly influences evapotranspiration dynamics because higher temperatures increase saturation vapor pressure and atmospheric water demand, thereby increasing ET rates. The dataset indicates moderate interannual variability in temperature patterns between 2021 and 2025. For example, the 2023 summer period shows some of the highest observed maximum temperatures, exceeding 40 °C, suggesting enhanced evaporative demand that year. Slightly lower maximum temperatures were observed during 2021 and 2022, although seasonal patterns remained consistent. Such variability is important when modeling irrigation requirements because small temperature differences can lead to significant differences in seasonal crop water demand. The temperature range observed in the dataset (often 12–18 °C between daily maxima and minima during summer) indicates strong diurnal variability, which typically leads to higher evapotranspiration rates in arid climates.
During the summer months (May–September), when pecan trees are in active growth and nut-filling stages, the combination of high temperatures and large diurnal temperature ranges suggests that daily ETo values may exceed 6–8 mm day−1, depending on radiation and wind conditions.
This dataset included 5 years of observations (2021–2025), with 12 months per year, for a total of 60 observations. The variables analyzed included the maximum temperature (Tmax), minimum temperature (Tmin), and average temperature (Tavg), with the statistical analyses shown in
Table 1. The probability distribution of the monthly average temperature for the period 2021–2025 was derived from the CeTraTecAI monitoring station dataset. Because evapotranspiration is strongly temperature dependent, the probability distribution helps estimate the likelihood of high atmospheric water demand. The standard deviation is approximately
σ ≈ 8.3 °C. The fitted normal distribution illustrates the seasonal variability in temperature conditions that influence evapotranspiration demand in pecan production systems in northern México. The interpretation of these data shows that the large temperature range confirms the strong seasonal climate variability typical of semi-arid northern Mexico, whereas the difference between Tmin and Tmax indicates a strong diurnal and seasonal thermal amplitude that strongly influences evapotranspiration processes. For the standard deviation (σ), which quantified temperature variability, the largest variability occurred in Tmin, reflecting strong winter cooling events; Tmax variability was slightly lower but still significant due to seasonal warming.
The statistical reliability of the monitoring dataset helped demonstrate consistent multi-year climatic patterns, statistically stable temperature averages, and significant seasonal variability that influence irrigation demand, whereas the 60-observation dataset provided a statistically robust basis for evapotranspiration modeling and irrigation scheduling. The potential uncertainty associated with energy balance closure in evapotranspiration estimation was addressed by combining meteorological-based evapotranspiration calculations with soil moisture monitoring and water balance validation. Although surface heat flux measurements in arid and semi-arid environments may introduce systematic deviations of approximately 10–20% in the surface energy balance [
16], the reference evapotranspiration used in this study was estimated using standardized methods based on meteorological observations rather than direct eddy-covariance measurements of energy fluxes. To minimize potential bias, evapotranspiration estimates were cross-validated against changes in soil water content within the root zone measured by the in situ sensors. This approach allowed the consistency of evapotranspiration estimates to be evaluated against observed soil moisture depletion during irrigation cycles [
32]. By integrating soil moisture measurements with evapotranspiration modeling, the potential impact of energy balance closure errors on irrigation scheduling was reduced, ensuring that calculated crop water demand remained consistent with observed soil water dynamics under the arid climatic conditions of the study area.
For this study, daily values and monthly average ETo were estimated from digital data transfer and administered under ETL and SQL protocols, which were developed to enhance the project’s “Data Lake” infrastructure. ETL (Extract, Transform, Load) is a data management process/workflow, while SQL (Structured Query Language) is the programming language used within that process to manage and manipulate the databases. The study covered a total area of 4 ha, with two wired tensiometers installed in sandy loam soil at depths of 25 cm and 45 cm (in this area, 375 two-inch-diameter pecan trees were established in 2021 with 10 m × 10 m spacing). ETP calculations using the Hargreaves-Samani Equation [
5] were incorporated into an algorithm to support real-time ET calculation. Additionally, digital piezometric sensors were installed at nearby wells within the watershed to monitor and evaluate groundwater depletion, thereby informing future transboundary management of the binational aquifers. Equipment with cutting-edge technology was installed in 2020 to enable real-time monitoring of soils irrigated under a TORO
® microsprinkler system (36 lph) (The TORO Co. Micro-Irrigation Business, El Cajon, CA, USA), while measurements of temperature to calculate evapotranspiration (ETP) and atmospheric conditions at the pecan orchard were taken using a Davis Weather Station
® and two LSE01 Dragino soil sensors (Dragino Technology, Co., Shenzhen, China), these being installed to measure soil relative humidity, soil temperature, and soil EC. The LSE01 LoRaWAN is a wireless environmental monitoring device designed primarily for precision agriculture and Internet of Things (IoT) applications, and it transmits data via LoRaWAN networks to remote servers or cloud platforms. These characteristics enabled the development of the two sensor installations to monitor soil relative humidity at different depths on the same site. The soil sensors use FDR (Frequency-Domain Reflectometry) to measure the soil’s dielectric constant. The sensors were calibrated according to Hrisko’s methodology [
29], which enabled calibration of the volumetric humidity of the field-collected sample at the specified depths against the sensor voltage using a linear equation (
Figure 5). The sensors were installed on 15 November 2023, and collected daily data until 30 December 2025. Because sandy loam soils exhibit spatial variability in texture and structure, sensor placement was carefully selected to represent typical soil conditions within the monitored orchard. Nevertheless, it is recognized that point-based soil moisture measurements may not fully capture field-scale heterogeneity.
To visualize real-time soil moisture, temperature, and salinity using the Internet of Things (IoT), Dragino sensors were placed at a strategic location within the pecan orchard (
Figure 3), which has greater canopy homogeneity. The sensor’s digital data was transmitted 100 m to the LG308 Dragino gateway, enabling connection to the server and integration with the Dragino Sensor platform, allowing real-time data viewing on the Things Board internet platform. Crop evapotranspiration (
ETc) was estimated through the variation in soil moisture, without the presence of rain. Since the sensor collects information every 5 min, it was decided to average over the period from 24:00 to 14:00. The formula for estimating
ETc is given in Equation (3) [
31].
ETc is the daily crop evapotranspiration (mm); Zri is the depth of the sensor (mm); ΔSWCi is the variation in the sensor humidity (%); and Δt is the validation time (one day).
Estimation of the crop coefficient and its relationship with remote sensors was calculated using Equation (4) [
31], where
Kc is the crop coefficient;
ETc is the crop evapotranspiration calculated with Equation (3) (mm d
−1), and
ETo is the potential reference evapotranspiration measured with climate data from the RER meteorological station and Equation (2).
Data for the
ETc calculation were collected with the RER weather station at the research site (
Figure 3), whereas
ETc (Crop Evapotranspiration) is crop-specific and represents the water needed by a specific crop and is used for scheduling irrigation, calculating crop water requirements, and water management by multiplying
ETo by a crop coefficient (
Kc) (Equation (5)) [
31].