2.1. Study Area and Sampling Design
The study area encompasses road corridors linking the city of Loja (−4.0055° S, −79.2052° W; 2100 m a.s.l.) with surrounding communities in Loja Province, southern Ecuador. The six toponymic designations used locally, Loja–Zamora, Loja–Catamayo, Loja–Malacatos, Loja–Saraguro, Loja–Villonaco, and Santa Ana–Los Encuentros, correspond to five physically distinct continuous routes: Loja–Villonaco and Loja–Catamayo denote sequential segments of the same road (Villonaco lies en route to Catamayo), and Loja–Malacatos and Loja–Yangana (the southward continuation of Loja–Malacatos) likewise form a single corridor. Segment coordinates catalogued in
Table S1 (Supplementary Material S1) allow unambiguous geospatial identification independent of naming convention. The region is characterised by steep Andean topography (elevational range 1800–3200 m a.s.l.), a bimodal precipitation regime (mean annual rainfall 850–1200 mm), and recurrent ENSO-driven extremes that have historically triggered slope failures and drainage infrastructure damage along these corridors [
1,
2]. The regional hydrological baseline, IDF curves calibrated to INAMHI stations [
26], isohyetal maps for Loja Province, and ENSO precipitation anomaly characterisation is fully documented in the companion publication [
1].
Segment selection followed a purposive-stratified spatial-coverage protocol designed to maximise geomorphological representativeness while ensuring road safety and supervisor accessibility within a 16-week academic semester. The corridor network was first stratified by dominant geomorphological regime: (i) high-elevation inter-Andean valleys (Loja–Villonaco/Catamayo, Loja–Malacatos/Yangana); (ii) mid-elevation transitional slopes (Loja–Zamora, Loja–Saraguro); and (iii) low-elevation foothill corridors (Santa Ana–Los Encuentros). Within each stratum, segments were selected to capture the full elevational gradient (1800–3200 m a.s.l.) and the range of slope instability classes observable from UAV-derived terrain reconnaissance. This protocol does not claim statistical randomness; rather, it operationalises maximum-diversity spatial coverage within logistical constraints [
27], yielding a purposive sample [
28] adequate for non-parametric descriptive inference and index validation, but not for population-level frequency estimation. The 42 segments represent approximately 12% of the ~340 km corridor network; their spatial distribution is catalogued in
Table S1 (Supplementary Material S1). Generalisation to the full network therefore requires cautious interpretation: the
DDI prevalence estimates reported here are lower-bound indicators of network-wide deficiency severity, not unbiased population parameters.
2.2. Diagnostic Instrument and Data Collection
A standardised diagnostic instrument was developed and deployed via ArcGIS Survey123, (Esri, Redlands, CA, USA. Available online:
https://survey123.arcgis.com, accessed on 8 May 2026), enabling georeferenced field data collection with integrated photographic documentation at each site. Variables recorded per segment comprised: (1) slope instability class (four–level ordinal scale: High, Medium, Low, None); (2) crown gutter presence and cross-sectional type; (3) road gutter presence, cross-sectional type, longitudinal drainage condition (failure code 0–3), and geometric dimensions (depth H, base width b, side-slope Z); (4) hydraulic chute cross-sectional type, structural condition (Good/Poor), failure code, and dimensions (H, b, Z); and (5) culvert cross-sectional type, structural condition, failure code, and dimensions (H or diameter D, base width b). GPS coordinates were automatically recorded at each survey point. Slope instability was assessed through geomorphological field indicators calibrated to established landslide hazard protocols [
2]. Classification criteria are summarized in
Table 1. Classification was performed by the supervising author at each segment during the field survey, with photographic documentation (
Supplementary Material S2). This is an observational geomorphological rating, not a geotechnical engineering assessment: no shear-strength testing, inclinometer data collection, or pore-pressure monitoring was conducted. The ordinal scale captures visible surficial evidence of instability risk relevant to drainage infrastructure planning, not quantitative safety factors.
Forty-two survey groups, each corresponding to a distinct road segment and comprising 3–4 members (mean 3.8 ± 0.4), completed the instrument under direct supervisor validation to ensure measurement consistency. Group size was not treated as an analytical covariate in the infrastructure analysis but was included as a control variable in Spearman partial correlations (
Section 2.4) to rule out group-composition artefacts.
2.3. STEM Digital Tool Integration
UAV-derived aerial imagery provided terrain context and supported candidate site identification prior to ground survey. ArcGIS Survey123 enforced consistent multi-field data entry with automatic geolocation and timestamping, eliminating transcription error and ensuring full reproducibility. Field geometric measurements were obtained with calibrated measuring tapes, with photogrammetric cross-checks applied where site conditions permitted. The complete diagnostic workflow is illustrated in
Figure 2.
All statistical analyses were performed in R (v. 4.5.2 [
29]), R software, version 4.5.2 (R Core Team, Vienna, Austria. Available online:
https://www.R-project.org, accessed on 8 May 2026), using the following packages (version in parentheses): readxl (1.4.3), dplyr (1.1.4), tidyr (1.3.1), ggplot2 (3.5.1), ggpubr (0.6.0), scales (1.3.0), vcd (1.4-13), corrplot (0.94), psych (2.4.6), rstatix (0.7.2), patchwork (1.2.0), forcats (1.0.0), stringr (1.5.1), knitr (1.48), flextable (0.9.6), officer (0.6.6), RColorBrewer (1.1-3), viridis (0.6.5), janitor (2.2.0), skimr (2.1.5), openxlsx (4.2.7), boot (1.3-30), DescTools (0.99.54), effectsize (0.8.9), factoextra (1.0.7), cluster (2.1.6), ggdendro (0.2.0), broom (1.0.6), car (3.1-2), and lmtest (0.9-40). The complete annotated R script has been deposited on Zenodo (
https://zenodo.org, accessed on 8 May 2026).
2.4. Statistical Analysis
Distributional assumptions were assessed using the Shapiro–Wilk test (Equation (1)).
where
denotes the order statistics,
the sample mean,
the expected normal-score coefficients, and
the sample size. Non-normality was confirmed for all eight continuous geometric variables (
W = 0.458–0.899;
p < 0.001 throughout), justifying the exclusive use of non-parametric methods. Descriptive statistics reported include the mean, standard deviation (SD), median, first and third quartiles (Q1, Q3), and the coefficient of variation (CV%).
Binomial proportions were accompanied by 95% Wilson confidence intervals (Equation (2)).
where
is the sample proportion,
is the count of segments meeting the criterion,
is the total number of surveyed segments, and
is the upper quantile of the standard normal distribution for a 95% confidence level
).
Bivariate associations between categorical variables were tested using Fisher’s exact test with Monte Carlo simulation (
B = 100,000), when any expected cell count fell below 5, or Pearson’s χ
2 otherwise. Effect size was quantified by Cramér’s rule using Equation (3).
where
is the Pearson Chi-squared test statistic;
is the total sample size;
is the number of rows in the contingency table;
is the number of columns; and
is the smaller of the degrees of freedom, ensuring
regardless of table dimensions. Interpretation thresholds follow Cohen (1988) [
30]: negligible < 0.10; weak 0.10–0.20; moderate 0.20–0.40; strong 0.40–0.60; very strong > 0.60. Bootstrap 95% CIs for
V were computed from B = 2000 resamples. Odds ratios for 2 × 2 tables were derived as
OR = (
a ·
d)/(
b ·
c).
Geometric dimension relationships were characterised using Spearman rank-order correlations (Equation (4)), computed pairwise on segments where both structures in each pair were present (pair-available approach), ensuring that correlations reflect observed geometric dimensions rather than absence-coded entries.
where
(rho) is the Spearman rank correlation coefficient
;
is the difference between the ranks of the i–th observation on variables
and
;
is the sum of squared rank differences across all
pairs; and
is the number of road segments. This formula assumes no tied ranks; ties were handled by averaging ranks prior to computation. Benjamini–Hochberg (BH) correction was applied across all 15 unique pairs to control the false discovery rate at α = 0.05.
Between-group dimensional comparisons across cross-section type categories were performed using the Kruskal–Wallis H-test (Equation (5)).
where
is the Kruskal–Wallis test statistic, approximately Chi-squared distributed with
degrees of freedom under the null hypothesis of equal distributions;
is the total number of observations across all groups (
);
is the number of groups (cross-section type categories;
for chute depth comparisons);
is the sum of ranks assigned to all observations in group
; and
is the number of observations in group
.
Effect sizes ε2 = (H − k + 1)/(n − k) and rank–η2 were reported, with thresholds as follows: small ≥ 0.01; medium ≥ 0.06; large ≥ 0.14. Statistically significant Kruskal–Wallis results were followed by BH-corrected Dunn post hoc tests restricted to present structures. represent the proportion of variance in ranks explained by group membership.
A Composite Drainage Deficiency Index (
DDI; range 0–100) was computed for each segment by aggregating five equally weighted binary indicators (Equation (6)).
where
is the deficiency score for segment
;
I(·) is the indicator function, returning 1 if the condition is met and 0 otherwise. Components corresponding to absent structures were assigned 0, providing a conservative lower bound for deficiency severity.
denotes the absence of a crown gutter;
denotes the absence of a road gutter;
denotes a High or Medium slope instability rating;
denotes poor structural condition of the hydraulic chute, assigned 0 if the structure is absent;
denotes poor structural condition of the culvert, assigned 0 if absent. Each component contributes equally (20 points), reflecting the absence of empirical data to justify differential weighting.
DDI classes: Very Low (0–20), Low (21–40), Moderate (41–60), High (61–80), and Critical (81–100).
The
DDI is formulated as an equal-weight composite index following the principle of insufficient reason (Laplace criterion) under structural uncertainty [
31,
32]. When no empirical failure-probability data exist to differentiate the hydraulic risk contribution of individual components, differential weighting would introduce unverifiable subjective priors that reduce inter-study reproducibility. Equal weighting therefore maximises procedural objectivity and transferability across corridor networks where local hydrological forcing differs. The five indicators operationalise the minimum necessary condition for hydraulically functional drainage along an Andean road segment [
18,
19]: crown gutters intercept upslope runoff; road gutters provide longitudinal platform drainage; slope instability captures geotechnical failure risk independent of drainage capacity; chutes govern controlled downslope conveyance; and culverts determine cross-drainage capacity. Each component is necessary but not sufficient for drainage functionality; no single component can compensate for another. This non-substitutability justifies additive aggregation over multiplicative or compensatory formulations [
33].
To test whether equal weighting is robust against admissible alternatives, a Monte Carlo weight-perturbation analysis (n = 10,000 iterations) was performed. Component weights were drawn from a Dirichlet(1,1,1,1,1) distribution, representing an uninformative (uniform) prior over the 4-simplex. This approach stress-tests the index across the full admissible weight space, including extreme but theoretically plausible allocations. A perturbed DDI was computed for each iteration. Rank-order stability between perturbed and equal-weight DDI was assessed via Spearman correlation, and the retention of Critical-band segments in the top decile was tracked across iterations. Results confirmed that rank-order prioritisation remained stable across the admissible weight space (Spearman ρ = 0.930; 95% CI: [0.753, 0.998]), providing empirical support for the equal-weight specification adopted in the main analysis. The stability of key proportions was additionally validated through jackknife leave-one-out resampling.
Missing data were interpreted under Rubin’s taxonomy [
34]: condition ratings for absent structures (chutes and culverts) constitute structural zeros by design—the quantity is undefined when the structure does not exist—and were encoded as such in the DDI. For the minority of segments where a structure was present, but its condition could not be rated during the survey window, the rating was treated as missing (not a structural zero) and excluded from condition-specific analyses; the DDI component defaulted to zero under conservative bounding. No imputation was performed.