1. Introduction
The construction sector is a critical driver of the economy, contributing significantly to the Gross Domestic Product (GDP) of both developed and developing countries [
1]. In some European nations, construction can account for up to 10% of the GDP, often even higher in developing countries [
2]. This significance underscores the necessity of evaluating the performance of these projects in terms of schedule and cost to ensure the efficient utilization of public resources [
3,
4,
5]. Despite the considerable attention and resources dedicated to the design and planning of construction projects, schedule and cost deviations still need to be improved [
6,
7]. The performance in these three criteria determines the overall project outcomes in both the short and medium terms [
8]. Despite the extensive studies and proposed solutions addressing schedule and cost deviations, these issues persist globally [
9,
10,
11,
12,
13]. Research conducted since 1985 across different countries revealed that these deviations are common in construction projects worldwide [
14]. In Colombia, road projects face persistent schedule and cost pressures driven by mountainous topography, intense rainfall, and geotechnical uncertainty that force scope adjustments and temporary work stoppages [
15]. Institutional frictions amplify these risks through permitting and utility relocation delays, fragmented oversight, and uneven regional capacities. Procurement dynamics also matter: high bidder counts can induce optimistic prices that later convert into change orders and overruns, while many tenders exhibit few valid bidders, weakening effective competition [
10]. At the project level, high project intensity, short planned durations, and frequent suspensions are associated with larger deviations [
16]. These conditions shape a recurrent pattern of delays and cost growth across secondary road programs.
Some of the most affected road infrastructure projects by schedule and cost deviations are located in developing countries [
17,
18,
19]. For instance, in 2020, the Comptroller General of Colombia identified 131 alerts in the construction sector due to delays, suspensions, and other project risks [
20]. These deviations in construction projects can have profound implications, including claims for additional funds, contractual disputes, and alterations to the original project objectives [
21,
22,
23]. Ultimately, these issues can lead to a compromise in the quality of the project deliverables. Given the significant impact of schedule and budget deviations on road infrastructure projects worldwide, numerous industry professionals and researchers have focused on identifying these deviations’ root causes [
24,
25]. The findings from these studies have demonstrated that schedule and budget deviations in road projects are highly complex phenomena, influenced by various factors arising at different stages of the project lifecycle and involving multiple stakeholders [
26]. Identified causes of these deviations include inadequate project planning, design deficiencies, lack of experience among designers and contractors, and financial limitations of the owner and contractor [
25,
27,
28,
29,
30].
Deviations in the schedule and cost of road infrastructure projects occur due to various factors [
15,
31]. In this context, one of the most critical factors is related to the characteristics of the contracting method used for project development [
32,
33]. Various contracting methods are employed, with public bidding being the most commonly used in road construction projects [
34,
35]. This method involves competition among several construction industry organizations, each vying for a contract to construct a specific road project [
36,
37]. Selection is based on evaluating a set of characteristics of the proposing entity, leading to a score according to the terms defined in the bidding process, ultimately awarding the contract to the highest-scoring contractor [
38,
39,
40]. Other methods that have gained traction include public–private partnerships (PPPs), where the private sector finances, constructs, and sometimes operates the project for a defined period [
41]. Additionally, there are methods such as restricted bidding, direct contracting, and management contracts. Due to their varying characteristics, some contracting methods may present a higher risk of budget and schedule deviations [
36,
42]. Therefore, it is essential to study how the characteristics of contracting methods can influence schedule and cost deviations in road infrastructure projects.
The widespread impact of schedule and cost deviations observed in projects in developed and developing countries has led various authors to identify the primary factors causing these deviations [
24,
29,
43,
44]. However, there is a significant research gap concerning studies that analyze the relationship between the characteristics of contracting processes in road projects and schedule and budget deviations. This gap is related to the fact that most research methods have relied primarily on consultations with construction industry experts [
9,
10], needing studies grounded in the statistical analysis of executed projects to pinpoint the precise causes of schedule and cost deviations. To address these gaps, this article aims to (1) identify factors in the contracting process that influence schedule and budget deviations in Colombian road projects and (2) analyze the impact of these factors on the magnitude of schedule and budget deviations. Unlike much of the previous literature, which relies on surveys and interviews with practitioners to identify the causes of delays and cost overruns, this study is based on evidence from completed road contracts in Colombia. Through a statistical approach, the study seeks to identify critical factors contributing to these deviations and provide evidence-based recommendations to improve the planning and management of future projects. Understanding these dynamics is essential for optimizing the use of public resources and ensuring the efficiency and effectiveness of road infrastructure project execution. This study contributes to the existing literature by providing a comprehensive quantitative analysis of the relationship between contracting process characteristics and schedule and budget deviations. It offers valuable insights for decision-makers and construction professionals to efficiently mitigate the risks associated with schedule and cost deviations in road projects. Another significant contribution of this study is the proposed method of data analysis, which can be replicated in other contexts to analyze phenomena based on large datasets.
This study focuses on secondary road infrastructure projects developed in Colombia, using completed and settled public contracts recorded in the SECOP platform between 2011 and 2019 as the empirical foundation. The main outcomes examined are schedule and cost deviations, while seventeen contracting and project variables related to the planning, procurement, execution, and closure stages act as explanatory factors. The analysis employs a data-driven approach that includes exploratory profiling, nonparametric bivariate tests such as Spearman and Kruskal–Wallis, as well as multivariate learning methods. These methods include Random Forest for assessing variable relevance and Bayesian networks for probabilistic inference. The research does not cover projects developed after 2020, pandemic-related factors, detailed methods at the project level, or macroeconomic shocks that are not included in the open data. Additionally, it does not examine long-term public–private partnership operations that extend beyond the delivery period covered by the dataset. In light of current engineering challenges (such as the need for budget discipline within fiscal constraints, ensuring certainty of delivery in environments subject to environmental and administrative risks, and maintaining transparency in public procurement), this research offers two key contributions. First, it identifies evidence-based factors, including competition intensity, award behavior, and project intensity, which can inform ex-ante planning and tender design. Second, it provides a replicable methodology based on open data that public owners and policymakers can utilize to reduce the risks of cost overruns and schedule delays in road programs under similar institutional conditions.
3. Research Method
This chapter presents the research method adopted (see
Figure 1). The research comprised three main stages based on previous studies that proposed enabling data-driven decision-making by acquiring, processing, and utilizing data in the construction industry [
94]. Initially, the definition of target projects and data collection was developed, followed by a literature review to identify factors considered in previous research. Then, the exploratory analysis was conducted to identify the nature of the factors. Finally, statistically significant factors contributing to deviations in schedule and cost in secondary road infrastructure projects were identified through bivariate analysis. The choice of analytical methods is based on the nature of the available data and the need to obtain robust and replicable results. Exploratory data analysis was conducted to clean, transform, and characterize the dataset, providing a foundational understanding of variable distributions and potential anomalies. This stage informed the selection of appropriate statistical tests. Bivariate analysis was employed to identify significant relationships between independent variables and schedule and cost deviations, using non-parametric tests (Spearman and Kruskal–Wallis) due to the lack of normality in the data. These tests are widely recognized for their robustness in handling non-normally distributed data and ordinal variables. The Kruskal–Wallis test, often considered the non-parametric equivalent of one-way ANOVA, is particularly suitable for comparing three or more independent groups when assumptions of normality and homogeneity of variance are violated [
95,
96]. Similarly, Spearman’s rank correlation is effective for assessing monotonic relationships between variables without assuming linearity or normal distribution [
97]. Both Kruskal–Wallis and Spearman’s rank correlation tests helped identify statistically significant relationships between independent variables and cost/schedule deviations. Subsequently, a Random Forest model was applied due to its ability to handle both categorical and numerical variables simultaneously, and to rank the importance of variables in predicting deviations. Random Forest is a robust ensemble learning method based on decision trees, used in construction project performance analysis due to its capacity to manage complex datasets and identify key predictors [
98,
99]. It has been successfully applied in infrastructure studies to assess cost overruns, schedule delays, and risk factors, offering high accuracy and interpretability through variable importance measures [
100,
101]. In our study, Random Forest was applied to simultaneously evaluate multiple variables and rank their importance in predicting deviations. Finally, Bayesian networks were selected for their ability to model complex probabilistic dependencies among discretized variables, enabling inference about the likelihood of deviation occurrence under uncertainty [
102,
103,
104]. This method is particularly suitable for analyzing construction projects, as it enables the integration of multiple interdependent factors and supports decision-making based on conditional probabilities [
105]. By discretizing continuous variables using unsupervised clustering (e.g., k-means), the model captures nonlinear relationships and facilitates the interpretation of deviation risks in a structured graphical format [
106]. Bayesian networks have been successfully applied in infrastructure studies to assess social sustainability, risk propagation, and performance prediction, demonstrating their robustness in handling incomplete or uncertain data and generating actionable insights [
107,
108,
109,
110]. In the present study, Bayesian Networks were implemented to model probabilistic dependencies among discretized variables. These methods were selected for their suitability in analyzing open data, their robustness against non-normal distributions, and their usefulness in generating practical recommendations based on empirical evidence. Each stage of analysis contributed to a more refined understanding of the data: exploratory analysis ensured data integrity, bivariate analysis identified significant individual relationships, Random Forest highlighted key predictors, and Bayesian Networks provided a probabilistic framework for interpreting the interactions among variables. This structured approach enhances the robustness and interpretability of the findings.
3.1. Definition of Target Projects and Data Collection
The data collection process began with a comprehensive web search on the SECOP, the Colombian state’s public data platform. This platform, established by the Colombian government in 2011, is a trusted source that shares public policies, plans, programs, and procurement standards. It also operates as a decentralized entity of the national executive branch with legal status, assets, and administrative and financial autonomy [
111]. The platform enables the filtering of construction and road projects, providing the capability to focus on completed and liquidated projects. It offers valuable information across various stages of the project lifecycle.
The main aim of the web search in the present study was to identify projects relevant to the research and complement the existing database. Projects were initially considered for inclusion if they involved road infrastructure works, either new construction or maintenance, were fully completed and reported as administratively closed (settled) in the SECOP platform, and had been awarded through public tender, abbreviated selection, or direct contracting modalities. This step served as a preliminary filter, followed by a subsequent phase in which additional projects were excluded during data cleaning and exploratory analysis to ensure completeness and consistency of the final dataset. The search was conducted on the SECOP platform, where the following search criteria were established:
Group ID: Land, buildings, structures, and roads.
Project Status: Settled
Contracting modality: Public tender, abbreviated selection, or direct contracting.
Category: Roads
Once the initial database was obtained, a systematic process was applied to remove projects that did not satisfy the inclusion criteria. Additionally, an Exploratory Data Analysis (EDA) was conducted to identify incomplete or inconsistent records, which were excluded from the final sample. This process resulted in a final data set suitable for statistical analysis. The downloadable database from the platform includes factors related to various stages of the infrastructure project lifecycle, such as the initiation phase, procurement phase, execution stage, and closure phase. Simultaneously with the data collection, a literature review was conducted in the ‘Scopus’ and ‘Web of Science’ databases. Search queries utilizing specific keywords were formulated to target relevant information regarding schedule and cost overruns in secondary road infrastructure projects. These keywords include ‘construction projects,’ ‘construction industry,’ ‘poor planning,’ ‘poor supervision,’ ‘cost overruns,’ ‘time overruns,’ ‘project delays,’ ‘project management,’ and ‘public projects.’ At this stage, the documents were reviewed to identify factors that could have been previously analyzed in the context of schedule and cost deviations in construction projects.
3.2. Exploratory Data Analysis
Once the database had been compiled, exploratory data analysis was conducted following the approach outlined by Larose and Larose [
112]. This approach involved two key steps:
Data preparation: In this stage, necessary adjustments were made to enhance the data’s suitability for analysis, involving processes such as data cleaning and variable transformation.
Sample characterization: This stage enabled the collection of data profiles by calculating descriptive statistics through univariate analysis. This entailed calculating the maximum, minimum, mean, median, and standard deviation for the numerical factors, as well as the frequency and percentage for the categorical factors, while looking for imbalances in the frequency distribution.
These analyses were conducted using the open-source R software version 4.5.1 [
113].
3.3. Bivariate Analysis
For the bivariate analysis, various statistical tools were considered based on the nature of the factors (numerical or categorical) and whether they are parametric or nonparametric. The bivariate analysis involved comparing schedule and cost deviations with each independent variable by applying statistical tools according to the variable’s nature. For numerical factors, the initial step was to assess their normality. In all cases, the Shapiro–Wilk test was applied. As the factors did not follow a normal distribution, non-parametric tests were chosen. Considering that the dependent factors were numerical, the Spearman correlation test was applied. Spearman’s rho is a non-parametric test that quantifies the correlation between two numerical dependent and independent factors. Spearman’s rho ranges from −1.00 to +1.00, where +1 indicates a perfect positive linear correlation, and −1 indicates a perfect negative linear relationship [
114].
Normality was checked for the independent variable in each group for the categorical factors. Since the data were not normally distributed, the Kruskal–Wallis test, a non-parametric method, was chosen. This test analyzes differences in the median values of groups [
115], with each categorical variable compared to the dependent cost and schedule deviation factors. The Kruskal–Wallis test determines whether the groups associated with the categorical factors exhibit different behaviors regarding the dependent factors being analyzed. However, it does not determine which specific groups are significantly different from each other. To address this, the Wilcoxon Mann–Whitney test is used to compare paired data and establish specific group differences [
116]. In hypothesis testing, the significance of the results is determined by the
p-value. A
p-value below 0.05 is typically accepted for scientific inference, indicating strong evidence to reject the null hypothesis [
117]. The analysis was conducted using the open-source R software [
113].
3.4. Multivariate Analysis
After the bivariate analysis, multivariate analysis was also considered. The multivariate analysis allows for establishing the effect of considering variables that interact simultaneously [
118]. It involves comparing cost and schedule deviations with all independent variables at once. For this approach, Random Forest was chosen, which is a machine learning method based on decision trees. One advantage of this method is that it can handle both numerical and categorical variables simultaneously, for dependent and independent variables. Additionally, it ranks variables in order of importance [
119]. Random Forest constructs decision trees using random subsets of samples and variables. It includes two control parameters: the number of trees and the number of variables per tree. After modeling, Random Forest ranks variables according to their importance [
120].
After implementing Random Forest, an additional technique was employed to analyze the data from a probabilistic perspective. Bayesian networks were employed to identify relationships between variables, examine their dependencies, and estimate the probabilities of events. The inference mechanism of Bayesian networks allows for calculating the likelihood of an effect on any variable based on the presence of a cause [
105]. Since this technique involves categorical variables and the dataset contained numerical variables, a discretization process was applied using k-means clustering to group similar data points [
112].
5. Discussion
This chapter presents the research results, evaluating their alignment with or discrepancy from previous studies and interpreting their implications for the construction management sector. While schedule and cost overruns have received considerable attention from researchers over time, resulting in a substantial body of literature, the methods traditionally used to identify such delays and cost overruns have been extensively replicated [
131] and predominantly rely on stakeholder opinions, thereby introducing the possibility of bias [
132]. This research addressed this gap by employing an unbiased method, analyzing empirical data using statistical learning techniques. This approach has the potential to contribute to the generation of new knowledge in the field of construction project management. This study replicated a data-driven method for acquiring, processing, and using data to identify significant factors causing schedule and cost deviations based on open data, replicating the method established by Gómez-Cabrera [
94], for rural road construction projects, whereas this study analyzes secondary roads. Additionally, this study emphasizes the importance of open public procurement data in uncovering trends and relationships between factors, enabling research based on empirical data. Such insights are invaluable for decision-makers in public procurement, particularly in projects of this nature.
In Colombia, this study indicates that the magnitude and frequency of delays and cost overruns in secondary road construction projects are frequent, aligning with global findings. Regarding cost escalation, a mean of 9% was reported in this study, although other studies have reported higher values. This difference is influenced by the law prohibiting deviations in public projects from exceeding 50% in Colombia. A previous study in Colombia for rural roads reported a mean cost deviation of 8% for rural roads [
16]. However, higher values were found in other studies abroad. For instance, a study conducted in 2014 analyzed construction projects across five continents and found that road projects reported a mean cost deviation of 20%. An early study in the United States, conducted between 1996 and 2002, analyzed various projects developed by the Indiana Department of Transportation (INDOT) and reported a mean cost overrun of 3.2% for different types of infrastructure projects. However, they also reported cost underruns [
50]. Researchers in Portugal found a mean of 17.8% of cost deviation, considering overruns and underruns [
133]. Regarding schedule deviation, a mean of 97% was reported in this study, representing higher variability than a previous study for rural roads, which found a mean of 19% [
16]. In 2012, researchers reported that approximately 75% of stakeholders indicated that the average delay in road construction projects is between 10% and 30% of the original project duration, with none indicating any schedule delay greater than 100% of the original contract duration [
134]. However, in the case of Colombian secondary roads, schedule deviations as high as 1273% were observed.
Regarding the frequency, in this research, 36.05% of the projects reported cost deviation (2.04% solely cost deviation and 34.01% both deviations). Although in this research, the percentage of projects reporting only cost deviation is very low, a previous study shows that for rural roads in Colombia, this percentage is higher, reporting 26.92% of projects with cost deviation, 23.18% of projects reporting schedule deviation, and 15.33% reporting both simultaneously [
16]. Early studies conducted abroad have reported that 9 out of 10 transport infrastructure projects worldwide experience cost deviations [
123]. Additionally, transportation projects in the United States are often underestimated, with approximately 50% of them exceeding their initial budgets [
135], while 57% of highway projects in Indiana have reported cost overruns [
126]. Recent evidence from the United States shows that the impact of change orders on both schedule and cost increases as projects progress, with significantly greater effects observed during the final construction phases. This supports the importance of early risk identification and proactive change management throughout the project lifecycle [
136]. Moreover, research conducted in Palestine indicates that 100% of transportation projects experience cost deviations: 76% of the projects are underestimated, while 24% are overestimated [
49]. Higher values regarding the frequency of schedule deviations were found in this research. 81.63% of the projects reported schedule deviation (47.62% solely schedule deviation and 34.01% cost and schedule deviation). Similarly, results have been reported in different regions. In the United States, specifically in Indiana, 91% of highway projects experienced schedule deviation [
126]. In Palestine, for municipal roads, it was reported that approximately 8% of projects suffered unjustified delays, and 30% suffered justified delays; in this case, with cost overruns and underruns [
137]. Another study in Thailand, which used stakeholders’ opinions as a source of information, reported that approximately 21% of projects were deemed unsuccessful due to time performance issues [
138]. More recent studies in Asia have shown that integrating predictive models with project control tools can significantly enhance decision-making, resulting in minimal cost variance (0.12%), favorable schedule performance (SPI = 1.04), and only slight cost overruns (CPI = 0.91). These findings also indicate that effective resource allocation and well-implemented cost-saving measures contribute to stronger financial performance and better project outcomes, underscoring the importance of data-driven approaches to improve planning accuracy, procurement strategies, and contract administration [
139].
This study revealed a correlation between cost and schedule deviations, consistent with prior research findings. Early studies, such as Kumaraswamy and Chan [
140], have also demonstrated this relationship. Researchers utilizing the three-stage least squares model highlighted that cost and schedule overruns are interconnected [
126]. Moreover, findings from the Indiana Department of Transportation indicated that higher-cost projects typically experienced longer delays on average [
50]. The significant relationships for both deviations include variables such as estimated cost, project intensity, the number of bidders and the number of valid bidders, process type, and contractor type. The results are consistent with the findings for transport infrastructure projects. In Asia [
141], it was suggested that greater contract values lead to higher cost deviations, and larger projects tend to have a higher percentage of cost overruns. Some studies have highlighted that technical documentation deficiencies, inaccurate cost estimates, and poor planning and scheduling are the most critical issues affecting project timelines and performance. Similarly, research in Portugal found that larger projects tend to have a higher percentage of cost overruns when analyzing the performance of construction projects [
133]. However, it is worth noting that tendencies may differ for larger-scale projects. Some studies have reported that smaller-scale road projects tend to be more prone to larger cost overruns [
124], and previous studies have reported that larger road projects do not have a higher risk of cost escalation [
142]. Regarding project intensity, the findings indicate that higher values in this variable might correlate with increased cost and schedule deviations. This could be attributed to factors such as optimism bias, a significant contributor to unrealistic estimates [
143], poor planning and scheduling, as reported previously in road projects [
144], or inadequate preparation of the project concerning planning and execution and inadequate planning for project costs as major causes of cost escalation [
142,
145]. These results have important implications from an engineering perspective, emphasizing the need to strengthen technical risk assessment and design-stage studies, particularly for projects with higher investment levels or accelerated execution schedules. High project intensity and large contract values are likely to increase the complexity of construction activities, the probability of encountering unforeseen site conditions, and the potential for scope changes, all of which contribute to greater schedule and cost variability.
The results included significant variables related to the contractor selection process, such as the number of bidders, the number of valid bidders, and the contractor type, with similar results found in previous research reporting errors in bidding and awarding, causing time delays and cost overruns [
134,
141]. The results regarding the number of bidders are particularly interesting. Having a higher number of bidders is a desirable outcome. Therefore, this result warrants special attention. Further analysis revealed a relationship between this variable and award growth. It was found that the lower the contract’s award amount, the higher the number of bidders and, subsequently, the higher the deviations. Previous researchers have reported similar results. Lower values in award growth represent higher schedule deviations [
50]. Additionally, the higher the level of competition, the more competitive the bidding, which leads to a higher number of change orders and, consequently, greater cost overruns and schedule delays [
126]. In Italy, researchers reported that the process type used to select contractors affects cost overruns, noting that cost overruns are lower under the average bid format but only when the entry of bidders is restricted [
128]. Recent studies have reported errors in bidding awards, highlighting that the current pre-qualification system implemented by various client authorities in India for selecting contractors for road construction projects falls short of choosing the most competent contractor for the job. This indicates the necessity of revising the existing prequalification process [
146].
Consistent with these findings, the results of this study highlight the need to strengthen contract administration and change management practices. The significant relationship between procurement methods, the number of bidders, and performance deviations suggests that procurement strategies should be tailored to project complexity and scale. Public agencies may benefit from revising prequalification systems to ensure the selection of technically capable contractors and from implementing more robust monitoring and control mechanisms throughout the project lifecycle. In practical terms, these results highlight the importance of structured risk workshops, phased design reviews, and the inclusion of contingency buffers in both the schedule and cost baselines. To further mitigate these risks, engineers and project teams should adopt methodologies that enhance cross-departmental communication and coordination, such as integrated project delivery meetings and digital collaboration platforms, along with more detailed construction sequencing to develop realistic timelines and contingency plans.
Variables significantly related to schedule deviation include the original duration and time suspended. A new insight identified in this research is that time suspended is significant for schedule deviation. Previous studies have also found time suspension to be a primary cause of overruns, due to suspensions caused by inclement weather [
147], which reported this variable as significant. Previous findings for highway projects have reported that the likelihood of delay increases for projects with shorter planned deadlines, consistent with this research [
126,
147]. Other authors have emphasized that one possible explanation for this phenomenon is the level of management invested in these projects compared to smaller ones. Based on the analysis results, recommendations can be provided to enhance project management and stakeholder engagement, thereby minimizing schedule and cost overruns. Public entities should adopt tailored procurement strategies that are specific to the project’s characteristics. For larger-scale projects, more rigorous planning is recommended. During the planning stage, it is essential to assess expectations regarding project intensity, defined as the amount of investment per unit of time, as high values in this variable are associated with greater deviations from the expected outcome. This approach enables the development of more realistic project schedules. Furthermore, strengthening bidder qualification criteria and considering the use of direct contracting in specific contexts where it has demonstrated better performance can help reduce risks. In more competitive processes, such as public bidding, it is advisable to establish a cap on the contract value that bidders propose. The analysis shows that when the awarded contract value significantly differs from the estimated cost, deviations tend to increase. Likewise, a higher number of bidders is associated with more aggressive bidding behavior, which correlates with greater deviations. These recommendations aim to guide policymakers and project managers in mitigating time and cost overruns, as well as improving the outcomes of infrastructure projects.
6. Conclusions
This study aimed to identify the variables of the contracting process available in public data platforms in Colombia to analyze their impact on schedule and cost deviations in secondary road projects. It includes a sample of 149 projects executed from 2011 to 2019, obtained from the Colombian Public Electronic Procurement System (SECOP) before the pandemic. This study makes four theoretical contributions to the field of road infrastructure projects. First, the analysis identifies and examines how the characteristics of the contracting process, such as process type, number of bidders, estimated cost, and project intensity, directly influence schedule and cost deviations. This finding confirms the relevance of these factors as determinants of such deviations, contributing to understanding how contract management can impact project performance. Second, the study reveals a correlation between schedule and cost deviations, a relationship that strengthens previous theories about the interdependence between delays and cost overruns. Thus, this result validates prior studies and provides new evidence within the context of secondary roads. Third, the study challenges the conventional notion that greater competition in bidding processes leads to improved efficiency. Surprisingly, the findings suggest that more bidders are associated with increased cost and schedule deviations, offering a new perspective on the theoretical debate concerning the effects of competition in public procurement. Finally, the results show that projects awarded through direct contracting have fewer deviations compared to those awarded through more competitive processes. This suggests that the straightforward nature of direct awards can reduce the risk of deviations. The analysis also indicates that overly optimistic planning, especially regarding the capital invested per unit of time (project intensity), tends to lead to deviations, highlighting the importance of realistic estimation during the project planning phase.
The practical contribution of this study provides developers of road projects with relevant information for planning future initiatives, thereby minimizing cost and schedule deviations. First, it is necessary to review the project delivery methods and explore ways to increase the number of bidders without reaching extreme values for award growth. This underscores the need to review the rules and requirements of competitive processes to prevent price from becoming a relevant factor in the decision-making process and to avoid errors in awarding projects at prices below the minimum acceptable level. Second, larger projects are associated with increased deviations in both schedule and cost. Larger-scale projects can be divided into components or planned for execution with greater detail. Additionally, project intensity can be analyzed historically by observing the behavior of previous projects. Thus, the historical behavior of this variable can be considered during the planning stage, as it should not vary significantly across similar projects. These are vital aspects that developers can consider reducing the risk of schedule and budget deviations. This study demonstrates the value that public data can provide to project developers and society, as it serves as a tool to understand public spending performance. While having public data is valuable, it does not, by itself, provide the information that society requires. This data must be transformed to ensure accessibility and enhance the essential transparency in public procurement processes. The data analysis using the tools proposed in this article enables the identification of relationships between factors that serve as inputs for decision-making. Public owners and road agencies can operationalize these findings through a staged control plan. Before tendering, they can screen the award growth, expected bidder counts, project intensity, and planned duration against historical thresholds and adjust the solicitation to avoid unrealistic prices and timelines. They can also strengthen bidder qualification on demonstrated delivery capacity and set caps on admissible discounts and minimum feasible durations. During execution, they can track suspended time, additional time, and change orders as leading indicators, with predefined corrective actions and adopt Earned Value Management with weekly reviews and recovery plans when performance indices fall below targets. They should start with pilots in secondary road programs, evaluate results, and scale through formal guidance and training.
This study has some limitations that highlight opportunities for future research. First, the findings are based on a relatively limited database, indicating a need to expand the dataset in future studies to validate and strengthen these results. Additionally, incorporating qualitative methods, such as key stakeholder interviews, could provide deeper insights into the factors influencing project outcomes and success. Expanding the research to include different types of infrastructure projects across various regions and countries would facilitate comparative analysis, thereby allowing for the development of best practices that can be adapted to diverse contexts. Second, the study relies exclusively on public contract data from the SECOP platform, which may not capture other critical factors affecting project performance, such as political, social, or environmental variables. Future research should expand to include these factors for a more comprehensive understanding of project deviations. Similarly, performance in this study was primarily evaluated based on avoiding schedule delays and cost overruns. Including additional performance indicators, such as work quality, sustainability, or socioeconomic impact, could provide a more holistic view of project success. Third, another promising area for future investigation involves the post-pandemic landscape, specifically analyzing how this unprecedented global event has affected secondary road infrastructure projects. Finally, while this research focuses on endogenous factors such as project size and procurement methods, future studies should also examine the influence of exogenous elements, including governance structures, economic conditions, and regional political climates.
Future work should move from association to causation and from diagnosis to prescription. Quasi-experimental designs that exploit procurement rule thresholds, scoring rule revisions, reserve price caps, or staged policy rollouts can identify the causal impact of contracting mechanisms on schedule and cost deviations. Structural models of bidding and award behavior can link competition intensity, award growth, and prequalification stringency to observed overruns, and then counterfactual policy simulations can be run to evaluate caps, incentive clauses, and penalties. High-frequency early-warning systems can be developed by fusing SECOP streams with time-stamped change orders, suspension logs, and permit milestones to forecast deviation risk and recommend targeted interventions. A public benchmarking suite aligned with the Open Contracting Data Standard can standardize variables, codebooks, and evaluation metrics, enabling reproducible cross-country comparisons. Methodologically, heterogeneity should be mapped through quantile and finite-mixture models that reveal tail risks across project scales and regions. In contrast, agent-based and system dynamics models can test how cash-flow rules, advance payments, and retention profiles propagate through contractor behavior and affect delivery performance. These lines of inquiry would turn the present evidence base into actionable design rules for procurement and planning in road programs.