Cost Overrun Risk Assessment and Prediction in Construction Projects: A Bayesian Network Classifier Approach
Abstract
:1. Introduction
2. Literature Review
2.1. Previous Studies
2.2. Application of ML to Construction Project Risk Analysis
2.3. Problem Definition
3. Materials and Methods
3.1. ML Algorithms
3.2. ML Process
4. Case Study
5. Research Methodology
5.1. Data Collection
5.2. Data Reliability
5.3. Data Preparation and Preprocessing
5.4. Algorithm Selection: An Experimental Analysis
5.5. Decision Tree
5.6. Naïve Bayes
5.7. Bayesian Network Classifier
5.7.1. Learning Bayesian Networks
5.7.2. Network Type
5.7.3. Maximum Number of Parents
5.8. Training and Evaluation Method and Performance Metrics
- True positives (TPs): Number of predictions that were correctly assigned to a class (i.e., value in the matrix diagonal for the corresponding class).
- False positives (FPs): Number of predictions that were incorrectly assigned to a class (i.e., the sum of values in the corresponding class column excluding the TPs).
- False negatives (FNs): Number of predictions incorrectly unrecognized as class assignments (i.e., the sum of values in the corresponding class row excluding the TPs).
- True negatives (TNs): Number of predictions correctly recognized as not belonging to a class (i.e., the sum of values of all rows and columns excluding the row and column of that class).
6. Results
6.1. Models Implementation
6.2. First Step: Cost Overrun Prediction
6.3. Second Step: Cost Overrun Risk Analysis
7. Discussion
- Utilizing the BN classifier model to predict cost overruns and assess cost overrun risks for the first time in construction management
- Evaluating the effect of considering possible relationships between cost overrun risks on the predictive accuracy of cost overruns in construction projects in ML models
- Determining the correlations between cost overrun risks and identifying the most critical cost overrun risks in terms of the number of relationships with other risks by interpreting the learned BN classifier model.
- Developing a proactive decision-making tool to assist stakeholders with risk management.
8. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Afzal, F.; Yunfei, S.; Nazir, M.; Bhatti, S.M. A review of artificial intelligence based risk assessment methods for capturing complexity-risk interdependencies: Cost overrun in construction projects. Int. J. Manag. Proj. Bus. 2019, 14, 300–328. [Google Scholar] [CrossRef]
- Shane, J.S.; Molenaar, K.R.; Anderson, S.; Schexnayder, C. Construction project cost escalation factors. J. Manag. Eng. 2009, 25, 221–229. [Google Scholar] [CrossRef]
- Hammad, A.; AbouRizk, S.; Mohamed, Y. Application of KDD techniques to extract useful knowledge from labor resources data in industrial construction projects. J. Manag. Eng. 2014, 30, 5014011. [Google Scholar] [CrossRef]
- Liu, J.; Zhao, X.; Yan, P. Risk paths in international construction projects: Case study from Chinese contractors. J. Constr. Eng. Manag. 2016, 142, 5016002. [Google Scholar] [CrossRef]
- Love, P.E.; Ahiaga-Dagbui, D.D.; Irani, Z. Cost overruns in transportation infrastructure projects: Sowing the seeds for a probabilistic theory of causation. Transp. Res. Part A Policy Pract. 2016, 92, 184–194. [Google Scholar] [CrossRef] [Green Version]
- Darko, A.; Chan, A.P.; Adabre, M.A.; Edwards, D.J.; Hosseini, M.R.; Ameyaw, E.E. Artificial intelligence in the AEC industry: Scientometric analysis and visualization of research activities. Autom. Constr. 2020, 112, 103081. [Google Scholar] [CrossRef]
- García de Soto, B.; Agustí-Juan, I.; Joss, S.; Hunhevicz, J. Implications of Construction 4.0 to the workforce and organizational structures. Int. J. Constr. Manag. 2022, 22, 205–217. [Google Scholar] [CrossRef]
- Sanni-Anibire, M.O.; Zin, R.M.; Olatunji, S.O. Machine learning model for delay risk assessment in tall building projects. Int. J. Constr. Manag. 2020, 22, 2134–2143. [Google Scholar] [CrossRef]
- Jin, R.; Zuo, J.; Hong, J. Scientometric review of articles published in ASCE’s journal of construction engineering and management from 2000 to 2018. J. Constr. Eng. Manag. 2019, 145, 06019001. [Google Scholar] [CrossRef]
- Patrício, D.I.; Rieder, R. Computer vision and artificial intelligence in precision agriculture for grain crops: A systematic review. Comput. Electron. Agric. 2018, 153, 69–81. [Google Scholar] [CrossRef]
- Salehi, H.; Burgueño, R. Emerging artificial intelligence methods in structural engineering. Eng. Struct. 2018, 171, 170–189. [Google Scholar] [CrossRef]
- Islam, M.S.; Nepal, M.P.; Skitmore, M.; Attarzadeh, M. Current research trends and application areas of fuzzy and hybrid methods to the risk assessment of construction projects. Adv. Eng. Inform. 2017, 33, 112–131. [Google Scholar] [CrossRef]
- Hegde, J.; Rokseth, B. Applications of machine learning methods for engineering risk assessment—A review. Saf. Sci. 2020, 122, 104492. [Google Scholar] [CrossRef]
- Guide, A. Project Management Body of Knowledge (Pmbok® Guide); Project Management Institute: Newton Square, PA, USA, 2021. [Google Scholar]
- Soibelman, L.; Kim, H. Generating construction knowledge with knowledge discovery in databases. Comput. Civ. Build. Eng. 2000, 2, 906–913. [Google Scholar]
- An, S.-H.; Park, U.-Y.; Kang, K.-I.; Cho, M.-Y.; Cho, H.-H. Application of support vector machines in assessing conceptual cost estimates. J. Comput. Civ. Eng. 2007, 21, 259–264. [Google Scholar] [CrossRef]
- Lee, S.; Kim, C.; Park, Y.; Son, H.; Kim, C. Data Mining-Based Predictive Model to Determine Project Financial Success using Project Definition Parameters. In Proceedings of the 28th International Symposium on Automation and Robotics in Construction, ISARC, Seoul, Korea, 29 June–2 July 2011. [Google Scholar]
- Chaovalitwongse, W.A.; Wang, W.; Williams, T.; Chaovalitwongse, P. Data mining framework to optimize the bid selection policy for competitively bid highway construction projects. J. Constr. Eng. Manag. 2012, 138, 277–286. [Google Scholar] [CrossRef]
- Asadi, A.; Alsubaey, M.; Makatsoris, C. A machine learning approach for predicting delays in construction logistics. Int. J. Adv. Logist. 2015, 4, 115–130. [Google Scholar] [CrossRef]
- El-Kholy, A. Exploring the best ANN model based on four paradigms to predict delay and cost overrun percentages of highway projects. Int. J. Constr. Manag. 2021, 21, 694–712. [Google Scholar] [CrossRef]
- Ghazal, M.M.; Hammad, A. Application of knowledge discovery in database (KDD) techniques in cost overrun of construction projects. Int. J. Constr. Manag. 2020, 22, 1632–1646. [Google Scholar] [CrossRef]
- Gondia, A.; Siam, A.; El-Dakhakhni, W.; Nassar, A.H. Machine learning algorithms for construction projects delay risk prediction. J. Constr. Eng. Manag. 2020, 146, 4019085. [Google Scholar] [CrossRef]
- Yaseen, Z.M.; Ali, Z.H.; Salih, S.Q.; Al-Ansari, N. Prediction of risk delay in construction projects using a hybrid artificial intelligence model. Sustainability 2020, 12, 1514. [Google Scholar] [CrossRef]
- Egwim, C.N.; Alaka, H.; Toriola-Coker, L.O.; Balogun, H.; Sunmola, F. Applied artificial intelligence for predicting construction projects delay. Mach. Learn. Appl. 2021, 6, 100166. [Google Scholar] [CrossRef]
- Shoar, S.; Chileshe, N.; Edwards, J.D. Machine learning-aided engineering services’ cost overruns prediction in high-rise residential building projects: Application of random forest regression. J. Build. Eng. 2022, 50, 104102. [Google Scholar] [CrossRef]
- Dang-Trinh, N.; Duc-Thang, P.; Cuong, T.N.-N.; Duc-Hoc, T. Machine learning models for estimating preliminary factory construction cost: Case study in Southern Vietnam. Int. J. Constr. Manag. 2022, 1–9. [Google Scholar] [CrossRef]
- Bu-Qammaz, A.S.; Dikmen, I.; Birgonul, M.T. Risk assessment of international construction projects using the analytic network process. Can. J. Civ. Eng. 2009, 36, 1170–1181. [Google Scholar] [CrossRef]
- Taroun, A. Towards a better modelling and assessment of construction risk: Insights from a literature review. Int. J. Proj. Manag. 2014, 32, 101–115. [Google Scholar] [CrossRef]
- Huang, C.-N.; Liou, J.J.; Chuang, Y.-C. A method for exploring the interdependencies and importance of critical infrastructures. Knowl. -Based Syst. 2014, 55, 66–74. [Google Scholar] [CrossRef]
- Valipour, A.; Yahaya, N.; Noor, N.M.; Kildienė, S.; Sarvari, H.; Mardani, A. A fuzzy analytic network process method for risk prioritization in freeway PPP projects: An Iranian case study. J. Civ. Eng. Manag. 2015, 21, 933–947. [Google Scholar] [CrossRef] [Green Version]
- Pehlivan, S.; Öztemir, A.E. Integrated risk of progress-based costs and schedule delays in construction projects. Eng. Manag. J. 2018, 30, 108–116. [Google Scholar] [CrossRef]
- Gupta, V.K.; Thakkar, J.J. A quantitative risk assessment methodology for construction project. Sādhanā 2018, 43, 116. [Google Scholar] [CrossRef] [Green Version]
- Chandra, H.P. Structural equation model for investigating risk factors affecting project success in Surabaya. Procedia Eng. 2015, 125, 53–59. [Google Scholar] [CrossRef]
- Adeleke, A.Q.; Bahaudin, A.Y.; Kamaruddeen, A.M.; Bamgbade, J.A.; Salimon, M.G.; Khan, M.W.A.; Sorooshian, S. The influence of organizational external factors on construction risk management among Nigerian construction companies. Saf. Health Work. 2018, 9, 115–124. [Google Scholar] [CrossRef] [PubMed]
- Hung, L. A risk assessment framework for construction project using artificial neural network. J. Sci. Technol. Civ. Eng. 2018, 12, 51–62. [Google Scholar]
- Carr, V.; Tah, J. A fuzzy approach to construction project risk assessment and analysis: Construction project risk management system. Adv. Eng. Softw. 2001, 32, 847–857. [Google Scholar] [CrossRef]
- Taylan, O.; Bafail, A.O.; Abdulaal, R.M.; Kabli, M.R. Construction projects selection and risk assessment by fuzzy AHP and fuzzy TOPSIS methodologies. Appl. Soft Comput. 2014, 17, 105–116. [Google Scholar] [CrossRef]
- Prascevic, N.; Prascevic, Z. Application of fuzzy AHP for ranking and selection of alternatives in construction project management. J. Civ. Eng. Manag. 2017, 23, 1123–1135. [Google Scholar] [CrossRef] [Green Version]
- Shariat, R.; Roozbahani, A.; Ebrahimian, A. Risk analysis of urban stormwater infrastructure systems using fuzzy spatial multi-criteria decision making. Sci. Total Environ. 2019, 647, 1468–1477. [Google Scholar] [CrossRef]
- Ebrahimnejad, S.; Mousavi, S.; Tavakkoli-Moghaddam, R.; Hashemi, H.; Vahdani, B. A novel two-phase group decision making approach for construction project selection in a fuzzy environment. Appl. Math. Model. 2012, 36, 4197–4217. [Google Scholar] [CrossRef]
- Islam, M.S.; Nepal, M.; Skitmore, M. Modified fuzzy group decision-making approach to cost overrun risk assessment of power plant projects. J. Constr. Eng. Manag.-ASCE 2019, 145, 40181261-15. [Google Scholar] [CrossRef]
- Velasquez, M.; Hester, P.T. An analysis of multi-criteria decision making methods. International journal of operations research 2013, 10, 56–66. [Google Scholar]
- Aburrous, M.; Hossain, M.A.; Dahal, K.; Thabtah, F. Predicting Phishing Websites Using Classification Mining Techniques with Experimental Case Studies. In Proceedings of the 2010 Seventh International Conference on Information Technology: New Generations, Las Vegas, NV, USA, 12–14 April 2010; IEEE: Manhattan, NY, USA, 2010. [Google Scholar]
- Flath, C.; Nicolay, D.; Conte, T.; van Dinther, C.; Filipova-Neumann, L. Cluster analysis of smart metering data. Bus. Inf. Syst. Eng. 2012, 4, 31–39. [Google Scholar] [CrossRef]
- Eybpoosh, M.; Dikmen, I.; Birgonul, M.T. Identification of risk paths in international construction projects using structural equation modeling. J. Constr. Eng. Manag. 2011, 137, 1164–1175. [Google Scholar] [CrossRef]
- El-Sayegh, S.M. Risk assessment and allocation in the UAE construction industry. Int. J. Proj. Manag. 2008, 26, 431–438. [Google Scholar] [CrossRef]
- Guan, L.; Liu, Q.; Abbasi, A.; Ryan, M.J. Developing a comprehensive risk assessment model based on fuzzy Bayesian belief network (FBBN). J. Civ. Eng. Manag. 2020, 26, 614–634. [Google Scholar] [CrossRef]
- Yan, H.; Yang, N.; Peng, Y.; Ren, Y. Data mining in the construction industry: Present status, opportunities, and future trends. Autom. Constr. 2020, 119, 103331. [Google Scholar] [CrossRef]
- Witten, I.H.; Frank, E.; Hall, M.A. Data Mining: Practical Machine Learning Tools and Techniques; Morgan Kaufmann: San Francisco, CA, USA, 2005. [Google Scholar]
- Hu, Y.; Wang, Y.; Zhao, T.; Phoon, K.-K. Bayesian supervised learning of site-specific geotechnical spatial variability from sparse measurements. ASCE-ASME J. Risk Uncertain. Eng. Syst. Part A Civ. Eng. 2020, 6, 4020019. [Google Scholar] [CrossRef]
- Ayodele, T.O. Types of machine learning algorithms. New Adv. Mach. Learn. 2010, 3, 19–48. [Google Scholar]
- Fan, C.-L. Defect risk assessment using a hybrid machine learning method. J. Constr. Eng. Manag. 2020, 146, 04020102. [Google Scholar] [CrossRef]
- Brownlee, J. Why Data Preparation is so Important in Machine Learning. 2020. Available online: https://machinelearningmastery.com/data-preparation-is-important/ (accessed on 31 July 2022).
- Brownlee, J. Framework for Data Preparation Techniques in Machine Learning. 2020. Available online: https://machinelearningmastery.com/framework-for-data-preparation-for-machine-learning/ (accessed on 18 July 2021).
- Langley, P. Machine learning as an experimental science. Mach. Learn. 1988, 3, 5–8. [Google Scholar] [CrossRef] [Green Version]
- Mehrjoo, M. What to Consider before Selecting a Machine Learning Algorithm. 2017. Available online: https://www.linkedin.com/pulse/what-consider-before-selecting-machine-learning-marzieh-mehrjoo-phd (accessed on 18 July 2021).
- Ebrahimnejad, S.; Mousavi, S.; Mojtahedi, S. A Model for Risk Evaluation in Construction Projects Based on Fuzzy MADM. In Proceedings of the 2008 4th IEEE International Conference on Management of Innovation and Technology, Bangkok, Thailand, 21–24 September 2008; IEEE: Manhattan, NY, USA, 2008. [Google Scholar]
- Liu, J.; Xie, Q.; Xia, B.; Bridge, A.J. Impact of design risk on the performance of design-build projects. J. Constr. Eng. Manag.-ASCE 2017, 143, 40170101-10. [Google Scholar] [CrossRef]
- Ke, Y.; Wang, S.; Chan, A.P.; Lam, P.T. Preferred risk allocation in China’s public–private partnership (PPP) projects. Int. J. Proj. Manag. 2010, 28, 482–492. [Google Scholar] [CrossRef]
- Rebeiz, K.S. Public–private partnership risk factors in emerging countries: BOOT illustrative case study. J. Manag. Eng. 2012, 28, 421–428. [Google Scholar] [CrossRef]
- Li, Y.; Wang, X. Risk assessment for public–private partnership projects: Using a fuzzy analytic hierarchical process method and expert opinion in China. J. Risk Res. 2018, 21, 952–973. [Google Scholar] [CrossRef]
- Gliem, J.A.; Gliem, R.R. Calculating, Interpreting, and Reporting Cronbach’s Alpha Reliability Coefficient for Likert-Type Scales; Midwest Research-to-Practice Conference in Adult, Continuing, and Community: DeKalb, IL, USA, 2003. [Google Scholar]
- Hall, M.A. Correlation-Based Feature Selection for Machine Learning. Ph.D. Dissertation, The University of Waikato, Hamilton, New Zealand, 1999. [Google Scholar]
- Bielza, C.; Larranaga, P. Discrete Bayesian network classifiers: A survey. ACM Comput. Surv. (CSUR) 2014, 47, 1–43. [Google Scholar] [CrossRef]
- Provost, F.; Fawcett, T. Data Science for Business: What you Need to Know about Data Mining and Data-Analytic Thinking; O’Reilly Media, Inc.: Sebastopol, CA, USA, 2013. [Google Scholar]
- Hastie, T.; Tibshirani, R.; Friedman, J. The Elements of Statistical Learning: Data Mining, Inference and Prediction; Springer: New York, NY, USA, 2009. [Google Scholar]
- Piryonesi, S.M.; El-Diraby, T.E. Data analytics in asset management: Cost-effective prediction of the pavement condition index. J. Infrastruct. Syst. 2020, 26, 4019036. [Google Scholar] [CrossRef]
- Wu, X.; Kumar, V.; Ross Quinlan, J.; Ghosh, J.; Yang, Q.; Motoda, H.; McLachlan, G.J.; Ng, A.; Liu, B.; Yu, P.S. Top 10 algorithms in data mining. Knowl. Inf. Syst. 2008, 14, 1–37. [Google Scholar] [CrossRef] [Green Version]
- Friedman, N.; Geiger, D.; Goldszmidt, M. Bayesian network classifiers. Mach. Learn. 1997, 29, 131–163. [Google Scholar] [CrossRef]
- Bouckaert, R.R.; Eibe, F.; Hall, M.; Kirkby, R.; Reutemann, P.; Seewald, A.; Scuse, S. WEKA Manual for Version 3-9-1; University of Waikato: Hamilton, New Zealand, 2016. [Google Scholar]
- Bouckaert, R.R. Bayesian Network Classifiers in WEKA for Version 3-5-7; Artificial Intelligence Tools; University of Waikato: Hamilton, New Zealand, 2008; Volume 11, pp. 369–387. [Google Scholar]
Study | Scope | ML Model(s) | Risk Factors as Predictors | Number of Identified Features | Data Source | Output Variable | Feature Selection Method | Number of Features after Feature Selection | Training and Evaluation Method | Informative Learned Model |
---|---|---|---|---|---|---|---|---|---|---|
[15] | Causes of delays identification | DT and NN | - | 98 | RMS (Resident Management System) | Delays (day) | Wrapper | 9 | Hold-Out | - |
[16] | Conceptual cost estimates quality assessment | SVM | - | 20 | Reviewing past research and interviews | Conceptual cost estimation error range | - | - | 5-fold CV | - |
[17] | Cost performance prediction | SVM | - | 64 | PDRI (Project Definition Rating Index) | Project cost performance | Wrapper | 39 | 10-fold CV | - |
[18] | Cost overrun investigation | NN classif cation and regression | - | 1 | DOT (Department of Transportation) | Closest ratio to the actual cost of the project | - | - | 5-fold CV | - |
[19] | Causes of delays investigation | NB and DT | + | 9 | Surveys and project reports | Occurrence or non-occurrence of delay | - | - | Hold-Out | - |
[20] | Cost overrun and delay prediction | NNs classification | - | 15 | El-Maaty et al. [20] | Cost overrun and delay percentage | - | - | Hold-Out | - |
[21] | Cost overrun prediction | NB, DT, SVM, RF | + | 48 | Reviewing past research | Cost overrun | Correlation attribute eval, Info gain attribute eval, Wrapper | 1 | Hold-Out | + |
[22] | Delay prediction | NB and DT | + | 9 | Reviewing past research and holding meetings | Delay | - | - | 10-fold CV | - |
[8] | Delay prediction using delay risk analysis | ANN, SVM, K-NN | + | 36 | Reviewing past research | Delay | Correlation attribute eval, Wrapper | 4 | Hold-Out | - |
[23] | Delay prediction | RF | + | 37 | Reviewing past research and interviews | Delay | - | - | Hold-Out | - |
[24] | Delay prediction | Ensemble algorithms | - | 24 | Expert surveys | Delay | Chi-squared | 9 | Hold-out | - |
[25] | Engineering services’ cost overruns prediction | RF regression | - | 12 | Project reports | Engineering services’ cost overruns | - | - | Hold-out | - |
[26] | Predict construction cost | SVM, ANN, GENLIN (generalized linear regression), CART (classification and regression-based techniques), CHAID (chi-squared automatic interaction detection), and DLNN (deep learning neural network) | - | 10 | Project specification | Preliminary construction cost | - | - | 5-fold CV | - |
Present study | Step 1: Cost overrun prediction, Step 2: Cost overrun risks assessment | BN, NB, and DT | + | 43 | Expert judgments | Cost overrun | Step 1: CFS, Step 2: - | Step 1: 8, Step 2: - | 10-fold CV | + |
Category | Type | Number | Percent |
---|---|---|---|
Organization | Employer | 10 | 25.6 |
Consultant | 9 | 23.1 | |
Contractor | 11 | 28.2 | |
Consultant/Employer | 3 | 7.7 | |
Consultant/Contractor | 4 | 10.3 | |
Government supervisor | 2 | 5.1 | |
Experience (years) | =<20 | 13 | 33.3 |
=<15 | 12 | 30.8 | |
=<10 | 11 | 28.2 | |
=<5 | 3 | 7.7 | |
Discipline | Civil Engineering | 33 | 84.6 |
Architecture | 3 | 7.7 | |
Electrical Engineering | 2 | 5.1 | |
Industrial Engineering | 1 | 2.6 | |
Education level | Bachelor’s | 20 | 51.3 |
Master’s | 17 | 43.6 | |
Ph.D. | 2 | 5.1 |
No. | Source | Risk Factors | Reference | Code | Probability Impact | |||
---|---|---|---|---|---|---|---|---|
Mean | Stdv. | Mean | Stdv. | |||||
1 | Managerial | Poor feasibility study | [57] | Mg1 | 3.28 | 1.21 | 3.95 | 1.02 |
2 | Contractor managerial weakness | [45,57] | Mg2 | 3.05 | 1.05 | 4.28 | 0.76 | |
3 | Poor communication between the parties | [45,58] | Mg3 | 2.59 | 0.68 | 3.28 | 1.02 | |
4 | Conflict between the project parties | [45] | Mg4 | 2.67 | 1.03 | 3.33 | 0.98 | |
5 | Consultant managerial weakness | [58] | Mg5 | 2.90 | 0.97 | 3.82 | 0.82 | |
6 | Owner incapable of project manager | [58] | Mg6 | 2.97 | 1.29 | 3.77 | 1.06 | |
7 | Materials and Equipment | Increased price of materials | [59] | Mt1 | 4.59 | 0.55 | 4.74 | 0.59 |
8 | Shortage of equipment | [58] | Mt2 | 2.79 | 1.20 | 3.18 | 1.27 | |
9 | Delay by the suppliers in delivering equipment to the site | [45,58,60] | Mt3 | 2.74 | 1.04 | 3.23 | 1.13 | |
10 | Shortage of materials | [59] | Mt4 | 2.56 | 1.19 | 3.18 | 1.23 | |
11 | New equipment/technology issues | [61] | Mt5 | 2.23 | 1.20 | 2.69 | 1.19 | |
12 | Workforce | Lack of knowledge and experience | [45] | Hu1 | 2.74 | 0.88 | 3.28 | 1.02 |
13 | Labour shortage | [45,60] | Hu2 | 2.18 | 1.10 | 3.26 | 1.07 | |
14 | Lack of skilled personnel (technical staff) on site | [45,60] | Hu3 | 2.61 | 1.14 | 3.56 | 1.05 | |
15 | Financial | Currency exchange rate | [59,60,61] | Fi1 | 4.20 | 1.13 | 4.49 | 0.91 |
16 | Inflation | [59,61] | Fi2 | 4.69 | 0.52 | 4.85 | 0.36 | |
17 | Owner fund shortage and payment delays | [57,61] | Fi3 | 4.05 | 1.10 | 4.36 | 0.84 | |
18 | Multiple sources of funds | [57] | Fi4 | 2.54 | 1.00 | 3.13 | 1.15 | |
19 | Contractor fund shortage | [57] | Fi5 | 3.38 | 0.78 | 3.92 | 1.06 | |
20 | Project | Adverse change in geological conditions | [45,60] | Pr1 | 2.10 | 1.14 | 3.08 | 1.26 |
21 | Site constraints | [45] | Pr2 | 2.23 | 1.01 | 2.69 | 1.05 | |
22 | Project complexity | [45,60] | Pr3 | 2.51 | 1.05 | 3.20 | 1.15 | |
23 | Owner | Site availability | [59] | Ow1 | 2.26 | 0.97 | 3 | 1.32 |
24 | Change orders during construction | [58,59,60,61] | Ow2 | 3.44 | 1.16 | 3.77 | 0.96 | |
25 | Delays in decision making | [58] | Ow3 | 3.38 | 1.02 | 3.77 | 0.96 | |
26 | Owner customs policy and complexity (procurement delay) | [45] | Ow4 | 2.95 | 1.34 | 3.48 | 1.33 | |
27 | Delays in land acquisition | [58] | Ow5 | 2.54 | 1.21 | 3.28 | 1.39 | |
28 | Utility supply | [59] | Ow6 | 4.08 | 0.84 | 2.13 | 1.00 | |
29 | Lowest bidder selection | [41] | Ow7 | 3.67 | 1.11 | 3.74 | 1.12 | |
30 | Contractor | Lack of knowledge and experience | [45,59,60] | Cn1 | 3.20 | 0.92 | 3.85 | 0.93 |
31 | Procurement delays | [45] | Cn2 | 2.85 | 0.84 | 3.54 | 0.91 | |
32 | Sub-contractor delays from preceding work | [60] | Cn3 | 3.05 | 0.97 | 3.33 | 1.06 | |
33 | Improper finance management | [41] | Cn4 | 3.15 | 1.09 | 3.77 | 0.90 | |
34 | Site safety | [45,59] | Cn5 | 2.979 | 1.22 | 3.38 | 1.39 | |
35 | Construction (defect) quality | [45,59] | Cn6 | 3.10 | 1.12 | 3.74 | 1.19 | |
36 | Poor planning and scheduling | [58] | Cn7 | 3.28 | 1.19 | 3.97 | 0.90 | |
37 | Consultant | Lack of knowledge and experience | [45,58] | Cs1 | 2.74 | 1.09 | 3.67 | 1.08 |
38 | Improper design/design errors | [45,59,60] | Cs2 | 2.95 | 1.02 | 3.79 | 1.13 | |
39 | Delays in delivering design | [45,58] | Cs3 | 2.70 | 1.03 | 3.49 | 0.97 | |
40 | Change of equipment, or specification of equipment, during construction | [58] | Cs4 | 2.64 | 0.99 | 3.26 | 1.07 | |
41 | Environment | Bad weather or emergency condition | [45,57,58,59,61] | Ev1 | 2.64 | 0.93 | 3.28 | 0.94 |
42 | Unexpected casualties/injuries | [59,60,61] | Ev2 | 1.77 | 1.01 | 2.36 | 1.33 | |
43 | Environment preservation law | [41] | Ev3 | 1.46 | 0.82 | 1.92 | 1.18 |
Instance | Input Variables | Class | ||||
---|---|---|---|---|---|---|
Mg1 | Mg2 | Mg3 | ... | Ev3 | ||
1 | Low | Low | Low | ... | Very Low | Moderate |
2 | High | High | Moderate | ... | Very Low | Moderate |
3 | Moderate | High | High | ... | Very Low | Moderate |
... | ... | ... | ... | ... | ... | ... |
38 | Very High | Very High | Moderate | ... | Very Low | High |
39 | Very High | High | Moderate | ... | Very Low | High |
Label | Number | Percent |
---|---|---|
Low | 0 | 0 |
Moderate | 15 | 38.5 |
High | 24 | 61.5 |
Name | Learning Algorithm | Network Type | Max. Num. of Parents | Accuracy | Area under ROC | Precision | Recall | F1-Score |
---|---|---|---|---|---|---|---|---|
K2-TAN | K2 | initAsNaive Bayes | 2 | 80.25 | 0.89 | 0.87 | 0.80 | 0.83 |
K2-BAN | initAsNaive Bayes | 3 | 79.75 | 0.89 | 0.86 | 0.80 | 0.83 | |
K2-Un | initAsNaive Bayes | None | 79.75 | 0.89 | 0.86 | 0.80 | 0.83 | |
K2GN-2 | General | 2 | 80.25 | 0.90 | 0.87 | 0.80 | 0.83 | |
K2GN-3 | General | 3 | 79.75 | 0.89 | 0.86 | 0.80 | 0.83 | |
K2GN-Un | General | None | 79.75 | 0.89 | 0.86 | 0.80 | 0.83 | |
HC-TAN | hill-climbing | initAsNaive Bayes | 2 | 79.25 | 0.89 | 0.86 | 0.79 | 0.82 |
HC-BAN | initAsNaive Bayes | 3 | 79 | 0.88 | 0.86 | 0.79 | 0.82 | |
HC-Un | initAsNaive Bayes | None | 79 | 0.88 | 0.86 | 0.79 | 0.82 | |
HCGN-2 | General | 2 | 77.75 | 0.88 | 0.84 | 0.78 | 0.81 | |
HCGN-3 | General | 3 | 77.50 | 0.88 | 0.84 | 0.78 | 0.81 | |
HCGN-Un | General | None | 77.50 | 0.88 | 0.84 | 0.78 | 0.81 | |
TS-TAN | tabu search | initAsNaive Bayes | 2 | 79.25 | 0.88 | 0.86 | 0.79 | 0.82 |
TS-BAN | initAsNaive Bayes | 3 | 78.75 | 0.88 | 0.85 | 0.79 | 0.82 | |
TS-Un | initAsNaive Bayes | None | 78.75 | 0.88 | 0.85 | 0.79 | 0.82 | |
TSGN-2 | General | 2 | 77.75 | 0.90 | 0.85 | 0.78 | 0.81 | |
TSGN-3 | General | 3 | 77.75 | 0.90 | 0.85 | 0.78 | 0.81 | |
TSGN-Un | General | None | 77.75 | 0.90 | 0.85 | 0.78 | 0.81 | |
BN classifier models (average) | 78.86 | 0.89 | 0.85 | 0.79 | 0.82 | |||
NB | 77.92 | 0.89 | 0.85 | 0.78 | 0.81 | |||
DT | 65.25 | 0.68 | 0.76 | 0.65 | 0.70 |
Metrics | Performance |
---|---|
Accuracy | 81.67 |
Area Under ROC | 0.93 |
Precision | 0.89 |
Recall | 0.82 |
F1-score | 0.84 |
Rank | Risk | Number of Connections |
---|---|---|
1 | Increased price of materials | 10 |
2 | Lack of knowledge and experience | 10 |
3 | Inflation | 9 |
4 | Owner customs policy and complexity (procurement delay) | 6 |
5 | Shortage of equipment | 6 |
6 | Adverse change in geological conditions | 6 |
7 | Construction (defect) quality | 5 |
8 | Project complexity | 5 |
9 | Delays in land acquisition | 5 |
10 | Site availability | 5 |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Ashtari, M.A.; Ansari, R.; Hassannayebi, E.; Jeong, J. Cost Overrun Risk Assessment and Prediction in Construction Projects: A Bayesian Network Classifier Approach. Buildings 2022, 12, 1660. https://doi.org/10.3390/buildings12101660
Ashtari MA, Ansari R, Hassannayebi E, Jeong J. Cost Overrun Risk Assessment and Prediction in Construction Projects: A Bayesian Network Classifier Approach. Buildings. 2022; 12(10):1660. https://doi.org/10.3390/buildings12101660
Chicago/Turabian StyleAshtari, Mohammad Amin, Ramin Ansari, Erfan Hassannayebi, and Jaewook Jeong. 2022. "Cost Overrun Risk Assessment and Prediction in Construction Projects: A Bayesian Network Classifier Approach" Buildings 12, no. 10: 1660. https://doi.org/10.3390/buildings12101660