An Integrated Artificial Intelligence Tool for Predicting and Managing Project Risks

Geamanu, Andreea; Dascalu, Maria-Iuliana; Neagu, Ana-Maria; Guica, Raluca Ioana

doi:10.3390/make8010001

Open AccessArticle

An Integrated Artificial Intelligence Tool for Predicting and Managing Project Risks

¹

Department of Engineering in Foreign Languages, Faculty of Engineering in Foreign Languages, National University of Science and Technology Politehnica Bucharest, 060042 Bucuresti, Romania

²

ICN Business School, 92800 Puteaux, France

^*

Author to whom correspondence should be addressed.

Mach. Learn. Knowl. Extr. 2026, 8(1), 1; https://doi.org/10.3390/make8010001

Submission received: 16 November 2025 / Revised: 8 December 2025 / Accepted: 9 December 2025 / Published: 20 December 2025

(This article belongs to the Topic AI and Computational Methods for Modelling, Simulations and Optimizing of Advanced Systems: Innovations in Complexity, 2nd Edition)

Download

Browse Figures

Versions Notes

Abstract

Artificial Intelligence (AI) is increasingly used to enhance project management practices, especially in risk analysis, where traditional tools often lack predictive capabilities. This study introduces an AI-based tool that supports project teams in identifying and interpreting risks through machine learning and integrated documentation features. A synthetic dataset of 5000 project instances was generated using deterministic rules across 27 input variables, enabling the training of multi-output Decision Tree and Random Forest models to predict risk type, impact, probability, and response strategy. Due to the rule-based structure of the dataset, both models achieved near-perfect classification performance, with Random Forest showing slightly better regression accuracy. These results validate the modelling pipeline but should not be interpreted as real-world predictive accuracy. The trained models were deployed within a web platform offering prediction visualization, automated PDF reporting, result storage, and access to a structured risk management plan template. Survey feedback highlights strong user interest in AI-assisted mitigation suggestions, dashboards, notifications, and mobile access. The findings demonstrate the potential of AI to improve proactive risk assessment and decision-making in project environments.

Keywords:

project management; risk management; Decision Tree; Random Forest; machine learning; classification; regression

Graphical Abstract

1. Introduction

Artificial Intelligence (AI) has increasingly become a strategic asset across industries, enabling automated decision-support, advanced analytics, and real-time insights. In project management (PM), AI adoption has accelerated significantly in recent years, supported by global market projections indicating a rise from USD 2.5 billion in 2023 to USD 5.7 billion by 2028 [1,2]. AI-enabled tools assist project teams by automating routine tasks, analyzing large volumes of project data, and providing predictive insights that enhance planning accuracy and execution efficiency. Generative AI and Machine Learning (ML) further extend these capabilities by producing intelligent summaries, scenario-based recommendations, and adaptive risk assessments that augment the project manager’s decision-making process [3,4,5,6].

Despite these advances, AI applications for risk management remain limited, particularly in predictive modelling and integrated decision support. Existing solutions tend to focus on task automation, document processing, or isolated risk registers, with few providing multi-output risk prediction (type, impact, probability, and response); interpretable ML models tailored to project features; user-friendly platforms linking prediction, documentation, and visualization, or empirical validation through practitioner feedback.

Furthermore, many organizations still rely on qualitative assessments, expert judgement, or static checklists, which are prone to inconsistency, bias, and limited scalability. These limitations highlight a clear gap in the availability of AI-driven tools that combine predictive analytics with practical usability for everyday project management.

To address this gap, this study proposes an AI-based risk management system that integrates supervised and unsupervised ML techniques to support project teams throughout the risk identification and mitigation process. A comprehensive synthetic dataset of 5000 project instances was generated to train a multi-output Random Forest model capable of predicting key risk variables. The system is integrated into a light-weight web application that enables interactive prediction, result storage, automatic reporting, and access to a structured risk management template. A user survey further evaluates practitioner needs and expectations, offering insights into desired functionalities for modern AI-driven PM tools.

The main contributions of this study are as follows: (1) Development of a multi-output ML framework for predicting risk type, probability, impact, and response strategy; (2) Construction of a large, rule-based synthetic dataset reflecting diverse project characteristics and risk conditions; (3) Empirical comparison of Decision Tree (DT) and Random Forest (RF) models for multi-output risk prediction; (4) Implementation of an integrated web-based platform that operationalizes predictive analytics in a practical Project Management (PM) context; (5) Validation of user needs through a dedicated survey targeting PM practitioners.

The remainder of this paper is structured as follows. Section 2 reviews AI applications in PM. Section 3 discusses existing approaches to AI-enabled risk management. Section 4 presents the proposed ML framework, dataset generation process, and integrated tool architecture. Section 5 reports the evaluation results and practitioner survey findings. Section 6 concludes the study and outlines directions for future research.

While the ML algorithms used in this study (Decision Tree and Random Forest) are well-established, the novelty of our work lies in the integration of a complete multi-output prediction pipeline, the transparent construction of a reproducible synthetic dataset, and the development of an end-to-end operational tool linking prediction, visualization, reporting, and practitioner feedback. Current PM risk tools typically treat risk factors independently, provide only qualitative assessments, or lack predictive components. By contrast, our approach models multiple interdependent risk outputs simultaneously and embeds the resulting predictive engine into an accessible practitioner-oriented platform. Nonetheless, we acknowledge that synthetic data cannot fully capture real-world risk complexity; therefore, as future work, we plan to incorporate an additional dataset-either anonymized industrial data or semi-synthetic data augmented with noise-to benchmark model generalizability under more realistic conditions.

2. Related Work

2.1. AI in Project Management

As AI adoption continues to expand across industries, its transformative influence on project management has become increasingly evident. Organizations seeking to improve efficiency, anticipate risks, and strengthen data-driven decision making now view AI not only as a supporting technology but as a strategic partner in project execution. This section examines how AI—particularly generative AI—is reshaping traditional project management processes through automation, predictive analytics, and intelligent decision support.

Recent peer-reviewed studies further strengthen this trend. El Khatib and Al Falasi show that AI-supported decision processes significantly improve collaboration and information flow in complex projects [7]. Similarly, Salimimoghadam demonstrates that AI-augmented decision making reshapes governance structures by enabling more adaptive and data-driven managerial responses [8]. New AI-enabled project dashboards and planning tools have also emerged which highlight the integration of predictive analytics, automated scheduling, and real-time reporting into organizational PM ecosystems [9,10].

Generative Artificial Intelligence (GenAI) leverages advanced models capable of learning patterns from vast amounts of data to generate new content such as text, code, images, and designs. These models function as conversational agents that produce coherent, human-like responses and adapt through continuous learning and feedback. Although GenAI is frequently associated with creative applications, its ability to summarize information, adapt content, and generate context-aware outputs makes it particularly valuable for project management tasks. Prior studies show that GenAI can automate report creation, update timelines, produce concise summaries of large datasets, and recommend corrective actions throughout the project lifecycle [11,12]. It may also advise on the presentation of ideas, identify drivers of potential project failure, and anticipate conditions under which a project should or should not be initiated. By automating repetitive tasks, GenAI frees project managers to concentrate on strategic analysis, stakeholder engagement, and higher-level decision making.

Practical examples further illustrate these benefits. As noted by Michael McCullough, Project Manager at Amtrak, AI-enabled transcription and summarization tools can automatically brief late participants during meetings, generate consolidated minutes, and assign action items—substantially reducing administrative workload. While such technologies provide substantial support, they do not inherently understand the deeper contextual nuances of language; human oversight therefore remains essential when interpreting AI-generated insights. Nonetheless, GenAI has the potential to substantially increase project efficiency, improve risk preparedness, enhance stakeholder satisfaction, and contribute to overall project success.

Beyond practical examples, recent academic studies confirm the value of generative AI in project environments. Naji et al. [13] explain how GenAI enhances scenario-based reasoning and reduces decision ambiguity in engineering projects. Recent reviews show that AI-enabled project management tools significantly expand decision support, enabling earlier detection of risks and performance deviations. For instance, Taboada et al. [9] conclude that predictive analytics and AI-driven decision support are increasingly used for project monitoring and control, while Felicetti et al. [14] documents empirical cases where AI dashboards and predictive models improved early identification of schedule, budget or resource deviations and Valentine [15] argues that generative models fundamentally accelerate knowledge extraction from large and unstructured data sources. Together, these contributions position GenAI as a catalyst for higher-quality, evidence-based project decision making.

The importance of leveraging past project knowledge has also been highlighted in recent PMI work. Brunet [16] demonstrates how generative AI can bridge the persistent gap between data collection and actionable insights by consolidating historical information—such as Lessons Learned—and transforming it into practical recommendations for future initiatives. This approach offers a promising direction for organizational learning and evidence-based project management [17].

A growing number of AI-enabled tools further contributes to this transformation. Platforms such as PMI Infinity, the ChatGPT 5.2 AI Assistant for Jira, and Microsoft Copilot provide automated reporting, predictive analytics, and intelligent navigation through large repositories of project management resources. PMI Infinity, built on GPT-based architecture, enables members to access curated best practices, validated sources, and recommended prompts for real-time problem solving. Similarly, tools integrated into Jira or Microsoft 365 enhance productivity by recommending tasks, analyzing project status, and assisting with scheduling and prioritization [18,19].

These industry developments are aligned with findings from recent research on AI-enabled PM tools. Nindartin et al. [20] demonstrate that ensemble learning significantly improves the prediction of project cost deviations, while Ashtari et al. [21] demonstrate that machine-learning models can assess and predict cost-overrun risks in construction projects, enhancing early risk detection and proactive mitigation.

As project managers adopt these advanced tools, they must also develop the ability to formulate precise and meaningful questions [22]. The quality of GenAI outputs depends heavily on the quality of user input, and project managers remain responsible for decisions made based on AI-assisted insights. Strong business acumen and domain expertise remain crucial, as GenAI can support—but not replace—professional judgement. When used effectively, GenAI can help project managers rapidly acquire domain insights, evaluate emerging trends, simulate potential scenarios, and refine project strategies based on organizational context.

In practice, project managers can use GenAI to articulate challenges, request analyses grounded in industry data, and obtain best practices or step-by-step recommendations. AI models may also simulate project outcomes under various assumptions, enabling teams to validate and adjust their strategies. Iterative use—combined with human verification—allows AI systems to refine outputs over time and better align with organizational needs. While GenAI enhances general project management capabilities, its potential is particularly strong in specialized domains such as risk management, where predictive analytics and automated assessments can significantly strengthen project resilience.

In parallel, emerging research highlights the importance of integrating AI within broader digital transformation strategies. For example, Gao et al. [23] identify both drivers and barriers for AI adoption in project-based organizations, emphasizing the need for interpretable, user-friendly tools. Qian et al. [24] and Goyal et al. [25] also discuss the challenges of obtaining real project data and propose synthetic or semi-synthetic datasets for benchmarking risk prediction models—approaches directly aligned with our methodological choices.

2.2. AI in Risk Management

As described in PMBOK Guide, the book of reference for PM, “Project Risk Management comprises the procedures of conducting risk management planning, identification, analysis, response planning, response implementation, and monitoring risk on a project” [26]. The aim of project risk management is to maximize the possibility of project success by lowering the chance and/or effect of negative risks and raising the likelihood and/or effect of positive risks. All projects involve inherent risks due to their unique nature, varying complexity, constraints, and stakeholder expectations. Successful project risk management seeks to identify and manage risks that could undermine project goals. Risks exist at both individual and overall project levels, with individual risks potentially impacting specific project objectives and overall project risk representing the cumulative effect of all uncertainties. Project risk management processes aim to mitigate negative risks (threats) while capitalizing on positive risks (opportunities). Managing overall project risk involves minimizing negative variations, maximizing positive outcomes, and ensuring the probability of achieving project objectives remains high.

Recent studies also stress the role of AI and machine learning in automating risk detection and forecasting. Yaseen et al. [27] used Random Forest models to predict schedule delays with high reliability, while Goyal et al. [25] showed that rule-based synthetic datasets can effectively validate ML risk pipelines when confidential industrial data cannot be shared. These contributions highlight the growing need for predictive, data-driven risk assessment solutions that complement conventional PMBOK processes.

To handle developing risks, project risk management procedures should be repeated and carried out all the way through the project lifecycle. Acceptable degrees of risk exposure are defined by risk thresholds, which also direct risk management activities. Good practices seek to guarantee that all kinds of hazards are taken into account and handled successfully, therefore raising the general effectiveness and worth of undertakings.

Regarding threats, five further tactics might be taken into consideration:

Escalate: Escalation is reasonable when a risk is outside the project’s purview or beyond the power of the project manager. Higher up in the company, the risk is controlled, and the pertinent party receives details for ownership acceptance. The project team stops monitoring escalating dangers after that.
Avoid: Risk avoidance entails eliminating the threat or protecting the project from its impact. It is suitable for high-priority threats with a high probability of occurrence and a significant negative impact. Actions may include changing project plans or objectives to eliminate the threat entirely or isolating project objectives from the threat’s impact.
Transfer: Risk transfer involves shifting ownership of the threat to a third party, who will manage the risk and bear its impact. This often involves payment of a risk premium and can be achieved through mechanisms like insurance, performance bonds, or warranties.
Mitigate: Risk mitigation entails reducing the probability of occurrence and/or impact of a threat. Early action is more effective, and examples include adopting simpler processes, conducting more tests, or incorporating redundancy into systems. Prototype development can potentially be a part of mitigation to lower risks.
Accept: Risk acceptance acknowledges the threat’s existence without proactive action. It is suitable for low-priority threats or when it is not feasible to address them otherwise. Active acceptance involves establishing contingency reserves, while passive acceptance involves periodic review to ensure threats do not change significantly.

According to SEI [28], once recognized, the hazards can be ranked and assessed according to likelihood and effect using qualitative risk analysis. This involves implementing preventive measures to minimize risks and creating detailed contingency plans for high-risk scenarios. Choosing mitigation plans specific to each risk, ranking risks according to likelihood and possible impact, setting aside enough time and money for mitigation measures, and delegating clear duties to the project team are all important first steps. Regular review and updates to the response plan are crucial, ensuring its effectiveness in addressing evolving risks and challenges. By fostering a proactive and adaptive approach to risk management, organizations can enhance project resilience and improve overall success.

To facilitate a structured understanding of typical sources of uncertainty encountered in projects, we adopt the risk categorization recommended in the PMBOK Guide [29]. Table 1 summarizes the main risk types commonly described in the project management literature, including technical, organizational, external, financial, schedule-related, and stakeholder-related risks. This classification provides the conceptual foundation for the risk prediction outputs later modelled by our ML system.

Project leaders embracing innovation are increasingly leveraging technology to identify and mitigate risks more effectively. By harnessing unbiased data to discern clear patterns, teams can proactively address potential problems and ensure they are adequately prepared for emerging threats. For example, companies like Aecom and Boeing utilize drones, waterborne aircraft, and advanced analytics to assess risks such as flooding and safety issues in various phases of aircraft development. Vishwajeet Uddanwadiker from Boeing emphasizes the importance of a holistic approach to data analysis, combining domain knowledge with advanced analytics to model safety risks accurately [30].

Automation tools, including AI and ML, play a significant role in generating insights and freeing up project leaders’ time for strategic decision-making. However, it is crucial for project leaders to balance AI-driven insights with human input and analysis, as projects are ultimately managed and executed by people. While AI and technology can enhance productivity and project outcomes, it is essential to address resistance to change and ensure that stakeholders’ feedback is considered in decision-making processes. Striking a balance between technology and human involvement is key to successful risk management and project execution.

Building on the advances of AI in risk management, we propose a dedicated risk tool that integrates predictive analytics and automated assessment capabilities to enhance organizational decision-making and mitigate project uncertainties.

3. Methodology

This study follows a structured methodology combining dataset construction, multi-output machine learning, model evaluation, and system integration. The overall research workflow is further described.

3.1. Research Flow

Define risk components based on PM standards (type, impact, probability, response).
Construct a 5000-instance synthetic dataset using rule-based logic.
Preprocess and encode the dataset (categorical encoding, scaling of ordinal values).
Train multi-output models (Decision Tree and Random Forest).
Evaluate using classification (Hamming Loss, Exact Match Ratio) and regression metrics (MAE).
Deploy models within an interactive web-based tool.
Collect practitioner feedback through a focused survey.

3.2. Dataset Generation and Structure

Because real project risk datasets are proprietary and confidential, we generated a reproducible synthetic dataset of 5000 project instances (see the Supplementary Material). Each instance contains 27 input variables reflecting financial, operational, technical, environmental, and stakeholder dimensions (Table 2).

Categorical features (e.g., industry, economic conditions) were sampled using balanced discrete distributions. Continuous and ordinal variables (e.g., budget, variance, complexity) were produced using uniform, normal, or bounded integer distributions to reflect realistic ranges.

The mapping between input attributes and output risk labels follows a structured conceptual rationale grounded in established project-risk taxonomies. Input variables were grouped according to causal relationships consistently documented in PM standards and empirical research. For example, requirements volatility and budget variance are primary drivers of cost-related risks, while supplier reliability and task interdependence strongly influence schedule-related risks. Likewise, high project complexity combined with low team experience increases coordination and stakeholder-related risks. These conceptual linkages define how multiple input factors are consolidated into broader output categories (risk type, impact, probability, and response plan), ensuring that the dataset reflects meaningful project-risk mechanisms rather than arbitrary groupings. This rationale forms the conceptual foundation for the deterministic rules applied in the subsequent step.

Risk outputs were assigned deterministically using rule-based logic anchored in PM principles. For example:

high technical complexity → technical risk;
large cost variance → financial risk;
high regulatory impact → compliance risk;
high probability + high impact → “mitigate” response plan.

This deterministic mapping ensures internal consistency and provides a controlled environment for model benchmarking.

Because the synthetic dataset was generated using predefined rule dependencies, the output variables (risk probability, impact level, risk category, and response plan) are inherently determined by combinations of input attributes. As a result, the ML model is expected to reproduce these deterministic mappings with very high accuracy. We emphasize that this behaviour should not be interpreted as evidence of superior model performance but rather as validation that the modelling pipeline correctly reconstructs the logical structure encoded in the data. The goal of the synthetic dataset is therefore not to demonstrate predictive generalization but to provide a controlled, fully transparent environment in which the feature—output relationships are traceable, explainable, and reproducible. This controlled setup allows us to validate the architecture, data workflows, and evaluation procedures before applying the framework to empirical project-risk datasets where relationships are non-deterministic and performance metrics carry substantive meaning.

3.3. Model Training and Multi-Output Formulation

The prediction task involves jointly estimating four dependent variables. We adopt a multi-output supervised learning framework, where classification and regression heads are trained simultaneously using Decision Tree and Random Forest algorithms. Categorical features were numerically encoded using Scikit-learn’s preprocessing modules.

Model training used an 80/20 train–validation split with a fixed random seed (42) to ensure reproducibility. Decision Tree and Random Forest models were trained using default Scikit-learn hyperparameters with max_depth = None and n_estimators = 100 for RF. Categorical features were encoded using Scikit-learn’s OrdinalEncoder, and the multi-output formulation was implemented as a parallel approach in which each target is predicted independently within a unified model.

3.4. Evaluation Metrics

Classification performance for risk type and response plan was assessed using Hamming Loss and Exact Match Ratio. Regression accuracy for probability and impact was measured using Mean Absolute Error (MAE). This combination allows a holistic assessment of model behaviour across multiple target types.

3.5. System Integration

The final trained models were embedded into a Flask-based web application featuring prediction dashboards, PDF reporting, result storage, and a structured risk management plan template. This integration demonstrates practical applicability and supports real-world adoption.

4. Machine Learning Tool to Predict Risks in Projects

The strategic importance of identifying risks as both threats and opportunities is undeniable, but every team can benefit from a tech upgrade. Investing in AI and emerging tech tools enables companies and project leaders to better understand, identify, and manage risks, ultimately limiting the probability and impact of unexpected events. According to a 2022 survey by PwC, executives are increasingly allocating resources to risk management technology, particularly focusing on data analytics, process automation, and threat detection [31].

Automation not only reduces cognitive biases but also allows for real-time analysis of data, revealing patterns and flagging noteworthy changes. With examples like Shell utilizing AI and ML to improve supply chain visibility [32] and Boeing employing digital twin models to achieve a 40% increase in first-time quality of components and systems [33], there is a big payout for deploying digitally driven risk management solutions. Amazon Web Services company emphasizes that AI and ML can identify high-level vulnerabilities and prevent risks from cascading through dependent projects, facilitating optimal decision-making [34]. By tapping into AI and ML to collect and analyze data from multiple sources and projects, leaders can efficiently compare, analyze, and optimize information, turning it into strategic insights for the entire project ecosystem. While teams cannot hand off all risk management to AI, strategically leveraging technology accelerates the risk management process, allowing project leaders to focus on analyzing results and improving predictive success.

We developed a tool based on an ML algorithm that takes into consideration project features (input) and predicts the risk type, level, impact, and probability (output): see Table 2. Because project features can differ substantially according to the industry, project scope, and specific operational contexts, we used a scoring system that will help project teams and stakeholders consistently evaluate the risks associated with each project.

To operationalize the ML-based prediction task, each project instance was represented using 27 carefully constructed input features. These variables cover financial, operational, organizational, environmental, and stakeholder-related aspects that influence project exposure to risk. Table 2 provides a detailed description of each feature, the measurement scale used, and the corresponding four output dimensions predicted by our system: risk type, impact, probability, and response strategy. This structured formulation ensures consistency in model training and provides a transparent mapping between project characteristics and the resulting risk profile.

One powerful Python ML library is Scikit-learn [35], which we also used. Several classification ML algorithms from Scikit-learn were tried (e.g., DT, RF classifier), in order to decide what is the best model for risk prediction. ML techniques were chosen for project risk prediction because they provide a robust and data-driven framework capable of capturing complex, non-linear relationships among project variables that traditional statistical or rule-based algorithms often fail to model. Unlike conventional approaches that depend on predefined formulas, fixed thresholds, or expert-assigned weights, ML algorithms can automatically learn from historical data, adapt to new contexts, and improve predictive accuracy over time. This adaptability is particularly valuable in dynamic project environments, where risk factors and interdependencies evolve continuously. Compared with fuzzy logic systems, which are effective in handling linguistic uncertainty but depend on manually defined membership functions and rule sets, ML models offer superior scalability and objectivity. Fuzzy logic approaches require extensive expert calibration and are less flexible when applied to heterogeneous or large-scale datasets. In contrast, ML algorithms infer decision boundaries directly from data, allowing them to discover hidden correlations and generalize to unseen projects without human intervention. Consequently, ML provides a more adaptive, accurate, and generalizable solution for quantitative risk prediction, while fuzzy logic remains complementary for qualitative reasoning when data availability is limited [36].

4.1. Dataset Construction

A synthetic dataset of 5000 project instances was generated to train and evaluate the proposed machine learning models. Because real-world project risk datasets are typically confidential and not publicly available, a rule-based generator was implemented to simulate realistic project behaviour across diverse industries and operational contexts. The dataset includes 27 input variables describing financial, operational, organizational, environmental, and stakeholder-related aspects of projects, as seen in Table 2. The generated data is available here: https://ctipub-my.sharepoint.com/:x:/g/personal/maria_dascalu_upb_ro/EWoIN5rPSuFBkkXs8AMKKsoBbwGiC8aOod-qVsV5ANveyA?e=3DdrnJ, accessed on 15 November 2025.

Categorical attributes (e.g., project industry, legal risk) were sampled using balanced discrete distributions, while continuous and ordinal variables (e.g., budget, schedule variance, technical complexity) were generated using uniform, normal, or bounded integer distributions.

A dedicated deterministic function assigned each project its corresponding risk outputs:

Risk Type (Technical, Operational, Financial, Compliance)
Risk Impact (1–10)
Risk Probability (20–100%)
Risk Response Plan (mitigate, avoid, transfer, accept)

Assignment rules were based on project management logic. For example, high technical complexity and elevated security threats triggered technical risks, while cost variance and budget instability increased the likelihood of financial risks. Because these decision rules encode direct relationships, the resulting dataset exhibits low ambiguity and strong internal consistency—an important factor explaining model performance in subsequent experiments.

4.2. Prediction Problem Formulation and Machine Learning Models

The goal of the proposed system is to predict project risks using a multi-output supervised learning framework. Each project instance is represented as a feature vector:

X = (x_{1}, x_{2}, \dots, x_{27})

(1)

The prediction task involves estimating a set of four dependent outputs:

Y = (y_{1}, y_{2}, y_{3}, y_{4})

(2)

where

y₁ represents the risk type (multiclass classification);
y₂ represents the risk impact (regression);
y₃ represents the risk probability (regression);
y₄ represents the risk response plan (multiclass classification).

Because the outputs are interdependent (e.g., high impact tends to align with certain response strategies), a multi-output learning setup is preferable to training independent models. The dataset was split into 80% training and 20% testing, with all categorical features encoded numerically to enable compatibility with tree-based algorithms.

Two ML approaches were evaluated. DT classifiers and regressors were selected as baseline models due to their interpretability and ability to model non-linear decision boundaries. They were applied using a MultiOutputClassifier and MultiOutputRegressor framework.

4.3. Evaluation Metrics and Experimental Results

Two evaluation categories were used: classification and regression. Regarding classification, we computed Hamming Loss (manual implementation due to multi-output multiclass nature) and Exact Match Ratio (percentage of samples where both risk type and response plan are predicted correctly). Regarding regression, we calculated Mean Absolute Error (MAE) for both impact and probability. This set of metrics provides a complete view of predictive performance across both discrete and continuous outputs.

Table 3 summarizes the performance of both models on the synthetic dataset.

4.4. Interpretation of Results

Given the deterministic nature of the dataset generation rules, DT performed exceedingly well, effectively learning the explicit relationships embedded in the data. RF, as an ensemble method based on bootstrap aggregation, was used for both classification and regression tasks. They typically generalize better than single trees, especially in noisy or high-dimensional datasets. Although the synthetic dataset is highly structured, RF still yielded slightly superior regression performance due to variance reduction across trees.

These results reflect the strong predictability inherent in a dataset where input–output relationships follow explicit rules. The models correctly recover the generative logic and replicate it with minimal error.

Accordingly, these findings should be interpreted as validation of the modelling pipeline, not evidence of real-world predictive superiority. Performance on actual project portfolios would naturally be lower due to noise, incomplete information, and overlapping risk patterns.

A key limitation of this study is the use of synthetic, rule-based data. While this approach ensures reproducibility and controllability, it also leads to deterministic relationships that inflate predictive performance. As a result, the high accuracy values reported in Section 3.3 reflect internal consistency rather than real-world generalizability. Future work will incorporate anonymized industrial datasets, introduce controlled noise, and explore probabilistic risk modelling to better simulate real project uncertainty.

4.5. Integrative Tool

The trained models are deployed inside a web-based application built using the Flask framework. In Figure 1, there is the component diagram of the developed tool, highlighting key parts such as the user interface, risk prediction module, authentication module and the database.

Figure 1 presents the system’s component architecture, illustrating the separation of concerns between the presentation layer, the application logic, the machine-learning services, and the persistence layer. The user interacts with the system through a standard web browser, which issues HTTP requests to the Flask application. Flask functions as the central controller, responsible for routing, request handling, session management, and orchestrating downstream service calls.

Within the web application boundary, HTML templates and CSS/JavaScript assets provide the client-side rendering and interface logic. These static resources are served directly by Flask and executed on the client side, enabling form submission, asynchronous updates, and communication with backend inference endpoints.

The ML subsystem is encapsulated as a separate component cluster consisting of three independent models:

a Classifier Model (risk type and response plan),
an Impact Regressor, and
a Probability Regressor.

Flask invokes these models via internal function calls, passing preprocessed input features and receiving predictions in a structured format. The models operate independently, enabling parallel execution and modular retraining without affecting the rest of the system. Their encapsulation also ensures clear version control and reproducibility of inference results.

The persistence layer is implemented using an SQLite database, which provides lightweight, file-based storage for user-submitted project data, prediction results, and generated reports. Flask manages database interactions through CRUD operations, supporting both synchronous writes (e.g., logging predictions) and asynchronous reads (e.g., loading historical entries).

Data flows between components follow a well-defined request/response pattern:

User input is collected in the browser.
Flask validates and preprocesses the input.
The backend invokes the appropriate ML models.
Predictions are stored in the SQLite database.
Results are assembled into an HTML response and delivered back to the user.

This architecture provides a modular, maintainable, and scalable structure, allowing the ML components, UI templates, and database layer to evolve independently while ensuring reliable end-to-end interaction across the system.

The system integrates: an ML inference module for predicting risk outputs, a user authentication component, a result storage engine via SQLAlchemy, an interactive prediction dashboard (see Figure 2), automatic PDF reporting tools, and a downloadable risk management plan template. This project tool enables practitioners to enter project data, obtain predictions instantly, and maintain a repository of previous analyses.

The download report functionality produces a PDF report including the latest prediction and input data. The report has a bar chart displaying the likelihood and impact values: see Figure 3. If the session lacks any prediction or input data, the user will be automatically returned to the home page. Users can obtain a downloadable and shareable report of their prediction results using this method.

In addition, the user has access to a Risk Management Plan template that they can use to brainstorm potential risks and their impact, offering also thorough mitigation strategies for every risk. PMs will find great use for this template since it provides an organized and understandable approach to managing risks, guaranteeing that we are ready for any problems that may arise during the project, having a structured brainstorming document.

A complete activity diagram of the tool is available in Figure 4.

The activity diagram outlines the end-to-end interaction flow within the risk-prediction web application. After accessing the system, the user either authenticates or registers, triggering the corresponding backend processes for credential validation and account creation. Once authenticated, the user submits a risk-prediction form, which the system processes through the application logic and connected machine-learning models. The generated prediction is presented to the user, who may optionally save the result, add comments, view historical records, or generate a downloadable PDF report. Conditional branches capture user decisions such as navigating back to the home page, accessing stored results, or downloading supplemental templates. The workflow concludes with the logout operation, during which the system terminates the session and redirects the user to the login interface. Overall, the diagram illustrates the orchestration of authentication, data processing, ML inference, storage operations, and session management within the application architecture.

4.6. Survey Feedback

We conducted a survey to gather opinions regarding the potential functionalities of the application: see Appendix A Most responders are between the ages of 22 and 44, and include both genders, with a slight preference for females. Many participants hold a Bachelor’s degree, while others have a Master’s degree or have completed high school. Engineering, information technology, project management, and medical engineering are among the many areas of competence available. The respondents’ project management experience ranges from less than a year to more than ten years, with a notable concentration in the 1–3- and 4–6-year groupings. This diversified background provides a detailed picture of how many experts approach AI in risk management. The vast majority of respondents utilize project management software on a regular basis, with many claiming daily or weekly usage. This emphasizes the importance of digital technologies for efficient project management. However, a few respondents stated occasional or monthly use, implying that not all professionals rely extensively on these technologies.

The survey collected a total of N = 32 responses (convenience sampling), obtained by distributing the questionnaire to project management practitioners and graduate engineering students through academic mailing lists and professional networks. Participation was voluntary and anonymous. All Likert-scale items used a 5-point response format (1 = very low/very dissatisfied, 5 = very high/very satisfied), enabling descriptive statistical analysis of central tendencies and response distributions. A brief assessment of potential sampling bias was also performed: the dataset is slightly skewed toward early-career professionals and users with moderate PM tool experience, which may influence the perceived usefulness of AI-based functionalities. Nonetheless, the diverse expertise represented in the sample (engineering, IT, PM, business) provides valuable exploratory insights into user expectations for AI-enabled risk management tools.

The majority of respondents believe risk management is important. Risk management is performed using a variety of tools, including spreadsheets, dedicated software, and manual approaches. Project management software with built-in risk management tools is also widely used, as presented in Figure 5. Satisfaction with present risk management tools varies, with many respondents reporting moderate to high satisfaction, although some see potential for improvement.

There is widespread agreement on the value of embedding AI predictions into risk management solutions. Most respondents feel that AI can considerably improve risk management by forecasting risk kinds, impacts, and probabilities. AI-based risk management apps should have capabilities that suggest mitigation methods, save previous results, issue notifications, and provide interactive dashboards. Respondents also stressed the importance of mobile accessibility, multilingual support, and collaborative tools. A few participants mentioned the possibility of AI chatbots to help with risk management: see Figure 6.

Respondents provided good response, demonstrating a broad interest in AI-powered project management systems. They saw AI’s promise for providing more accurate risk assessments and facilitating improved decision-making. Some respondents mentioned a desire for improved visualization capabilities and integration with existing project management software, indicating a preference for seamless and user-friendly interfaces. This analysis emphasizes the need of building AI-based risk management tools that meet the unique needs of project managers from various areas. The collected input will be essential in developing the application and ensuring it meets user expectations effectively.

5. Discussions

The results demonstrate the feasibility of integrating multi-output ML models into an operational PM risk management tool. Theoretically, this work contributes to bridging a recognized gap between qualitative PM standards and quantitative, data-driven approaches to risk prediction. Unlike traditional risk registers or checklist-based assessments, the system models interdependencies between risk type, impact, probability, and response strategy, offering a more holistic representation of project uncertainty.

Practically, the deployed tool offers project teams immediate value through automated prediction, structured documentation, and visual reporting. Survey findings indicate strong practitioner interest in AI-assisted mitigation suggestions, dashboards, and notification systems. These insights provide guidance for enhancing AI-enabled PM environments.

Despite the near-perfect classification performance, results must be interpreted cautiously. The determinism of the synthetic dataset results in high internal consistency and artificially elevated accuracy. Therefore, the current findings validate the modelling pipeline rather than providing a real-world benchmark.

Several limitations should be acknowledged:

Synthetic determinism: The dataset encodes direct rules, limiting the model’s exposure to ambiguity, noise, or contradictory patterns typical in real projects. Because risk labels were generated using rule-based logic, the learning task becomes highly structured and presents little ambiguity. As a consequence, the near-perfect accuracy observed—especially in classification—is not indicative of real-world performance but reflects the internal consistency between the input features and the deterministic mapping rules.
Generalizability: High accuracy cannot be extrapolated to real-world settings without validation on industrial datasets.
Scope of risk factors: The feature set, while comprehensive, may omit domain-specific risks relevant to particular industries.
Lack of temporal data: Real project risks evolve over time, but the dataset does not include time-series characteristics.
Survey scale: Practitioner feedback, while insightful, is based on a modest sample size.

The next phase of this project will therefore involve (i) incorporating real project data from industrial partners, (ii) extending the dataset with stochastic noise to simulate uncertainty, and (iii) evaluating model behaviour under incomplete, noisy, or conflicting project information.

6. Conclusions

This study introduced an AI-based risk prediction tool that integrates supervised machine learning models with an intuitive, practitioner-oriented web platform to support modern project teams in proactive risk assessment. Using a synthetic dataset of 5000 project instances—generated through deterministic rule-based logic over 27 input variables—the system is capable of predicting four essential dimensions of project risk: type, impact, probability, and recommended response plan. The Random Forest model consistently outperformed the Decision Tree baseline, particularly on regression tasks; however, the deterministic nature of the dataset resulted in near-perfect classification accuracy for both models, indicating that the reported results validate the modelling pipeline rather than serving as real-world predictive benchmarks.

Beyond prediction capabilities, the integrative platform includes a suite of practical features such as interactive result visualization, automatic PDF reporting, user accounts, historical storage of analyses, and access to a structured Risk Management Plan template. Survey responses from practitioners further confirmed the tool’s relevance, highlighting the need for AI-supported mitigation suggestions, dynamic dashboards, notification systems, and seamless integration with existing project management workflows. These findings align with previous observations in the literature emphasizing the importance of digital tools and intelligent decision support in risk-intensive technological environments [37,38,39].

Future work will focus on several key directions: (1) training and validating the models on anonymized real-world project datasets; (2) introducing controlled noise and probabilistic logic to better reflect uncertainty; (3) incorporating explainable AI (XAI) components to increase transparency of model recommendations; (4) integrating the system with enterprise project management ecosystems such as Jira and Microsoft Project; and (5) expanding the survey to a larger and more diverse professional population.

Overall, the results demonstrate that AI-enabled tools have strong potential to enhance project preparedness, strengthen decision-making processes, and support proactive risk management across a wide range of organizational contexts.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/make8010001/s1.

Author Contributions

Conceptualization, A.G. and M.-I.D.; methodology, A.G.; software, A.G.; validation, M.-I.D., A.-M.N. and R.I.G.; formal analysis, M.-I.D.; investigation, A.G.; resources, R.I.G.; data curation, A.G.; writing—original draft preparation, A.G.; writing—review and editing, M.-I.D. and A.-M.N.; visualization, A.-M.N.; supervision, M.-I.D.; project administration, M.-I.D.; funding acquisition, M.-I.D. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The synthetic dataset used in this study is provided as Supplementary Material in Excel format and also available at this link: https://ctipub-my.sharepoint.com/:x:/g/personal/maria_dascalu_upb_ro/EWoIN5rPSuFBkkXs8AMKKsoBbwGiC8aOod-qVsV5ANveyA?e=3DdrnJ, accessed on 15 November 2025. It contains all 5000 generated project instances and the corresponding input and output variables used for model training and evaluation.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

AI	Artificial Intelligence
ML	Machine Learning
DT	Decision Tree
RF	Random Forest
PM	Project Management
GenAI	Generative Artificial Intelligence

Appendix A

The questionnaire applied to gather opinions regarding the potential functionalities of the application for risk prediction is available in Table A1.

Table A1. The questionnaire applied to gather options regarding the integrative tool for risk prediction.

Section	Question	Response Options
General Information	What is your age?	18–24; 25–34; 35–44; 45–54; over 55
	What is your gender?	Male; Female; Prefer not to say
	What is your highest level of education?	Highschool; Bachelor; PhD; Others: ___________
	What is your field of expertise?	Project Management; Information Technology; Engineering; Business Administration; Others: ___________
Experience in Project Management	How many years of experience do you have in project management?	Less than 1 year; 1–3 years; 4–6 years; 7–10 years; more than 10 years
	How often do you use project management tools/software?	Daily; Weekly; Monthly; Rarely; Never
Project Risk Management	How important is risk management in your projects?	Likert scale 1–5 (Not important → Very important)
	Which methods/tools do you currently use for risk management?	[ ] Spreadsheets; [ ] Dedicated RM software; [ ] PM software w/RM features; [ ] Manual methods; [ ] Others: ____
	How satisfied are you with your current RM tools/methods?	Likert scale 1–5 (Very dissatisfied → Very satisfied)
Evaluation of AI-based Application	Do you think AI predictions are useful in RM applications?	Yes; No; Maybe
	Desired AI-based features	[ ] Predict risk type; [ ] Predict impact; [ ] Predict probability; [ ] Mitigation plans; [ ] PDF reports; [ ] Templates; [ ] Comments; [ ] Save results; [ ] Notifications
Additional Features	Additional AI-based RM features	[ ] Integration w/ PM tools; [ ] Dashboards; [ ] Multi-user; [ ] Visualizations; [ ] Mobile access; [ ] Multi-language; [ ] Chatbot; [ ] Others: ____

References

World Economic Forum. The Future of Jobs Report 2023; World Economic Forum: Geneva, Switzerland, 2023; p. 5. Available online: https://www.weforum.org/publications/the-future-of-jobs-report-2023/digest (accessed on 9 November 2025).
Nilsson, M. PMI: Community-Led AI and Project Management Report; Project Management Institute: Newtown Square, PA, USA, 2024. Available online: https://www.pmi.org/-/media/pmi/documents/public/pdf/artificial-intelligence/community-led-ai-and-project-management-report.pdf?rev=bca2428c1bbf4f6792f521a95333b4df (accessed on 9 November 2025).
Gartner. Gartner Report; Gartner: Stamford, CT, USA, 2019. [Google Scholar]
IBM. A New Frontier for the Future of Work; IBM: Armonk, NY, USA, 2022. [Google Scholar]
Haan, K. Forbes Advisor Research with 600 American Business Owners; Forbes: Jersey City, NJ, USA, 2023. [Google Scholar]
Ahmed, M. The Rise of AI: How AI Is Revolutionizing Project Management; Project Management Institute: Newtown Square, PA, USA, 2023. [Google Scholar]
El Khatib, M.; Al Falasi, A. Effects of Artificial Intelligence on Decision Making in Project Management. Am. J. Ind. Bus. Manag. 2021, 11, 251–260. [Google Scholar] [CrossRef]
Salimimoghadam, S. The Rise of Artificial Intelligence in Project Management: A Systematic Literature Review. Buildings 2025, 15, 1130. [Google Scholar] [CrossRef]
Taboada, I. Artificial Intelligence Enabled Project Management: A Systematic Review. Appl. Sci. 2023, 13, 5014. [Google Scholar] [CrossRef]
Adebayo, Y.; Udoh, P.; Kamudyariwa, X.B.; Osobajo, O.A. Artificial Intelligence in Construction Project Management: A Structured Literature Review of Its Evolution in Application and Future Trends. Digital 2025, 5, 26. [Google Scholar] [CrossRef]
Müller, R.; Locatelli, G.; Holzmann, V.; Nilsson, M.; Sagay, T. Artificial intelligence and project management: Empirical overview, state of the art, and guidelines for future research. Proj. Manag. J. 2024, 55, 9–15. [Google Scholar] [CrossRef]
Project Management Institute (PMI). Generative AI Overview for Project Managers. Available online: https://www.pmi.org/shop/p-/elearning/generative-ai-overview-for-project-managers/el083 (accessed on 9 November 2025).
Naji, K.K.; Gunduz, M.; Mohamed, A.; Alomari, A. Generative AI for Sustainable Project Management in the Built Environment: Trends, Challenges, and Future Directions. Sustainability 2025, 17, 9063. [Google Scholar] [CrossRef]
Felicetti, A.M.; Cimino, A.; Mazzoleni, A.; Ammirato, S. Artificial intelligence and project management: An empirical investigation on the appropriation of generative Chatbots by project managers. J. Innov. Knowl. 2024, 9, 100545. [Google Scholar] [CrossRef]
Valentine, J.O. Exploring the Potential of Artificial Intelligence Tools in Educational Measurement and Assessment. Eurasia J. Math. Sci. Technol. Educ. 2023, 19, em2307. [Google Scholar] [CrossRef] [PubMed]
Brunet, P. Unlocking Collective Wisdom: Generative AI for Lessons Learned Analysis in Project Management; PMI: Newtown Square, PA, USA, 2023. [Google Scholar]
Bainey, K. PMP, CEO K-pic Systems. Interview by PMI. Available online: https://k-picsystems.com/ (accessed on 9 November 2025).
PMI. PMI Infinity. Available online: https://www.pmi.org/membership/infinity (accessed on 9 November 2025).
Atlassian. ChatGPT AI Assistant for Jira. Atlassian Marketplace. Available online: https://marketplace.atlassian.com/apps/1230962/chatgpt-ai-assistant-for-jira?tab=privacy-and-security (accessed on 9 November 2025).
Nindartin, A.; Park, S.-J.; Lee, K.-T.; Kim, J.-H.; Rostiyanti, S.F. Prediction of cost contingency in construction projects by introducing machine learning algorithms. J. Civ. Eng. Manag. 2025, 31, 860–880. [Google Scholar] [CrossRef]
Ashtari, M.A.; Ansari, R.; Hassannayebi, E.; Jeong, J. Cost Overrun Risk Assessment and Prediction in Construction Projects: A Machine-Learning Approach. Buildings 2022, 12, 1660. [Google Scholar] [CrossRef]
Gao, S.; Low, S.P.; Lim, X.Y.V. Prospects, drivers of and barriers to artificial intelligence adoption in project management. Built Environ. Proj. Asset Manag. 2023, 13, 629–645. [Google Scholar] [CrossRef]
Microsoft. Microsoft 365 Copilot. Available online: https://www.microsoft.com/en-us/microsoft-365/copilot (accessed on 9 November 2025).
Qian, Z.; Callender, T.; Cebere, B.; Janes, S.M.; Navani, N.; van der Schaar, M. Synthetic Data for Privacy-Preserving Clinical Risk Prediction. Sci. Rep. 2024, 14, 25676. [Google Scholar] [CrossRef] [PubMed]
Goyal, M.; Lathia, N.; Raub, D. A Systematic Review of Synthetic Data Generation Methods. Electronics 2024, 13, 3509. [Google Scholar] [CrossRef]
PMI. A Guide to the Project Management Body of Knowledge (PMBOK^® Guide), 6th ed.; PMI: Newtown Square, PA, USA, 2017. [Google Scholar]
Yaseen, Z.M.; Ali, Z.H.; Salih, S.Q.; Al-Ansari, N. Prediction of Risk Delay in Construction Projects Using a Hybrid Artificial Intelligence Model. Sustainability 2020, 12, 1514. [Google Scholar] [CrossRef]
SEI. The Power of Preparedness: Building the Risk Management Plan Your Business Needs; SEI: Stockholm, Sweden, 2023. [Google Scholar]
PMI. The Standard for Risk Management in Portfolios, Programs, and Projects; PMI: Newtown Square, PA, USA, 2019. [Google Scholar]
Uddanwadiker, V. Predict to Prevent: Aerospace Safety Analytics. Innov. Q. Available online: https://www.boeing.com/innovation (accessed on 21 November 2025).
PwC. 2022 Global Risk Survey, February–March 2022; PwC: London, UK, 2022; Available online: https://www.pwc.com/sk/en/current-press-releases/global-risk-survey.html (accessed on 9 November 2025).
Thomas, R. Shell Adopts AI Risk Management across Global Supply Chain. Supply Chain. Digit. 2021. Available online: https://supplychaindigital.com/supply-chain-risk-management/shell-adopts-ai-risk-management-across-global-supply-chain (accessed on 21 November 2025).
Bellamy, W., III. Boeing CEO Talks ‘Digital Twin’ Era of Aviation. AviationToday. 14 September 2018. Available online: https://www.aviationtoday.com/2018/09/14/boeing-ceo-talks-digital-twin-era-aviation/ (accessed on 21 November 2025).
AWS. AI/ML for Security—AWS Prescriptive Guidance. Available online: https://docs.aws.amazon.com/prescriptive-guidance/latest/security-reference-architecture/ai-ml.html (accessed on 9 November 2025).
Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V.; et al. Scikit-learn: Machine Learning in Python. J. Mach. Learn. Res. 2011, 12, 2825–2830. Available online: https://www.jmlr.org/papers/volume12/pedregosa11a/pedregosa11a.pdf (accessed on 21 November 2025).
Bodea, C.N.; Dascălu, M. Modeling Research Project Risks with Fuzzy Maps. Quantitative Methods in Enterprises. Behav. Anal. Under Risk Uncertain. 2009, 22, 53–68. [Google Scholar]
KPMG. AI Transforming the Enterprise; KPMG: Amstelveen, The Netherlands, 2019; Available online: https://assets.kpmg.com/content/dam/kpmg/tr/pdf/2021/03/ai-trends-transforming-the-enterprise.pdf (accessed on 22 November 2025).
Fireteanu, V.-V.; Ciuc, M. Designing Risk Assessment Applications for Internet of Things Projects. Sci. Bull. UPB Ser. C Comput. Sci. Eng. 2021, 83, 145–156. [Google Scholar]
Nieto-Rodriguez, A.; Vargas, R.V. How AI Will Transform Project Management. Harv. Bus. Rev. 2023, 2. Available online: https://hbr.org/2023/02/how-ai-will-transform-project-management (accessed on 21 November 2025).

Figure 1. Component Diagram of the Integrative Tool for Risk Predictions in Projects.

Figure 2. Machine Learning Tool to Predict Risks in Projects: Prediction Dashboard.

Figure 3. Machine Learning Tool to Predict Risks in Projects: Prediction Report.

Figure 4. Activity Diagram of the Integrative Tool.

Figure 5. Risk Management Methods used by Responders.

Figure 6. Features for a Risk Management App according to Survey.

Table 1. Risk types.

Risk Type	Description
Technical risks	Risks related to the technology or tools being used in the project, such as compatibility issues or technical failures.
Organizational risks	Risks related to the organization or company carrying out the project, such as lack of resources or poor communication.
External risks	Risks related to external factors outside of the organization’s control, such as changes in regulations or natural disasters.
Schedule risks	Risks related to the project timeline, such as delays or unexpected changes in the schedule.
Cost risks	Risks related to the project budget, such as unexpected expenses or cost overruns.
Resource risks	Risks related to the availability or allocation of resources needed for the project, such as staff or equipment.
Quality risks	Risks related to the quality of the project deliverables, such as defects or errors.
Scope risks	Risks related to the project scope, such as changes in requirements or unclear project goals.
Stakeholder risks	Risks related to the project stakeholders, such as conflicts or misunderstandings.
Communication risks	Risks related to communication within the project team or with external stakeholders, such as misunderstandings or lack of information sharing.
Legal risks	Risks related to legal or regulatory compliance, such as violations or lawsuits.
Environmental risks	Risks related to the impact of the project on the environment, such as pollution or damage to natural resources.
Safety risks	Risks related to the safety of project team members or other stakeholders, such as accidents or injuries.

Table 2. Input and Output Features for our ML Algorithm for Risk prediction.

Feature Type	Feature Name	Measurement/Interpretation
input	Project ID	Unique identifier for each project
	Project Industry (Construction, Software, Manufacturing, etc.)	The sector in which the project is taking place
	Project Duration (in months)	Total planned duration of the project
	Percentage of Completion: Numeric (0–100%)	The progress of the project if it is ongoing
	Initial Budget (in $)	The budget allocated at the start of the project
	Cost Variance (+/− number of $)	The difference between the projected budget and the actual spending
	Scope Creep (number of changes in scope)	Quantifies how many times the project scope has expanded beyond the original plan
	Technical Complexity (score 1–5)	Assesses the level of technical difficulty
	Operational Efficiency (score 1–5)	Assesses how well the projects’ operations are going
	Budget Changes (score 1–5)	Indicates frequency and magnitude of budget adjustments
	Regulations’ Impact (score 1–5)	Measures the effect of regulatory requirements on the project
	Deviations from Strategic Goals (score 1–5)	Captures how much the project deviates from its initial strategic objectives
	Market Fluctuations (score 1–5)	Evaluates how market changes could impact the project
	Reputational Impact (score 1–5)	Indicates the potential effect of the project on the organization’s reputation
	Environmental Impact (score 1–5)	Analyses the project’s environmental aspects
	Legal Challenges (score 1–5)	Reflects the intensity of legal issues encountered
	Security Threats (score 1–5)	Rates the severity of security concerns related to the project
	Supply Chain Disruptions (score 1–5)	Measures the impact of supply chain issues on the project
	HR Issues (score 1–5)	Evaluates human resources challenges within the project
	Schedule Changes (score 1–5)	Captures the frequency and impact of changes to the project schedule
	Schedule Variance (+- number of days)	Quantifies the difference between planned and actual timelines
	Stakeholder Issues (score 1–5)	Reflects challenges arising from stakeholder interactions
	Stakeholder Engagement Level (score 1–3)	Scores the degree of involvement of stakeholders
	Stakeholder Influence Level (score 1–3)	Calculates the level of influence stakeholders have over the project
	External Dependencies (score 1–3)	Evaluates the project’s reliance on external factors or third parties
	Economic Conditions (0 for unstable, 1 for stable)	Binary indicator of the economic environment’s stability
	Quality Issues (0 for no, 1 for yes)	Binary indicator of whether the project has faced any significant quality issues
output	Risk Type	Descriptive category of the most significant risk faced
	Risk Impact	Scale of 1–10 rating the potential impact of the risk
	Risk Probability	Percentage of likelihood of the risk to occur
	Risk Response Plan	The approach taken to control recognized risks

Table 3. Experimental Results for Risk Prediction.

Metric	Decision Tree	Random Forest	Comments
Hamming Loss	0.00	0.0005	Near-perfect classification due to deterministic dataset
Exact Match Ratio	1.00	0.999	Both models replicate the rule-based assignments almost perfectly
MAE—Impact	1.67	1.28	Random Forest performs better due to variance reduction.
MAE—Probability	12.61	9.46	Random Forest again outperforms Decision Tree for regression outputs

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Geamanu, A.; Dascalu, M.-I.; Neagu, A.-M.; Guica, R.I. An Integrated Artificial Intelligence Tool for Predicting and Managing Project Risks. Mach. Learn. Knowl. Extr. 2026, 8, 1. https://doi.org/10.3390/make8010001

AMA Style

Geamanu A, Dascalu M-I, Neagu A-M, Guica RI. An Integrated Artificial Intelligence Tool for Predicting and Managing Project Risks. Machine Learning and Knowledge Extraction. 2026; 8(1):1. https://doi.org/10.3390/make8010001

Chicago/Turabian Style

Geamanu, Andreea, Maria-Iuliana Dascalu, Ana-Maria Neagu, and Raluca Ioana Guica. 2026. "An Integrated Artificial Intelligence Tool for Predicting and Managing Project Risks" Machine Learning and Knowledge Extraction 8, no. 1: 1. https://doi.org/10.3390/make8010001

APA Style

Geamanu, A., Dascalu, M.-I., Neagu, A.-M., & Guica, R. I. (2026). An Integrated Artificial Intelligence Tool for Predicting and Managing Project Risks. Machine Learning and Knowledge Extraction, 8(1), 1. https://doi.org/10.3390/make8010001

Article Menu

An Integrated Artificial Intelligence Tool for Predicting and Managing Project Risks

Abstract

1. Introduction

2. Related Work

2.1. AI in Project Management

2.2. AI in Risk Management

3. Methodology

3.1. Research Flow

3.2. Dataset Generation and Structure

3.3. Model Training and Multi-Output Formulation

3.4. Evaluation Metrics

3.5. System Integration

4. Machine Learning Tool to Predict Risks in Projects

4.1. Dataset Construction

4.2. Prediction Problem Formulation and Machine Learning Models

4.3. Evaluation Metrics and Experimental Results

4.4. Interpretation of Results

4.5. Integrative Tool

4.6. Survey Feedback

5. Discussions

6. Conclusions

Supplementary Materials

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Abbreviations

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI