Data Science Project Barriers—A Systematic Review
Abstract
1. Introduction
2. Literature Review
2.1. Data Science
2.2. Barriers for Data Science Projects
3. Methods
3.1. Systematic Literature Review
3.2. Quantitative Methods for Clustering Barriers
4. Results
4.1. Systematic Literature Review
Barriers Identified
4.2. Clustering the Barriers
5. Discussion
Limitations
6. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- The World Bank GDP Growth (Annual %). Available online: https://data.worldbank.org/indicator/NY.GDP.MKTP.KD.ZG (accessed on 28 January 2022).
- Statista Research Department Total Data Volume Worldwide 2010–2025. Available online: https://www.statista.com/statistics/871513/worldwide-data-created/ (accessed on 20 November 2022).
- Arthur, C. Tech Giants May Be Huge, but Nothing Matches Big Data. The Guardian, 23 August 2013. [Google Scholar]
- Statista Research Department Biggest Companies in the World by Market Cap 2021. Available online: https://www.statista.com/statistics/263264/top-companies-in-the-world-by-market-capitalization/ (accessed on 28 January 2022).
- BusinessWire Global $243 Billion Big Data Market Trajectory & Analytics to 2027. Available online: https://www.businesswire.com/news/home/20201208005685/en/Global-243-Billion-Big-Data-Market-Trajectory-Analytics-to-2027-Age-of-Analytics-Provides-the-Cornerstone-for-the-Disruptive-Growth-Proliferation-of-Big-Data-Technologies---ResearchAndMarkets.com (accessed on 28 January 2022).
- Davenport, T.H.; Patil, D.J. Data Scientist: The Sexiest Job of the 21st Century. Harvard Business Review, 1 October 2012. [Google Scholar]
- LinkedIn US Jobs on the Rise Report. Available online: https://business.linkedin.com/talent-solutions/resources/talent-acquisition/jobs-on-the-rise-us (accessed on 28 January 2022).
- NewVantage Partners LLC. Big Data and AI Executive Survey 2021: Executive Summary of Findings. Available online: https://www.newvantage.com/_files/ugd/e5361a_d59b4629443945a0b0661d494abb5233.pdf (accessed on 28 January 2022).
- Capgemini Consulting Cracking the Data Conundrum: How Successful Companies Make Big Data Operational 2014. Available online: https://www.capgemini.com/gb-en/wp-content/uploads/sites/3/2019/01/Cracking-the-Data-Conundrum-How-Successful-Companies-Make-Big-Data-Operational.pdf (accessed on 28 January 2022).
- White, A. Our Top Data and Analytics Predicts for 2019. Gartner Blog Network 2019. Available online: https://blogs.gartner.com/andrew_white/2019/01/03/our-top-data-and-analytics-predicts-for-2019/ (accessed on 28 January 2022).
- Fleming, O.; Fountaine, T.; Henke, N.; Saleh, T. Getting Your Organization’s Advanced Analytics Program Right. Available online: https://www.mckinsey.com/business-functions/mckinsey-analytics/our-insights/ten-red-flags-signaling-your-analytics-program-will-fail (accessed on 28 January 2022).
- Kelleher, J.D.; Tierney, B. Data Science; The MIT Press: Cambridge, MA, USA, 2018; ISBN 978-0-262-53543-4. [Google Scholar]
- Breiman, L. Statistical Modeling: The Two Cultures. Stat. Sci. 2001, 16, 199–215. [Google Scholar] [CrossRef]
- Habib ur Rehman, M.; Liew, C.S.; Abbas, A.; Jayaraman, P.P.; Wah, T.Y.; Khan, S.U. Big Data Reduction Methods: A Survey. Data Sci. Eng. 2016, 1, 265–284. [Google Scholar] [CrossRef]
- Brown, M.S. Transforming Unstructured Data into Useful Information. In Big Data, Mining, and Analytics; Auerbach Publications: Boca Raton, FL, USA, 2014; ISBN 978-0-429-09529-0. [Google Scholar]
- L’Heureux, A.; Grolinger, K.; Elyamany, H.F.; Capretz, M.A.M. Machine Learning with Big Data: Challenges and Approaches. IEEE Access 2017, 5, 7776–7797. [Google Scholar] [CrossRef]
- Krasteva, I.; Ilieva, S. Adopting Agile Software Development Methodologies in Big Data Projects—A Systematic Literature Review of Experience Reports. In Proceedings of the 2020 IEEE International Conference on Big Data (Big Data), Atlanta, GA, USA, 10–13 December 2020; pp. 2028–2033. [Google Scholar]
- Project Management Institute (PMI). The Standard for Project Management and a Guide to the Project Management Body of Knowledge (PMBOK Guide), 7th ed.; Project Management Institute: Newtown Square, PA, USA, 2021; ISBN 978-1-62825-667-3. [Google Scholar]
- Bi, W.; Cai, M.; Liu, M.; Li, G. A Big Data Clustering Algorithm for Mitigating the Risk of Customer Churn. IEEE Trans. Ind. Inform. 2016, 12, 1270–1281. [Google Scholar] [CrossRef]
- Jagabathula, S.; Subramanian, L.; Venkataraman, A. A Model-Based Embedding Technique for Segmenting Customers. Oper. Res. 2018, 66, 1247–1267. [Google Scholar] [CrossRef]
- Thennakoon, A.; Bhagyani, C.; Premadasa, S.; Mihiranga, S.; Kuruwitaarachchi, N. Real-Time Credit Card Fraud Detection Using Machine Learning. In Proceedings of the 2019 9th International Conference on Cloud Computing, Data Science & Engineering (Confluence), Noida, India, 10–11 January 2019; pp. 488–493. [Google Scholar]
- Mittal, S.; Tyagi, S. Performance Evaluation of Machine Learning Algorithms for Credit Card Fraud Detection. In Proceedings of the 2019 9th International Conference on Cloud Computing, Data Science & Engineering, Noida, India, 10–11 January 2019; pp. 320–324. [Google Scholar]
- Liou, F.-M.; Yang, C.-H. Predicting Business Failure under the Existence of Fraudulent Financial Reporting. Int. J. Account. Inf. Manag. 2008, 16, 74–86. [Google Scholar] [CrossRef]
- Tian, F.; Lan, T.; Chao, K.-M.; Godwin, N.; Zheng, Q.; Shah, N.; Zhang, F. Mining Suspicious Tax Evasion Groups in Big Data. IEEE Trans. Knowl. Data Eng. 2016, 28, 2651–2664. [Google Scholar] [CrossRef]
- Raut, R.; Yadav, V.S.; Cheikhrouhou, N.; Narwane, V.S.; Narkhede, B.E. Big Data Analytics: Implementation Challenges in Indian Manufacturing Supply Chains. Comput. Ind. 2021, 125, 103368. [Google Scholar] [CrossRef]
- Park, J.-H.; Kim, Y.B. Factors Activating Big Data Adoption by Korean Firms. J. Comput. Inf. Syst. 2021, 61, 285–293. [Google Scholar] [CrossRef]
- Nayal, K.; Raut, R.D.; Queiroz, M.M.; Yadav, V.S.; Narkhede, B.E. Are Artificial Intelligence and Machine Learning Suitable to Tackle the COVID-19 Impacts? An Agriculture Supply Chain Perspective. Int. J. Logist. Manag. 2021, 34, 304–335. [Google Scholar] [CrossRef]
- Piatetsky, G. CRISP-DM, Still the Top Methodology for Analytics, Data Mining, or Data Science Projects. KDnuggets 2014. Available online: https://www.kdnuggets.com/2014/10/crisp-dm-top-methodology-analytics-data-mining-data-science-projects.html (accessed on 7 May 2023).
- Martinez, I.; Viles, E.; Olaizola, I.G. Data Science Methodologies: Current Challenges and Future Approaches. Big Data Res. 2021, 24, 100183. [Google Scholar] [CrossRef]
- Morlock, F.; Boßlau, M. Concept for Enabling Customer-Oriented Data Analytics via Integration of Production Process Improvement Methods and Data Science Methods. Procedia CIRP 2021, 104, 542–546. [Google Scholar] [CrossRef]
- Rameezdeen, R.; Chileshe, N.; Hosseini, M.R.; Lehmann, S. A Qualitative Examination of Major Barriers in Implementation of Reverse Logistics within the South Australian Construction Sector. Int. J. Constr. Manag. 2016, 16, 185–196. [Google Scholar] [CrossRef]
- Page, M.J.; McKenzie, J.E.; Bossuyt, P.M.; Boutron, I.; Hoffmann, T.C.; Mulrow, C.D.; Shamseer, L.; Tetzlaff, J.M.; Akl, E.A.; Brennan, S.E.; et al. The PRISMA 2020 Statement: An Updated Guideline for Reporting Systematic Reviews. BMJ 2021, 372, n71. [Google Scholar] [CrossRef]
- Xiao, Y.; Watson, M. Guidance on Conducting a Systematic Literature Review. J. Plan. Educ. Res. 2019, 39, 93–112. [Google Scholar] [CrossRef]
- Haddaway, N.R.; Page, M.J.; Pritchard, C.C.; McGuinness, L.A. PRISMA2020: An R Package and Shiny App for Producing PRISMA 2020-compliant Flow Diagrams, with Interactivity for Optimised Digital Transparency and Open Synthesis. Campbell Syst. Rev. 2022, 18, e1230. [Google Scholar] [CrossRef]
- Brock, V.F.; Khan, H.U. Are Enterprises Ready for Big Data Analytics? A Survey-Based Approach. Int. J. Bus. Inf. Syst. 2017, 25, 256. [Google Scholar] [CrossRef]
- Saltz, J.; Shamshurin, I.; Connors, C. Predicting Data Science Sociotechnical Execution Challenges by Categorizing Data Science Projects. J. Assoc. Inf. Sci. Technol. 2017, 68, 2720–2728. [Google Scholar] [CrossRef]
- Bernardi, L.; Mavridis, T.; Estevez, P. 150 Successful Machine Learning Models: 6 Lessons Learned at Booking.Com. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, Anchorage, AK, USA, 4–8 August 2019; pp. 1743–1751. [Google Scholar]
- Alalawneh, A.A.F.; Alkhatib, S.F. The Barriers to Big Data Adoption in Developing Economies. Electron. J. Inf. Syst. Dev. Ctries. 2021, 87, e12151. [Google Scholar] [CrossRef]
- Rosenthal, R.; DiMatteo, M.R. Meta-Analysis: Recent Developments in Quantitative Methods for Literature Reviews. Annu. Rev. Psychol. 2001, 52, 59–82. [Google Scholar] [CrossRef]
- Börner, K.; Sanyal, S.; Vespignani, A. Network Science. Annu. Rev. Inf. Sci. Technol. 2007, 41, 537–607. [Google Scholar] [CrossRef]
- Newman, M.E.J. Communities, Modules and Large-Scale Structure in Networks. Nat. Phys. 2012, 8, 25–31. [Google Scholar] [CrossRef]
- Jaccard, P. The Distribution of the Flora in the Alpine Zone.1. New Phytol. 1912, 11, 37–50. [Google Scholar] [CrossRef]
- Kosub, S. A Note on the Triangle Inequality for the Jaccard Distance. Pattern Recognit. Lett. 2019, 120, 36–38. [Google Scholar] [CrossRef]
- Thorndike, R.L. Who Belongs in the Family? Psychometrika 1953, 18, 267–276. [Google Scholar] [CrossRef]
- Rousseeuw, P.J. Silhouettes: A Graphical Aid to the Interpretation and Validation of Cluster Analysis. J. Comput. Appl. Math. 1987, 20, 53–65. [Google Scholar] [CrossRef]
- Riveros, C.; Salas, J.; Skibski, O. How to Choose the Root: Centrality Measures over Tree Structures. arXiv 2021, arXiv:2112.13736. [Google Scholar]
- Kavre, M.; Gardas, B.; Narwane, V.; Jafari Navimipour, N.; Yalcin, S. Evaluating the Effect of Human Factors on Big Data Analytics and Cloud of Things Adoption in the Manufacturing Micro, Small, and Medium Enterprises. IT Prof. 2022, 24, 17–26. [Google Scholar] [CrossRef]
- Sharma, M.; Luthra, S.; Joshi, S.; Kumar, A. Implementing Challenges of Artificial Intelligence: Evidence from Public Manufacturing Sector of an Emerging Economy. Gov. Inf. Q. 2022, 39, 101624. [Google Scholar] [CrossRef]
- Gangadhari, R.K.; Khanzode, V.; Murthy, S.; Dennehy, D. Modelling the Relationships between the Barriers to Implementing Machine Learning for Accident Analysis: The Indian Petroleum Industry. Benchmarking 2022, 30, 3357–3381. [Google Scholar] [CrossRef]
- Gupta, A.K.; Goyal, H. Framework for Implementing Big Data Analytics in Indian Manufacturing: ISM-MICMAC and Fuzzy-AHP Approach. Inf. Technol. Manag. 2021, 22, 207–229. [Google Scholar] [CrossRef]
- Bahrami, F.; Kanaani, F.; Turkina, E.; Moin, M.S.; Shahbazi, M. Key Challenges in Big Data Startups: An Exploratory Study in Iran. Iran. J. Manag. Stud. 2021, 14, 273–289. [Google Scholar] [CrossRef]
- Raut, R.; Narwane, V.; Kumar Mangla, S.; Yadav, V.S.; Narkhede, B.E.; Luthra, S. Unlocking Causal Relations of Barriers to Big Data Analytics in Manufacturing Firms. Ind. Manag. Data Syst. 2021, 121, 1939–1968. [Google Scholar] [CrossRef]
- Bag, S.; Gupta, S.; Wood, L. Big Data Analytics in Sustainable Humanitarian Supply Chain: Barriers and Their Interactions. Ann. Oper. Res. 2020, 319, 721–760. [Google Scholar] [CrossRef]
- Zhang, X.; Lam, J.S.L. A Fuzzy Delphi-AHP-TOPSIS Framework to Identify Barriers in Big Data Analytics Adoption: Case of Maritime Organizations. Marit. Policy Manag. 2019, 46, 781–801. [Google Scholar] [CrossRef]
- Moktadir, M.A.; Ali, S.M.; Paul, S.K.; Shukla, N. Barriers to Big Data Analytics in Manufacturing Supply Chains: A Case Study from Bangladesh. Comput. Ind. Eng. 2019, 128, 1063–1075. [Google Scholar] [CrossRef]
- Shukla, M.; Mattar, L. Next Generation Smart Sustainable Auditing Systems Using Big Data Analytics: Understanding the Interaction of Critical Barriers. Comput. Ind. Eng. 2019, 128, 1015–1026. [Google Scholar] [CrossRef]
- Kastouni, M.Z.; Ait Lahcen, A. Big Data Analytics in Telecommunications: Governance, Architecture and Use Cases. J. King Saud. Univ. Comput. Inf. Sci. 2022, 34, 2758–2770. [Google Scholar] [CrossRef]
- Aho, T.; Kilamo, T.; Lwakatare, L.; Mikkonen, T.; Sievi-Korte, O.; Yaman, S. Managing and Composing Teams in Data Science: An Empirical Study. In Proceedings of the 2021 IEEE International Conference on Big Data (Big Data), Orlando, FL, USA, 15–18 December 2021; pp. 2291–2300. [Google Scholar]
- Escobar, C.A.; McGovern, M.E.; Morales-Menendez, R. Quality 4.0: A Review of Big Data Challenges in Manufacturing. J. Intell. Manuf. 2021, 32, 2319–2334. [Google Scholar] [CrossRef]
- Wang, S.; Wang, H. Big Data for Small and Medium-Sized Enterprises (SME): A Knowledge Management Model. J. Knowl. Manag. 2020, 24, 881–897. [Google Scholar] [CrossRef]
- Saltz, J.S.; Shamshurin, I. Achieving Agile Big Data Science: The Evolution of a Team’s Agile Process Methodology. In Proceedings of the 2019 IEEE International Conference on Big Data (Big Data), Los Angeles, CA, USA, 9–12 December 2019; pp. 3477–3485. [Google Scholar]
- Jensen, M.H.; Nielsen, P.A.; Persson, J.S. Managing Big Data Analytics Projects: The Challenges of Realizing Value. In Proceedings of the 27th European Conference on Information Systems (ECIS), Muenster, Germany, 8–14 June 2019. [Google Scholar]
- Kim, M.; Zimmermann, T.; DeLine, R.; Begel, A. Data Scientists in Software Teams: State of the Art and Challenges. IEEE Trans. Softw. Eng. 2018, 44, 1024–1038. [Google Scholar] [CrossRef]
- Barham, H.; Daim, T. Identifying Critical Issues in Smart City Big Data Project Implementation. In Proceedings of the SCC ‘18: The 1st ACM/EIGSCC Symposium on Smart Cities and Communities, Portland, OR, USA, 20–22 June 2018. [Google Scholar]
- Barham, H. Achieving Competitive Advantage through Big Data: A Literature Review. In Proceedings of the 2017 Portland International Conference on Management of Engineering and Technology (PICMET), Portland, OR, USA, 9–13 July 2017; pp. 1–7. [Google Scholar]
- Becker, D.K. Predicting Outcomes for Big Data Projects: Big Data Project Dynamics (BDPD): Research in Progress. In Proceedings of the 2017 IEEE International Conference on Big Data (Big Data), Boston, MA, USA, 11–14 December 2017; pp. 2320–2330. [Google Scholar]
- Chipidza, W.; George, J.; Koch, H. Chartering Predictive Analytics: A Case Study. In Proceedings of the 22nd Americas Conference on Information Systems, AMCIS 2016, San Diego, CA, USA, 11–14 August 2016. [Google Scholar]
- Klosowski, T. The State of Consumer Data Privacy Laws in the US (And Why It Matters). Wirecutter. 6 September 2021. Available online: https://www.nytimes.com/wirecutter/blog/state-of-privacy-laws-in-us/ (accessed on 20 November 2022).
- Wang, R.Y.; Strong, D.M. Beyond Accuracy: What Data Quality Means to Data Consumers. J. Manag. Inf. Syst. 1996, 12, 5–33. [Google Scholar] [CrossRef]
Search Strategy | Topic | Search Terms * | Result | Search Date |
---|---|---|---|---|
Strategy 1 | Data Science | “data science” OR “big data” OR “data analytics” OR “machine learning” AND | 213 | 20 October 2022 |
Barrier | “barrier*” OR “obstacle*” OR “challenge*” OR “hindrance*” AND | |||
MCDM | “multi-criteria decision-making” OR “MCDM” OR “AHP” OR “TOPSIS” OR “VIKOR” OR “ANP” OR “DEMATEL” OR “PROMETHEE” OR “ELECTRE” OR “ISM” OR “TISM” OR “MICMAC” | |||
Strategy 2 | Data Science Project | “data science project” OR “big data project” OR “data analytics project” OR “machine learning project” AND | 239 | 21 October 2022 |
Barrier or project failure | “barrier*” OR “obstacle*” OR “challenge*” OR “hindrance*” OR “fail*” |
# | Author | Context | Methods | Keypoints |
---|---|---|---|---|
A | Kavre et al. (2022) [47] | Small and medium-sized enterprises (SME), India | Literature review, expert opinion, ISM, DEMATEL | Barriers to BDA adoption. Analyzes the impact and relationships among the barriers. |
B | Sharma et al. (2022) [48] | Manufacturing, public sector, India | DEMATEL | Barriers to machine learning adoption. Analyzes the impact and relationships between the barriers. |
C | Gangadhari et al. (2022) [49] | Oil industry, India | Literature review (PRISMA), DEMATEL, COPRAS, MOORA, Delphi (n = 10) | Barriers to machine learning adoption. Analyzes the impact and relationships between the barriers. |
D | Nayal et al. (2021) [27] | Agriculture, COVID-19, India | Delphi, ISM, Fuzzy MICMAC, ANP | Barriers to machine learning adoption. Analyzes the impact and relationships between the barriers. |
E | Raut, Yadav, et al. (2021) [25] | Manufacturing, India | ISM, Delphi, Fuzzy MICMAC, DEMATEL, expert opinion (n = 47) | Barriers to BDA adoption. Analyzes the impact and relationships between the barriers. |
F | Park & Kim (2021) [26] | No specific industry, Korea | AHP (n = 50), regression analysis (n = 226) | Barriers to BD adoption. Categorizes and analyzes the impact of the barriers. |
G | Gupta & Goyal (2021) [50] | Manufacturing, India | Literature review, survey, ISM, MICMAC, Fuzzy AHP, expert opinion (n = 16) | Barriers to BDA adoption. Categorizes and analyzes the impact and relationships among the barriers. |
H | Bahrami et al. (2021) [51] | Startups, Iran | Interviews, survey, Fuzzy AHP | Barriers faced by BD startups. Categorizes and analyzes the impact of the barriers. |
I | Raut, Narwane, et al. (2021) [52] | Supply chain, India | Literature review, survey, DEMATEL, ANP, expert opinion (n = 15) | Barriers to BDA adoption. Analyzes the impact and relationships among the barriers. |
J | Bag et al. (2020) [53] | Third sector, supply chain, Africa | Literature review, Fuzzy TISM (n = 5), survey (n = 108), SEM | Barriers to BDA adoption. Analyzes the impact and relationships among the barriers. |
K | Alalawneh & Alkhatib (2020) [38] | Financial, industrial, services, public and supply chain sectors, Jordan | Literature review, Semi-structured interviews, survey (n = 23), AHP, TOPSIS | Barriers to BD adoption. Categorizes and analyzes the impact and relationships among the barriers. |
L | Zhang & Lam (2019) [54] | Maritime industry | Fuzzy-Delphi (n = 6), Fuzzy AHP (n = 20), TOPSIS | Barriers to BDA adoption. Categorizes and analyzes the impact of the barriers. |
M | Moktadir et al. (2019) [55] | Supply chain, Bangladesh | Literature review, Delphi (n = 15), AHP, expert opinion | Barriers to BDA adoption. Categorizes and analyzes the impact of the barriers. |
N | Shukla & Mattar (2019) [56] | Agriculture | Literature review, ISM, MICMAC, expert opinion | Barriers to BDA adoption. Analyzes the impact and relationships between the barriers. |
O | Kastouni & Lahcen (2022) [57] | Telecommunications | Case Study (n = 1) | Barriers for the studied BDA project. Categorizes the barriers. |
P | Martinez et al. (2021) [29] | No specific context | Literature review | DS project management methodologies. Proposes a framework for framing the methodologies. Raises challenges for DS projects. |
Q | Aho et al. (2021) [58] | No specific context | Survey (n = 50) | Barriers and other issues for DS projects. |
R | Escobar et al. (2021) [59] | Manufacturing | Literature review | Barriers to DS projects. |
S | Wang & Wang (2020) [60] | Small and medium-sized enterprises | Literature review, case studies (n = 8) | Proposes a knowledge management model for BD. Raises barriers for BD projects. |
T | Saltz & Shamshurin (2019) [61] | Undisclosed | Case study (n = 1) | Agile methodologies for DS projects. Raises barriers for DS projects. |
U | Jensen et al. (2019) [62] | Energy | Case study (n = 1) | Barriers for the BD project studied. |
V | Kim et al. (2018) [63] | Technology | Survey (n = 793) | Barriers and other issues faced by data scientists in projects. Categorizes the barriers. |
W | Barham & Daim (2018) [64] | Smart cities | Literature review | Barriers for BD projects. Categorizes the barriers. |
X | Barham (2017) [65] | No specific context | Literature review | Barriers and other issues for DS projects. |
Y | Becker (2017) [66] | No specific context | Survey (n = 19), system modelling and simulation | Model for predicting outcomes of BD projects. Raises causes of failure for projects of this type. |
Z | Chipidza et al. (2016) [67] | Supply chain | Case study (n = 1) | Barriers for the BD project studied. |
# | Barrier | Description |
---|---|---|
B1 | Insufficient skills | The organization does not have individuals with the necessary skills to execute the project. |
B2 | Poor data quality | The data provided is of poor quality (i.e., lacking accuracy, relevancy, and representation) |
B3 | Insufficient IT infrastructure | The organization’s IT infrastructure does not support the project’s needs. |
B4 | Data privacy and security | Privacy and security risks are not managed properly. |
B5 | IT illiteracy | Employees involved with the project do not have basic technology knowledge in order to be able to cooperate or use the tools developed. |
B6 | Lack of support from top management | Top management does not provide the necessary support for the development of the project. |
B7 | Complexity of data or technology | Data or technology characteristics are too complex. |
B8 | Lack of an integrated data environment | There is a lack of an integrated data environment from which the project can load and write information. |
B9 | Insufficient funding | There is not enough funding for the project’s needs. |
B10 | Immature technology and lack of appropriate tools | The technology is immature in the sense of well-established responsibilities, processes, and tools. |
B11 | Strategy mismatch | There is a mismatch between company strategy and project goals. |
B12 | Inadequate data sharing policy | The organization’s data sharing policy hinders the team’s access to needed data. |
B13 | Scalability issues | The established infrastructure does not have sufficient scalability capacity to support data processing and storage over time. |
B14 | Resistance to change and other cultural barriers | There is unwillingness on the part of the organization to adapt to the new processes. This includes other cultural barriers such as fear of technology and behavioral problems. |
B15 | Insufficient ROI or business case | The proposed project does not have sufficient ROI or business case (justification) to gain the support of the decision-makers. |
B16 | Inadequate training programs and facilities | The organization’s training programs and/or facilities are inadequate. |
B17 | High investment and maintenance cost | Project implementation requires a high investment cost and also leads to high maintenance costs. |
B18 | Government policies and regulation | Government regulation and/or policies are insufficient, counterproductive, or unclear. |
B19 | Poor data management and architecture | The organization’s data management and architecture are inadequate, making it difficult to acquire data and integrate the new processes into the existing architecture. |
B20 | Lack of coordination, collaboration, and communication | There is a lack of coordination, collaboration, or communication between the parties involved in the project. |
B21 | Data availability | The necessary data is not available, either due to a lack of access, or other issues, such as data loss, lack of record keeping, and physical (non-electronic) storage. |
B22 | Inadequate or inconsistent methodology | The methodology used is inadequate, inconsistent, or immature. |
B23 | External sources of data | External sources of data are not available to the organization, either due to legal issues, high costs, etc. |
B24 | Scope, objectives, and expected results unclear | Scope, objectives and expected results were poorly defined at the beginning of the project. |
B25 | Uncertainty about benefits | The organization is uncertain about the benefits of the project. There is a lack of support to start the project or to develop the activities. |
B26 | Deployment and sustainability issues | Deployment and/or sustainability are inadequate leading to low usage of project deliveries, possibly due to a lack of an implementation plan to manage the changes in the business processes. |
B27 | Associated risks | Risks associated with the project are high or overestimated. |
Search strategies | ||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Strategy 1 | Strategy 2 | |||||||||||||||||||||||||||
# | Freq. | F. (x) | A | B | C | D | E | F | G | H | I | J | K | L | M | N | O | P | Q | R | S | T | U | V | W | X | Y | Z |
B1 | 20 | 5 | ∙ | x | ∙ | x | x | ∙ | ∙ | ∙ | ∙ | ∙ | x | ∙ | x | ∙ | ∙ | ∙ | ∙ | ∙ | ∙ | ∙ | ||||||
B2 | 18 | 2 | x | ∙ | ∙ | ∙ | x | ∙ | ∙ | ∙ | ∙ | ∙ | ∙ | ∙ | ∙ | ∙ | ∙ | ∙ | ∙ | ∙ | ||||||||
B3 | 15 | 4 | ∙ | x | ∙ | ∙ | x | x | ∙ | ∙ | x | ∙ | ∙ | ∙ | ∙ | ∙ | ∙ | |||||||||||
B4 | 15 | 4 | x | x | x | ∙ | ∙ | ∙ | ∙ | ∙ | ∙ | x | ∙ | ∙ | ∙ | ∙ | ∙ | |||||||||||
B5 | 15 | 2 | ∙ | x | ∙ | ∙ | ∙ | ∙ | ∙ | ∙ | x | ∙ | ∙ | ∙ | ∙ | ∙ | ∙ | |||||||||||
B6 | 14 | 4 | x | ∙ | ∙ | x | ∙ | x | ∙ | ∙ | ∙ | x | ∙ | ∙ | ∙ | ∙ | ||||||||||||
B7 | 11 | 2 | ∙ | ∙ | ∙ | ∙ | ∙ | x | x | ∙ | ∙ | ∙ | ∙ | |||||||||||||||
B8 | 11 | 2 | ∙ | x | ∙ | ∙ | ∙ | x | ∙ | ∙ | ∙ | ∙ | ∙ | |||||||||||||||
B9 | 10 | 3 | ∙ | x | x | x | ∙ | ∙ | ∙ | ∙ | ∙ | ∙ | ||||||||||||||||
B10 | 9 | 3 | ∙ | x | ∙ | x | x | ∙ | ∙ | ∙ | ∙ | |||||||||||||||||
B11 | 9 | 2 | ∙ | x | x | ∙ | ∙ | ∙ | ∙ | ∙ | ∙ | |||||||||||||||||
B12 | 9 | 1 | x | ∙ | ∙ | ∙ | ∙ | ∙ | ∙ | ∙ | ∙ | |||||||||||||||||
B13 | 9 | 0 | ∙ | ∙ | ∙ | ∙ | ∙ | ∙ | ∙ | ∙ | ∙ | |||||||||||||||||
B14 | 9 | 0 | ∙ | ∙ | ∙ | ∙ | ∙ | ∙ | ∙ | ∙ | ∙ | |||||||||||||||||
B15 | 9 | 0 | ∙ | ∙ | ∙ | ∙ | ∙ | ∙ | ∙ | ∙ | ∙ | |||||||||||||||||
B16 | 8 | 3 | x | ∙ | x | x | ∙ | ∙ | ∙ | ∙ | ||||||||||||||||||
B17 | 8 | 2 | x | ∙ | ∙ | x | ∙ | ∙ | ∙ | ∙ | ||||||||||||||||||
B18 | 8 | 2 | x | ∙ | ∙ | ∙ | ∙ | ∙ | x | ∙ | ||||||||||||||||||
B19 | 8 | 1 | x | ∙ | ∙ | ∙ | ∙ | ∙ | ∙ | ∙ | ||||||||||||||||||
B20 | 8 | 1 | x | ∙ | ∙ | ∙ | ∙ | ∙ | ∙ | ∙ | ||||||||||||||||||
B21 | 8 | 0 | ∙ | ∙ | ∙ | ∙ | ∙ | ∙ | ∙ | ∙ | ||||||||||||||||||
B22 | 6 | 1 | x | ∙ | ∙ | ∙ | ∙ | ∙ | ||||||||||||||||||||
B23 | 6 | 0 | ∙ | ∙ | ∙ | ∙ | ∙ | ∙ | ||||||||||||||||||||
B24 | 6 | 0 | ∙ | ∙ | ∙ | ∙ | ∙ | ∙ | ||||||||||||||||||||
B25 | 5 | 2 | x | x | ∙ | ∙ | ∙ | |||||||||||||||||||||
B26 | 4 | 0 | ∙ | ∙ | ∙ | ∙ | ||||||||||||||||||||||
B27 | 4 | 0 | ∙ | ∙ | ∙ | ∙ |
B1 | B2 | B3 | B4 | B5 | B6 | B7 | B8 | B9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
B2 | 1.0 | |||||||||||||||||||||||||
B3 | 1.0 | 0.7 | ||||||||||||||||||||||||
B4 | 1.0 | 0.6 | 0.9 | |||||||||||||||||||||||
B5 | 0.6 | 1.0 | 1.0 | 1.0 | ||||||||||||||||||||||
B6 | 0.8 | 1.0 | 1.0 | 0.9 | 0.8 | |||||||||||||||||||||
B7 | 1.0 | 0.6 | 0.8 | 0.7 | 1.0 | 1.0 | ||||||||||||||||||||
B8 | 1.0 | 0.3 | 0.8 | 0.7 | 1.0 | 1.0 | 0.4 | |||||||||||||||||||
B9 | 0.9 | 1.0 | 1.0 | 1.0 | 1.0 | 0.9 | 1.0 | 1.0 | ||||||||||||||||||
B10 | 1.0 | 0.9 | 0.6 | 0.9 | 1.0 | 1.0 | 0.8 | 1.0 | 1.0 | |||||||||||||||||
B11 | 0.9 | 1.0 | 1.0 | 1.0 | 0.9 | 0.6 | 1.0 | 1.0 | 1.0 | 1.0 | ||||||||||||||||
B12 | 0.9 | 0.9 | 1.0 | 0.9 | 1.0 | 0.8 | 1.0 | 0.9 | 1.0 | 1.0 | 0.9 | |||||||||||||||
B13 | 1.0 | 0.6 | 0.7 | 0.7 | 1.0 | 1.0 | 0.6 | 0.7 | 1.0 | 0.6 | 1.0 | 1.0 | ||||||||||||||
B14 | 1.0 | 1.0 | 1.0 | 0.9 | 1.0 | 0.8 | 1.0 | 1.0 | 1.0 | 1.0 | 0.8 | 0.7 | 1.0 | |||||||||||||
B15 | 0.9 | 1.0 | 1.0 | 1.0 | 0.9 | 0.9 | 1.0 | 1.0 | 0.9 | 1.0 | 0.9 | 1.0 | 1.0 | 1.0 | ||||||||||||
B16 | 0.7 | 1.0 | 0.9 | 1.0 | 0.9 | 0.9 | 1.0 | 1.0 | 0.9 | 0.9 | 1.0 | 0.9 | 1.0 | 0.9 | 1.0 | |||||||||||
B17 | 0.8 | 1.0 | 1.0 | 1.0 | 1.0 | 0.9 | 1.0 | 1.0 | 0.3 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 0.9 | 0.9 | ||||||||||
B18 | 1.0 | 1.0 | 1.0 | 0.9 | 1.0 | 1.0 | 1.0 | 1.0 | 0.9 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | |||||||||
B19 | 1.0 | 0.8 | 0.9 | 0.9 | 1.0 | 1.0 | 0.9 | 0.8 | 1.0 | 1.0 | 1.0 | 0.9 | 0.9 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | ||||||||
B20 | 0.9 | 1.0 | 1.0 | 1.0 | 1.0 | 0.9 | 1.0 | 1.0 | 1.0 | 1.0 | 0.8 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | |||||||
B21 | 1.0 | 0.6 | 0.9 | 0.9 | 1.0 | 1.0 | 0.9 | 0.7 | 1.0 | 0.8 | 1.0 | 0.9 | 0.9 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | ||||||
B22 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 0.6 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | |||||
B23 | 1.0 | 1.0 | 1.0 | 0.9 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 0.3 | 1.0 | 1.0 | 1.0 | 1.0 | ||||
B24 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 0.9 | 1.0 | 1.0 | 1.0 | 1.0 | 0.8 | 0.9 | 1.0 | 0.8 | 0.8 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 0.8 | 1.0 | |||
B25 | 0.9 | 0.9 | 0.9 | 0.9 | 0.8 | 0.9 | 0.9 | 0.9 | 1.0 | 1.0 | 0.8 | 1.0 | 1.0 | 1.0 | 0.8 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | ||
B26 | 0.9 | 1.0 | 1.0 | 1.0 | 0.9 | 0.9 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 0.9 | 1.0 | 1.0 | 1.0 | 1.0 | 0.8 | 1.0 | 0.6 | 1.0 | 0.8 | 1.0 | |
B27 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 0.8 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 |
# | Cluster | Closeness | Degree |
---|---|---|---|
People | |||
B1 | Insufficient skills | 1.38 | 11 |
B5 | IT illiteracy | 1.28 | 7 |
B16 | Inadequate training programs and facilities | 1.16 | 9 |
B25 | Uncertainty about benefits | 1.10 | 10 |
Data and technology | |||
B2 | Poor data quality | 1.57 | 10 |
B8 | Lack of an integrated data environment | 1.52 | 9 |
B7 | Complexity of data or technology | 1.42 | 9 |
B13 | Scalability issues | 1.42 | 8 |
B3 | Insufficient IT infrastructure | 1.29 | 10 |
B4 | Data privacy and security | 1.27 | 14 |
B21 | Data availability | 1.22 | 8 |
B10 | Immature technology and lack of appropriate tools | 1.20 | 7 |
B19 | Poor data management and architecture | 1.10 | 7 |
Management | |||
B6 | Lack of support from top management | 1.31 | 14 |
B11 | Strategy mismatch | 1.27 | 9 |
B14 | Resistance to change and other cultural barriers | 1.22 | 6 |
B12 | Inadequate data sharing policy | 1.17 | 11 |
B20 | Lack of coordination, collaboration, and communication | 1.09 | 4 |
Economic | |||
B9 | Insufficient funding | 1.72 | 7 |
B17 | High investment and maintenance cost | 1.50 | 5 |
B27 | Associated risks | 1.09 | 1 |
Project | |||
B22 | Inadequate or inconsistent methodology | 1.54 | 3 |
B15 | Insufficient ROI or business case | 1.33 | 10 |
B26 | Deployment and maintenance issues | 1.33 | 7 |
B24 | Scope, objectives, or expected results unclear | 1.28 | 7 |
External | |||
B18 | Government policies and regulation | 4.00 | 3 |
B23 | External sources of data | 4.00 | 2 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Labarrère, N.; Costa, L.; Lima, R.M. Data Science Project Barriers—A Systematic Review. Data 2025, 10, 132. https://doi.org/10.3390/data10080132
Labarrère N, Costa L, Lima RM. Data Science Project Barriers—A Systematic Review. Data. 2025; 10(8):132. https://doi.org/10.3390/data10080132
Chicago/Turabian StyleLabarrère, Natan, Lino Costa, and Rui M. Lima. 2025. "Data Science Project Barriers—A Systematic Review" Data 10, no. 8: 132. https://doi.org/10.3390/data10080132
APA StyleLabarrère, N., Costa, L., & Lima, R. M. (2025). Data Science Project Barriers—A Systematic Review. Data, 10(8), 132. https://doi.org/10.3390/data10080132