4.1. IT Systems
Hypothesis 1. The necessary data to provide a sufficiently detailed view on the current and past condition of processes and objects in the factory are available in a digital format and of an accurate quality.
The existence of digital, accurate data for the area of the company under consideration is obviously an essential basis for the idea of data democratization. If there are no data to be shared to draw conclusions from, there is no sense in democratizing them.
To investigate this hypothesis, we examined the two capabilities “data quality” and “decision support (IT)”. In this context, we understand data quality as the property of the data to be complete, accurate, current and consistent [
28]. The maturity of this capability was characterized by the following statements (see
Table 1). The percentage of answers per described scenario is depicted as well in the following table.
In order to comply with the concept of data democratization, the minimum target level is 4. However, most of the examined factories are currently at a lower level. More than 75% of the surveyed factories do not have the appropriate level of data quality to enable an automated use of data.
A second capability that is related to the first hypothesis is the decision support by IT systems. Decision support systems have been discussed in the literature for decades and are, for instance, defined as “interactive computer-based systems, which help decision makers utilize data and models to solve unstructured problems” [
29] (pp. 1–26). Other authors confirm that these systems support decision-making in “semistructured tasks” [
30] and emphasize that they “enhance the traditional information access and retrieval functions with support for model building and model-based reasoning” [
31] (p. 6). The capability to create meaningful data-based insights to improve decision-making is at the center of the concept of data democratization and, thus, needs to be considered in this study.
Table 2 contains the possible scenarios for this capability. The minimum level for this capability that is necessary to implement the approach of data democratization is level 3. According to the described scenario, factories at level 3 are able to visualize operational data. At levels 1 and 2, data are not yet visually processed or decision-making is not supported with data. The data set shows that more than 68% of the assessed factories are not at that level. However, the fact that 47.82% of factories at least collect data on the current machine status indicates that some basics for databased decisions on their operations are present.
Comparing these results with other studies on data quality and databased decision support in manufacturing, the results even indicate a larger gap in these capabilities compared to the findings of other studies. For instance, Spath et al
. found that 58.9% of surveyed companies are not able to detect relevant events in their production automatically and, thus, based on data, 43.7% of participants see the missing actuality of production data as the root cause of manual interventions to production planning [
32]. Lanza et al. found, in their study on digital shopfloor management, that today, in more than 50% of the examined cases, operational KPIs were calculated manually; in more than 66%, the visualization of these KPIs done in an analog way [
33].
In conclusion, the hypothesis cannot be considered validated. Thinking of this as a prerequisite for decisions based on democratized data, one central field of action for most companies is the systematic improvement in the quality of their operational data in combination with systems that make them available for decisions.
Hypothesis 2. The data are organized in a way that it is accessible for ad-hoc analyses.
“Data Democratization” and “Industrie 4.0” place the stakeholders at the center. They are expected to optimize their area of responsibility through the use of data, e.g., by making decisions themselves on the basis of their own data analyses. These analyses often arise spontaneously (ad hoc) due to unexpected events stakeholders need to cope with but can also be caused by the pursuit of general performance or quality improvements.
In many cases, analyses involve several data sources, e.g., order data, material flow data or quality data. Data mostly originate from IT systems that are used for controlling or monitoring processes. However, other data sources might also be used, e.g., feedback from operators on process or product quality. In hypothesis 1, we showed that these data must be of sufficient quality and that the type and scope of the data must be sufficient so that they can be used for data-driven decision-making. However, further requirements must be met in the area of data organization so that stakeholders can perform analyses on their own. Data from different sources must be associated with one another, and data must be accessible to the user. The organization of data must meet these principles so that data can be used for ad-hoc analysis.
To investigate this hypothesis based on the available data set, we focus on two capabilities. “Data model” describes the conceptual approach to structure data. It includes information on data objects, references between objects and it defines valid operations [
34]. In this study, we focus primarily on the scope of the data model, which can encompass individual processes up to an entire production network. The maturity of the capability “data model” was characterized by the following statements. The percentage of answers per described scenario is depicted as well in the following table (see
Table 3).
The results show that in more than 18% of the performed assessments, no data model was available and data points to monitor or control the process were unknown. Companies that meet this maturity level are not yet able to offer their stakeholders the basis for their own data analyses. In almost 60% of cases, data models exist that refer to individual processes and, thus, enable analyses at process level. Practice shows that in production processes, Scada/Level 2 (ISA 95) data are often used (time-series data that can include machine states, sensor values or information on product quality). However, the analysis options are limited depending on the question. In cases where interfaces to other processes are to be examined or for the determination of performance indicators, data from higher layers (ISA 95), such as from Manufacturing Execution Systems and Enterprise Resource Planning Systems, are missing. This is made possible by cross-process data models, which are currently available to almost 21% of the plants examined. It can be observed that data models are limited to the plant itself in almost 98% of the cases surveyed. Only 1.5% state that they have integrated data models for all sites. The integration of suppliers and customers into company-wide data models has not yet played a role according to our data set.
The second capability we focus on is what we call “Collaboration in IT”. This expresses whether social software or enterprise collaboration systems are being used for stakeholder communication and collaboration. A digital collaboration environment with a uniform user interface is important for collaboration between users, including working together on documents and other use cases [
35]. The use of such solutions has two advantages. On the one hand, data are made available in a central location and can usually be accessed and consumed by users with the appropriate rights. On the other hand, they promote the formalization of tacit knowledge [
36].
Table 4 contains the possible scenarios for this capability.
The minimum level for this capability that is necessary to implement the approach of data democratization is level 4. The data set shows that, so far, only 2.6% of the companies surveyed have such a solution in place. However, just under a third (28.3%) are on their way there and have at least a solution in place that facilitates the management of tasks and projects. However, a large proportion of the companies surveyed still rely on classic file sharing via a network drive (61.36%). Just under 8% forgo the use of collaboration solutions altogether.
The research shows that we are still in the early stages of organizing data for use in ad-hoc analyses. Although data are available, they can usually only be correlated at the individual process level. Overarching models that encompass the entire value-creation process of a factory or production network are scarce. A similar picture can be drawn for the joint creation and use of data. Although data are shared in many cases, classic approaches, such as network drives, are used for this purpose. Our data set shows that comprehensive collaboration platforms have not been used much so far. However, due to the long duration of the study, it is possible that collaboration platforms are underrepresented in our results. Other studies seem to indicate that many companies adopted such platforms during the COVID-19 pandemic [
37].
4.2. Organizational Structure
Hypothesis 3. The organizational structure in the factory empowers employees to proactively improve their routines and initiate and implement improvements on their own.
The effective use of data democratization requires that employees are involved in improving their working conditions, tasks and routines. It even goes one step further and relies on employees themselves identifying potential for improvement and implementing measures. The organization should not only tolerate this behavior but should actively promote it. It has long been recognized that improvements brought in by employees contribute to the company’s success. Continuous improvement approaches are, for example, an important part of management and work organization methods, such as Lean [
38] and Kaizen [
39,
40,
41]. Our hypothesis is that the same mechanisms could be applied to the ideas of democratization of data. It is important here that stakeholders accompany both the analysis and the implementation of measures.
To examine our hypothesis, we look at two capabilities in the data set. First, we look at whether companies have implemented an “innovation process” and how far reaching it is. This capability provides a good understanding of whether and how employees are involved in improvement processes. The maturity level of the “innovation process” capability was characterized by the following state characteristics. The percentage of responses per described scenario is also shown in the following table (see
Table 5).
In order to comply with the concept of data democratization, the minimum target level is 4. About 23% of the examined factories currently are, at least, at this level. We observe that almost all of the factories studied have implemented an innovation process, with only around 8% not doing so at all. In just under 5% of the plants surveyed, idea generation and idea development are based on the open innovation principle. Employees at all levels contribute to idea generation and evaluation. Measures and their effectiveness are systematically monitored, documented and, if necessary, adjusted in accordance with the PDCA cycle. As such, 33% have at least one process for collecting suggestions for improvement (level 2), and another 35% circulate information about ideas submitted, evaluation and implementation (level 3). Thus, in more than half of the companies, there is an active innovation process that leads to implemented improvements. Only the participation of employees in the implementation of measures is not yet sufficiently well developed. Overall, however, these are good conditions for data democratization.
“Decision power and responsibility” is the second capability we examine. This is about how and by whom decisions are made in the organization and how decision-making is supported. We are particularly interested in the hierarchical level at which decisions are made for operational activities. The defined maturity levels and the associated scenarios are listed in
Table 6.
At least level 3 is required to use data democratization. Decisions concerning operational activities must be able to be made by operators. In addition, however, there must also be support from IT systems that hold data relevant to decision-making. We see that just under half of the factories studied meet these requirements (49.71%). Among them, 3.92% even choose holocratic approaches for decision-making. In around 35% of the factories surveyed, decisions are made as a team (level 2). However, this group lacks the necessary IT support. Only slightly more than 15% still rely on distinct hierarchical decision-making structures.
Both capabilities studied show that we are at a turning point. It is apparent that organizations want to involve employees more and are increasingly granting them freedom to do so. Based on the available data, it can be seen that both capabilities can still be expanded in the majority of the factories surveyed. The most important factors include employee participation in the implementation of measures and the creation of organizational conditions for the use of data in decision-making processes. The potential of employee participation in terms of data democracy still relies on the maturity of the data dimension, though.
4.3. Culture
Hypothesis 4. The culture among the associates in factories encourages data-driven decision-making and an open sharing of knowledge and insights.
Besides formal processes, organizational structure and assigned responsibilities, the culture among the associates has a paramount influence on how the concept of data democratization is embraced in a factory. In general, we see culture as the commonly accepted set of values within the organization that guides the actions of employees [
17,
42]. Specifically focusing on the concept of data democratization, these values need to comprise the “willingness to share information” [
11] (p. 5). For that reason, the capability “knowledge and knowledge responsibility” is selected from the data set. In addition, the capability “data-based decision processes” was analyzed to understand to what extent the companies base their decisions on data [
43].
In
Table 7, the possible scenarios for the capability “Knowledge and knowledge responsibility” are listed. With every level, the importance to formalize and to share knowledge increases. Thus, this capability describes to what extent “tacit knowledge” or “explicit knowledge” guide the course of actions of the employees. “Tacit knowledge” solely exists in the minds of individuals and, thus, is not shareable and available to the rest of the organization [
44]. For the idea of data democratization, the willingness to share knowledge is crucial. Due to that, the minimum level of this capability is level 3. At this level, a company collects and shares knowledge in a structured way. In nearly 30% of the assessed companies, such a culture can be observed. However, about two-thirds of the companies have a culture that does not actively encourage the formalization and sharing of knowledge.
The results in
Table 8 for the capability “data-based decision processes” confirm the impression of the previous capability: About 40% of the companies already use data as the basis for their decision-making. Still, around 60% rely on intuition and individual knowledge or only partly consider data in their decision-making processes.
Based on the analysis of the two capabilities, the hypothesis cannot be confirmed. Even if around two-thirds of the companies have already adopted a culture of data-based decision-making and of an initiative-taking sharing of insights, still, most of the evaluated companies are lacking such a culture. Other studies partly confirm these findings. For instance, Lanza et al. found that in five out of nine analyzed case studies on digital shopfloor management, there is an exchange of knowledge, but it is an analog exchange without the support of digital tools [
33]. A recent study of YouGov, with more than 3500 surveyed decision-makers, found that 56% of the companies consider themselves as data driven, which is a significantly higher share than in the analyzed data set [
45]. Another study with more than 1000 participants, conducted by Capgemini in 2020, found that 50% consider their decision-making as data driven [
16]. However, the results in the present analysis are based on external, independent observations, whereas the cited studies rely on a self-evaluation of the surveyed participants.
As a conclusion, it is a central field of action for these companies to create such a culture on their way towards a democratization of data, to not only have the technology ready, but also create a “pull” for data in their workforce.
4.4. Resources
Hypothesis 5. The employees have the appropriate capabilities to work with data and interpret them in the context of their domain.
As Belli et al. state in their definition of data democratization, it is the “ability of users to access all data using well-defined and easily used analytic patterns to answer unexpected questions” [
12] (pp. 1362–1368). The definition implies, besides organizational aspects, such as the access to data, and technological aspects, such as the easily used analytic patterns, that the users themselves need to be capable. To examine the necessary skills of the employees, the two capabilities “IT competencies” and “Interdisciplinary skills” were analyzed.
As depicted in
Table 9, the surveyed scenarios range from a rudimentary utilization of existing system functionalities with manual bypasses to the system to a full utilization of the features and an independent covering of information needs. In only 3.69% of the companies, the employees can cover their information needs themselves, which would be the required level for the concept of data democratization. The vast majority are only capable of handling basic system features. Still, in around one-quarter of companies, it can be observed that the associates are able to understand the logics and dependencies of the systems they are using. This leads to the conclusion that, at least in this group of companies, a certain awareness for the multiple data in their IT systems and its value exists.
Table 10 contains the scenarios for the capability “interdisciplinary skills of employees”. The scenarios range from employees having a very limited scope for their own considerations to employees who understand the dependencies with and their impact on other processes and activities in the company. This capability is important for a successful implementation of a data democratization approach, as the whole idea is based on accessing the relevant data from the whole organization and not only from their own limited domain. In order to make sense of these data, it is inevitable that the context the data is sourced from is understood.
The results show that the associates of only around 25% of the companies strictly focus on their own domain. A large group either is aware and considers neighboring processes or even proactively involves them in their activities.
Looking at these two capabilities, a slightly higher readiness can be observed compared to other examined capabilities in this study. In addition, it is not visible in the data whether the capability “IT competencies of employees” would be on an even higher level if the companies were to provide their employees with more capable systems. After all, employees were not able to demonstrate the capabilities required for the higher levels, even if they had had them. It is obvious that the competencies required to work with data effectively rely on the data itself being available and structured, which cannot be considered true, as already mentioned earlier.
The comparatively high maturity level in the capability “Interdisciplinary skills of employees” might be related to an increasingly high maturity level in lean methodologies [
46]. The concepts of interdisciplinary collaboration and process-oriented thinking are core ideas of lean management [
47]. This, once again, supports the importance of considering Industry 4.0 and Lean Management or comparable approaches in an integrated way [
48].