Next Article in Journal
Gamit! Icing on the Cake for Mathematics Gamification
Next Article in Special Issue
Educational Loss at Times of Crisis: The Role of Games in Students’ Learning in Palestine and Iraq
Previous Article in Journal
Regenerative Agriculture—A Literature Review on the Practices and Mechanisms Used to Improve Soil Health
Previous Article in Special Issue
Determining Ergonomic Appraisal Factors Affecting the Learning Motivation and Academic Performance of Students during Online Classes
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Innovation Performance Prediction of University Student Teams Based on Bayesian Networks

1
Department of Industrial Engineering, Northwestern Polytechnical University, Xi’an 710072, China
2
School of Public Policy and Administration, Northwestern Polytechnical University, Xi’an 710072, China
3
School of Life Sciences, Northwestern Polytechnical University, Xi’an 710072, China
*
Author to whom correspondence should be addressed.
Sustainability 2023, 15(3), 2335; https://doi.org/10.3390/su15032335
Submission received: 14 November 2022 / Revised: 21 January 2023 / Accepted: 21 January 2023 / Published: 27 January 2023

Abstract

:
Many studies have been conducted on the impact of dualistic learning, knowledge sharing, member heterogeneity, and their influencing factors on team performance in enterprises. However, research on the substantial differences between university student teams and enterprise teams is scarce. To address this void, this empirical study explores how the mechanism of dualistic learning affects university student teams’ learning performance facing rapid changes in higher education. Using the questionnaire, two modules of dualistic learning were identified through reliability and validity tests, and the research data set was formed. After preprocessing the data set, two team innovation performance prediction models were established based on the Bayesian network (BN). According to the characteristics of BN, the probability reasoning of the model was calculated and the posterior probability table was obtained under different dualistic learning levels. The results show that dualistic learning has significant impacts on innovation performance, and the improvement of dualistic learning can stimulate team innovation performance. This research can provide important theoretical guidance for teams to improve their ability, gain competitive advantages, and stimulate the creative enthusiasm of college students. Hopefully, this research will enrich the existing theoretical connotation to a certain extent and promote the development of relevant empirical research.

1. Introduction

Innovation is the most important driving engine for development and is the strategic support for building a modern economic system. University students have the desire for interdisciplinary cooperation and the autonomy to choose courses and research fields that provide convenience for their innovative behavior. Moreover, as the inheritors of knowledge and the builders of the future society, university students play an important role in the development of innovation. In general, their innovative behavior has been paid more and more attention to by society and researchers. Team innovation performance has become a key indicator to measure the team’s technological innovation ability and overall performance. Therefore, research on the prediction of team innovation performance of university students can provide an effective method for the team to improve its ability and gain competitive advantages, so as to stimulate university students’ creativity and enthusiasm.
We found that in addition to the research on the factors that affect the innovation performance of enterprise employees in organizations [1], at the educational level some scholars have conducted indepth research on the relationship between creativity and intelligence in middle school students [2] and used regression analysis to verify that scientific creative capabilities can accurately predict academic performance [3]. However, the existing literature lacks research on the relationships between dualistic learning, knowledge sharing, and innovation performance of university student teams from the perspective of organizational learning and knowledge management. Meanwhile, research on predicting innovation performance using Bayesian models is relatively scarce.
Hence, we planned to use the Bayesian network (BN) model to establish two performance prediction models, which compared the impacts of different levels of dualistic learning on university student teams’ learning performance. Firstly, 228 questionnaires were designed and collected. Secondly, the prediction model was established through the steps of reliability and validity tests and confirmatory factor analysis. Finally, the importance degree of each influencing factor was calculated and each research variable was ranked according to the experimental results. Generally, it was of great significance to explore the relationship between the main factors affecting university students’ team learning and accurately predict the innovation performance of university student teams by the BN model. On the one hand, the research results can theoretically uncover how the mechanism of university student teams’ dualistic learning affects their learning performance and reveal a predictive method of team innovation performance. On the other hand, in practical terms, the results can provide management enlightenment for improving the innovation performance of university student teams.

2. Literature Review

Dualistic learning is divided into exploration learning and exploitation learning, in which exploration learning, characterized by “pursuing and acquiring knowledge in new fields, is manifested in the initiative of the organization to create new knowledge, explore new technologies and strategies, and abilities to find new opportunities and new rules.” Exploitation learning is characterized by “learning and using existing knowledge” [4]; that is, in a mature or new environment, existing knowledge is fully exploited to better apply it to practice, and the original knowledge can be improved and adjusted according to actual needs, so as to improve the implementation efficiency and performance of the established scheme in the specific environment. As for the relationship between dualistic learning and team performance, Katila and Ahuja [1] found that the combination of the two learning methods can enhance the viability of the organization, improve the financial performance of the enterprise, and enhance the ability of organizational learning and innovation. Colbert [5] believes that the competitive advantage generated by the interaction of two learning methods is better than the competitive advantage obtained by carrying out one learning method alone. Some scholars have also analyzed the characteristics of complex network structures and believed that a dualistic balanced network structure can better promote organizational performance [6]. However, the impact of dualistic capability on organizational performance is flexible [7] and the change in network situation may significantly affect organizational performance. Dualistic learning plays a more important role in improving the performance of enterprise alliances with higher network centrality. It can also help enterprise alliances with fewer structural holes achieve better performance. Although the relationship between dualistic learning and team performance has been widely studied, it is mainly concentrated in the industrial and business sectors. At present, scholars have empirically studied the impacts of dualistic learning, knowledge sharing, member heterogeneity, and their influencing factors on team performance in an enterprise [8,9,10]. The results from these researchers have an important guiding significance for human resource management practices and innovation management practices in the enterprise field. However, there are substantial differences between university student teams and enterprise teams. There are also substantial differences in learning objects and innovation pursuits. This study focuses on the theoretical issue of “the mechanism of dualistic learning impacts on university student team learning performance” to explore the impacts of dualistic learning on innovation performance in university student teams. Hopefully, this research will enrich the existing theoretical connotation to a certain extent and promote the development of relevant theoretical research.
With the rapid development of computers and chips, the limits of computing power are constantly broken through, and a large number of artificial intelligence (AI) methods, including Bayesian methods, support vector machine (SVM), integrated learning, and deep learning, are emerging and being gradually applied to data mining and predictive modeling [11,12,13,14,15]. Compared with the traditional generalized linear model, the Bayesian network has fewer restrictions on modeling variables and is more robust in the nonlinear relationship between variables. It can better fit the data distribution and give more accurate prediction results. At present, the Bayesian network (BN) is widely used in industrial applications, financial analysis, and military applications. Sun et al. [16] proposed an evolutionary algorithm based on BN to optimize the task allocation strategy, which to some extent solved the response delay when the industrial internet of things is used to monitor industrial plants. Bashar et al. [17] proposed three new network management solutions based on BN, which to some extent meet the requirements of automation and efficient management of telecommunications networks. Mahadevan et al. [18] put forward a structural reliability evaluation method combining BN, and also considered the multiple failure sequence characteristics of large structures and the correlation between element limit states. The effectiveness of this evaluation method was verified by comparison with the traditional series parallel reliability method. In order to control the quality risk of cold storage construction projects, Song et al. [19] constructed a BN-based cold storage construction quality risk assessment model and obtained key quality risk factors through reverse reasoning analysis and sensitivity analysis. Johnson et al. [20] created a math app to provide students in the Emirates in grade six with the opportunity to explore mathematics and then used BN to examine the educational implications and explore the impacts of different factors on their enthusiasm for learning mathematics.
To summarise, most researchers rarely use AI methods to explore the factors that affect university student teams’ innovation performance and predict their innovation performance. To achieve this aim, first of all, our study conducted a questionnaire survey on university students’ innovation teams to measure research variables such as dualistic learning, knowledge sharing, and task interdependence. We also verified the measurement model and data fitting of this study with confirmatory factor analysis in the next step. Then, we used the questionnaire (a five-point Likert scale) to quantify all variables and made the scale transform into the innovation performance dataset. Next, based on the dataset, we established two BN prediction models of university student team performance using a tree-enhanced Bayesian algorithm. After that, we calculated the accuracy of these models. Finally, we quantitatively analysed the importance degree of each influencing factor and ranked each research variable according to the experimental results. The theoretical solution could be viewed in Figure 1.

3. Methods

3.1. Questionnaire

This study uses the mature scale developed by domestic and foreign scholars to ensure the validity and reliability of the questionnaire. Through literature collection and reading, we found a mature scale for dualistic learning, knowledge sharing, team innovation performance, and task interdependence of university student teams. Among them, the ambidexterity learning scale [21] includes two dimensions of exploitative learning and exploratory learning, both of which were developed by Atuahene-Gema and Murray. Each dimension includes three measurement items. The knowledge-sharing behavior scale [22] was developed by Collins and translated by Tian Lifa, including four dimensions. The team task performance scale adopted the scale [23] developed by Chen Wei and others, which has four dimensions. The task interdependence scale [24] was developed by Liden, Wayne, and Broadway, and was translated by Liu Jun, with a total of three items.
To meet the needs of this study, we made corresponding modifications and improvements and formed the first draft of this research questionnaire. Then, based on the first draft of the questionnaire, we consulted with professionals on the rationality, logical relationship, wording, and grammar of each item. In order to avoid the fact that the respondents cannot answer the questionnaire due to the academic, professional, and obscure wording, we piloted on several college students in the university’s innovation team. According to their opinions, we modified and improved the relevant questions.
The five-point Likert scale was used to measure the relevant items of the research variables such as dualistic learning, knowledge sharing, and task interdependence of university student teams. Then we analyzed the reliability and validity of the predictive test data of the questionnaire and deleted the items according to the analysis results to form a formal questionnaire. The finalized questionnaire (see Appendix A) contained 11 basic personal information, 4 measurement scale dimensions, and 23 specific questions.

3.2. Confirmatory Factor Analysis

In this study, we used confirmatory factor analysis (CFA) to verify the convergent validity and discriminant validity of the scale for research variables, including dualistic learning, task interdependence, knowledge sharing, and learning performance. Convergent validity refers to the similarity degree of the same abstract concept or trait measured by different measurement methods. It is usually judged by the standardized factor loadings of each item and the composite reliability. If the value of the standardized factor loading of each item is greater than 0.5 and the value of composite reliability is greater than 0.6, it indicates that the scale has good convergent validity.
Discriminant validity refers to the degree to which a construct is truly different from other constructs according to empirical criteria. In an experiment, if it can be proved statistically that the indicators which are not related to the preset construct are indeed not related to the construct, then the experiment has discriminant validity. In this study, the structural equation model was used to obtain the CFA results of variables. Kline believes that multiple fitting indicators should be used to verify the fitting degree of research samples and theoretical models, and indicators such as x 2 / d f , GFI, AGFI, and RMSEA can be used to evaluate the fitting degree of structural equation models [25].

3.3. Bayesian Network (BN)

The BN originated from Bayesian statistical analysis theory, which is a classical machine learning method. BN uses a directed acyclic graph to succinctly represent the joint probability distribution on a group of random variables. The graph is annotated with the conditional probability table of any instantiated node probability distribution for a given parent node [26]. As an effective tool that combines probability theory and graph theory to deal with uncertainty reasoning and data analysis, BN can not only reveal the causal relationship of the problem through the graph structure but also analyze the uncertainty of the problem according to the principles of probability theory, thus greatly reducing the complexity of reasoning calculation.
The Tree Augmented naïve Bayes network model (TAN), which considers the relationship between variables, is an extension of the NB classifier. It retains the complexity and robustness needed to facilitate the mining of associations between variables, thus improving the accuracy of the model [27]. An example of a TAN model is shown in Figure 2.
In this model, C represents the target variable and A 1 , A 2 , , A n represents the attribute variable. The joint probability distribution of the TAN classifier can be expressed as Equation (1)
P ( C , A 1 , A 2 , A n ) = P ( C ) i = 1 n P ( A i | p a ( A i ) )
A posteriori probability refers to the probability that is more realistic after obtaining new additional information through the Bayesian formula and investigation, then modifying the prior probability. The Bayesian formula can be expressed as Equation (2)
P ( M i | N ) = P ( M i ) P ( N | M i ) i = 1 n P ( M i ) P ( N | M i )
In the formula, P M i is the prior probability obtained through data statistics. This formula is to explore the probability of each event that causes N to occur when it is observed that event N has occurred. A posteriori probability is mainly used for predictive inference—to predict the probability of occurrence of a certain result node in the network through the information of the network node, and diagnostic inference—to infer the probability of occurrence of the network condition node through the information of the result node.

3.4. Confusion Matrix and ROC Curve

To ensure the high accuracy of the prediction model, the accuracy of the BN model must be evaluated.
The confusion matrix is a basic method to evaluate the reliability of predictive classification models. The column represents the actual situation, while the row represents the prediction result of the classifier. The possible results produced by the classifier are true positive (TP), false positive (FP), false negative (FN), and true negative (TN).
The model accuracy is determined by the following, Equation (3).
A ccuracy = T P + T N T P + F P + T N + F N
At the same time, the Bayesian classifier classifies the samples into zero or one, according to the classification threshold. Moreover, in these classifiers, different thresholds determine different classification results and classifier evaluation indexes. Considering the above situation, we used the ROC curve and the area under the curve (AUC) to measure the model performance. The ROC curve is called the receiver operating characteristic curve, which is commonly used to measure the overall reliability of the classification model. It takes multiple pairs of FPR rates and TPRs as the horizontal axis and vertical axis and obtains the ROC curve by connecting each intersection point. The larger the area under the curve, the higher the accuracy of the model, and the better the classification effect [28]. AUC is the calculated value of the area under the ROC curve, and the value range of AUC is between 0.5 and 1. The greater the AUC value, the better the effect of the machine learning classifier. This method is very convenient and accurate for evaluating the performance of a prediction model.

3.5. Importance Measure

Importance refers to the degree to which the state change of single or multiple components in the system affects the reliability of the system [29]. It is used to evaluate the relative importance of various factors. Birnbaum first put forward the importance analysis theory in 1969, which is also called probability importance [30].
The classic Birnbaum importance depends on the structure of the system model and has no relationship with the probability of current basic events.
At moment t, the Birnbaum importance of the i unit is defined as Equation (4)
I B ( i | t ) = h ( t ) p i ( t ) , i = 1 , 2 , , n
Birnbaum importance is obtained from the partial derivative of system reliability to unit reliability P i t . This is the traditional sensitivity analysis method.
For the whole system unit, the importance of Birnbaum can also be expressed as Equation (5)
I B ( i | t ) = R ( 1 i , p ) R ( 0 i , p )
where R 1 i , p AA represents the probability of the top event when the bottom event i must occur, and R 0 i , p represents the probability of the top event when the bottom event i must not occur.

4. Results and Discussions

4.1. Reliability and Validity Test of the Questionnaire

The reliability of the scale is tested by Cronbach’s α value and the CR value of the scale. The calculation shows that the Cronbach’s α of all latent variables in this study was greater than 0.7, and the CR value of the composite reliability was greater than 0.6, indicating that the scale has a high level of internal consistency reliability. The scale used in this paper is formed by referring to existing literature, expert interviews, and preresearch, which can ensure its content validity. Table 1 shows that the standardized factor loading value of each item was greater than 0.5, and the AVE value of each latent variable was greater than 0.5, indicating that the scale had good convergent validity.
Finally, the null model, single-factor model, six-factor model, and seven-factor model were compared by confirmatory factor analysis to test the discriminant validity of all variables. Table 2 shows that the seven-factor model fits well ( x 2 = 320.868, p < 0.001; x 2 / d f = 1.535; RMSEA = 0.049; GFI = 0.897; AGFI = 0.864; TLI = 0.936; CFI = 0.947) and was significantly better than the fitness of other models, so all variables have good discriminant validity.
Based on the experimental results, we divided the question module in the scale into five predictive variables, namely, exploration learning, exploitation learning, sharing willingness, sharing behavior, task interdependence, and a predictive goal, learning performance.

4.2. Establishment of University Student Team Innovation Performance Data Set

This study collected 280 sample data about the team learning of university students. By sample integrity verification and rationality evaluation, we finally screened 228 sample data to establish the BN prediction model. Among the respondents, 121 teams (53.1%) come from world-class universities, 24 teams (10.5%) come from world-class disciplines, and 83 teams (36.4%) come from other universities. Some cases of the data set are shown in Figure 3.
After a single factor analysis of the basic conditions of 228 selected samples, we selected some basic information that has an impact on team learning performance into the model, including 121 (53.07%) from world-class universities, 83 (36.40%) world-class discipline universities, and 24 (10.53%) from other universities; The team’s final results were 15 research papers (6.58%), 114 application software or equipment developments (50.00%), and 99 business plans (43.42%); 90 (39.47%) of the core team members were sophomores and below, 76 (33.33%) were juniors, and 62 (27.19%) were juniors and above; 80 (35.09%) respondents participated in the innovation team once, 75 (32.89%) respondents participated twice, and 73 (32.05%) respondents participated three or more times. In addition, after sorting out the questionnaire information through the equal frequency division, we received the following prior probability information of the dualistic learning part and the learning performance part. In the two-level classification model, the prior probability of learning performance was not higher than 27 points is 62.72% (143/228), and the prior probability of learning performance higher than 27 points is 37.28% (85/228). In another prediction model, the prior probabilities of learning performance were lower than 25 points, 25–27 points, 27–29 years, and higher than 29 points are 28.51% (65/228), 34.21% (78/228), 15.79% (36/228), and 21.49% (49/228), respectively. Later, based on the prior probability and the change of dualistic learning proportion distribution, the posterior probability distribution of the model is discussed.

4.3. TAN Model for Learning Performance Prediction

Based on the university student team database, we established two TAN models for experiments. Primarily based on the equal frequency principle, we divided the learning performance score into two intervals ≤27 and >27, and four intervals ≤25, 25–27, 27–29, and >29. After establishing the data set of the university students’ innovation team and converting all continuous prognostic factors into discrete variables, a prediction model was established using the TAN algorithm. In this model, team learning performance scores were set as target variables to be predicted, while other factors were considered to be the attribute variables that affect the status of target variables. Since the two models are built using the same database, the Bayesian network model diagram was consistent (Figure 4).

4.4. Accuracy of TAN Model

The confusion matrix, reliability, and accuracy of the TAN model are shown in Table 3 and Table 4. The independent variables of the data set were used to establish the TAN model to obtain the predicted team performance, compare it with the data in the original record, and then use the confusion matrix evaluation indicators to determine the reliability and accuracy of the TAN prediction model. The two-level classification model of learning performance is shown in Table 3. The reliability and accuracy of prognosis prediction were obtained by using the confusion matrix evaluation index with a default probability threshold of 0.5.
In the two-level classification model, the number of respondents with a learning performance score of ≤27 was 143, of which 125 were classified correctly, and the reliability was 84.46% (125/143). The number of respondents with a learning performance score of >27 was 85, of which 62 were classified correctly, and the reliability was 77.50% (62/85). A total of 125 respondents with a learning performance of ≤27 points and 62 respondents with a learning performance of >27 points were correctly classified. Therefore, the overall model accuracy is 82.02% (Figure 5).

5. Research on the Effect of Dualistic Learning on Innovation Performance

5.1. Research Strategy Design

First, we locked one of the initial states of exploration learning or exploitation learning, and then changed the state of the other to observe the probability distribution of learning performance. Through experiments, the posterior probabilities of both models preliminarily confirmed that the promotion of dualistic learning can improve team innovation performance. For example, the initial state of exploration learning or exploitation learning is in the middle score state. When exploration learning was improved to >12, the probability of the learning performance score >27 increased from 43.26% to 71.07% in the two-level classification model. When exploitation learning was improved to >12, the probability continued to increase to 78.63% (Figure 6).

5.2. A Posterior Probability Table Based on the Distribution of Dualistic Learning States

After experimenting with all state combinations of exploration learning and exploitation learning, we obtained a complete posterior probability table of the TAN model that dualistic learning affects innovation performance, as shown in Table 5.
With the improvement of exploration learning and exploitation learning, the rating of learning performance was also improved. In other words, dualistic learning had a positive feedback effect on learning performance, which is consistent with the research conclusions of previous scholars.

5.3. Calculation of Influence Factor Importance

To quantitatively analyze the importance degree of each influencing factor, we introduced the importance calculation method. The general importance calculation method takes into account the occurrence time of the transition, so it was necessary to introduce the probability of the state occurrence into the system model. However, the design of the feature importance calculation method in this paper does not consider the failure function of the feature. It only considers the impact of a factor on the overall system from the perspective of the state change of the feature itself in order to calculate the importance of each influence factor.
We chose Formula (4) as the basic formula for the Birnbaum importance calculation. The calculation method of the Birnbaum importance of the influencing factors in the innovation performance prediction model is given: we first locked all variables in the model, and then changed the prior probability distribution of one of the variables, such as setting the sharing behavior to ≤15, and then observed the difference between the probability of innovation performance >27 and its prior probability. Then, we set the sharing behavior to ≤18 to continue recording the difference between the probability of innovation performance >27 and its prior probability. Finally, we added up the absolute values of the differences between all states, and calculated the importance by referring to Formula (4). Based on this method, the importance of each influencing factor is shown in Table 6.
Through the experiment, we found that the importance of the five scale features of sharing behavior, task interdependence, exploration learning, sharing willingness, and exploitation learning were close and at a high level, which was much higher than the three kinds of basic information we obtained through the single factor analysis. The results indicated the importance of dualistic learning for innovation performance. In addition, the study suggested that the importance of sharing behavior ranked first. It can be preliminarily judged that knowledge sharing played an important intermediary role in dualistic learning and team learning performance. The reason was that the knowledge-sharing behavior realised the knowledge circulation flow and knowledge spillover effect within the team, further improving the knowledge structure of team knowledge receivers and disseminators, enhancing the innovation ability and scientific research ability of university students, and promoting the improvement of their team task performance. In follow-up research, based on this experiment, we can further confirm this hypothesis and explore the impacts of other influencing factors on innovation performance and the interrelationship between factors.

6. Discussion

In this study, we established a prediction model based on the Bayesian network and designed a method that can analyze the impact of dualistic learning on the innovation performance of university students in different situations. We explored the knowledge construction and learning performance influencing factors of university students’ team learning from the perspective of knowledge construction and performance management. On this basis, we further discussed the mechanism of university student teams’ dualistic learning affecting learning performance through empirical research.
The existing literature lacks research on the relationship between team dualistic learning, knowledge sharing, and learning performance from the perspective of predictive modeling. Therefore, we aimed to use statistical methods and importance measure theory to supplement the shortage of existing literature. This research focused on university student teams, constructed a theoretical model of the relationship between dualistic learning, knowledge sharing, and team learning performance, and revealed the “black box” of the mechanism of dualistic learning on team learning performance in the context of university students’ innovative team learning by using quantitative and qualitative methods, indicating that the improvement of dualistic learning can significantly promote innovation performance. It is basically consistent with the results of other scholars’ research, that is, dualistic learning, as well as knowledge sharing, can positively affect innovation performance or task performance in most cases. Although the research objects and methods are different, there are many similarities between our conclusions and other studies’ conclusions.
The theoretical contribution of this study is mainly reflected in two aspects. First, from the perspective of university students’ research-based learning, the research results of the enterprise ambidexterity learning theory are expanded to the university students’ innovative team learning situation. For university students’ learning, the traditional individual level is extended to the team level, the general creativity is shifted to scientific creativity, and the enterprise organizational situation is stretched to the educational organizational situation, which further promotes the integration of team level and scientific dimension and further enriches the theoretical system and research content of organizational learning. Secondly, dividing the ambidexterity learning into exploitative learning and exploratory learning not only explores the direct effects on team task performance directly, however, it also reveals the indirect effect of ambidexterity learning on team task performance through the mediating effect knowledge sharing behavior, revealing the university students’ innovation team learning situation and the dual role of team task performance of the “black box”.
This study has several limitations. Firstly, the data set only contains 228 samples, so more detailed data screening should be carried out for the questionnaires obtained, key information should be identified, and the dataset should be expanded for future research. Secondly, this study doesn’t fully consider the impacts of knowledge sharing and task interdependence on team innovation performance. In the future, based on our calculations of the importance of each influencing factor, we will conduct supplementary research on the mediating role of knowledge sharing and the moderating role of task interdependence.

7. Conclusions

This empirical study explored how the mechanism of dualistic learning affects university student teams’ learning performance in the face of rapid changes in higher education. We obtained 228 samples of university student teams from three types of universities. Among them, 121 samples (53.1%) were obtained from world-class universities, 24 samples (10.5%) were obtained from world-class disciplines, and 83 samples (36.4%) were obtained from other universities. We evaluated these samples and established two BN prediction models of university student team performance using a tree-enhanced Bayesian algorithm. The accuracy rates of the two-level classification model and the four-level classification model were 82.02% and 64.91%, respectively. In both models, the posterior probability of high learning performance scores increased with the increase of exploitation learning and exploration learning scores. This study shows that the improvement of dualistic learning had a significant impact on innovation performance.
The practical uses of this study are as follows. From the perspective of policy making by innovation team managers, the innovation and entrepreneurship management departments of colleges and universities should actively build an academic community for university students’ team knowledge sharing, formulate a scientific and reasonable system, stimulate students’ knowledge-sharing motivation, improve students’ knowledge sharing ability, promote university students’ knowledge resource sharing, and improve the team’s innovation output. From the perspective of university students’ team building, we should pay attention to establishing good interpersonal interaction and mutual learning relationships with other members, constantly optimise the team knowledge-sharing environment, promote the knowledge circulation flow within the team, further create a strong academic atmosphere to stimulate college students’ learning motivation and enthusiasm. From the perspective of individual students, it is necessary to associate the expectation of success with the teams’ subjective task value and pay more attention to the selection and application of learning strategies, so as to promote their teams to achieve higher task performance.

Author Contributions

Conceptualization, Z.C.; methodology, X.Z.; formal analysis, J.L. and C.Z.; investigation, J.L. and C.Z.; data curation, D.S.; writing—original draft preparation, X.Z.; writing—review and editing, Z.C.; supervision, Z.C.; project administration, X.Z.; funding acquisition, X.Z. and D.S. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Social Science Foundation’s Social Science Academic Association Themed Academic Activity Funding Program [Grant Nos. 22STA010]; the China Association of Higher Education Program [Grant Nos. 2022ZD03]; Development Strategy Research Fund of Northwestern Polytechnical University [Grant Nos. 2023FZZ09] and Innovation and Entrepreneurship Program of Graduate School of Northwestern Polytechnical University [Grant Nos. D5204210513].

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The experimental data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that they have no conflict of interest regarding this work.

Appendix A

Table A1. Questionnaire.
Table A1. Questionnaire.
Dimension Indicators
Dualistic
learning
1–3 refer to exploitation learning (using existing knowledge); 4–6 refer to exploration learning (exploration of unknown knowledge)
1. In the task completion, the team is more inclined to search for the team’s task target information in its field
2. When encountering problems, the team pays more attention to searching for valuable information and solutions from familiar fields
3. The team is more inclined to refine and excavate the existing knowledge
4. In the task completion, the team is more inclined to seek the team task target information in the new field
5. When encountering problems, the team pays more attention to seeking information or methods in new fields to be verified, even if faced with uncertain risks
6. Teams are more inclined to learn and master new knowledge
Knowledge
sharing
1–3 refer to sharing willingness; 4–7 refer to sharing behavior
1. In order to keep up with the requirements of team tasks, members are willing to share work inspiration and creative ideas with others
2. Members are willing to share their experience and knowledge to help the team implement new ideas
3. Members are willing to share knowledge and information acquired with others
4. Knowledge sharing among members helps to improve the level of understanding
5. It is faster to complete learning tasks through knowledge sharing and communication among members than by individuals
6. Members can skillfully solve problems encountered through sharing knowledge
7. When completing the task, I found that I learned a lot of new knowledge from others through knowledge sharing.
Learning
performance
1–4 refer to task performance; 5–8 refer to cooperation performance
1. The team can complete the task efficiently in a certain time
2. The team can complete the task within the budget
3. Team members can work efficiently
4. The team’s work results are basically consistent with the expected goals
5. Team members cooperate happily at work
6. Team members are trained and promoted in the cooperation process of work tasks
7. I hope to continue to cooperate with others and complete learning tasks together
Task interdependence1. Team members trust each other’s work-related information
2. Different members of the team have good coordination in work tasks
3. Team members can cooperate well on complex tasks

References

  1. Katila, R.; Ahuja, G. Something Old, Something New: A Longitudinal Study of Search Behavior and New Product Introduction. Acad. Manag. J. 2002, 45, 1183–1194. [Google Scholar] [CrossRef]
  2. Ferrando, M.; Gloria, S.; Prieto, L.; Sáinz, M.; Ferrándiz, C. Synthetic-Creative Intelligence and Psychometric Intelligence: Analysis of the Threshold Theory and Creative Process. Turk. J. Gift. Educ. 2016, 6, 88–98. [Google Scholar]
  3. Ruiz, M.J.; Bermejo, R.; Ferrando, M.; Prieto, M.D.; Sáinz, M. Intelligence and scientific-creative thinking: Their convergence in the explanation of students’ academic performance. Electron. J. Res. Educ. Psychol. 2014, 12, 282–302. [Google Scholar]
  4. March, J.G. Exploration and Exploitation in Organizational Learning. Organ. Sci. 1991, 2, 71–87. [Google Scholar] [CrossRef]
  5. Colbert, B.A. The Complex Resource-Based View: Implications for Theory and Practice in Strategic Human Resource Management. Acad. Manag. Rev. 2004, 29, 341–358. [Google Scholar] [CrossRef]
  6. Lazer, D.; Friedman, A. The Network Structure of Exploration and Exploitation. Adm. Sci. Q. 2007, 52, 667–694. [Google Scholar] [CrossRef] [Green Version]
  7. Lin, Z.; Yang, H.; Demirkan, I. The Performance Consequences of Ambidexterity in Strategic Alliance Formations: Empirical Investigation and Computational Theorizing. Manag. Sci. 2007, 53, 1645–1658. [Google Scholar] [CrossRef] [Green Version]
  8. Hansen, M.T. Knowledge Networks: Explaining Effective Knowledge Sharing in Multiunit Companies. Organ. Sci. 2002, 13, 232–248. [Google Scholar] [CrossRef]
  9. Davenport, T.H.; Prusak, L. Working Knowledge: How Organizations Manage What They Know; Harvard Business School Press: Cambridge, MA, USA, 1998. [Google Scholar]
  10. Williams, K. Demography and Diversity in Organizations: A Review of 40 Years of Research. Res. Organ. Behav. 1998, 8, 70–140. [Google Scholar]
  11. Wang, C.; Zhu, K.; Hedström, P.; Li, Y.; Xu, W. A Generic and Extensible Model for the Martensite Start Temperature Incorporating Thermodynamic Data Mining and Deep Learning Framework. J. Mater. Sci. Technol. 2022, 128, 31–43. [Google Scholar] [CrossRef]
  12. Gu, Z.; Cao, M.; Wang, C.; Yu, N.; Qing, H. Research on Mining Maximum Subsidence Prediction Based on Genetic Algorithm Combined with XGBoost Model. Sustainability 2022, 14, 10421. [Google Scholar] [CrossRef]
  13. Ponnusamy, V.K.; Kasinathan, P.; Madurai Elavarasan, R.; Ramanathan, V.; Anandan, R.K.; Subramaniam, U.; Ghosh, A.; Hossain, E. A Comprehensive Review on Sustainable Aspects of Big Data Analytics for the Smart Grid. Sustainability 2021, 13, 13322. [Google Scholar] [CrossRef]
  14. Jiang, Y.; Jin, C.; Yu, H.; Wu, J.; Chen, C.; Yuan, Q.; Huang, W.; Hu, Y.; Xu, Y.; Zhou, Z.; et al. Development and Validation of a Deep Learning CT Signature to Predict Survival and Chemotherapy Benefit in Gastric Cancer: A Multicenter, Retrospective Study. Ann. Surg. 2021, 274, e1153–e1161. [Google Scholar] [CrossRef] [PubMed]
  15. Pang, J.; Pu, X.; Li, C. A Hybrid Algorithm Incorporating Vector Quantization and One-Class Support Vector Machine for Industrial Anomaly Detection. IEEE Trans. Ind. Inform. 2022, 18, 8786–8796. [Google Scholar] [CrossRef]
  16. Sun, L.; Wang, J.; Lin, B. Task Allocation Strategy for MEC-Enabled IIoTs Via Bayesian Network Based Evolutionary Computation. IEEE Trans. Ind. Inform. 2021, 17, 3441–3449. [Google Scholar] [CrossRef]
  17. Bashar, A.; Parr, G.; McClean, S.; Scotney, B.; Nauck, D. Application of Bayesian Networks for Autonomic Network Management. J. Netw. Syst. Manag. 2014, 22, 174–207. [Google Scholar] [CrossRef]
  18. Ma, J.; Su, H.; Zhao, W.; Liu, B. Predicting the Remaining Useful Life of an Aircraft Engine Using a Stacked Sparse Autoencoder with Multilayer Self-Learning. Complexity 2018, 2018, 3813029. [Google Scholar] [CrossRef] [Green Version]
  19. Song, Y.; Wei, Z. Quality Risk Management Algorithm for Cold Storage Construction Based on Bayesian Networks. Comput. Intell. Neurosci. 2022, 2022, 6830090. [Google Scholar] [CrossRef]
  20. Johnson, J.D.; Smail, L.; Corey, D.; Jarrah, A.M. Using Bayesian Networks to Provide Educational Implications: Mobile Learning and Ethnomathematics to Improve Sustainability in Mathematics Education. Sustainability 2022, 14, 5897. [Google Scholar] [CrossRef]
  21. Atuahene-Gima, K.; Murray, J.Y. Exploratory and Exploratative learning in new product development: A social capital perspective on new technology ventures in China. J. Int. Mark. 2007, 15, 1–29. [Google Scholar] [CrossRef]
  22. Collins, C.J.; Smith, K.G. Knowledge exchange and combination: The role of human resource practices in the performance of high-technology ferms. Acad. Manag. J. 2006, 49, 544–560. [Google Scholar] [CrossRef] [Green Version]
  23. Chen, W.; Yang, Z.; Lang, Y. An Empirical Study on the Effect of Team Fracture Zone on Team Effectiveness: Mediation between Relational Leadership Behavior and Interactive Memory System. Bus. Rev. 2015, 27, 99–110. [Google Scholar] [CrossRef]
  24. Liden, R.C.; Wayne, S.J.; Broadway, L.K. Task interdependence as a moderator of the relation between group control and performance. Hum. Relat. 1997, 50, 169–181. [Google Scholar] [CrossRef]
  25. Kline, R.B. Principles and Practice of Structural Equation Modeling; Guilford Publications: New York, NY, USA, 2015. [Google Scholar]
  26. Pearl, J. Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference; Morgan Kaufmann: San Fransisco, CA, USA, 1988. [Google Scholar]
  27. Friedman, N.; Geiger, D.; Goldszmidt, M. Bayesian Network Classifiers. Mach. Learn. 1997, 29, 131–163. [Google Scholar] [CrossRef] [Green Version]
  28. Bamber, D. The Area Above the Ordinal Dominance Graph and the Area Below the Receiver Operating Characteristic Graph. J. Math. Psychol. 1975, 12, 387–415. [Google Scholar] [CrossRef]
  29. Ma, C.; Wang, W.; Cai, Z.; Zhao, J. Maintenance optimization of reconfigurable systems based on multi-objective Birnbaum importance. Part O J. Risk Reliab. 2022, 236, 277–289. [Google Scholar] [CrossRef]
  30. Birnbaum, Z.W. On the Importance of Different Components in a Multicomponent System; Academic Press: New York, NY, USA, 1969. [Google Scholar]
Figure 1. Theoretical Solution.
Figure 1. Theoretical Solution.
Sustainability 15 02335 g001
Figure 2. Tree Augmented Naive Bayes network model.
Figure 2. Tree Augmented Naive Bayes network model.
Sustainability 15 02335 g002
Figure 3. University students’ team innovation performance dataset.
Figure 3. University students’ team innovation performance dataset.
Sustainability 15 02335 g003
Figure 4. Relationship of Bayesian network prediction model.
Figure 4. Relationship of Bayesian network prediction model.
Sustainability 15 02335 g004
Figure 5. ROC curve of learning performance with a survival time of >27 months under the two-level classification model.
Figure 5. ROC curve of learning performance with a survival time of >27 months under the two-level classification model.
Sustainability 15 02335 g005
Figure 6. The change of dualistic learning results in the change of learning performance probability in the two-level classification model.
Figure 6. The change of dualistic learning results in the change of learning performance probability in the two-level classification model.
Sustainability 15 02335 g006
Table 1. Reliability and validity test results of the scale.
Table 1. Reliability and validity test results of the scale.
VariableItemFactor LoadingCronbach’s αCRAVE
Exploration learningQ10.850.740.850.65
Q20.84
Q30.72
Exploitation learningQ40.830.740.840.64
Q50.81
Q60.77
Sharing behaviorQ70.850.840.870.69
Q80.83
Q90.82
Sharing willingnessQ100.790.770.0.
Q110.79
Q120.70
Q130.62
Task performanceQ140.810.730.800.50
Q150.72
Q160.66
Q170.62
Cooperation performanceQ180.870.730.810.60
Q190.84
Q200.57
Task interdependenceQ210.880.810.810.60
Q220.88
Q230.80
Table 2. Results of discriminant validity analysis.
Table 2. Results of discriminant validity analysis.
ModelχdfRMSEATLICFI
null model940.5772300.1170.6310.664
seven-factor model a320.8682090.0490.9360.947
six-factor model b448.4662150.0690.8700.890
six-factor model c481.3442150.0740.8520.874
six-factor model d417.2442150.0640.8870.904
six-factor model e431.4512150.0670.8800.898
six-factor model f396.6082150.0610.8990.914
six-factor model g375.8362150.0570.9110.924
single-factor model h786.3942300.1030.7110.737
Note: a In the null model, there was no relationship between all measured items. b Combining exploration learning and exploitation learning into a potential factor. c Combining exploitation learning and sharing willingness into a potential factor. d Combining sharing willingness and sharing behavior into a potential factor. e Combining sharing behavior and task interdependence into a potential factor. f Combining task interdependence and task performance into a potential factor. g Combining task performance and cooperation performance into a potential factor. h Integrate all items into a potential factor.
Table 3. The confusion matrix, reliability, and accuracy of the two-level classification model.
Table 3. The confusion matrix, reliability, and accuracy of the two-level classification model.
Two-Level Classification Model
≤27(148)>27(80)
Confusion matrix
≤27(143)12518
>27(85)2362
Reliability
≤27(143)84.46%22.50%
>27(85)15.54%77.50%
Accuracy
≤27 (143)87.41%12.59%
>27 (85)27.06%72.94%
Table 4. The confusion matrix, reliability, and accuracy of the four-level classification model.
Table 4. The confusion matrix, reliability, and accuracy of the four-level classification model.
Four-Level Classification Model
≤25(70)25–27(83)27–29(32)>29(43)
Confusion matrix
≤25(65)471422
25–27(78)155265
27–29(36)310185
>29(49)57631
Reliability
≤25(65)67.14%16.87%6.25%4.65%
25–27(78)21.43%62.65%18.75%11.63%
27–29(36)4.29%12.05%56.25%11.63%
>29(49)7.14%8.43%18.75%72.09%
Accuracy
≤25(65)72.31%21.54%3.08%3.08%
25–27(78)19.23%66.67%7.69%6.41%
27–29(36)8.33%27.78%50%13.89%
>29(49)10.20%14.29%12.24%63.27%
Table 5. Bayesian network posterior probability.
Table 5. Bayesian network posterior probability.
Exploitation LearningExploration LearningPosterior Probability
Two-Level Classification ModelFour-Level Classification Model
≤11≤10≤2781.28%≤2549.57%
>278.72%25–2741.44%
27–295.71%
>293.29%
≤1110–12≤2781.28%≤2542.03%
>2718.72%25–2739.46%
27–297.54%
>2910.96%
≤11>12≤2756.30%≤2515.80%
>2743.70%25–2741.06%
27–2911.10%
>2932.04%
11–12≤10≤2778.43%≤2536.66%
>2721.57%25–2741.26%
27–2911.91%
>2910.17%
11–1210–12≤2756.74%≤2527.60%
>2743.26%25–2728.16%
27–2918.79%
>2925.45%
11–12>12≤2728.93%≤2510.74%
>2771.07%25–2719.80%
27–2926.59%
>2942.87%
>12≤10≤2761.98%≤2519.91%
>2738.02%25–2743.37%
27–2917.01%
>2919.71%
>1210–12≤2737.30%≤2511.32%
>2762.70%25–2726.08%
27–2930.98%
>2931.62%
>12>12≤2721.37%≤253.04%
>2778.63%25–2716.95%
27–2925.53%
>2954.48%
Table 6. Feature importance ranking of two-level classification model.
Table 6. Feature importance ranking of two-level classification model.
StatePrior
Probability
Posterior
Probability
MBMRank
Sharing behavior≤150.35090.11250.28891
≤180.48680.4955
>180.16230.5676
Task interdependence≤110.44740.19610.27512
≤120.35960.4146
>120.19300.7045
Exploration learning≤100.42980.19390.26163
≤120.35960.4024
>120.21050.6875
Sharing willingness≤120.58770.25370.24724
≤130.13600.6129
>130.27630.5079
Exploitation learning≤110.37720.16280.23485
≤120.32020.4110
>120.30260.5942
University category10.53070.41320.10486
20.10530.4583
30.36400.2892
Grade≤20.39470.28890.09307
≤30.33330.4605
>30.27190.3871
Times10.35090.30000.07628
20.32890.3867
30.32020.4384
Team result10.06580.33330.03109
20.50000.3860
30.43420.3636
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zhang, X.; Liu, J.; Zhang, C.; Shao, D.; Cai, Z. Innovation Performance Prediction of University Student Teams Based on Bayesian Networks. Sustainability 2023, 15, 2335. https://doi.org/10.3390/su15032335

AMA Style

Zhang X, Liu J, Zhang C, Shao D, Cai Z. Innovation Performance Prediction of University Student Teams Based on Bayesian Networks. Sustainability. 2023; 15(3):2335. https://doi.org/10.3390/su15032335

Chicago/Turabian Style

Zhang, Xueliang, Jiawei Liu, Chi Zhang, Dongyan Shao, and Zhiqiang Cai. 2023. "Innovation Performance Prediction of University Student Teams Based on Bayesian Networks" Sustainability 15, no. 3: 2335. https://doi.org/10.3390/su15032335

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop