1. Introduction
Logistics management education deals with systems that are complex and changing [
1,
2]. Decisions about transportation, warehousing, and inventory are closely connected, and choices made in one stage of the logistics process often significantly affect other stages [
3,
4]. Because of these connections, delays, and trade-offs, students cannot rely on simple reasoning and linear thinking when analyzing logistics problems [
5,
6]. Thus, cultivating systems thinking becomes a central goal of logistics management education [
1,
7].
Systems thinking refers to the paradigm for observing and understanding things from a holistic, interconnected, and dynamic perspective [
8,
9]. It focuses on the interactions between parts and how these interactions produce an overall effect. For logistics practices, this competence is essential for analyzing system-wide consequences of operational decisions and for managing complex supply chain relationships [
1,
2,
5,
8].
With the rapid diffusion of generative artificial intelligence (GenAI), these tools are increasingly embedded into logistics education [
10]. However, exposure to advanced technology does not automatically lead to the development of higher-order cognitive abilities such as systems thinking [
11]. When learners treat GenAI outputs as ready-made answers, learning becomes fragmented and disconnected from deeper systemic understanding [
7,
12]. This suggests that the existence of GenAI may act as a double-edged sword. While offering immense data affordances, it may inadvertently undermine the active mental modeling capabilities required for systems thinking.
Despite the growing body of research on GenAI-supported learning, existing studies have primarily focused on issues such as technology adoption, general learning performance, or user perceptions [
10,
13,
14,
15,
16]. Limited attention has been paid to the cognitive and behavioral mechanisms through which higher-order thinking skills could be developed in the GenAI-supported learning context [
17]. Furthermore, most existing studies employ a linear analytical framework, directly treating technology use as an antecedent variable of learning outcomes [
18,
19], overlooking the nonlinear interactions between the environment, behavior, and cognitive processing [
20].
To bridge this gap, this study aims to explore how students behave in a GenAI-supported context to enhance their higher-order abilities [
10,
21,
22]. Self-Regulated Learning (SRL) theory offers a useful theoretical perspective for explaining this process [
23]. It emphasizes students’ active goal setting, monitoring of comprehension, and strategy adjustment throughout the task. These conscious regulatory behaviors increase the likelihood that students will develop systematic reasoning abilities when faced with complex or uncertain logistics problems [
24,
25,
26].
Furthermore, participating in complex logistics systems requires coordinating multiple interacting variables, feedback mechanisms, and delayed consequences. These demands place enormous stress on limited cognitive resources [
5,
27]. Cognitive Load Theory (CLT) provides a framework for understanding the allocation of these cognitive resources during learning [
12]. Learning processes supported by GenAI might contribute to systems thinking development if cognitive resources are allocated toward germane processing rather than toward managing extraneous cognitive load [
22,
28].
Accordingly, this study explores the underlying mechanisms of students’ systems thinking development within a logistics management learning environment supported by GenAI. We construct an analytical framework based on SRL, CLT, and the triadic reciprocal determinism configuration. First, we explore the influence of environmental conditions, behavioral regulation, and cognitive processes on cultivating students’ systems thinking, revealing potential mediating effects. Furthermore, we emphasize the interactions among these factors, providing a configuration explanation for the development of systems thinking. Within this framework, self-regulated learning is conceptualized as a central behavioral factor enacted by students in GenAI-supported settings, which captures goal setting, ongoing process monitoring, and reflective adjustment. Cognitive load is considered a personal-level cognitive factor used to characterize the allocation of students’ limited cognitive resources across different processing activities. Teacher GenAI scaffolding is introduced as a learning environment factor to characterize the role of instructional support in shaping cognitive processing. We argue that systems thinking is a higher-order learning outcome that emerges under specific conditions with combinations of behavioral regulation, cognitive processing and teacher scaffolding. Specifically, this study aims to answer the following overarching research question:
How do behavioral, cognitive, and environmental factors shape the development of students’ systems thinking in GenAI-supported logistics management education?
Building on the overarching research question, this study further addresses the following specific research questions:
Research Question 1: How does self-regulated learning influence systems thinking?
Research Question 2: How do extraneous and germane cognitive loads differently affect systems thinking?
Research Question 3: How does self-regulated learning shape the cognitive load allocation (extraneous vs. germane)?
Research Question 4: How does teacher GenAI-supported scaffolding influence students’ cognitive load allocation (extraneous vs. germane)?
Research Question 5: What combinations of these factors lead to high-level systems thinking?
With respect to the methodology, this study employs a hybrid analysis approach, integrating Partial Least Squares Structural Equation Modeling (PLS-SEM) and Fuzzy Set Qualitative Comparative Analysis (fsQCA). PLS-SEM is used to estimate the net effect between key constructs and assess the proposed structural relationships. fsQCA is used to reveal combinations of conditions associated with high- and low-level systems thinking, identifying multiple adequate configurations that may lead to the same outcome. The integration of these two methods contributes to a more comprehensive understanding of the phenomena studied, revealing the interactions between environmental, behavioral, and cognitive factors in GenAI-supported learning environments.
This study contributes to the existing literature in three ways. First, this study advances research on GenAI-supported learning by unpacking the cognitive and behavioral mechanisms underlying higher-order thinking development. The existing studies have typically viewed the use of GenAI as a direct antecedent of learning outcomes [
13,
14,
15,
21]. Limited research has examined the interactions among behavioral regulation, cognitive conditions, and instructional guidance during the learning process. In GenAI-supported learning environments, where information is generated rapidly and learning processes become more self-regulated, learning outcomes largely depend on self-regulation and cognitive load allocation. In this research, we integrate SRL and CLT, providing detailed explanations of how self-regulated learning processes, different types of cognitive load, and teacher scaffolding jointly influence the development of high-order abilities in GenAI-supported learning environments.
Second, we expand the scope of research on GenAI-supported learning by examining systems thinking as a key educational outcome. Existing research has explored the pedagogical potential and general academic performance of GenAI [
10,
13,
15], with limited empirical evidence of how it promotes the development of higher-order cognitive skills such as systems thinking. As systems thinking is crucial for understanding complex and dynamic decision-making in logistics management, this study empirically uncovers the key factors and pathways that promote systems thinking in GenAI-supported learning environments, offering new insights for logistics management education.
Third, we adopt a configurational perspective, while most prior GenAI education studies rely on net-effect models that assume symmetric and linear relationships between learning factors and outcomes. We indicate that systems thinking enhancement stems from a combination of multiple conditions, emphasizing the heterogeneity of developmental paths rather than focusing on a stand-alone average effect. The findings offer important references for course design and instructional guidance in GenAI-supported logistics management education.
3. Research Methodology
This study adopts a quantitative mixed-analytical research design to investigate how systems thinking develops in logistics management education under GenAI-supported learning. Data were collected through a questionnaire survey and analyzed using PLS-SEM and fsQCA. These two analytical approaches serve complementary purposes. While PLS-SEM is employed to identify average net effects across the sample [
54], fsQCA captures the configurational and asymmetric mechanisms through which different combinations of learning conditions give rise to similar levels of systems thinking [
20]. Their combined use provides a more comprehensive understanding of how SRL and cognitive load dynamics jointly shape systems thinking in logistics management learning within a GenAI-supported instructional context.
First, PLS-SEM was employed to assess the structural relationships among the latent constructs specified in the proposed model. As a variance-based structural equation modeling technique, PLS-SEM is well suited for prediction-oriented research, theory building, and models that incorporate multiple mediators or hierarchical component structures [
55]. In addition, this approach is robust to deviations from normality and performs adequately with moderate sample sizes, which makes it appropriate for survey-based educational studies [
56]. The analysis was conducted in two stages. The measurement model was evaluated to confirm internal consistency reliability, convergent validity, and discriminant validity. After establishing satisfactory measurement properties, the structural model was examined. Bootstrapping procedures were applied to test the statistical significance of the hypothesized paths, thereby providing empirical evidence to address research questions RQ1 to RQ4.
Second, fsQCA was conducted to complement the linear and net-effect logic of PLS-SEM by examining configurational pathways leading to high levels of systems thinking. Rooted in set-theoretic reasoning, fsQCA is well suited for identifying multiple sufficient combinations of conditions that jointly produce the same outcome [
20]. In contrast to regression-based approaches, fsQCA is designed to capture causal asymmetry and equifinality. It recognizes that multiple causal configurations can produce comparable outcomes rather than assuming a single net effect [
57]. In the present study, teacher GenAI scaffolding, self-regulated learning, germane cognitive load, extraneous cognitive load, and systems thinking were calibrated as fuzzy sets in accordance with established guidelines [
20]. Configuration analysis was then performed to identify distinct combinations associated with high systems thinking, thereby addressing RQ5.
3.1. Survey Instrument Design
All constructs in this study were measured using multi-item Likert-type scales. Items were rated on a five-point Likert scale ranging from 1 (“strongly disagree”) to 5 (“strongly agree”). To ensure content validity, the measurement items were derived from previously validated scales reported in the literature and adapted to the context of GenAI-supported logistics management learning. Consistent with prior research, most constructs in this study were specified as reflective measurement models, as the observed indicators were assumed to represent manifestations of their underlying latent variables [
56,
58]. For example, TAS has been adapted to capture the instructional guidance provided by instructors in the use of GenAI (e.g.,
“Instructors provide clear guidance on appropriate GenAI use at different learning stages”); GCL reflects the extent to which learners actively invest mental effort in understanding logistics concepts when interacting with GenAI (e.g.,
“When using GenAI, I invest substantial mental effort in analyzing underlying structures and relationships among logistics variables.”).
SRL was modeled as a second-order formative construct comprising three first-order dimensions: planning, monitoring, and reflection. This specification reflects the conceptualization that SRL stems from the combined action of distinct but complementary self-regulatory processes [
24,
26]. Each first-order dimension was operationalized reflectively using multiple items, while the second-order SRL construct was formed by these dimensions [
58].
The scale items were translated from English into Chinese by two researchers familiar with the research topic. Then, an independent bilingual researcher performed a back-translation from Chinese into English to ensure semantic equivalence between the original and translated versions. Any discrepancies between the two versions were discussed and resolved through joint review.
We conducted an expert validation procedure to further refine the instruments. To enhance content validity, four experts with substantial experience in logistics management education were invited to evaluate the questionnaire. The panel included a professor of logistics management who also serves as a college dean and has extensive experience in undergraduate instruction. The second expert was the chair of the Department of Logistics and Supply Chain Management, bringing in depth knowledge of industry practices and disciplinary development. The third expert, an associate professor of logistics management, specializes in scale development and validation. The fourth expert was a faculty member with extensive teaching experience in logistics management. These review experts came from different levels of organizations, and each evaluated the questionnaire items in terms of relevance, clarity, and representativeness to determine their suitability for measuring constructs in the field of logistics management. Based on their evaluations, we revised items that were vaguely worded or lacked precise concepts.
Based on the expert-reviewed questionnaire, we conducted a pre-test of understanding with 14 master’s students majoring in Logistics Engineering and Management. The purpose of the test is to examine the clarity and comprehensibility of the questionnaire from the students’ perspective. These students first completed a draft questionnaire and then provided detailed feedback on the wording, potential ambiguities, and whether each question could reflect their actual learning experience. We focused on checking whether core concepts were consistently understood and whether scale items conformed to their expected theoretical definitions. Based on their feedback, we revised overly technical terminology and vaguely defined statements. The complete list of measurement items for all constructs is provided in
Appendix A.
Following the adjusted scale, we conducted a pilot study with 52 anonymous respondents from the logistics management major. We assessed indicator performance using Cronbach’s alpha, composite reliability, factor loadings and cross-loadings. The pilot results demonstrated satisfactory internal consistency and acceptable factor loadings across all constructs, providing initial support for the scale quality and justifying its use in the subsequent full-sample analysis.
3.2. Data Collection
We collect empirical data through Wenjuanxing (
https://www.wjx.cn/ (accessed on 14 February 2026)), which is a widely used online survey platform in questionnaire survey research. To align with the research context, we used the platform’s paid customized distribution service to send questionnaires to students registered on the platform and tagged as Logistics Management majors. We also added a “What is your major?” item to the questionnaire for double confirmation. The final sample included respondents from eastern, central, western, and northeastern China. Participants were informed that the data would be used solely for academic research, all answers would be anonymous, and there were no right or wrong answers, encouraging them to answer based on the real learning experiences and perspectives.
The online survey had three parts. The first part included screening and identification questions, checking whether the participant had ever used GenAI in logistics management learning and when they first started to use it. These items are applied to confirm that participants had relevant experience in GenAI-supported learning and are not used as constructs in the analysis model. The second part collected demographic information, including age, gender, and geographic region. The third part included the main measurement items for the key constructs.
We apply several methods to ensure the quality of the collection. First, we set a minimum time limit for each page of the online questionnaire response. Participants who did not complete the questionnaire in this time were not allowed to continue. After collection, we checked and removed 28 responses that included duplicates, missing data, and invalid answers. Moreover, two attention check questions were placed randomly in the survey to evaluate respondent attentiveness. Thirty-one responses that failed attention checks were further excluded, resulting in 236 valid responses. We also calculated the required sample size for SEM analysis, using an online statistical power analysis tool (
https://www.danielsoper.com/statcalc/calculator.aspx?id=89 (accessed on 14 February 2026)). The inputs included 35 observed variables, 7 latent constructs, an expected medium effect size of 0.30, a significance level of 0.05 for a 95 percent confidence level, and a target statistical power of 0.85. The calculation showed that at least 187 cases were needed to test the proposed relationships with enough reliability. Our final sample included 236 responses, which is higher than the minimum size.
Figure 2 and
Table 2 illustrate the basic information of the respondents and their experience using GenAI. Regarding the geography, the largest group came from the eastern region, including Beijing, Shanghai, Zhejiang, and Guangdong. Other students came from the western region, such as Chongqing and Sichuan, and the central region, such as Hunan and Hubei.
4. Analysis Results
4.1. Common Method Bias
This study employed measures to mitigate concerns about common method bias. First, the questionnaire underwent expert review and small-sample pretesting to improve scale clarity. Second, respondents were informed that their answers were anonymous and confidential. The questionnaire instructions also explicitly stated that there were no right or wrong answers. These measures aimed to reduce evaluation bias and social expectation bias, encouraging respondents to answer truthfully [
59]. Third, we conducted Harman’s single-factor test, including all measures in an unrotated exploratory factor analysis. The first factor explained 27.2% of the total variance, well below the commonly used threshold of 40%, indicating that no single factor dominated and common method bias was not a serious problem in this study. Fourth, we also calculated variance inflation factors (VIFs) for all latent constructs to test possible multicollinearity. All VIF values were below 3.0, lowering the strict threshold of 5 [
54], indicating that multicollinearity is not a concern and further reducing the possibility that common method bias distorts the estimated relationships among the constructs [
60].
4.2. Measurement Model Assessment
The study tested reliability and convergent validity of the measurement model by following the guidelines of Hair et al. (2019) [
55]. The results are shown in
Table 3. All constructs show good internal consistency. Cronbach’s alpha values range from 0.805 to 0.879, which are higher than the threshold of 0.70. Composite reliability values range from 0.873 to 0.916, further confirming strong reliability across the constructs.
We also tested convergent validity and the results are listed in
Table 3. The average variance extracted for each construct ranges from 0.608 to 0.734, which is higher than 0.50. This shows that each construct explains more than half of the variance of its indicators on average.
Table 3 shows adequate internal consistency and convergent validity of the measurement model, supporting the suitability of the model for subsequent structural analysis.
Discriminant validity was assessed using a dual approach: the heterotrait–monotrait ratio (HTMT) and the Fornell–Larcker criterion (
Table 4 and
Table 5). First, the HTMT results indicate that most values fall below the common threshold of 0.90, demonstrating sufficient distinction among the reflective constructs [
61]. Although HTMT values for the higher-order SRL construct and its three dimensions exceed this threshold, this is theoretically expected, as SRL is conceptualized as a higher-order formative construct composed of these components. Second, as shown in
Table 5, the Fornell–Larcker criterion was satisfied across all first-order constructs [
62]. These results provide robust empirical support for the discriminant validity of the measurement model.
4.3. Structural Model Assessment
The structural model was analyzed using partial least squares structural equation modeling (PLS-SEM) with SmartPLS 4 to examine the hypothesized relationships among the study constructs. The analysis used the full sample of 236 valid responses. To test the significance and robustness of the path coefficients, the study used a nonparametric bootstrapping method with 5000 resamples. The study did not include control variables in the structural model because the main goal was to test the direct effects among the key theoretical constructs rather than to estimate net demographic influences.
Table 6 reports the results of the path analysis for the main effects specified in the proposed model.
As shown in
Table 6, SRL has a significant and positive effect on ST (β = 0.423,
p < 0.001), indicating that higher self-regulated learning is linked to higher levels of systems thinking ability. The magnitude of this effect is relatively large, suggesting that SRL plays a central role in promoting ST, which supports H1.
For cognitive load effects, ECL has a negative relationship with ST (β = −0.118, p = 0.095). Although the magnitude of this effect is relatively small, its direction aligns with theoretical expectations. Accordingly, H2 receives marginal empirical support. In contrast, GCL has a positive and significant effect on ST (β = 0.219, p = 0.026), indicating that meaningful cognitive engagement enhances systems thinking and therefore supports H3.
SRL has a negative relationship with ECL (β = −0.166, p = 0.022), showing that students with stronger self-regulation are better able to reduce unnecessary or distracting cognitive effort. In contrast, SRL has a strong positive effect on GCL (β = 0.550, p < 0.001), indicating the key role of SRL in increasing meaningful cognitive effort. Thus, H4 and H5 are supported.
TAS has a significant negative effect on ECL (β = −0.200, p = 0.025). This result shows that effective teacher GenAI scaffolding reduces extraneous cognitive load, supporting H6. TAS also has a positive but weak effect on GCL (β = 0.132, p = 0.073). Although the effect size is modest and reaches only marginal significance, its positive direction suggests that teacher GenAI scaffolding may facilitate germane cognitive processing. Therefore, H7 receives marginal support.
The effect size analysis further reveals the relative contribution of each predictor to the endogenous constructs. SRL has a large effect on GCL (f2 = 0.407) and a moderate effect on ST (f2 = 0.185), while its effect on ECL is small (f2 = 0.025). Other relationships, including GCL → ST (f2 = 0.049) and ECL → ST (f2 = 0.021), show small effect sizes.
We assessed the coefficient of determination (R
2) and predictive relevance (Q
2) of constructs, as shown in
Table 7. The results indicate that the model demonstrates satisfactory explanatory and predictive power. Specifically, the R
2 values show that the model explains 39.1% of the variance in ST and 37.9% in GCL, suggesting moderate explanatory power, as well as a weak 9.5% in ECL. We further conducted the blindfolding procedure with an omission distance of 7. All Q
2 values are greater than zero, indicating that the structural model has adequate predictive relevance.
Table 8 shows the bootstrap results of the mediation analysis. The indirect effect of SRL on ST via GCL was positive and significant (β = 0.120,
p = 0.020), and the 95% confidence interval did not include zero. This result indicates that SRL could enhance systems thinking by promoting relevant cognitive processing and sustained cognitive engagement, supporting the mediating effect of GCL. Conversely, the indirect effect of SRL on ST via ECL was not significant (β = 0.020,
p = 0.202). Although the indirect effect is positive, its small magnitude and lack of statistical significance suggest that reductions in extraneous cognitive load do not constitute a stable transmission mechanism linking SRL to ST. Similarly, neither GCL nor ECL mediates the relationship between the TAS and ST. The indirect effects of GCL and ECL on TAS and ST relationship are both non-significant.
To further explore the internal mechanisms of SRL, chain mediation effects involving different SRL stages were examined. The results indicate that the ECL-based chain mediation effects are not supported across all SRL stages. In contrast, all chain mediation paths through GCL are positive and statistically significant, indicating robust support for GCL-based mediation mechanisms. The indirect effects of SR-M → SRL → GCL → ST (β = 0.052, p = 0.021), SR-P → SRL → GCL → ST (β = 0.046, p = 0.020), and SR-R → SRL → GCL → ST (β = 0.047, p = 0.022) are all significant, with confidence intervals excluding zero. These positive indirect effects suggest that each SRL stage contributes to enhanced ST by increasing learners’ investment in germane cognitive processing. Accordingly, the GCL-based chain mediation effects are supported for SR-M, SR-P, and SR-R.
Taken together, the mediation results demonstrate a clear asymmetry between the two cognitive load pathways. While reductions in ECL do not form a significant mediating mechanism, increases in GCL constitute the primary cognitive pathway through which SRL and its constituent stages exert positive indirect effects on ST.
Figure 3 presents the final structural model, with R
2 values displayed inside the circles and the P values shown in parentheses.
4.4. Configurations Analysis: FsQCA Results
To complement the identification of net effects obtained from the PLS-SEM analysis, this study further employed fsQCA to uncover multiple equifinal configurations leading to high levels of systems thinking and to examine potential asymmetry [
20]. In the fsQCA setting, ST was specified as the outcome condition. Separate sufficiency analyses were conducted for high ST (fST) and low ST (~fST) in order to assess possible asymmetry. The analytical models were specified as
for the high-outcome analysis and
for the low-outcome analysis. All continuous variables were converted into fuzzy-set membership scores from 0 to 1 by using the direct calibration method. Based on earlier studies, the 5th percentile was set as full non-membership, the 50th percentile was set as the crossover point, and the 95th percentile was set as full membership. This method maps the original scores to fuzzy-set values in a smooth way and keeps the relative order of the cases. The calibrated data were then used in the necessity and sufficiency analyses.
Table 9 shows the necessity analysis results for high systems thinking (ST) and low systems thinking (~ST). Using the conventional threshold of 0.90 [
20], the results show that none of the stay-alone conditions, including SRL, GCL, ECL, and TAS, reach this level. Their opposite forms also do not reach this level for either high ST or low ST. This result demonstrates that the emergence of high- or low-level systems thinking cannot be attributed to any isolated factor. We should turn to a configurational setting, examining how combinations of conditions collectively produce results.
Then, we built a truth table and tested sufficiency using the Quine–McCluskey algorithm. This study refines the truth table with a frequency threshold of 3 and a consistency threshold of 0.80 to improve the robustness and interpretability of the results. These standards were set according to our sample size. We also removed solutions with PRI consistency below 0.50 to reduce contradictory patterns.
Table 10 shows the fsQCA solution and it finds two paths that lead to high ST. The overall coverage is 0.756 and the consistency is 0.821. These results show that the two paths are sufficient and reliable for explaining high ST. We can see that SRL and GCL appear in both paths, highlighting their function as shared foundational conditions in the development of high ST. The first configuration (S1: TAS + SRL
+ GCL) can be understood as a scaffolding–regulation–processing pathway. In this path, teacher GenAI support strengthens the positive effects of high SRL and high GCL, leading to high ST. This configuration demonstrates substantial explanatory relevance, with a raw coverage of 0.684 and a unique coverage of 0.179. The second path (S2: SRL + GCL + ECL) can be seen as a regulation-based compensation path, with a raw coverage of 0.577 and a unique coverage of 0.072. This path indicates that high ST can still develop even when high extraneous cognitive load is present, and this happens when students also show strong self-regulation and high germane cognitive effort.
Table 11 shows the fsQCA results for low systems thinking (~ST). Two equifinal configurations are identified, with an overall coverage of 0.587 and a consistency of 0.915. The absence of self-regulated learning (~SRL) appears in both paths, indicating that weak self-regulation is a basic condition for low ST. The two pathways leading to low ST do not represent simple inversions of those associated with high ST, indicating the presence of pathway asymmetry. The first path (S1: ECL + ~GCL + ~SRL) shows that low ST is more likely in poor cognitive conditions. In this path, a high extraneous load appears together with low germane processing and weak self-regulation. This configuration has a raw coverage of 0.499 and a unique coverage of 0.276. The second path (S2: ~ECL + ~SRL + GCL + TAS) also shows the key role of self-regulation. Even when extraneous load is low, and germane effort and teacher GenAI support are present, low ST can still occur if SRL is absent.
6. Theoretical and Practical Implications
6.1. Theoretical Implications
First, this study expands the theoretical boundaries of higher-order competence development in GenAI-supported learning environments. Existing research on GenAI in education largely focuses on outcomes such as motivation, engagement, and performance, while empirical studies on its impact on higher-order competence remain limited [
10,
63,
64]. Previous studies indicate that simply relying on GenAI use cannot improve higher-order competence [
21]. This study applies systems thinking as a core dependent variable, examining how behavioral, cognitive, and environmental pathways interact to influence systems thinking for logistics management students, providing new evidence for the cultivation of higher-order competence in a GenAI learning context.
Moreover, this study expands the explanatory power of SRL and CLT in GenAI-supported education. On the one hand, cognitive processing exhibits asymmetric effects. The positive driving force of deep processing willingness (i.e., GCL) outweighs the hindering effect of external cognitive load. This indicates that in GenAI-assisted learning, enhancing learners’ germane cognition is more crucial for the formation of systems thinking than simply simplifying the interface or task (i.e., ECL). On the other hand, this study finds that SRL is a core driving force for the development of systems thinking. Learners’ goal-setting, monitoring, and reflection determine whether the outputs of GenAI can be transformed into effective cognitive schemas.
Last, this study provides configuration insights through fsQCA. These nonlinear paths explain the diversity of antecedent combinations for learning outcomes, replenishing the single-variable determinism in previous studies. We find that even under the adverse condition of ECL, high SRL can also compensate for developing systems thinking by activating GCL, i.e., the compensatory effect. The comparative analysis from low systems thinking configuration further shows that its antecedents are not simply the reverse of those of high systems thinking. In particular, high-level teacher-assisted instructions and weak self-regulated learning are associated with low systems thinking, which indicates that instructional scaffolding may even limit the development of higher-order abilities if it cannot stimulate students’ self-regulation behaviors.
6.2. Practical Implications
From a practical perspective, these findings offer guidance for cultivating student systems thinking in GenAI-supported higher education. First, SRL is the most critical factor in enhancing systems thinking, and interaction with GenAI may even impair it in its absence. Therefore, teachers should shift the focus from the GenAI-generated solution to the regulation process of student–GenAI collaboration. For example, in the logistics case study assignment, teachers could require students to submit a Self-regulation Log as part of their final report. In this log, students must record: (1) the initial task breakdown (planning); (2) how they improved their prompts when GenAI provided false or unnecessary information (monitoring); and (3) how the GenAI output changed their final assessment of their cognitive model of the logistics system (reflection). This could prompt students to shift from passive acceptance to active regulating learning.
Second, this study finds that reducing ECL is insufficient to enhance systems thinking, and the key lies in activating GCL. Instructors should use GenAI to expose students to system-wide tensions rather than providing optimal answers. For example, mini-scenarios of trade-offs can be designed, requiring students to use the GenAI tools to generate strategies with conflicting objectives. Students must identify feedback loops and trade-offs between multiple objectives, forcing them to engage in deep processing and schema-building rather than seeking off-the-shelf solutions.
Third, the primary role of teacher scaffolding is to reduce interference, but it cannot replace students’ own efforts in promoting GCL. Therefore, teachers should focus on how to use scaffolding to help students filter out the illusory or noisy information generated by GenAI, creating a clean and efficient digital exploration environment. Following this line, teachers could design a task on GenAI-Output Verification and System Mapping. When students use GenAI to analyze a complex logistics network, the teacher could provide a set of criteria or a checklist specifically designed to help students identify common GenAI hallucinations regarding regional infrastructure, international regulations or specific maritime regulations. Moreover, teachers could require students to iteratively rewrite and evolve their prompts based on the quality of GenAI feedback, guiding them to incorporate specific systemic constraints into the revised prompt, such as lead-time volatility or cost–service trade-offs. This process encourages students to engage in in-depth mental modeling and active reflection toward the task objectives.
Finally, tiered interventions can be designed for different student groups. For students with weak SRL, it is essential to prioritize addressing their SRL weaknesses. For example, teachers could require students to write down the intention and logic behind the prompt before inputting it; to draw or create mind maps online while interacting with the GenAI; and to compare their initial questions with their final satisfactory questions to analyze why earlier questions failed to elicit in-depth answers. For students with high SRL, a moderate level of complexity can be tolerated in the targeted instructional scaffolding design. Even if the information output by GenAI is slightly redundant or complex, these students can transform it into learning motivation through the “compensation effect”. In these ways, teaching scaffolding design focuses on guiding students with different levels of SRL to engage in deep cognitive input, thus achieving a high level of systems thinking.
7. Conclusions
This study explores how to enhance systems thinking in GenAI-supported logistics management education. The integration of PLS-SEM and fsQCA provides a comprehensive understanding of these mechanisms. The PLS-SEM results reveal the linear relationships among key variables, indicating that self-regulated learning and germane cognitive load promote systems thinking, while extraneous cognitive load constrains it. In addition, teacher GenAI scaffolding helps reduce extraneous cognitive load. fsQCA findings complement the linear models by showing how the key variables identified in PLS-SEM combine into multiple pathways. Among them, stronger self-regulated learning and deeper cognitive engagement are fundamental conditions across multiple pathways, leading to high systems thinking. In contrast, the absence of self-regulated learning consistently leads to low systems thinking, even when instructional support or favorable cognitive conditions are present.
This study has some limitations that may guide future research. First, while the combined application of PLS-SEM and fsQCA enhances the explanatory power of the analysis, the lack of a time dimension in the cross-sectional survey data limits causal inferences. Future research could employ longitudinal or experimental designs to examine the causal mechanisms between factors. Second, this study used a self-report scale, which may introduce potential bias when assessing complex competencies such as systems thinking. Future research could incorporate objective behavioral data to improve the objectivity of the measurement. Third, this study focuses on enhancing systems thinking in logistics management. Whether similar configuration patterns could exist among disciplines with different knowledge structures warrants further investigation. Finally, the fsQCA model include a limited number of variables, and future research could incorporate more variables such as individual differences, task complexity characteristics, or different GenAI interaction patterns to construct a more comprehensive capability-development configuration model.