Smart-PLS 4.1.0.0 supported the analysis conducted in this research and applied structural equation modeling (SEM) as the primary analytical strategy. SEM is the best variance-based modeling technique used mainly in social sciences and management, as noted by Nitzl et al. [
65]. PLS-SEM was warranted given the capacity of the method to estimate causal models that may provide the maximum explained variance for endogenous latent variables [
66,
67]. MGA was also utilized to examine sub-group differences and estimate relationship heterogeneity in other contexts that were not examined through conventional regression approaches [
66,
67]. Analysis was conducted based on methodological standards outlined in Wong [
68] to ensure the proper estimation of path coefficients, reliability, and standard errors. For reflective measurement model item reliability, outer loadings of 0.70 and above were used as acceptable, meaning that the observed indicators had a good fit with the corresponding latent constructs.
4.2. Measurement Model
The first stage of the PLS-SEM process is the assessment of the measurement model, with reflective indicators being used to represent each construct. Key psychometric properties such as composite reliability, indicator reliability, convergent validity, and discriminant validity are assessed, as suggested by Hair et al. [
71].
Indicator reliability is the degree to which the variance of a measurable variable is captured by its corresponding latent construct. It is typically measured based on outer loadings in a manner such that values greater than 0.70 are found to be acceptable based on criteria by Wong [
68] and Chin [
72]. Yet, as Vinzi et al. [
73] observe, smaller values of loadings are not new to social science studies. Hence, such indicators with loadings of 0.40 to 0.70 would be deleted only if dropping them would appreciably increase the composite reliability or average variance extracted (AVE) of the construct, as recommended by Hair et al. [
74].
According to the procedure illustrated by Gefen et al. [
75], the end model dropped three items—PE5, TECH4, and RTC4—because their factor loadings were below the cut-off point of 0.50, as presented in
Table 3.
Cronbach’s alpha, rho_A, and composite reliability were utilized in this study to estimate reliability. As suggested by Wasko et al. [
76], coefficients for constructs like behavioral intention (BI), perceived ease of use (PE), perceived risk (PR), perceived usefulness (PU), and resistance to change (RTC) were more than 0.70 and showed very good internal consistency. For the rest of the constructs, reliability coefficients were moderately to highly consistent with previous empirical evidence [
71,
74]. The rho_A coefficient, which is between Cronbach’s alpha and composite reliability conceptually, was also above 0.70 for most of the constructs, consistent with the reliability results following Sarstedt et al. [
67] and Vinzi et al. [
73].
Convergent validity was confirmed by employing the average variance extracted (AVE), and all the constructs were found to have an AVE above the 0.50 mark recommended by Fornell et al. [
77]. Whenever the AVE was below this value, convergent validity was also deemed satisfactory if composite reliability was higher than 0.60, as given by Fornell et al. Discriminant validity was verified on the basis of the Fornell–Larcker criterion that demands that the square root of the AVE of each construct should be higher than correlations with all other constructs. This was met by all the constructs. Additional validation was performed on the basis of the heterotrait–monotrait (HTMT) ratio of correlations, where all were below the conservative threshold value of 0.85 as suggested by Vinzi et al. [
73]. The results are shown in
Table 4 and
Table 5 in more detail.
Item-level descriptives yielded full data on all measures (N = 608). All distributions were well-obedient for PLS-SEM: skewness −1.503 to +0.589 and kurtosis −1.329 to +1.414, within typical robustness thresholds for SEM. Some items had moderate negative skew (e.g., BI2, BI3; PR2–PR3), indicating greater endorsement, while RTC2 had the highest positive skew (0.589). The ranges observed were 2–5 or 3–5, without floor effects and minimal ceiling tendencies on individual items (
Appendix A,
Table A2). Cross-loadings were checked (
Appendix A,
Table A3), and it was found that each item loaded the highest on the construct for which it was intended, with the lowest margin to any cross-loading ≥ 0.10 and no cross-loading greater than its primary loading, asserting discriminant validity as well as HTMT and Fornell–Larcker standards. These distributional characteristics and the reliability/validity measures (CR > 0.80; AVE > 0.50; HTMT < 0.85) ensure the indicators’ redundancy for later structural analysis.
4.3. Structural Model
The structural model was assessed by examining the coefficient of determination (R
2), predictive relevance (Q
2), and statistical significance of the path coefficients, as suggested by Hair et al. [
71]. R
2 values of 0.431 for behavioral intention, 0.28 for resistance to change, and 0.388 for technostress were all within the acceptable limit (0–1), thus reflecting moderate explanatory power. As such, the Q
2 values reflected moderate to high predictive importance, with values of 0.358 for behavioral intention, 0.269 for resistance to change, and 0.388 for technostress. As
Figure 2 illustrates, the model accounts for 43.1% of BI, with PU and PE directly and positively influencing BI, TECH and RTC (β = 0.264) contributing additionally to BI, and PR not directly affecting BI but indirectly through TECH and RTC.
Hypothesis testing also complemented model validation, in which the statistical significance of the hypothesized construct relationships was ascertained. Path coefficient estimation through the bootstrapping procedure was congruent with conventional standards based on Hair et al. [
71]. Mediation effects were tested via the bias-corrected bootstrap procedure with 10,000 resamples, adhering to the methodological recommendations of Preacher et al. [
78] and Streukens et al. [
79]. The results of the structural model are summarized in an overview in
Table 6.
Perceived usefulness (PU) was significantly positively related to BI, β = 0.312, t = 8.39, p < 0.001, which was very strong evidence for H1. Likewise, perceived ease of use (PE) also significantly predicted BI, β = 0.123, t = 2.50, p = 0.006, which was in line with H2. The influence of perceived risk (PR) on BI, however, was not statistically significant, β = 0.050, t = 1.54, p = 0.061, and hence H3 was not supported. Technostress (TECH) was also found to have a large and significant positive correlation between it and BI, β = 0.138, t = 3.57, p < 0.001, with evidence for H4a. Likewise, resistance to change (RTC) also showed a large and significant influence on BI, β = 0.264, t = 6.22, p < 0.001, with evidence for H4b. These findings mean that both cognitive appraisals (PU, PE) and emotional barriers (TECH, RTC) were significant predictors of behavioral intention, while perceived risk was not an essential deterrent in this case.
4.3.1. Mediation Analysis
The indirect effects of perceived usefulness (PU), perceived ease of use (PE), and perceived risk (PR) on behavioral intention (BI) via technostress (TECH) and resistance to change (RTC) were also investigated with a bias-corrected bootstrapping procedure with 10,000 samples. All of the hypothesized paths of mediation were found to be statistically significant, as presented below in
Table 7.
Mediation analyses with BCa 95% CIs (two-tailed, 10,000 bootstraps) indicated that PU significantly directly affected BI (β = 0.312) and boasted two significant positive specific indirect effects via TECH (β = 0.037, CI [0.018, 0.060]) and RTC (β = 0.043, CI [0.024, 0.066]), summing to ≈ 0.080—a show of partial (complementary) mediation, with slightly higher RTC than TECH. PE also showed some partial mediation: a moderate direct path to BI (β = 0.123) and some indirect paths through TECH (β = 0.058, CI [0.032, 0.087]) and RTC (β = 0.103, CI [0.069, 0.142]), for a total indirect effect of ≈ 0.161; RTC again contributed the greater percentage. Conversely, PR was mediated only indirectly: the direct pathway with BI was not significant, but the indirect pathways through TECH (β = −0.015, CI [−0.026, −0.007]) and RTC (β = −0.045, CI [−0.067, −0.024]) were significant and negative (total indirect ≈ −0.060), where increased perceived risk reduces intention through lower readiness for technology and increased resistance to change, and RTC is the main mediator.
Perceived usefulness had strong indirect impacts on BI via both TECH (β = 0.037, t = 2.84, p = 0.002) and RTC (β = 0.043, t = 3.37, p < 0.001), supporting H5a and H5b, respectively. Since the direct impact of PU on BI was still significant, it reflects partial mediation via both technostress and resistance to change. Likewise, perceived ease of use had strong indirect effects on BI through TECH (β = 0.058, t = 3.43, p < 0.001) and RTC (β = 0.103, t = 4.70, p < 0.001), confirming H6a and H6b. Since the direct relationship from PE to BI was also significant, the results imply partial mediation by both the mediators. For perceived risk, both indirect routes were significant and in the negative: through TECH (β = −0.015, t = 2.59, p = 0.005) and through RTC (β = −0.045, t = 3.31, p < 0.001), confirming H7a and H7b.
As the direct impact of PR on BI was not statistically significant (
p = 0.061), the results above confirm full mediation through the two affective constructs. Briefly, the results indicate that technostress and change resistance mediating effects are substantial in the explanation of cognitive appraisals’ influence (usefulness, ease, and risk) on students’ behavioral intention to use mobile learning applications. Given the non-significant direct effect PR → BI (
p = 0.061), these findings are consistent with full mediation. For transparency,
Table 6 relabels the former “Total Effects” rows as “Total Indirect Effects” and additionally reports “Total Effects (Direct + Total Indirect)”.
4.3.2. Multi-Group Analysis
We confirmed measurement invariance through the MICOM procedure before conducting multi-group comparisons. Configural invariance was confirmed because the same indicators, data treatment, and algorithm parameters were employed for all groups, and the model specifications were equal. To test compositional invariance, we utilized the permutation test and compared the starting correlation of composite scores with the 5% quantile of the empirical distribution. In each of the groups and models, the initial correlation was larger than the 5% quantile, thus ensuring compositional invariance. Lastly, the equality of means and variances of composites was confirmed by testing whether the Mean Original Difference (MOD) and Variance Original Difference (VOD) were within the 95% confidence limits derived through permutation (p-values > 0.05). All reflective composites were within this, and this was evidence of full measurement invariance (Steps 1–3) across the research groups. We report the multi-group results as between-group differences (Δβ), where Δβ = β(Group A) − β(Group B). Signs indicate relative magnitude across groups, not the sign of any group’s path coefficient.
To check whether the structural relationships among the variables in the model were significantly different for male and female respondents, a multi-group analysis (MGA) was conducted. Statistical differences were indicated on several paths (
Table 8). The effect of technostress on behavioral intention (TECH → BI) was weaker in females (β = –0.298,
p < 0.001), indicating the more intense negative effects of technostress on adoption behavior by female students. On the other hand, men demonstrated significantly larger effects of perceived usefulness on behavioral intention (PU → BI; Δβ = 0.218,
p = 0.002) and on resistance to change (PU → RTC; Δβ = 0.233,
p = 0.003), which reflect more motivational and affective impact from perceived utility. In addition, perceived risk negatively predicted behavioral intention stronger in females (PR → BI; Δβ = –0.244,
p = 0.004), whereas perceived ease of use had a stronger positive effect in males (PE → BI; Δβ = 0.185,
p = 0.031). Finally, there was also a gender difference in the way perceived risk influences technostress (PR → TECH; Δβ = –0.143,
p = 0.039). These results imply that gender is a moderator of the direct relationships between the processing of technostress and perceived usefulness by the students in the mobile learning adoption model.
A multi-group test was used to examine whether age was a moderator of structural relationships between constructs in the model of mobile learning adoption. Statistically significant differences between age groups were present for several constructs. The influence of perceived ease of use (PE) on behavioral intention (BI) varied across age groups. In particular, the correlation was greater for the age range of 18–24 years than for 25–30 (Δβ = –0.386, p = 0.001), 31–40 (Δβ = 0.211, p = 0.036), and 40+ (Δβ = 0.255, p = 0.060) years. The findings indicate that ease of use would be a stronger predictor for younger users in creating adoption intentions for mobile learning technologies. Additionally, the behavioral intention impact of technostress (TECH) was significantly greater in the 25–30-year-old group than in the younger (18–24; Δβ = 0.294, p = 0.001) and older groups (31–40; Δβ = –0.419, p < 0.001). This indicates that there is higher stress-related impediment sensitivity in the 25–30-year-old group. Perceived usefulness (PU) had much stronger effects on decreasing technostress among the younger age group of respondents (18–24) than in the 25–30 age group (Δβ = –0.335, p = 0.003). The same pattern was observed in the perceived ease of use affecting technostress, where greater effects were observed in the 18–24 age group compared to the 25–30 age group (Δβ = 0.307, p = 0.004). Lastly, perceived risk (PR) had a stronger effect on resistance to change (RTC) on the oldest age group (40+) than on the other age groups. This was especially observed between 18–24 and 40+ (Δβ = 0.392, p = 0.001) and between 31–40 and 40+ (Δβ = 0.202, p = 0.026), and this shows that older learners resist mobile learning when they experience a higher risk.
To investigate whether the confidence of users to employ mobile learning applications as a moderator of relationships in the structural model, multi-group analysis was carried out across three levels of self-reported confidence (high, moderate, and low). Most paths were found to have substantive differences. The effect of resistance to change (RTC) on behavioral intention (BI) was more pronounced for high-confidence than for low-confidence (Δβ = –0.253, p = 0.003) and moderate-confidence (Δβ = 0.266, p = 0.028) participants, which means that confident users are more adversely affected by perceived behavior rigidity. Perceived usefulness (PU) had a greater influence on behavior intention among high-confidence users compared to low-confidence (Δβ = 0.186, p = 0.023) and moderate users (Δβ = –0.210, p = 0.016), suggesting that confidence in the usability of mobile applications has a greater influence on high-confidence users. There was also a difference found in the impact of perceived ease of use (PE) on resistance to change (RTC) between the high- and low-confidence groups (Δβ = –0.163, p = 0.043) and the low- and moderate-confidence groups (Δβ = 0.255, p = 0.021), meaning that ease of use can specifically lower resistance among less confident users. Also, technostress (TECH) had a negative impact on behavioral intention in low-confidence users much more compared to moderately (p = 0.010) and highly confident (p = 0.002) users, highlighting their susceptibility to stress factors when operating an app. Lastly, the impact of PE on behavioral intention was stronger in the high-confidence group than in the moderate-confidence group (Δβ = 0.249, p = 0.028), indicating that users with high confidence convert usability perceptions directly into intentions to use.
Multi-group analysis (MGA) across years of mobile learning application usage identified a set of large structural differences, but frequency of use identified no significant differences. In particular, the influence of perceived usefulness (PU) on resistance to change (RTC) was very contrasting for those with 1–2 years and less than 1 year of experience and less than 1 year and more than 2 years of experience (Δβ = –0.313, p < 0.001, and Δβ = 0.384, p = 0.007). Perceived ease of use (PE) was a better predictor of behavioral intention (BI) for the 1–2 year group than less experienced users (Δβ = 0.318, p = 0.001), but technostress (TECH) more negatively affected BI for the same group than for less experienced (p = 0.001) and more experienced (p = 0.050) users. Privacy risk (PR) affected BI stronger in the 1–2 year group than in the <1 year group (Δβ = 0.161, p = 0.019), and resistance to change (RTC) affected BI more negatively in the 1–2 year group than in both the <1 year (Δβ = –0.143, p = 0.034) and >2 year (Δβ = –0.277, p = 0.027) groups. Moreover, PR had a stronger impact on RTC in the 1–2 year user group compared to more beginners (Δβ = 0.187, p = 0.037), and PE influenced TECH stronger in the 1–2 versus >2 year category (Δβ = 0.426, p = 0.018). Lastly, PU was better explained for users with experience levels exceeding 2 years than for those with 1–2 years (p = 0.006) or <1 year (p = 0.001).
To investigate whether or not structural relationships were different among groups based on the frequency of application of mobile learning apps (i.e., rarely, sometimes, often, very often), a multi-group analysis (MGA) was performed. The findings indicated no statistically significant difference in path coefficients between frequency groups. This implies that the structural relationships between the constructs—perceived usefulness, perceived ease of use, perceived risk, technostress, resistance to change, and behavioral intention—are invariant irrespective of the frequency or otherwise with which students utilize mobile learning applications.