1. Introduction
The contemporary era of Artificial Intelligence (AI), characterized by unprecedented advances in machine learning algorithms and large language models, has pervaded many decision-making domains [
1], enabling AI systems to imitate and augment human cognitive functions in reasoning, learning, and problem solving [
2,
3,
4]. As this technology has matured, the healthcare sector has emerged as a particularly promising domain for AI applications [
5], witnessing a revolutionary advancement in complex medical reasoning and decision support [
6,
7]. In this vein, Chatbots, also known as conversational agents or virtual agents, represent a type of AI application designed to simulate human-like conversations with users in natural language, providing information or services in specific domains [
2,
3,
4]. The extant literature of healthcare services broadly acknowledges the superiority of chatbot applications in healthcare services over other AI technological alternatives, particularly in reshaping healthcare service delivery models by enhancing accessibility, personalization, and efficiency [
5,
8,
9], while simultaneously recasting patient education, interaction, and engagement with health information [
10,
11,
12].
In the extant literature, chatbot applications in the health services domain are neither novel nor marginal; rather, they have gained ground in the existing body of literature, reflecting a sustained and evolving scholarly interest in their potential and applications [
13,
14,
15]. Empirical studies appear to affirm the growing role of chatbots in promoting public health messages to enhance vaccine literacy and mitigate vaccine hesitancy [
16,
17,
18,
19] improving diabetic patient education [
20,
21], providing emotional support for patient [
22,
23], assisting in remote mental health monitoring [
24,
25,
26], aiding tobacco cessation [
27,
28,
29,
30], enhancing drug compliance, and alleviating symptoms of depression and anxiety [
7,
31,
32].
A great achievement of the mainstream research has been to convey the message that chatbots have the toolkit to assist users with various healthcare services yet remains less conclusive regarding the extent to which users find them useful for delivering context-sensitive healthcare service advice, which needs to be thoroughly evaluated [
14,
33,
34,
35]. For instance, prior studies have been grounded in adoption driven studies, predominantly informed by Technology Acceptance Model (TAM) and Unified Theory of Acceptance and Use of Technology (UTAUT) perspectives, and focuses primarily on users’ behavioral intentions [
36,
37,
38,
39,
40,
41,
42,
43,
44], leaving limited explanation on how healthcare chatbots function as effective service systems for use in practice. Others have gone so far as to demonstrate benefits and utilization of AI-powered chatbots in various healthcare environments [
5,
35,
45,
46,
47], but how effectively it is utilized to provide appropriate service support to the users remains an unresolved concern in the extant literature.
With a bolder move, some scholars argued that we are overwhelmed by contributions that produced a proliferation of research on users’ behavioural intentions and acceptance studies [
35], we nonetheless produce little, if at all, clear understanding about how core information system attributes contribute to user satisfaction, usage, and overall system success when engaging with AI-powered healthcare services [
14,
48,
49,
50,
51]. Against this backdrop, this paper penetrated the ‘black-box’ of the factors implicated in determining the success of chatbot applications in healthcare services; an area that has hitherto been overlooked by the bulk of scholars. Following a thorough analysis of the literature, this paper seeks to fill the gap by adopting the modified DeLone and McLean Information System Success Model (D&M ISS Model) [
52] to examine the mechanisms through which core information system attributes drive users’ satisfaction and subsequent use, which together serve as key indicators of overall system success in healthcare chatbot services.
While the original D&M ISS model emphasizes constructs like perceived usefulness, system quality, and service quality, it largely overlooks the role of trust. However, in the context of healthcare services, omitting trust would be a critical limitation, as it has long been fundamental to healthcare delivery, underpinning users and provider relationships and shaping reliance on healthcare outcomes [
53,
54]. In this vein, as chatbots become increasingly embedded in healthcare workflows, establishing trust from both patients and providers is essential for adoption and meaningful use [
55,
56]. Prior studies articulated that trust functions as a central mechanism that introduces perceptions of safety and credibility beyond what quality dimensions alone can ensure [
57,
58,
59]. Users may acknowledge a chatbot’s technical quality yet still require trust before engaging in repeated use or acting upon its health-related recommendations [
60]. Accordingly, trust operates as a critical mechanism through which system quality dimensions translate into user satisfaction, continued use, and net benefits. On this basis, this study was undertaken to examine the underlying mechanisms that drive users’ behavioral intention and subsequent use of healthcare chatbots and poses the following question: What are the effects of information quality, system quality, service quality, and trust on use, user satisfaction, and net benefits in the context of healthcare chatbot services?
The primary objective of this study is to evaluate the success of the Healthcare Chatbot Service from the user’s perspective by using the D&M ISS Model as a guiding framework for this research [
52]. Recognized as a type of IS, the healthcare chatbot service can be assessed using the D&M ISS Model [
61]. Numerous studies have employed and validated the D&M ISS Model across various domains. This model has been extensively tested in the context of e-commerce systems [
62], e-government systems [
63,
64], and technology adoption in tourism [
65]. The adaptability and robustness of the D&M ISS Model have also been demonstrated in several health IT success projects in both developing and developed nations [
63,
66,
67,
68,
69]. The D&M ISS Model provides a practical framework for evaluating the effectiveness of health IS [
70].
The enhancement of the original model, through the addition of the service quality variable, addresses the evolving nature of IS and the consequent shifts in the definition of “success” [
71]. This modification underscores the necessity of incorporating service quality when evaluating IS, as previously advocated by several scholars [
72]. Consequently, the updated D&M ISS model posits that three core components—service quality, information quality, and system quality—collectively influence system utilization and user satisfaction, thereby explaining the success of the IS platform [
73]. The D&M ISS Model delineates six interconnected constructs of IS success, with a particular emphasis on the quality dimensions (service quality, information quality, and system quality). These dimensions are critical as they potentially affect users’ intent to use, actual usage, and overall satisfaction. The model posits that user satisfaction derived from these quality dimensions can lead to net benefits, which in turn reinforce continued use of the information system. The next section elaborates more on these factors and posits the research hypotheses for this study.
3. Materials and Methods
A structured survey was developed to evaluate the relationships within the research model, containing indicators directly related to each construct. These indicators were adopted from pertinent literature [
52,
63,
64]. The measurement items were tailored to specifically assess users’ experience and perceptions regarding the Ministry of Health (MOH) chatbot service within its functional context. The chatbot integrates automated responses with optional live chat support and provides key services, including locating primary healthcare and on-duty centers, managing users’ appointments, directing users to teleconsultation platforms, providing guidance on healthcare procedures (e.g., medical leave application and timelines), delivering prescription-related services, and connecting users to service representatives for inquiries, complaints, and reports.
3.1. Pilot Study
Prior to the main data collection, a pilot study was conducted with 10 experts who hold a master’s degree or higher. The pilot’s aim was to evaluate the clarity, reliability, and face validity of the survey items through an iterative process. Because the items were adapted from established measures in prior studies, the primary aim of the pilot was to ensure that the wording was clear, the items were understandable, and the instrument was suitable for the target participants. Feedback from the experts led to minor revisions in the item wording to improve clarity and comprehensibility. No major changes to the instrument’s structure were required.
3.2. Measurement Items
Each of the seven constructs in our research model was measured using four items, selected and adapted from established scales in previous studies. The precise source for each construct’s items is provided in
Table 1.
All items were measured using a seven-point Likert-type scale, ranging from (1) “Strongly Disagree” to (7) “Strongly Agree”. Additionally, a demographic section was included to capture gender, age, and education level.
3.3. Sampling Method
We employed a non-probability sampling method, specifically convenience sampling, to recruit participants from the Kingdom of Saudi Arabia. This approach was chosen because the target population was not centrally registered and could only be reached via online platforms [
123,
124]. However, it may introduce selection bias and limit the extent to which the sample can be considered representative of the broader population [
125,
126]. The study was advertised through social media and WhatsApp groups. The inclusion criteria required respondents to be at least 18 years old, Saudi residents, and to have used at least one healthcare chatbot service in the past. Any participants who did not meet these criteria were excluded from further analysis.
3.4. Sample Size
The minimum recommended sample size for Partial Least Squares Structural Equation Modeling (PLS-SEM) is commonly determined using the “10-times rule,” which advises that the minimum sample should be ten times the largest number of indicators used to measure any single construct, or ten times the largest number of structural paths directed at any endogenous construct [
127,
128].
In our model, each construct is measured by 4 items, and our structural model includes 7 constructs and 10 hypotheses. The maximum number of predictors pointing to any construct is 6. Therefore, the minimum sample size requirement is 60. However, in most management, health informatics, and IS studies using PLS-SEM, a sample size between 200 and 300 is generally considered robust [
129].
3.5. Data Collection
Data collection was performed via an electronic survey built with Google Forms. The survey link was distributed between July and September 2024, accompanied by an invitation message specifying the study’s purpose and inclusion criteria. The survey began with an introductory section outlining the study’s purpose and assuring participants that no personally identifiable information would be collected. Participation was entirely voluntary, with respondents able to withdraw from the survey at any time without penalties.
3.6. Ethical Considerations
Ethical approval for this study was waived by the Research Ethics Committee at King Abdulaziz University, which granted an exemption under Reference No. (37-25). Participation was completely voluntary, and all potential participants were informed they could withdraw at any time without providing a reason or facing any penalty. No identifying information was collected, and all survey data remained anonymous. Participants did not receive any compensation for participating in the study. The research was funded by the KAU Endowment (WAQF) at King Abdulaziz University, Jeddah, Saudi Arabia. The funder had no role in the study design, data collection, data analysis, interpretation of results, manuscript preparation, or the decision to submit the manuscript for publication.
3.7. Quality and Analysis
Collected responses were downloaded and labeled using IBM SPSS Statistics (Version 29) for descriptive analysis. Responses were screened for incompleteness, duplication, extreme values, and outliers. No evidence of significant bias or systematic data issues was detected in the retained responses. The data were subsequently processed using Smart PLS 3.3 software for structural equation modeling analysis. Smart PLS was selected because it is a suitable approach for examining a model with multiple latent constructs, reflective indicators, and a relatively complex structural framework. Indices and factor loading thresholds were set according to established standards [
130,
131,
132,
133]. Detailed results of the analysis are presented in the following section.
4. Results
This study collected responses from 321 participants (n = 321), which exceeded the minimum requirements implied by the structural complexity of the model and is considered adequate for PLS-SEM estimation. To further justify sample adequacy, a post hoc power analysis was conducted using G*Power 3 [
134] for an F-test (R
2 increase) with 10 predictors. The analysis confirmed that n = 321 provides >0.99 power to detect small effects (f
2 = 0.02), 1.00 power for medium effects (f
2 = 0.15), and 1.00 power for large effects (f
2 = 0.35) at α = 0.05.
4.1. Demographic Data
The demographic composition revealed that 79.13% (254/321) of the respondents were male. The age distribution was as follows: 18–19 years old (24/321, 7.47%), 20–29 years old (145/321, 45.16%), 30–39 years old (69/321, 21.5%), 40–49 years old (43/321, 13.4%), 50–59 years old (23/100, 7.16%), and over 60 years old (17/321, 5.3%).
4.2. The Assessment of the Measurement Model
The updated guidelines for assessing the measurement model include calculating individual factor loadings, cross-loadings, Cronbach’s alpha, composite reliability, convergent validity (average variance extracted or AVE) [
132], inter-construct correlations, and latent variable scores.
Table 2 presents the findings of the measurement model analysis. The results indicate that the factor loading (FL) values meet the minimum required level (FL > 0.7). Composite reliability (CR) values indicate that all variables exhibit an acceptable level of reliability (CR > 0.7). Similarly, Cronbach’s alpha (α) values demonstrate that all variables maintain an acceptable level of reliability (α > 0.7). Additionally, the AVE values for all variables exceed the threshold of 0.5 (AVE > 0.5). These results confirm that the measurement model satisfies loadings, cross-loadings, reliability, and validity requirements.
The Fornell–Larcker criterion stipulates that the square root of AVE for each latent variable should be greater than the correlations with other variables [
135]. Thus, discriminant validity is established for all constructs.
Table 3 presents the outcomes of the Fornell–Larcker criterion.
In addition, the HTMT results indicate that most construct pairs meet the recommended threshold. However, the values between Satisfaction and Net Benefit, Use and Satisfaction, Service quality and Satisfaction, and Use and Net Benefit exceed 0.90, suggesting possible overlap among these constructs and warranting cautious interpretation.
Table 4 presents the outcomes of the Heterotrait–Monotrait Ratio (HTMT).
4.3. Testing of the Structural Model
The structural model assessment follows the updated guidelines provided by [
132]. The assessment applies the following threshold values:
Table 5 presents the statistical analyses of direct, total, and effect size. The path coefficient values quantify the relationships between each independent variable and the dependent variables. The analysis identified two variables with path coefficient values lower than 1.65 (
p > 0.05), indicating non-significant relationships. Consequently, hypotheses H2 (
p = 0.234) and H3 (
p = 0.289) are not supported. Conversely, system quality has a significant positive impact on satisfaction (
p = 0.018), supporting H4. Information quality has a significant positive impact on use (
p = 0.003), supporting H1. Similarly, service quality has a significant positive impact on both satisfaction and use (
p = 0.001), supporting H5 and H6. Finally, satisfaction has a significant positive influence on net benefits (
p < 0.001), trust has a significant positive influence on use (
p < 0.001), and intention to use has a significant positive influence on both net benefits and satisfaction (
p < 0.001), supporting hypotheses H7, H8, H9, and H10.
Figure 2 depicts all constructs with their respective path coefficients.
Effect size measures the impact of each predictor on the dependent construct [
138]. While significance tests indicate whether a relationship exists, effect sizes convey the magnitude of these effects [
127]. At the structural level, the effect of the predictor is categorized as large, medium, or small if F
2 is 0.35, 0.15, or 0.02, respectively [
139].
The R2 analysis revealed that the latent variables representing quality and trust explained a substantial 75% of the variance observed in user satisfaction. Additionally, the combined effects of quality and intention to use explained a high 86% of the variance in satisfaction. Finally, both intention to use and satisfaction jointly accounted for 85% of the variance in the net benefit dependent variable. These findings suggest a strong explanatory power of the model for predicting user satisfaction and net benefits.
A high Q2 value (>0) for endogenous variables signifies the model’s ability to explain a substantial proportion of the variance observed in these variables, exceeding the contribution of random error.
In this study, all three endogenous variables exhibited VIF values greater than 4 and tolerance values less than 0.25, indicating the potential presence of multicollinearity [
140,
141]. However, prior D&M ISS-based studies show that user satisfaction and net benefits are closely related, with user satisfaction often functioning as an important antecedent of net benefits [
142,
143,
144]. Thus, the model was retained to maintain theoretical fidelity to the original framework, with the elevated VIF levels acknowledged as a limitation regarding parameter precision rather than a fundamental structural distortion [
145].
Table 6 shows all the variance inflation factors.
5. Discussion
This study set out to examine the determinants of success for healthcare chatbot services through the lens of the D&M ISS Model. The discussion that follows interprets the empirical findings in light of the model’s theoretical propositions and articulates how the study’s findings contribute to both theoretical refinement and practical understanding of chatbot service success in healthcare—emphasizing the contextual dynamics that govern user trust, system use, and the realization of net benefits.
The demographic characteristics of the respondents are presented first to provide context for interpreting the findings. Although the sample was skewed toward male respondents, this characteristic should be considered when interpreting the findings. The results still offer useful evidence from the surveyed participants, while the gender imbalance limits the extent to which the findings can be generalized [
146,
147].
Drawing on SmartPLS Structural Equation Modeling (SEM), ten hypotheses were tested to evaluate the relationships among the model’s key dimensions. At the outset, the study results revealed that eight out of ten hypotheses were supported, as shown in
Table 2. Among these, information quality emerged as a pivotal construct influencing users’ intention to use chatbot services. Such a finding, on one hand, underscores that enhancing the accuracy, relevance, and reliability of information provided to users significantly increases their willingness to rely on chatbot-based health services. On the other hand, this finding is in sympathy with previous findings in the mainstream literature [
63,
76], reaffirming that perceived information quality remains central to the success of chatbot services in the healthcare context.
While prior studies have linked increased user satisfaction to higher levels of information quality [
82,
85,
87,
111], this study diverges from that established relationship. Indeed, the findings of this study assert that information quality did not exert a significant influence on user satisfaction with healthcare chatbot services and instead indicate that it might be specific experiential factors that have become necessary for users’ satisfaction more than others that seemed to depend upon users’ perceived system quality in terms of responsiveness and chatbot fluency. This outcome aligns with [
83], who similarly observed that information quality alone does not guarantee user satisfaction. It also aligns with empirical evidence from [
148], who reported similar results in student information systems, where baseline informational expectations were adequately met. The lack of significance may reflect the specific characteristics of healthcare chatbot services, where users expect accurate medical information as a basic requirement rather than a differentiating factor for satisfaction. When information quality meets minimum standards of accuracy and relevance, additional improvements may yield diminishing returns on satisfaction, particularly when relational factors like trust and service quality dominate the user experience. Therefore, this study contributes to the IS success literature by highlighting that, in chatbot service applications in healthcare contexts, satisfaction may be driven less by informational quality and more by experiential qualities that shape users’ engagement with chatbot services. This underscores the need for context-specific adaptation of the D&M ISS model when applied to AI-mediated services, where technical quality dimensions may play secondary roles relative to human-like interaction capabilities.
The findings in this study throw a spotlight on the pivotal role of system quality in shaping users’ behavioral intention and satisfaction with healthcare chatbot services. Users perceive that the overall quality of the chatbot system, as reflected in its reliability, response speed, and technical stability, directly influences their willingness to engage with the technology. This outcome is in alignment with the principle that the easier the system is to use, the greater the likelihood that users will adopt and continue to use healthcare chatbot services. Here, the mainstream research is dominated by a high proportion of studies that envisage system quality as a significant predictor of both user satisfaction and intention to use, echoing the propositions of D&M ISS Model [
74]. Thus, the findings of this study encapsulate the mainstream thinking and assert that enhancing system quality does not merely improve technical efficiency but also strengthens user-perceived value, ultimately fostering continued system engagement.
Contrary to expectations, system quality did not significantly influence the intention to use healthcare chatbot services. This finding highlights that providing a high-quality system alone may not be sufficient to drive the adoption of chatbots in healthcare services. While healthcare chatbots are normally developed to offer personalized services, users may prioritize other factors—such as trust and ease of use when deciding whether to continue engaging with these systems. This result is consistent with [
149], who explicitly reported that system quality has no significant effect on system use in their analysis of government information systems. Similarly, ref. [
150] confirmed that system quality does not significantly influence behavioral intention to use online services, suggesting that technical attributes become less salient when trust and service interactions dominate adoption decisions. The findings in this paper showed instances in which users adapted to basic system functionalities and prioritized relational factors, such as trust and service quality, over incremental improvements in technical performance. Once minimum usability thresholds are met, additional system quality enhancements may yield limited returns in terms of adoption intention, particularly in healthcare contexts where perceived reliability and human-like interaction outweigh technical sophistication. This pattern reinforces the context-dependent nature of D&M ISS model relationships and highlights trust as a critical mediator in chatbot healthcare applications.
Nevertheless, the results confirm that service quality remains an important determinant of user satisfaction, consistent with prior research. This indicates that satisfaction with healthcare chatbots depends not only on service efficiency but also on the perceived quality embedded in system interactions—elements that reinforce trust and long-term acceptance of chatbot in the healthcare environment.
Trust emerged as a strong predictor of intention to use healthcare chatbot services. The findings of this study reveal that trust exerts a strong effect on usage intention, consistent with recent systematic reviews [
80] and prior IS success literature. Indeed, the absence of trust often leads to hesitation in using technology in the healthcare context, underscoring its indispensable role in sustaining continuous use. Additionally, the findings suggest that trust reinforces user satisfaction, as users who perceive chatbot interactions as credible are more likely to evaluate the service positively. This outcome indicates that fostering trust among chatbot users could enhance both their intention to use and overall satisfaction with healthcare chatbot services.
Furthermore, the results demonstrate that intention to use has a direct impact on net benefits, corroborating recent findings in the IS success literature [
115,
116]. Likewise, user satisfaction emerged as a strong predictor of net benefits, advocating the notion that sustained engagement with chatbot services is essential for realizing meaningful value for users. As previously emphasized by [
113] that the specific benefits to be assessed are contingent upon the specific system and level of impact under examination, the finding of this study suggest that when users perceive value and satisfaction in their interactions, they are more likely to translate this experience into tangible benefits both at the individual level (e.g., service accessibility) and the organizational level (e.g., enhanced efficiency and users engagement).
As alluded to before, a great achievement of the extant studies has been the conceptualization that surmises the success of healthcare chatbot services in assisting users with various healthcare tasks. Indeed, the nature of success was taken for granted in the extant literature and somehow passed unnoticed, with no questions examining the extent to which different factors influence chatbot success. Against this backdrop, findings from this study demonstrate that the primary objective of evaluating the success of a healthcare chatbot service from the user’s perspective using the D&M ISS Model was informed by the reported results. Specifically, the result empirically indicated that chatbot success is not merely underpinned by technical quality alone, but by an integrated process in which information quality, service quality, system quality, and trust shape users’ engagement with this type of system, which in turn drives their satisfaction and perceived net benefits. Accordingly, the study provides empirical support for conceptualizing user-perceived chatbot success as a multidimensional construct encompassing quality perceptions, behavioral use, experiential satisfaction, and realized benefits.
6. Conclusions
This paper was concerned with the principal research question of: What are the effects of information quality, system quality, service quality, and trust on use, user satisfaction, and net benefits in the context of healthcare chatbot services? Rather than merely assuming that chatbots are inherently successful in assisting users with various healthcare tasks, this research sought to unpack the underlying mechanisms driving such success. The paper delved into the “black box” of interrelated constructs to uncover how they collectively shape user experience and perceived chatbot effectiveness in the healthcare context. By empirically examining these constructs, the study addresses a gap in the literature, where prior research has largely overlooked the interactions among these constructs in explaining chatbot success within healthcare services. Contrary to theoretical expectations, information quality significantly predicted intention to use but not user satisfaction, while system quality significantly influenced satisfaction but not intention to use. These findings indicate that accurate, relevant information primarily drives adoption decisions, whereas technical performance contributes more substantially to users’ evaluative experience of the service. Trust emerged as the strongest predictor of intention to use, underscoring its pivotal role in healthcare chatbot adoption. Collectively, these results demonstrate that information quality and system quality operate through distinct pathways in the healthcare chatbot context, with implications for the contextual adaptation of the D&M ISS Model. Moreover, both intention to use and user satisfaction were found to have a direct impact on net benefits, confirming that continued usage and satisfaction are crucial pathways through which users derive value from chatbot services. The results provided meaningful insights into the determinants of chatbot success and the dynamics shaping users’ experience when engaging with chatbot services.
6.1. Contribution
As the findings emerged from this research were tied back to the extant literature, the contributions of this research are now highlighted and organized into the following points. First, the present study advances the theoretical understanding of the D&M ISS Model within the context of chatbot healthcare services. The paper unearths the interrelated systems and perceptual practices—notably trust, user experience, and quality integration mechanisms—that collectively constitute the enactment of chatbot success. These practices were found to shape how users internalize chatbot effectiveness and how such perceptions translate into satisfaction, use, and ultimately, net benefits. The paper argues for nuanced changes to the extant literature that has been reluctant to treat chatbot success beyond system dimensions—such as information quality, system quality, and service quality—as discrete predictors of success. Although not dismissing their importance, the study envisages chatbot success as emerging from the dynamic interplay among quality dimensions and trust, which together mediate users’ cognitive and affective responses to system use. This reconceptualization extends the D&M ISS Model by embedding it within the socio-technical realities of healthcare, offering a more holistic explanation of how users perceive and derive value from chatbot health services.
Second, the paper offers a theoretical reframing of IS success in intelligent medical care systems—illustrating that the realization of net benefits depends not only on technical excellence but also on users’ cognitive trust, and perceived relational quality of interaction. This integrated perspective broadens the explanatory scope of the D&M ISS Model and sets a foundation for future inquiries into the success dynamics of AI-driven, trust-sensitive healthcare innovations. Such a contribution, while contextualizing D&M ISS Model application within the rapidly evolving domain of healthcare chatbot services, also serves as theoretical groundings to bridge the traditional divide between system-centric measures of success and the human-centered realities of healthcare technology adoption.
Third, the application of the D&M ISS Model in the context of this study embodies a sense of a diagnostic and interpretive device for healthcare professionals in different healthcare contexts. While the D&M ISS Model was operationalized within the Saudi public healthcare sector, it offers a transferable analytical lens that invites healthcare professionals to embark on experiential insights to tailor chatbot success strategies to their unique environments. In practice, this means that the enactment of IS success in intelligent healthcare systems is contextually constructed, reflecting the interplay of technical, organizational, and human factors. In this sense, the study sheds light on the contextual conditions under which information quality, system quality, service quality, and trust interact to influence user satisfaction, system use, and net benefits.
6.2. Limitation
Several limitations were acknowledged in this study. First, the use of convenience sampling may have introduced selection bias. This non-probability design supports feasibility in an otherwise hard-to-reach population, but it also constrains external validity and limits generalization beyond the study sample [
125,
126]. Second, the sample was heavily male-dominated, with women underrepresented. Because prior research suggests that gender can shape technology-related perceptions and adoption behavior, the observed relationships may not be identical in a more gender-balanced sample [
146,
147]. We therefore avoid claiming that the results would remain unchanged under balanced gender parity. Instead, the gender imbalance should be viewed as a boundary condition on the interpretation of the findings. Third, although the sample size was adequate for PLS-SEM, statistical sufficiency does not resolve representativeness concerns. Future research should replicate the model using stratified sampling or a more balanced design across gender groups, and it may also be useful to test for measurement or structural differences across male and female respondents [
125]. Such work would help determine whether the present model is stable across subgroups and whether the effects observed here generalize more broadly. Finally, although the measurement and structural models met the main evaluation criteria, some VIF values exceeded the more conservative threshold of 4.0, suggesting a degree of multicollinearity among predictors that may affect the precision of the estimated relationships. While this does not necessarily invalidate the structural results, it indicates that the model should be interpreted with caution. Future research should therefore further examine this issue using larger and more diverse samples, and, where appropriate, apply additional diagnostic tests or alternative model specifications to assess the stability of the path estimates.