1. Introduction
Digital harassment has become a growing and deeply consequential challenge within contemporary higher education. As universities increasingly rely on digital and AI-enhanced platforms for teaching, communication, and assessment, students’ academic and social lives are now embedded within data-driven environments that shape visibility, interaction, and social participation [
1,
2].
Beyond being a peripheral online is associated with higher levels of cyberbullying, and related forms of digital harassment are widespread and severe: global evidence indicates that approximately 14–57% of students have experienced cyberbullying, depending on context and measurement approaches [
3]. Significantly, cyberbullying is strongly associated with severe mental health outcomes, including anxiety, depression, emotional distress, and social withdrawal [
4,
5,
6,
7].
Large-scale epidemiological research further shows that students exposed to cyberbullying are more than four times as likely to report suicidal thoughts and suicide attempts compared with non-victims, even after controlling for social and family factors [
3]. In parallel, digital harassment has been shown to undermine educational participation, contributing to reduced concentration, avoidance of online platforms, absenteeism, and disengagement from learning activities [
8,
9].
These findings demonstrate that digital harassment within AI-mediated educational environments is not merely a behavioural inconvenience but a serious threat to student safety, mental health, and educational continuity, raising urgent concerns about equity, well-being, and institutional responsibility in data-driven higher education [
3].
AI-driven systems—such as algorithmic recommendations, automated content ranking, behavioural profiling, and AI-supported communication tools—play an increasingly active role in structuring online interaction [
1]. Rather than serving as neutral infrastructures, these systems influence what students see, whom they interact with, and how long content or interactions remain visible [
2].
Consequently, digital environments do not merely host social behaviour; they actively organise it. Within higher education, this shift has profound implications for students’ exposure to online harm and their capacity to engage safely and sustainably in learning [
3,
4]. Although digital technologies have expanded access to education and collaboration, they have also intensified students’ exposure to digital harassment [
4,
5,
6].
Cyberbullying, cyber hate, sexual extortion, impersonation, and digital stalking are no longer marginal phenomena but persistent issues associated with higher levels that undermine students’ psychological well-being, sense of safety, academic engagement, and willingness to remain active in online learning environments [
7,
8,
9,
10]. Empirical research consistently links exposure to such harms with anxiety, emotional exhaustion, reduced concentration, lower academic performance, and, in severe cases, withdrawal from digital learning communities and diminished educational motivation [
7,
11].
Crucially, digital harassment in contemporary learning environments cannot be understood solely as the product of individual interpersonal behaviour. In AI-mediated contexts, algorithmic systems may unintentionally reshape exposure conditions by amplifying visibility, facilitating repeated or unsolicited contact, and extending the persistence and circulation of harmful content [
12,
13]. This sociotechnical dynamic aligns with scholarship in digital sociology and algorithmic governance, which emphasises that technological infrastructures actively configure social relations rather than merely reflecting them [
3,
14].
Accordingly, understanding digital harassment requires moving beyond individual-level explanations to examine how algorithmic mediation restructures the conditions under which harm becomes more likely, sustained, or difficult to escape [
15]. These challenges are particularly acute in developing and emerging countries, where disparities in technological literacy, uneven access to digital resources, and culturally embedded norms influence both vulnerability and reporting behaviour [
16,
17].
In such contexts, digital inequalities translate into stratified patterns of exposure, reinforcing existing social divides and producing uneven access to safe and inclusive online learning spaces [
11]. Universities operating within these environments therefore face heightened ethical and institutional responsibilities, especially under international commitments such as the Sustainable Development Goals—most notably SDG 4 (Quality Education) and SDG 5 (Gender Equality)—which call for safe, inclusive, and non-discriminatory educational settings [
18,
19].
Despite a growing body of research on cyberbullying and online harassment (e.g., [
20,
21,
22]), significant gaps remain. Much of the existing literature focuses on general social media contexts or individual behavioural factors associated with higher levels, often overlooking how technological literacy, social media engagement intensity, digital identity visibility, and cultural norms interact within AI-driven educational environments. Moreover, sociotechnical dimensions—such as algorithmic curation, platform governance, automated moderation, and data-driven interaction patterns—remain underexamined, particularly in higher education settings where digital participation is frequently compulsory rather than optional.
In addition, relatively few studies explicitly distinguish between established behavioural factors associated with higher levels of factors and the amplification associations introduced by AI-mediated interactions. As AI becomes increasingly embedded in learning management systems, communication platforms, and academic support tools, understanding how algorithmic mediation reshapes exposure to digital harassment is essential for developing associationive, context-sensitive prevention and intervention strategies.
To address these gaps, the present study examines university students’ exposure to digital harassment within AI-enhanced learning environments using an expanded Unified Theory of Acceptance and Use of Technology (UTAUT) framework. The study investigates five key associated factors—technological literacy and cybersecurity awareness, social media engagement intensity, digital identity visibility, AI-mediated interactions, and cultural norms and social expectations—and analyses how these factors jointly shape vulnerability to online harassment. It further examines the consequences of exposure for students’ mental health and continuity in e-learning, while exploring the moderating roles of academic specialisation and cultural context to capture disciplinary and socio-cultural variation associated with higher levels of patterns.
Accordingly, the inferential population of this study comprises both Saudi nationals and international students enrolled in Saudi Arabian universities, allowing examination of AI-mediated experiences within a shared institutional and technological environment. In this study, developing and emerging countries are defined as low-, lower-middle-, or upper-middle-income economies according to the World Bank’s income classification. Country-of-origin classifications are used solely for descriptive and analytical purposes. References to the Human Development Index (HDI) and the IMF World Economic Outlook (WEO) are included only for contextual background and do not affect sample inclusion, which is governed exclusively by World Bank income categories.
Saudi Arabia provides a shared host-country educational environment with relatively uniform digital infrastructure and institutional governance. At the same time, students’ countries of origin reflect diverse sociocultural and technological conditions relevant to the dynamics of digital harassment. By focusing on AI-mediated educational ecosystems, this study advances existing research beyond general online interaction contexts. It conceptualises digital harassment as a structural sociotechnical phenomenon associated with higher levels of vulnerability, inequality, and student well-being within contemporary higher education, shaped by behavioural practices, cultural constraints, and algorithmic mediation, offering a more comprehensive understanding of how digital societies are reshaping vulnerability, inequality, and student well-being within contemporary higher education.
3. The Research Methods
3.1. Research Population and Sample
The World Economic Outlook (WEO) classifies countries into two broad categories: advanced economies (e.g., the United States, the United Kingdom, Italy, and Japan) and emerging and developing economies (e.g., Saudi Arabia, Egypt, Qatar, and Nigeria) [
47]. In the present study, WEO terminology is used solely for descriptive contextualisation of global economic groupings. Country inclusion and analytical classification are governed exclusively by World Bank income categories, while Human Development Index (HDI) levels and IMF classifications are referenced only for contextual interpretation.
The Kingdom of Saudi Arabia serves as a regional hub for multicultural higher education and digitally mediated learning environments. During the 2024–2025 academic year, Saudi universities enrolled approximately 98,453 international students, in addition to Saudi nationals, representing a wide range of national and cultural backgrounds [
48].
Accordingly, this study focused on Saudi and international students enrolled in Saudi Arabian universities. Saudi Arabia was deliberately selected as a shared host country to ensure consistency in key institutional and technological conditions, including digital infrastructure, learning management systems (LMS), platform governance, and institutional policies regulating online learning and digital interaction. This design enables examination of exposure to digital harassment within a relatively uniform regulatory and technological environment, while preserving meaningful cross-national variation associated with students’ countries of origin.
Universities were included in the sample if they met the following criteria: (i) provision of AI-enhanced or digitally mediated learning environments, such as institutionally managed LMS platforms, online assessments, or AI-supported educational tools; (ii) enrolment of a substantial population of Saudi nationals and international students studying within the same Saudi Arabian higher-education context; and (iii) reliance on formally governed, institutionally managed digital systems rather than informal or ad hoc platforms for teaching, learning, and academic communication. These criteria ensured that all participating students were embedded in comparable AI-mediated educational ecosystems aligned with the study’s objectives.
The final analytical sample comprised 2185 students (991 males and 1194 females) originating from Saudi Arabia and 32 other developing and emerging countries (33 countries in total; see
Appendix A). This sampling context provides a unique opportunity to examine digital interaction patterns within a unified host-country environment while simultaneously capturing sociocultural heterogeneity linked to students’ national backgrounds. Such variation is particularly relevant because exposure to digital harassment is shaped by sociotechnical systems, culturally embedded norms, and differing expectations regarding online behaviour and reporting. Consequently, the use of a culturally heterogeneous sample enhances the study’s capacity to identify differential patterns of vulnerability across social and cultural groups.
Data were collected over an extended period, from January 2024 to November 2025, to facilitate recruitment of a large, culturally diverse sample across multiple institutions. Although data collection spanned several academic semesters, the study employed a cross-sectional design, as each participant completed the survey only once and no repeated measurements were obtained (see
Figure 1).
Consequently, the dataset captures a single observational snapshot of students’ experiences and perceptions rather than longitudinal change. (The dataset is available at the following link:
https://2u.pw/mRgWI (accessed on 18 January 2026))
To assess potential temporal heterogeneity arising from the extended data collection window, additional robustness checks were conducted by comparing key latent constructs across different collection phases. No statistically significant differences were observed across periods (all p > 0.05), indicating that the estimated relationships are stable and not driven by time-varying contextual conditions.
To ensure sufficient statistical power for the complex structural model, which includes moderation associations, power analysis was conducted using inverse-square-root and gamma-exponential methods, indicating a minimum sample size of approximately 150 for medium-sized associations, with alpha = 0.05 and power = 0.80 [
47]. The sample size achieved of 2185 participants exceeds the required number. Post hoc power analysis, using the smallest observed path coefficient (β = 0.083,
p = 0.219), confirmed a power greater than 0.95, verifying the sample’s adequacy for detecting minor to moderate associations, including moderator interactions [
48].
3.2. Moderating Constructs
In this study, Cultural Context (CC) is operationalised as a macro-cultural grouping variable based on respondents’ country of origin (demographic item). It is used exclusively for moderation and heterogeneity assessment [
49]. CC does not represent an individual-level scale score and is conceptually distinct from Cultural Norms and Social Expectations (CNSE), which captures respondents’ personal perceptions of reporting stigma, honour-related concerns, and social expectations surrounding digital harassment [
50]. CC was coded into three regional groups: Middle East, Africa, and Eastern Asia, with percentages calculated relative to the total sample (N = 2185) (see
Table A1).
Moderation by CC was examined using interaction terms in the primary PLS-SEM specification. To enhance the interpretability and robustness of cross-cultural comparisons, complementary group-based analyses were conducted using PLS multigroup analysis (MGA) after establishing measurement invariance [
48]. Measurement invariance was assessed using the MICOM procedure, including configural, compositional, and composite-mean and variance equality. Cross-group differences in structural path estimates were interpreted only after MICOM requirements were satisfied, using permutation- and bootstrap-based MGA procedures implemented in SmartPLS [
51].
Therefore, because respondents may be nested within universities and national contexts, ignoring within-cluster correlation could inflate Type I error rates. Although full multilevel SEM is not always feasible in variance-based SEM, this potential bias was explicitly considered. Accordingly, robust analytical strategies commonly recommended for clustered data were adopted to ensure that standard errors and statistical inferences were not unduly affected by the sample’s nested structure.
Furthermore, Academic Specialisation (AS) was operationalised as a categorical moderator based on students’ self-reported field of study. To ensure theoretical coherence and adequate group sizes for moderation analysis, individual majors were aggregated into three analytically meaningful clusters commonly used in higher-education research: (i) STEM (Sciences, Engineering, Computer Sciences, and Health-Related Disciplines), (ii) Social Sciences and Humanities (Education, Law, Arts, and Related fields), and (iii) Business and Applied Studies (Business Administration, and Related fields).
Moreover, these categories reflect systematic differences in digital workload, online visibility, and platform-mediated interaction intensity reported in prior digital-behaviour research. For PLS-SEM moderation, AS was dummy-coded using a binary contrast, with Social Sciences and Humanities as the reference group. This coding strategy enables direct interpretation of interaction associations as differences in structural path strengths relative to the reference category.
In addition, moderation hypotheses (H8–H12) were tested using the product-indicator interaction approach implemented in SmartPLS. All correlate constructs in interaction terms were mean-centred prior to construction to reduce multicollinearity and facilitate interpretation of conditional associations. Interaction terms were entered into the structural model only after the significance and stability of the corresponding main associations had been established. Likewise, the statistical significance of interaction paths was assessed using bootstrapping with 5000 resamples. In addition to path coefficients and
p-values, interaction association sizes (f
2) were computed to evaluate the substantive relevance of the moderation associations, following established PLS-SEM reporting guidelines [
52].
For all statistically significant interaction associations, simple slope analyses were conducted to visualise and interpret how the relationship between each correlation and Exposure to Digital Harassment (EDH) varies across levels of the moderator. Continuous correlates were plotted at ±1 standard deviation from the mean, while categorical moderators (academic specialisation and cultural context) were visualised using group-specific conditional associations derived from the interaction model. Hence, simple-slope plots were generated directly in SmartPLS using the interaction-plot function, and the latent-variable scores were cross-validated. (See
Figure 2 and
Figure 3).
3.3. Data Collection and Instrument Design
Data were collected using an anonymous online questionnaire designed to minimise social desirability bias and address the sensitive nature of reporting digital harassment. Participation was entirely voluntary, and respondents were assured that all responses would remain confidential. The survey link was distributed through official institutional email channels, with an invitation emphasising the study’s relevance to improving digital safety and informing the future of AI-enhanced higher education.
The questionnaire comprised three sections; the first collected demographic information, including gender, academic specialisation, and cultural context. The second section presented an informed consent statement outlining the study’s purpose, ethical safeguards, and participants’ rights. The third section included Likert-scale items measuring the constructs of the proposed theoretical model.
All measurement items were adapted from established and validated instruments to ensure conceptual accuracy and reliability. Technological Literacy and Cybersecurity Awareness (TL&CA) was measured using five items (TL&CA1–TL&CA5) adapted from [
27,
28,
37]. Social Media Engagement Intensity (SMEI) was measured using five items (SMEI1–SMEI5) adapted from [
29,
39]. Digital Identity Visibility (DIV) was evaluated using three items (DIV1–DIV3) as described in [
22,
29,
30,
53]. AI-Mediated Interactions (AIMI) was measured using four items (AIMI1–AIMI4) derived from [
4,
30,
32,
54]. Cultural Norms and Social Expectations (CNSE) was assessed using five items (CNSE1–CNSE5) adapted from [
1,
9,
45].
Exposure to Digital Harassment (EDH) and Mental Health Impact (MHI) were measured using items informed by [
3,
4,
5,
6], comprising five EDH items (EDH1–EDH5) and four MHI items (MHI1–MHI4) [
15,
33,
45]. Mental Health Impact (MHI) was measured using four items (MHI1–MHI4) assessing psychological distress, anxiety, reduced sense of safety, and impaired concentration resulting from harassment exposure. MHI was operationalised as a negatively valenced construct, with higher scores indicating poorer mental health outcomes. E-Learning Continuity (ELC) was assessed using four items (ELC1–ELC4) adapted from [
16,
20,
55], capturing sustained participation and persistence in online learning activities.
To ensure cultural and linguistic appropriateness for the Saudi Arabian higher education context, the instrument underwent a rigorous localisation and validation process. Thirteen bilingual experts in higher education and information technology evaluated all items for content validity, cultural relevance, and conceptual clarity. In addition, ten university students reviewed the Arabic and English versions to assess readability and contextual suitability. All items were subjected to a forward–backward translation procedure verified by a bilingual committee to ensure linguistic and conceptual equivalence. Minor wording refinements were made to enhance clarity and contextual precision. A pilot study with 23 participants confirmed satisfactory reliability across all constructs [
48].
All constructs were modelled as reflective latent variables and analysed using Partial Least Squares Structural Equation Modelling (PLS-SEM). Exposure to Digital Harassment (EDH) was specified as a reflective construct, consistent with measurement theory, when indicators represent correlated manifestations of an underlying experiential condition [
56,
57,
58]. In this study, EDH is conceptualised as a frequency-based experiential outcome reflecting the intensity of exposure to harmful online behaviours across AI-mediated educational and social platforms. Accordingly, the EDH indicators are treated as parallel manifestations of a common latent exposure dimension, rather than as independent or formative components.
Furthermore, reflective modelling is appropriate when indicators are expected to be correlated, conceptually interchangeable manifestations of the same phenomenon, and are caused by the underlying latent construct rather than forming it. In contrast, formative modelling is suitable when indicators represent non-substitutable facets whose omission would alter the construct’s meaning [
56,
59,
60].
Given the theoretical framing of EDH as an underlying exposure intensity in AI-mediated digital environments, the reflective specification aligns with measurement theory. It supports the interpretation of EDH as a unified experiential outcome.
Empirically, the reflective EDH model demonstrated strong psychometric properties consistent with expectations for reflective measurement, including high and relatively homogeneous indicator loadings and strong internal consistency. These results support the interpretation that the EDH items capture a standard latent exposure dimension. While EDH encompasses multiple forms of harassment, the intent is not to treat these as interchangeable social phenomena, but as co-occurring manifestations of a shared exposure condition that is appropriately modelled as a single latent construct.
3.4. Common Method Variance (CMV) Concerns
Common Method Variance (CMV) is a frequent concern in social science research, particularly when independent and dependent variables are self-reported by the same participants [
61]. Such bias may inflate relationships among variables and affect the explanatory power and validity of the structural model [
61].
To mitigate this, the study followed the recommendations of Podsakoff et al. [
51] and implemented both procedural and statistical remedies. Procedurally, the questionnaire was carefully designed to minimise response bias by balancing item ordering and avoiding leading or patterned sequences. Dependent variables were deliberately positioned at later stages of the survey to reduce priming associations and mitigate respondents’ tendency to infer relationships across items [
62]. The questionnaire’s overall length was also monitored to reduce fatigue-related bias.
Statistically, CMV was first assessed using Harman’s single-factor test. The results showed that the first unrotated factor explained 41% of the total variance—well below the 50% threshold—indicating that CMV was unlikely to pose a significant threat to the study’s findings. To further reinforce this conclusion, additional, more robust statistical techniques were employed. A marker-variable technique using an unrelated construct (attitude toward SDGs) revealed that the marker accounted for less than 2.5% of shared variance with the primary constructs, signalling no inflation of correlations. A latent method factor model was also evaluated using confirmatory factor analysis (CFA), and the model fit did not improve relative to the baseline model (ΔCFI = 0.004; ΔRMSEA = 0.003), confirming the absence of substantial method bias.
Data integrity checks were implemented throughout the collection period. Automated validation screened for straight-lining, extreme outliers, and duplicate IP submissions. Responses with completion times below 40% of the median duration were excluded (n = 11), and no duplicate entries were retained. Enumerator-assisted completions were monitored through independent logs to ensure adherence to standardised administration procedures. Missing data were minimal (below 2% across all items), and Little’s MCAR test was not statistically significant (χ2 = 14.82, p = 0.26), supporting the assumption that missingness was random. Because of the low proportion, missing values were imputed using expectation–maximisation (EM) prior to PLS-SEM analysis.
To avoid order associations, questionnaire sections were counterbalanced during pilot testing, and no significant mean differences were observed between split forms (p > 0.05). The final version adopted the empirically validated sequence, beginning with demographic items and concluding with construct measures, ensuring a logical flow and sustained respondent engagement. Collectively, these procedural and statistical safeguards confirm that CMV, response bias, and order associations did not pose significant threats to the validity of the study’s data or its structural model estimates.
3.5. Hypothesis Examining Approach
A structured, sequential analytical strategy was adopted to examine the hypothesised relationships within the expanded UTAUT-based framework. The analysis proceeded in two phases. First, the direct associations of the five associated factors—Technological Literacy and Cybersecurity Awareness (TL&CA), Social Media Engagement Intensity (SMEI), Digital Identity Visibility (DIV), AI-Mediated Interactions (AIMI), and Cultural Norms and Social Expectations (CNSE)—on Exposure to Digital Harassment (EDH) were evaluated (H1–H5). This step isolated the primary behavioural, technological, and cultural correlations of exposure, controlling for moderating variables. Consistent with methodological guidelines, moderation terms were excluded during the examination of main associations to avoid confounding and multicollinearity between direct and interaction terms.
In the second phase, the outcome pathways (H6–H7) and moderator hypotheses (H8–H12) were analysed. Exposure to Digital Harassment (EDH) was tested for its correlation with Mental Health Impact (MHI) and E-Learning Continuity (ELC), with psychological and academic consequences assessed. Academic Specialisation (AS) was modelled as a moderator of the relationships between TL&CA, SMEI, and DIV with EDH (H8–H10), while Cultural Context (CC) was tested as a moderator of the relationships between AIMI, CNSE, and EDH (H11–H12). Interaction terms were mean-centred and entered the model in accordance with recommended PLS-SEM moderation procedures.
Furthermore, the measurement model was first evaluated to confirm psychometric adequacy. Indicator reliability was assessed using outer loadings, and internal consistency was assessed using Cronbach’s alpha and Composite Reliability. Convergent validity was examined via the Average Variance Extracted (AVE) [
52]. Discriminant validity was assessed using both the Fornell–Larcker criterion and HTMT ratios, and all constructs met established thresholds [
47]. Additionally, moderation hypotheses were tested by examining the significance of interaction terms, calculating moderation association sizes (f
2), and interpreting simple slope visualisations for significant interactions. All structural analyses were conducted in SmartPLS 4, chosen for its robustness in modelling complex frameworks with multiple moderators, non-normal data, and reflective constructs [
63].
Throughout the hypothesis testing and reporting of structural relationships, the term “association” is used solely as a statistical descriptor of estimated associations. Owing to the study’s cross-sectional, observational design, statistical inference is neither assumed nor implied.
Likewise, to minimise common method bias, both procedural and statistical remedies were implemented. Anonymity, randomisation of item sequences, and separation of correlated and outcome items were applied during instrument design. Statistically, Harman’s single-factor test indicated no general factor, full collinearity VIFs were below the recommended 3.3 threshold, and a latent method factor test revealed minimal loading shifts (<0.20). Together, these results suggest that standard method variance was not a substantive threat to the validity of the findings.
3.6. Ethical Approvals
Before initiating data collection, formal institutional approval was obtained. Furthermore, this procedure certified that our employed methods were consistent with the institutional criteria and the ethical concerns outlined in the Declaration of Helsinki [
64]. Additionally, several measures were implemented to protect participants’ rights. Participation was entirely voluntary and free from coercion, with written informed consent obtained from each respondent. Participants were assured of their right to withdraw at any time without providing a reason. All data were anonymised to safeguard confidentiality. Respondents were informed that their answers would remain anonymous, securely stored on encrypted institutional servers, and used solely for academic research purposes. No identifiable information was collected.
3.7. Software, Versions, and Reproducibility Details
All analyses were conducted using the following software and settings to support reproducibility. SmartPLS 4.0 (SmartPLS GmbH, Bönningstedt, Germany) was used for the primary PLS-SEM analysis, moderation modelling, bootstrapping with 5000 subsamples, MICOM measurement invariance testing, and simple-slope visualisation. IBM SPSS Statistics 29 was used for data screening, descriptive statistics, missing-data diagnostics, and preliminary robustness checks.
AMOS 29 was used to run a confirmatory factor analysis (CFA) with maximum likelihood estimation to assess standard method variance. Key analytical settings included the path weighting scheme, mean-centring of indicators for interaction terms, bias-corrected bootstrap confidence intervals, and two-tailed significance testing at α = 0.05. These details are reported to enable independent replication of the whole analytical workflow.
4. Data Analysis Methods and Results
To evaluate the hypothesised model, Partial Least Squares Structural Equation Modelling (PLS-SEM) was employed as the primary analytical technique. This choice reflects the study’s conceptualisation of digital harassment as a multidimensional sociotechnical phenomenon, shaped by behavioural, technological, and cultural associated factors operating within algorithmically mediated environments [
62].
Contrasting the covariance-based SEM (CB-SEM), which typically requires large sample sizes and strict adherence to multivariate normality, PLS-SEM can accommodate smaller samples and does not rely on the normality assumption [
61]. The data analysis was conducted employing SmartPLS (version 4) [
51]. To guarantee robustness of the findings, a bootstrapping process with 5000 subsamples was performed with a reflective measurement procedure [
65]. Following the recommendation from Hair et al. (2019) [
52], the model assessment process was performed in two successive stages.
Table 2 presents the assessment of the measurement model, which demonstrates strong reliability and convergent validity across all constructs. All indicator loadings exceeded the recommended threshold of 0.70, confirming that each item reliably represents its corresponding latent variable. For instance, loadings for Technological Literacy and Cybersecurity Awareness (TL&CA) ranged from 0.750 to 0.832, indicating robust indicator reliability.
Similarly, items measuring Social Media Engagement Intensity (SMEI) showed loadings ranging from 0.792 to 0.851, indicating that these indicators consistently capture the intensity of students’ social media use. Constructs with fewer items, such as Digital Identity Visibility (DIV), also demonstrated strong loadings (0.783–0.831), supporting the stability and clarity of these reflective measures.
In addition, internal consistency reliability was further supported by Cronbach’s α and Composite Reliability (CR), with all constructs exceeding the recommended minimum of 0.70. Notably, Exposure to Digital Harassment (EDH) displayed excellent reliability (α = 0.90; CR = 0.92), reflecting the consistency of students’ reported experiences across cyberbullying, cyber hate, sexual extortion, and digital stalking.
Furthermore, comparable reliability levels were observed for Cultural Norms and Social Expectations (CNSE) (α = 0.87; CR = 0.90) and E-Learning Continuity (ELC) (α = 0.87; CR = 0.90). Even constructs with moderate item counts, such as Mental Health Impact (MHI) and AI-Mediated Interactions (AIMI), demonstrated acceptable reliability, with CR values of 0.88.
Likewise, convergent validity was supported by the Average Variance Extracted (AVE), with all constructs surpassing the minimum threshold of 0.50. AVE values ranged from 0.55 to 0.64, indicating that each construct explains more than half of the variance in its observed indicators.
Additionally, EDH showed the strongest convergent validity (AVE = 0.64). At the same time, both TL&CA and SMEI demonstrated strong convergent validity (AVEs of 0.61 and 0.62, respectively). Additionally, the combined evidence indicates strong reliability and validity of the measurement model, providing a robust foundation for subsequent evaluation of the structural model.
Figure 1 presents the structural model with the estimated path coefficients, illustrating the relationships among the associated factors, EDH, and student outcomes. (See
Figure 4).
Table 3 presents the assessment of discriminant validity using the Fornell–Larcker criterion, and
Table 4 shows that all constructs exhibit satisfactory discriminant validity. For each latent variable, the square root of AVE, shown on the diagonal, is greater than the correlations with all other constructs in the corresponding row and column. This indicates that each construct accounts for more variance within its indicators than across indicators of other constructs—an essential requirement for reflective measurement models.
For example, TL&CA exhibited a square root of AVE of 0.781, which exceeds its correlations with SMEI (0.42), DIV (0.38), AIMI (0.40), CNSE (0.36), EDH (0.44), MHI (0.46), and ELC (0.41); similar patterns were observed for other constructs. Additionally, the square root of AVE for SMEI was 0.787, noticeably higher than its correlations with DIV (0.51), AIMI (0.49), CNSE (0.55), EDH (0.62), MHI (0.47), and ELC (0.45). Likewise, DIV demonstrated a diagonal value of 0.755, exceeding its correlations with SMEI (0.51), AIMI (0.48), CNSE (0.50), EDH (0.54), MHI (0.43), and ELC (0.40).
Furthermore, the same pattern holds for AIMI, CNSE, and EDH. Notably, EDH had a diagonal loading of 0.800, which was higher than its correlations with all other constructs, including SMEI (0.62) and CNSE (0.60), the strongest correlations. Hence, this confirms that although these correlations are theoretically related, they remain empirically distinct.
In addition, the outcome variables—MHI and ELC—satisfied the Fornell–Larcker criterion, with diagonal loadings of 0.742 and 0.775, respectively, exceeding all interconstruct correlations. Collectively, these results provide strong evidence that all constructs in the model meet the discriminant validity requirements according to the Fornell–Larcker criterion. Similarly, the findings support the conclusion that each latent variable is unique and captures a distinct conceptual domain, thereby justifying its use in the subsequent structural model analysis.
As presented in
Table 4, the HTMT values between the associated factor variables and exposure are moderate but acceptable. For instance, the HTMT value between SMEI and EDH is 0.74. In contrast, the values for DIV (0.68), AIMI (0.70), and CN&SE (0.72) also remain below the threshold of 0.85. These results are theoretically expected, as these associated factors are conceptually related to students’ digital behaviours and social experiences, yet they remain distinct constructs. Similarly, the HTMT value between EDH and MHI is 0.68, indicating a meaningful but not excessive association, consistent with theoretical expectations of the negative psychological consequences of EDH. The HTMT value between ELC remains within acceptable limits (0.62), indicating that although exposure influences ELC persistence, the two constructs measure distinct aspects of the student experience.
As presented in
Table 5, items associated with TL&CA show substantially higher loadings on their own construct (0.75–0.83) than on any other construct, where cross-loadings range only between 0.26 and 0.38. This pattern is consistent across all TL&CA items, confirming that the indicators uniquely measure students’ technological capabilities and cybersecurity awareness.
Similarly, items measuring SMEI exhibit high loadings on their own construct (0.79–0.85) and noticeably weaker cross-loadings with other constructs, ranging from 0.30 to 0.55. These results indicate that SMEI items associationively capture students’ engagement intensity with digital platforms without substantially overlapping with other constructs, such as DIV or AIMI. Likewise, items for DIV and AIMI follow the same pattern: primary loadings remain strong (DIV = 0.78–0.83; AIMI = 0.77–0.82), while cross-loadings remain considerably lower, reinforcing their conceptual independence within the model.
Additionally, the CNSE also demonstrates strong discriminant validity, with primary loadings ranging from 0.76 to 0.83 and cross-loadings that never exceed the highest loading for any item. This finding supports the argument that CNSE captures distinct cultural pressures, reporting expectations, and social constraints that shape experiences of digital harassment. In addition, items for EDH—which represent behaviours such as cyberbullying, cyber hate, sexual extortion, and digital stalking—exhibited high loadings (0.78–0.85) and consistently lower cross-loadings, indicating that the indicators collectively reflect a coherent construct relating to online harassment.
Furthermore, the outcome constructs—MHI and ELC—also demonstrate strong discriminant validity. MHI loadings fall between 0.77 and 0.82, while ELC loadings range from 0.79 to 0.83, with all cross-loadings remaining well below the primary loading values. This pattern indicates that the indicators for emotional strain, anxiety, and reduced learning persistence are uniquely associated with their respective constructs rather than overlapping with correlated variables.
As presented in
Table 6, a structural model assessment indicates strong support for most hypothesised relationships within the proposed framework. Among the five associated factors of digital harassment, four demonstrated significant positive associations with students’ exposure to online harm. AIMI (H4) showed the strongest statistical association with exposure (β = 0.31,
p < 0.001), suggesting that algorithmically driven communication, automated content, and AI-enhanced digital environments significantly increase the likelihood of encountering harmful behaviour.
Additionally, SMEI (H2) also exhibited a substantial positive association (β = 0.28, p < 0.001), highlighting the role of high-frequency online activity and social platform interactions in increasing vulnerability to harassment. Likewise, CNSE (H5) (β = 0.27, p < 0.001) and DIV (H3) (β = 0.24, p < 0.001) significantly increased exposure levels, indicating that cultural pressures, social constraints, and the openness of one’s digital identity contribute meaningfully to being associated with higher levels of patterns. In contrast, TL&CA (H1) showed a significant negative relationship with EDH (β = –0.21, p < 0.001). This finding confirms that students with more substantial knowledge of cybersecurity practices, online privacy management, and safe digital behaviours are less likely to experience harassment.
Furthermore, the protective association of technological literacy underscores the importance of digital hygiene education and student training to reduce the associated risks. The consequences of exposure to digital harassment were also strongly supported. Exposure to digital harassment (EDH) significantly worsened mental health impact (MHI) (H6; β = −0.33, p < 0.001), indicating that higher levels of harassment are associated with increased emotional distress, anxiety, and poorer psychological well-being. Given the negative valence of the MHI scale, this negative coefficient reflects more severe adverse mental health outcomes as exposure is associated with higher levels.
Exposure also significantly reduced ELC (H7) (β = –0.29, p < 0.001), demonstrating that students who experience harassment are more likely to disengage from digital learning environments, skip online sessions, or withdraw from online activities. These results emphasise the severe academic and emotional implications of digital harassment within AI-mediated educational contexts.
Moreover, regarding moderation associations, Academic Specialisation (AS) partially moderates the relationships among the associated factors and exposure to harassment. While its interaction with technological literacy (H8) was not statistically significant (β = 0.07, p = 0.055), AS significantly moderated the associations of Social Media Engagement (H9) (β = 0.11, p = 0.014) and DIV (H10) (β = 0.09, p = 0.035). These findings suggest that students across fields of study may experience and interpret digital engagement and visibility in distinct ways, thereby influencing their vulnerability to online harm.
Cultural Context (CC) also showed selective moderation associations. It significantly strengthened the relationship between AIMI and Exposure (H11) (β = 0.13, p = 0.006), indicating that cultural differences may shape how students experience AI-driven digital environments and their higher levels of association. However, CC did not significantly moderate the relationship between cultural norms and exposure (H12) (β = 0.05, p = 0.139), suggesting that the direct association of cultural norms is sufficiently strong and consistent across cultural groups, regardless of contextual variations.
Taken together, these results provide strong empirical support for the proposed model. Most associated factors significantly influence exposure to digital harassment; exposure, in turn, significantly predicts mental health outcomes and e-learning persistence. The moderating roles of AS and CC highlight significant socio-academic variations associated with higher levels of patterns, adding nuance to the understanding of how digital harassment operates within AI-mediated learning environments.
As presented in
Table 7, the structural model quality metrics, including R
2, f
2, and Q
2 values, collectively assess the explanatory power, association sizes, and predictive relevance of the proposed PLS-SEM model. Overall, the findings indicate that the model demonstrates strong explanatory capacity and meaningful predictive accuracy for the central constructs of interest—EDH, MHI, and ELC.
Additionally, the model explains a substantial proportion of variance in EDH, with an R2 value of 0.648, indicating that approximately 65% of the variability in exposure is accounted for by the five associated factors and the moderating variables. This level of explanatory power is considered high for behavioural and educational research, especially within complex digital environments.
Furthermore, the association sizes (f2) indicate that AIMI exerted the most significant relationship with EDH (f2 = 0.14), followed by SMEI (f2 = 0.12) and CNSE (f2 = 0.11). Likewise, DIV (f2 = 0.09) demonstrated a meaningful association, whereas TL&CA (TL&CA) exhibited a smaller yet significant protective association (f2 = 0.06). The moderating variables—AS (f2 = 0.02) and CC (f2 = 0.03)—showed small but theoretically relevant contributions, consistent with prior research on contextual and disciplinary influences in digital behaviour.
Predictive relevance, as assessed by Q2, further validates the model. Moreover, the Q2 value for EDH (Q2 = 0.411) is well above zero, indicating strong predictive relevance and confirming that the model accurately predicts the levels of digital harassment exposure among students. This finding underscores the robustness of the structural pathways linking behavioural, technological, and cultural associated factors with harassment outcomes.
The outcome constructs—MHI (MHI) and ELC (ELC)—also demonstrated meaningful explanatory power. EDH explained 38.2% of the variance in mental health outcomes (R2 = 0.382) and 33.6% of the variance in ELC (R2 = 0.336). These results indicate moderate to strong explanatory power, highlighting the central role of harassment experiences in shaping both psychological well-being and students’ ability to sustain participation in online learning.
Furthermore, association sizes confirm these relationships: f2 = 0.18 for MHI and f2 = 0.16 for ELC, both indicating moderate associations. Predictive relevance values (Q2 = 0.272 for MHI and Q2 = 0.234 for ELC) support the conclusion that EDH meaningfully predicts declines in mental health and disruptions in online learning engagement. Hence, the structural model demonstrates strong explanatory capabilities, meaningful association sizes, and robust predictive relevance. The results validate the theoretical framework proposed in this study and highlight the central role of exposure to digital harassment in shaping students’ psychological and academic experiences within AI-mediated educational environments.
5. Discussion
This study examined the behavioural, technological, and cultural factors associated with university students’ EDH in AI-mediated learning environments. The findings extend prior research by demonstrating how AI-driven interactions, digital identity patterns, and social engagement behaviours collectively contribute to online harm. This is consistent with earlier work on technology-mediated education (e.g., [
20,
66,
67]) and contributes to current debates on how digital societies and algorithmic systems reshape social reality. The results confirm that digital harassment remains a pervasive challenge for students, particularly in developing countries undergoing rapid digital expansion, where platform penetration and AI-supported tools grow faster than regulatory, pedagogical, and ethical safeguards.
The strong reliability and validity of the measurement model further confirm that constructs such as technological literacy, cultural norms, identity visibility, and harassment exposure are conceptually distinct and empirically robust, consistent with prior psychometric research in digital safety and online, which is associated with higher levels of these domains (e.g., [
28,
29,
37,
53]). Moreover, a key empirical contribution of this study is the identification of AI-Mediated Interactions (AIMI) as the strongest correlation of EDH. This finding aligns with growing concerns that algorithmic systems may unintentionally amplify harmful content, facilitate repeated or unsolicited contact, or weaken traditional cues of accountability through automated mediation (e.g., [
30,
54,
68]). In line with prior research on algorithmic curation, AI-driven recommendation and ranking mechanisms can increase exposure to extreme or undesirable interactions, thereby intensifying associations with factors beyond individual behavioural factors.
The significant association between Social Media Engagement Intensity (SMEI) and frequent participation in online spaces is consistent with earlier findings showing that frequent participation in online spaces is associated with greater exposure to cyber aggression [
3,
5,
21]. Additionally, High posting activity, real-time interaction, and presence across multiple platforms enhance visibility and accessibility to potential perpetrators. In AI-mediated environments, this visibility is further shaped by algorithmic amplification, which surfaces, promotes, or recommends highly active users. Consequently, SMEI is associated with higher levels of mechanisms operating at the intersection of user behaviour and platform-driven exposure dynamics.
Similarly, Digital Identity Visibility (DIV) was found to significantly increase vulnerability to harassment. This result is consistent with studies demonstrating that publicly accessible profiles, extensive self-disclosure, and weak privacy controls are associated with higher levels of stalking, impersonation, and targeted harassment [
28,
30,
33]. Even limited publicly available information—such as identifiable images or activity traces—can be exploited by malicious actors. The findings confirm that visible and weakly protected digital identities constitute a persistent structural vulnerability within AI-enhanced educational ecosystems.
The influence of Cultural Norms and Social Expectations (CNSE) highlights the central role of stigma, honour-related concerns, and reporting cultures in shaping prolonged exposure to digital harassment. In many contexts, students may refrain from reporting harassment due to fear of social repercussions, reputational damage, or perceptions of institutional inaction—patterns well documented in prior research [
3,
5,
16,
20,
55,
66]. These socio-cultural pressures can normalise online aggression and discourage help-seeking, allowing harmful behaviours to persist and escalate. The present findings underscore that digital harassment cannot be understood solely as a technological or individual issue but must be situated within broader cultural and social frameworks that shape tolerance, silence, and vulnerability.
The protective association of Technological Literacy and Cybersecurity Awareness (TL&CA) parallels findings from digital resilience research, which consistently show that knowledge of privacy controls, threat detection, and secure communication reduces victimisation [
15,
53]. Students equipped with digital safety skills are better able to navigate AI-mediated environments, manage exposure, and respond to emerging threats. Within developing contexts, TL&CA also reflects a critical dimension of digital inequality: disparities in skills and awareness translate into unequal protection against algorithmically amplified associations.
The study further confirms that EDH has significant adverse associations with mental health, consistent with extensive evidence linking online harassment to anxiety, emotional exhaustion, fear, and depressive symptoms [
16,
17,
55]. In addition, the negative association with e-learning continuity aligns with research showing that harassment leads to disengagement, avoidance of online platforms, and reduced academic motivation [
6,
15]. In AI-enhanced learning environments—where digital participation is central to assessment, collaboration, and communication—harassment can reshape educational trajectories and exacerbate inequalities in academic persistence and success. Moderation analyses provide additional insight into contextual variation. Academic specialisation moderated the associations of social media engagement and digital identity visibility, suggesting that discipline-specific cultures and patterns of digital exposure shape harassment [
69].
By contrast, the absence of moderation by specialisation for TL&CA indicates that cybersecurity awareness offers broadly protective benefits across fields. Cultural context moderated the relationship between AI-mediated interactions and EDH, reinforcing evidence that perceptions of AI, platform trust, and digital norms vary across cultural environments. However, cultural context did not moderate the association of CNSE, indicating that stigma and reporting pressures may operate as shared structural constraints rather than context-specific cultural traits, consistent with cross-cultural cyberbullying research [
70,
71,
72].
Within developing countries, the results suggest that students often engage with AI-enhanced platforms through shared digital infrastructure, limited privacy controls, and fragmented governance frameworks. These conditions can increase both the visibility and persistence of harmful online interactions. High levels of social media engagement and digital identity visibility further increase the association, particularly when online participation substitutes for academic and social interaction. In such contexts, students are frequently exposed to large digital audiences in environments characterised by weak cyberregulation, a pattern consistent with earlier studies [
5,
9].
Cultural norms surrounding reporting, combined with higher tolerance of online aggression, may further intensify vulnerability. Social stigma, honour-related concerns, and limited trust in institutional response mechanisms can discourage disclosure, allowing harassment to persist and escalate. The absence of a moderating association of cultural context on cultural norms suggests that these pressures reflect shared structural constraints rather than isolated cultural characteristics. By contrast, technological literacy and cybersecurity awareness emerged as protective factors, indicating that individual digital competence can mitigate the effects associated with limited institutional safeguards. In developing contexts, gaps in digital skills therefore represent a form of digital inequality that translates into unequal exposure to algorithmically amplified harm [
45,
71].
Overall, these findings indicate that digital harassment in AI-driven learning environments in developing countries should be understood as a structural sociotechnical issue. It is shaped by rapid AI adoption, constrained governance, cultural barriers to reporting, and unequal access to knowledge about digital safety, rather than by isolated acts of individual misconduct.