1. Introduction
Chronic diseases such as diabetes and cardiovascular conditions present a major public health challenge in Saudi Arabia, contributing significantly to morbidity, healthcare utilization, and economic burden [
1]. The need for comprehensive health system reforms became increasingly evident in recent years, as the country faced rising rates of noncommunicable diseases (NCDs), fragmented services, and inefficiencies in care delivery [
2].
In response, the Saudi government introduced the Saudi Health Sector Transformation Program (SHSTP) in 2016 as a central component of Vision 2030, a national strategic framework aimed at economic diversification and social development [
3]. Within the health sector, SHSTP was designed to modernize service delivery through integrated care networks, improved chronic disease management, expanded digital health adoption, and enhanced operational efficiency [
4]. The program also sought to institutionalize patient-centered care principles by promoting shared decision-making, personalized care pathways, and active patient engagement [
5].
To operationalize these goals, SHSTP introduced hospital clustering, a reform that reorganizes hospitals into geographically defined networks to enable better coordination across primary, secondary, and tertiary care [
6]. These clusters are expected to act as Accountable Care Organizations (ACOs) that assume responsibility for cost, quality, and outcomes. Digital health innovations such as telemedicine, the Sehat app, and electronic medical records (EMRs) are key enablers of this shift [
7]. Furthermore, performance-based contracting and public–private partnerships have been introduced to support these reforms.
Despite the ambitious scope of SHSTP, few empirical evaluations have assessed its real-world effectiveness, especially regarding its impact on chronic disease care, patient experience, and digital health adoption. The existing literature largely remains descriptive or policy-focused, lacking robust data on patient-level outcomes [
8]. Moreover, the comparative performance of clustered versus non-clustered hospitals, which have not yet fully adopted SHSTP reforms, has not been systematically explored. Internationally, comparable reforms have been implemented across Europe, North America, and Asia, where integrated care and digital transformation programs such as the JADECARE initiative in Europe and the U.S. Accountable Care Organization (ACO) model have demonstrated improvements in continuity and efficiency. However, SHSTP represents one of the first large-scale digital integration programs in the Gulf region, offering a unique perspective on how such models can be adapted within Middle-Eastern health system contexts [
9].
This study addresses that gap by evaluating the implementation and impact of SHSTP reforms in two key regions, Mecca and the Eastern Province, which served as pilot sites between 2016 and 2024. The study investigates whether clustered hospitals demonstrate improvements in patient-centered care, care coordination, and digital health integration compared to non-clustered counterparts [
10].
Objective and Hypothesis
The primary objective is to evaluate the effectiveness of SHSTP in embedding patient-centered practices and enhancing digital health use in chronic disease care. We hypothesize that clustered hospitals will report higher patient satisfaction, improved care coordination, and greater digital tool usage than non-clustered hospitals.
By providing the first mixed-methods assessment of SHSTP at the patient and provider level, this study contributes critical evidence to inform ongoing health system reforms and support efforts to optimize chronic care delivery in alignment with Vision 2030.
2. Materials and Methods
2.1. Study Design
This study employed a convergent parallel mixed-methods observational design to evaluate the implementation and impact of the Saudi Health Sector Transformation Program (SHSTP) across two regions (Mecca and Eastern Province) on chronic disease care. Quantitative and qualitative data were collected concurrently, analyzed independently, and integrated during interpretation through triangulation to identify areas of convergence, complementarity, and divergence [
11]. Quantitative analysis utilized longitudinal performance data [
12] from 2016 to 2024, while qualitative data were derived from cross-sectional semi-structured interviews [
13] conducted in 2024. The study compared patient-centered care, digital health adoption, and care coordination between clustered and non-clustered hospitals in the Mecca and Eastern Province regions. The mixed-methods approach was chosen to capture both measurable outcomes and the contextual realities of implementation. While quantitative data provided standardized indicators of SHSTP performance, qualitative insights uncovered barriers and facilitators, such as cultural resistance and informal practices, not detectable through conventional metrics. Walt and Gilson’s Health Policy Analysis Triangle served as the guiding conceptual framework, allowing for comparative analysis across policy content, context, and process [
14]. Although the quantitative dataset covered nine years (2016–2024) of hospital-level indicators, the small number of hospitals (
n = 4) limited the feasibility of repeated measures or panel regression modeling. Accordingly, longitudinal records were aggregated by year and examined descriptively with time-series plots, while between-group comparisons (clustered vs. non-clustered hospitals) were tested using
t-tests and ANOVA. This clarified analytic structure, documented in the full dissertation, ensures that the term longitudinal refers to the multi-year nature of data collection rather than a repeated measures statistical model.
2.2. Setting
The study was conducted in four hospitals located within the Mecca and Eastern Province regions of Saudi Arabia. These regions were purposively selected because they represent distinct implementation contexts within SHSTP, Mecca serving as one of the earliest pilot zones with advanced cluster integration, and the Eastern Province representing a comparatively transitional region where implementation was still evolving. This diversity allowed the study to capture both mature and developing models of reform within the same national framework.
The selection was also guided by practical considerations of data accessibility, institutional cooperation, and alignment with Ministry of Health pilot phase reporting. While these two regions do not represent the entire national healthcare landscape, they provide analytically rich contrasts that illustrate how clustering reforms function across differing administrative and infrastructural environments.
Potential regional bias is acknowledged; therefore, findings are interpreted as indicative rather than nationally generalizable.
2.3. Population and Sampling
Two groups were included: (1) physicians, senior-level professionals directly involved in SHSTP implementation and chronic disease care; and (2) patients, individuals receiving care for chronic conditions such as diabetes or cardiovascular disease. A purposive sampling strategy was employed to capture participants with direct experience of SHSTP reforms, ensuring diversity in age, gender, and digital literacy [
15].
Inclusion criteria:
Patients: Adults (≥18 years) with one or more chronic diseases, receiving care at participating hospitals.
Physicians: Practising for at least two years in hospitals under study and directly involved in chronic disease management.
Sample Size Justification
A total of 80 participants (40 physicians and 40 patients) were recruited for the qualitative component, with equal distribution across clustered and non-clustered hospitals. A priori power analysis conducted using G*Power 3.1.9.7 indicated that a minimum sample of 64 would be sufficient to detect a medium effect size (Cohen’s d = 0.5) at an alpha level of 0.05 and power of 0.80 [
16]. The final sample of 80 allowed for adequate statistical power while accounting for potential attrition and enabling subgroup analyses, though limitations related to small subgroup sizes were acknowledged. For the qualitative component, 53 participants (30 physicians and 23 patients) were purposively selected from the quantitative sample. Thematic saturation was achieved by the 25th physician and 18th patient interview, with no new themes emerging thereafter. This supported the decision to stop data collection, aligning with established criteria for qualitative saturation [
17]. This confirmed adequacy of the qualitative sample for thematic depth and variation across both clustered and non-clustered hospital contexts. Although both quantitative and qualitative components used purposive sampling, the analytic units differed. Quantitative indicators such as readmission rate and operational efficiency were aggregated annually at the hospital level (
n = 4), while patient satisfaction, PCCI, and digital literacy data were collected at the individual level (
n = 80). Because these observations represented independent hospital groups rather than hierarchical clusters and the number of hospitals was small, multilevel or clustering adjustments were not statistically feasible. This structure follows the analytic rationale documented in the dissertation and ensures comparability across the two data sources. Although purposive sampling may limit statistical representativeness, it aligns with the study’s exploratory and implementation-focused objectives. The sampling rationale prioritized information-rich cases capable of reflecting SHSTP performance diversity rather than population generalization. By selecting hospitals and participants across both clustered and non-clustered settings, the study mitigated regional bias through comparative analysis. The diagram below illustrates the sequential selection of hospitals (clustered vs. non-clustered), participant grouping (physicians and patients), and the derivation of final quantitative (
n = 80) and qualitative (
n = 53) samples following thematic saturation. Although the qualitative sample (
n = 53) was larger than typical saturation thresholds, it was deliberately designed to represent both hospital types and regional contexts, strengthening depth and validity rather than reflecting convenience sampling.
2.4. Variables and Data Sources
The primary exposure variable was hospital type (clustered vs. non-clustered). Key confounders adjusted for in the analysis included age, gender, socioeconomic status, ethnicity, comorbidities, digital literacy, and hospital bed size. These factors were selected due to their known influence on health outcomes, patient engagement, and technology adoption [
18].
Outcome measures included
Patient-Centered Care Index (PCCI), computed as a weighted composite of the following:
Patient Satisfaction (0.4)
Shared Decision-Making (0.3)
Care Coordination (0.3)
Digital Health Use, measured through the following:
Telemedicine usage (binary variable)
Sehat app usage (binary variable)
Technology Adoption Score, calculated as the following:
EMR Adoption Rate (0.6)
Teleconsultations per 100 patients (0.4)
Operational Efficiency, assessed using a composite score based on the following:
30-day readmission rate
Mean consultation time
Patient throughput, benchmarked against SHSTP indicators.
Chronic Disease Management Index (CDMI), derived from the following:
Follow-up Compliance (0.3)
Disease Control Rate (0.3)
Readmission Rate (0.2)
Patient Self-Management (0.2)
Disease control metrics were condition-specific: HbA1c for diabetes and BP control for hypertension.
Data sources included the following:
Hospital records (for operational metrics, satisfaction, PCCI, digital usage, and digital engagement)
Both the PCCI and digital literacy composite scores underwent internal consistency checks (Cronbach’s α = 0.86 and 0.81, respectively), confirming acceptable reliability for inclusion in regression analyses.
Semi-structured, structured interviews (for qualitative feedback and interpretation)
2.5. Data Collection
Quantitative Data:
Quantitative data were obtained from hospital records at administration level as performance metrics and patient satisfaction scores from the routine hospital surveys and the EMR statistics, telemedicine through IT department, readmission rate from medical records, and operational metrics through other systems. Quantitative data were collected to assess patient satisfaction, care coordination (via the PCCI), and use of digital tools. The data collection period extended from January 2016 to August 2024, categorized by three phases: pre-clustring implementation phase from 2016 to 2018, clustring phase from 2018 to 2020, and post-clustering phase from 2020 to 2024. These outcomes align with SHSTP objectives outlined in the Introduction.
Qualitative Data:
Qualitative data was collected through semi-structured and structured interviews (hybrid) with 80 sample-designed participants—patients (40) and senior physicians (40) with chronic conditions; due to the saturation being achieved, patients (30) and senior physicians (23) had no new emerging themes after the 18th patient and 25th physician has been interviewed, which took place in 2024. The interviews focused on their experiences with SHSTP reforms, including barriers and facilitators to patient engagement, the impact of digital health tools, the effectiveness of resource allocation strategies, and the views of changes in care quality.
2.6. Data Analysis
2.6.1. Quantitative Analysis
Quantitative data were aggregated monthly from 2016 to 2024 across four hospitals (two clustered and two non-clustered). Descriptive statistics (mean ± standard deviation) were calculated for key performance metrics, including patient satisfaction, care coordination (PCCI), digital tool usage, and operational efficiency.
Independent two-sample t-tests were used to compare outcomes between clustered and non-clustered hospitals, while analysis of variance (ANOVA) assessed differences across hospital type, healthcare level, and implementation phases: pre-clustering (2016–2018), clustering rollout (2018–2020), and post-clustering (2020–2024).
To identify predictors of patient-centered care (PCCI scores), a multiple linear regression model was applied, adjusting for hospital type, region, hospital size, prior performance, and respondent role (physician vs. patient) using a binary variable (is_physician). Effect sizes were reported using Cohen’s d, and 95% confidence intervals were calculated to convey the magnitude and precision of observed differences in line with established guidelines. Prior to analysis, quantitative datasets were screened for completeness and normality. Missing data comprised less than 5% of all variables and were handled through pairwise deletion to preserve sample independence. Normality and homogeneity of variance were verified using the Shapiro–Wilk and Levene tests, confirming the suitability of t-tests and ANOVA. Multicollinearity among predictors was examined through variance inflation factors (VIF < 2.5), while residuals were checked for linearity, independence, and homoscedasticity using scatter-plot and Durbin–Watson diagnostics. Given the exploratory, theory-building purpose of this study, formal multiplicity corrections such as Bonferroni were not applied; instead, interpretation emphasized effect sizes and confidence intervals rather than strict p-value thresholds, aligning with recommendations for exploratory health services research.
Finally, time-series analysis was conducted to project potential trends in care performance metrics through 2025, providing insights into the sustained impact of SHSTP reforms. All statistical analyses were performed using SPSS v26 and R v4.3.2. To illustrate temporal change from 2016 to 2024, smoothed time-series plots were produced for patient satisfaction, PCCI, digital health use (telemedicine and Sehat app), and readmission rate. Exploratory checks of autocorrelation and stationarity were carried out, and candidate ARIMA(p,d,q) models were evaluated using the Akaike information criterion to identify representative fits. In line with the dissertation analysis, a parsimonious ARIMA(5,1,1) specification provided the best in-sample fit for operational efficiency trends. Because of the limited number of hospitals (
n = 4), the time-series results are presented descriptively through
Figure 1,
Figure 2 and
Figure 3 rather than formal inferential modeling. These figures visualize pre-clustering, clustering rollout, and post-clustering phases to convey the direction and sustainability of SHSTP performance over time.
2.6.2. Qualitative Analysis
The qualitative component was designed and analyzed under the supervision of an experienced qualitative researcher with doctoral-level expertise in interpretive and policy-oriented inquiry. This ensured methodological alignment with established qualitative traditions and consistency between the epistemological stance of thematic analysis and the overall mixed-methods framework. The approach drew explicitly on Braun and Clarke’s reflexive thematic analysis and Lincoln and Guba’s criteria of trustworthiness credibility, dependability, confirmability, and transferability to maintain methodological coherence and analytical depth throughout the study. Interview transcripts were coded using NVivo software 12 Pro (QSR International;
https://www.qsrinternational.com/nvivo-qualitative-data-analysis-software/home; accessed on 10 September 2025) following thematic analysis methodology. Initial codes were based on SHSTP objectives and the Health Policy Analysis Triangle. Themes were iteratively refined through multiple coding rounds, with triangulation used to identify convergence between quantitative and qualitative findings. Thematic saturation was reached after 25 physician and 18 patient interviews, with no new codes emerging. To ensure rigor, coding consistency was enhanced through peer debriefing and reflective memoing. The first author conducted iterative code–recode checks, while an independent qualitative advisor reviewed a sample of coded transcripts for alignment. Any differences in interpretation were discussed until conceptual agreement was reached, ensuring dependability and transparency in the final themes. These steps enhanced the credibility and dependability of the thematic findings in accordance with qualitative research standards [
19]. The qualitative work was conducted under the guidance of senior researchers experienced in health policy and qualitative inquiry, ensuring methodological alignment and credibility.
2.7. Ethical Considerations
The Research Ethics Committee at Swansea University’s College of Health Sciences and the Umm Alqura university granted ethical approval for this study. All participants gave written informed consent prior to participation. In fact, their data were anonymized to fully protect privacy and confidentiality. Cultural sensitivity was maintained by respecting local norms, including gender-related protocols during interviews. All participants gave written informed consent, and their identities were anonymized. The study received ethical approval from the Research Ethics Committee at Swansea University’s College of Health Sciences and the Saudi Ministry of Health. Standardized procedures were documented to ensure transparency and replicability.
4. Discussion
This study may offer evidence that the Saudi Health Sector Transformation Program (SHSTP), through hospital clustering, has yielded meaningful improvements in chronic disease care. Clustered hospitals outperformed non-clustered counterparts in patient-centered care (higher satisfaction and PCCI scores), digital health adoption (greater telemedicine and Sehat app utilization), and operational outcomes (reduced readmission rates and shorter consultation times) [
20]. Regression analysis further established hospital cluster status (β = 12.5,
p = 0.022), physician role (β = 19.9,
p < 0.001), and digital literacy (β = 4.0,
p = 0.047) as significant predictors of perceived care coordination. Qualitative insights reinforced these trends, revealing enhanced communication, continuity, and structured care pathways in clustered hospitals, while highlighting barriers such as digital literacy gaps and workforce instability [
21]. Given multiple comparisons and the exploratory design, we emphasized effect sizes and CIs over
p-values, and did not apply formal multiplicity corrections.
International experiences with digitally enabled integrated care offer strong parallels. For example, a cross-national evaluation of 17 integrated chronic care programs across Europe found that embedding digital health tools into coordinated care networks improved care delivery and facilitated longitudinal program assessment [
22]. Additionally, the JADECARE project, aimed at scaling integrated care strategies across Europe, reinforces how structured regional networks supplemented with interoperable digital systems can enhance outcomes in patients with complex conditions [
23]. Similarly, in North America, accountable care frameworks and telehealth expansion initiatives have demonstrated improvements in coordination and cost containment, providing additional comparative insight into SHSTP’s alignment with global integrated care trajectories. These insights resonate with findings that clustering enabled sharing of digital infrastructure, improved care continuity, and fostered accountability, echoing these successful integrated models.
Domestically, efforts to advance digital health under Vision 2030 have accelerated transformation efforts. For instance, a study assessing digital health readiness in Eastern Province hospitals found robust implementation of governance and workforce structures, though predictive analytics remained less mature [
24]. Another comprehensive review shows that Saudi Arabia has rapidly expanded digital health infrastructure, including EMRs, telemedicine, and national health information exchanges, which aligns with the operational improvements observed in our clustered hospitals under SHSTP [
21]. These developments help explain the clustered hospitals’ higher rates of telemedicine and Sehat app usage, and the more efficient patient outcomes achieved.
In summary, SHSTP’s cluster-based reform approach demonstrates clear benefits in enhancing patient-centered care, boosting digital engagement, and improving operational efficiency in chronic care contexts [
25]. Sustaining and building upon these gains will require continued investments in digital literacy, workforce development, and change management [
26]. These findings provide crucial evidence for policymakers and healthcare systems seeking to leverage integrated, digitally supported models of care in national transformation agendas.
The findings underscore the importance of structured clustering as a model for scaling integrated, digitally supported chronic care systems. Other countries in transition or reform phases may adapt similar frameworks by investing in workforce digital competencies, establishing interoperable EMRs, and adopting cluster-based accountability mechanisms. In clinical practice, the study highlights how digital platforms and multidisciplinary care pathways can enhance continuity, efficiency, and patient trust.
Despite the strengths of its mixed-methods design, several methodological limitations should be acknowledged: this study has several limitations. Its cross-sectional design restricts causal inference and limits the ability to establish directionality in the observed associations. Thus, correlation has been found between the clustered hospital and improved performance. These improvements cannot be linked to the SHSTP process alone. Other underlying factors, such as ongoing quality assurance initiatives or variations in hospital management, may be relevant to the differences. The modest sample size and regional focus on Mecca and the Eastern Province reduce generalizability to other areas of Saudi Arabia. Although key confounders were adjusted for, unmeasured variables may have influenced the findings. Additionally, manual recordkeeping in non-clustered hospitals could have introduced inconsistencies in data quality, and the purposive sampling approach, while methodologically appropriate for qualitative depth, may have introduced selection bias. Broader, longitudinal studies are warranted to validate these findings on a national scale. In addition, several analyses were conducted, including statistical multiplicity assessments (e.g., Bonferroni correction) due to the exploratory focus of the study. Furthermore, the quantitative dataset was derived from administrative hospital records, which are inherently subject to variability in data-entry quality, completeness, and coding practices across facilities. Although standardized SHSTP indicators were used to enhance comparability, heterogeneity between hospital record systems—particularly between clustered and non-clustered sites—may have influenced data accuracy. These limitations were mitigated through consistency checks, aggregation at the annual level, and triangulation with qualitative evidence. The alignment between administrative data trends and interview findings strengthens confidence in the validity of results, yet future studies should consider harmonized digital reporting frameworks to ensure cross-site data reliability.
5. Conclusions
This study provides early evidence suggesting that SHSTP implementation is enhancing patient-centered care, digital adoption, and care coordination in clustered hospitals across Saudi Arabia. Patients in these settings reported higher satisfaction, better follow-up, and improved digital access, while providers described stronger collaboration and clearer care pathways. The alignment between statistical and narrative data confirms that clustering fosters more responsive and efficient care delivery.
At the same time, challenges such as digital literacy gaps, cultural resistance to change, and reliance on expatriate staff remain barriers to sustained success. Policymakers should prioritize targeted investments in workforce development, digital health education, and inclusive implementation strategies to ensure that reforms benefit all segments of the population. Future longitudinal studies with expanded geographic coverage will be essential to validate these findings and assess the long-term impact of SHSTP on the broader Saudi healthcare system. These outcomes may inform other health systems undergoing digital or structural reform, particularly in middle-income and rapidly developing contexts. Lessons from SHSTP may guide international policymakers seeking scalable frameworks for integrated, patient-centered digital health transformation.