1. Introduction
Before the 1990s, China’s hospital payment system primarily relied on fee-for-service (FFS) retrospective payments. This approach created perverse incentives, such as over-treatment, excessive medical expenditure, and irrational pricing behaviors, which contributed to the disorderly expansion of the healthcare market. Both public and private healthcare expenditures soared, at times exceeding GDP growth rates [
1]. In response, the Chinese government launched a new round of healthcare reforms in 2009, aiming to provide safe, efficient, and affordable basic healthcare services to the population. Since then, basic health insurance coverage has steadily exceeded 95% [
2,
3]. However, amid slowing economic growth, an aging population, and rising healthcare demands, the healthcare system faces mounting challenges, including inefficiencies, unsustainable expenditures, and increasing public dissatisfaction [
4]. Thus, it is imperative to explore more effective models of governance, organizational, and management and to improve the use and allocation of medical insurance funds to deliver higher quality and more efficient hospital services.
The concept of Diagnosis-Related Groups (DRG) was first introduced in the late 1960s in the United States as a mechanism to control healthcare costs more effectively. DRGs classify patients into clinically and economically comparable groups based on diagnosis, severity, and complications. Under this system, payments are based on the group assigned after diagnosis rather than the actual costs incurred during treatment. While DRG systems have been widely studied and applied across a variety of healthcare systems worldwide over the past few decades, robust empirical evidence on their effectiveness in developing countries remains limited, and the impacts across different medical institutions and insured populations are still debated [
5,
6,
7,
8].
Z City, located in eastern China, had a per capita GDP of CNY 183,000 and a population of 9.547 million in 2023. On 1 January 2022, Z City officially launched DRG payment reform for all secondary and tertiary healthcare institutions (excluding psychiatric hospitals). The reform applied to patients enrolled in both the Urban Employee Basic Medical Insurance (UEBMI) and the Urban-Rural Resident Basic Medical Insurance (URRBMI), creating a natural exogenous policy shock.
This study utilized inpatient data for colorectal cancer patients in Z City from 2020 to 2023 to quantitatively evaluate the impact of DRG payment reform on medical outcomes. It also examined heterogeneity in effects across insurance types and hospital levels, aiming to inform evidence-based policymaking to optimize health insurance payment mechanisms and promote more equitable and efficient healthcare resource allocation.
This study is organized as follows:
Section 2 presents the contributions relative to previous studies and the research questions. In
Section 3, we introduce the data collection and methodology used.
Section 4 provides the results. In
Section 5, we provide an interpretation and discussion of the results. And then discuss research implications and limitations. The conclusions are presented in
Section 6.
2. Background
2.1. The Chinese DRG System
As the world’s largest developing country, China faces rising medical expenditures and inefficiencies in healthcare delivery. In response, a comprehensive reform of the medical insurance payment system is underway. In 2009, the State Council issued the “Opinions on Deepening the Reform of the Medical and Health System,” which set the overarching goal of establishing a basic medical and healthcare system that covers both urban and rural residents, ensuring access to safe, effective, convenient, and affordable care. Before this reform, most public hospitals in China operated under FFS payment models, which fueled uncontrolled growth in healthcare costs and incentivized hospitals to overprovide services for additional revenue, leading to widespread over-treatment and inefficiency [
9]. With the onset of population aging, these structural problems increasingly challenges the short- and long-term financial sustainability of China’s healthcare insurance system [
4,
10].
China began experimenting with DRGs in the 1980s. In 2008, the first domestic DRG classification system—BJ-DRG was developed mainly based on the U.S. and Australian models [
11]. Since then, regional governments have piloted and refined DRG systems. In 2017, the State Council issued further guidelines to promote medical insurance payment reform, mandating DRG-based payment pilots and encouraging alternatives such as capitation to replace traditional FFS approaches [
12]. After more than two decades of experimentation and development, four main DRG variants emerged: BJ-DRG (focused on payment, used in 12 provinces), CN-DRG (focused on performance and quality, used in 29 provinces), CR-DRG (targeting the New Rural Cooperative Medical Scheme and urban-rural resident programs), and C-DRG (piloted by the National Health Commission in cities like Sanming and Karamay) [
13]. In October 2019, China introduced its first unified DRG system, CHS-DRG, which integrates the BJ, CN, and CR versions [
14]. Implementation followed a three-stage process—top-level design, simulated operation, and full implementation in 30 cities’ pilot programs. By leveraging DRG, China aims to shift from retrospective to prospective payment models, realign the interests of hospitals, insurers, and patients, and ultimately improve the efficiency of insurance fund utilization. The DRG framework also facilitates payment negotiations between healthcare institutions and insurers, supports financial balance, incentivizes clinical staff, standardizes medical practices, improves service efficiency, and promotes the sustainable development of the healthcare system [
15].
As an early adopter of China’s Diagnosis-Related Groups (DRG) payment reform, Z City has extended DRG-based payments to all 98 of its secondary-level and above medical institutions since 2022. This city-wide reform encompasses all inpatient care categories, characterized by extensive coverage, substantial scale of implementation, and a high degree of data transparency. Key strengths of Z City’s DRG model include standardized policy enforcement and exceptional data transparency. The healthcare insurance bureau releases operational indicators—including weight adjustments, Case Mix Index (CMI), and cost structures—on a quarterly basis, providing researchers with high-quality longitudinal datasets. Moreover, Z City implemented the reform simultaneously across all institutions from the outset, avoiding potential selection bias associated with staggered rollout. This natural experiment setting offers a robust opportunity to evaluate the causal effects of DRG policies on treatment practices for specific diseases.
2.2. Existing Research and Evidence
As the continuous rise in medical costs poses severe challenges to the healthcare security system, the Diagnosis-Related Group (DRG) payment reform has become a crucial direction for China’s healthcare policy reform. Existing research primarily unfolds from two dimensions: On one hand, a large number of studies based on macro data have confirmed that DRG plays a positive role in controlling the growth of medical expenses, shortening average hospitalization days, and optimizing medical resource allocation [
16,
17]; On the other hand, some scholars have begun to focus on the impact of DRG on clinical diagnosis and treatment behaviors, finding that it may induce medical institutions to adjust patient admission structures and lead to phenomena such as diagnostic code upgrades [
18]. In recent years, the academic community has gradually extended its research focus to specific disease areas, analyzing the impact of DRG on medical quality [
19]. These studies provide important references for understanding the policy effects of DRG.
However, current research still faces significant limitations. First, for colorectal cancer—a highly heterogeneous malignancy with complex treatment protocols—systematic evaluations of DRG payment reform’s impact on clinical practices, cost structures, and healthcare quality remain lacking. Second, existing studies predominantly employ policy before-after comparisons or cross-sectional analyses, which fail to adequately control for temporal trends and other confounding factors, potentially introducing bias in policy effect estimation. More importantly, while City Z has achieved full regional coverage, synchronized implementation, and high data transparency in its DRG pilot program, creating an ideal natural experiment setting for rigorous policy evaluation, no study has fully utilized these conditions to apply causal inference methods like Regrettable Difference (RDD) for identifying DRG’s net effects on colorectal cancer outcomes. Therefore, leveraging City Z’s reform practices and RDD methodology to scientifically assess DRG’s influence on colorectal cancer treatment not only helps fill existing research gaps but also provides empirical evidence and policy references for expanding DRG adoption in complex disease management.
3. Methodology
3.1. Dataset
This study investigates the impact of DRG payment reform on public hospitals from the perspectives of medical expenditure, efficiency, and quality. Colorectal cancer, characterized by high global incidence and mortality rates, entails complex treatment pathways and substantial healthcare costs. Accordingly, we utilize inpatient data of colorectal cancer patients from City Z, China, spanning the years 2020 to 2023. Based on the coverage scope of relevant policies, records from primary healthcare institutions, psychiatric hospitals, and cross-regional hospitalizations were excluded. The final sample comprises 66,533 inpatients covered by basic medical insurance.
The selection of outcome variables is informed by the Performance Evaluation Manual for Tertiary Public Hospitals (2023 Edition) published by the National Health Commission of China, along with statistical indicators from the World Health Organization and the World Bank. The length of stay (LOS) is used as the primary indicator of healthcare efficiency. Medical expenditure variables include total expenditure, medical costs paid by medical insurance, out-of-pocket, and expenditures on medications, diagnostics, and consumables. Healthcare quality was measured using the 30-day readmission rate and the mortality rate among low-risk patients. All cost variables are reported in nominal CNY according to the year of hospitalization. We did not adjust expenditures to constant prices because the study period (2020–2023) was relatively short and inflation in medical services was modest.
Control variables include demographic characteristics (e.g., age, gender), institutional attributes (e.g., hospital type, hospital grade, tertiary-A classification), type of medical insurance, and the age-adjusted Charlson Comorbidity Index (CCI). To address heteroscedasticity in regression analysis, all expenditure-related variables are log-transformed using the natural logarithm. To reflect the severity of illness, we adopt the age-adjusted CCI, where higher index scores indicate greater disease severity. Given that disease severity also varies with age, we further adjust the CCI according to age groups using the method proposed by Quan [
20] (see
Appendix A).
Table 1 summarizes and illustrates the variables used in this study.
In defining the low-risk mortality indicator, patients with pathological stage I or II and a CCI score ≤ 1 are classified as low-risk. Considering that severely ill patients with extended hospital stays and high medical costs are not suitable for DRG-based payments, we excluded extreme outliers in both hospitalization days and costs. Additionally, 1% winsorization was applied to the sample for regression analysis. All statistical analyses were performed using Stata/MP version 17.0 (StataCorp LLC, College Station, TX, USA).
Table 2 presents the summary statistics of the main variables in our study. A total of 66,533 colorectal cancer inpatients from City Z between 2020 and 2023 were included as the study sample. The average age of the patients was 65 years, with a predominance of male patients, accounting for 62.69%, which aligns with the known epidemiological characteristics of colorectal cancer. Regarding insurance coverage, 51,285 patients (77.08%) were enrolled in the Urban Employee Basic Medical Insurance (UEBMI) scheme, while 15,248 patients (22.92%) were covered under the Urban and Rural Resident Basic Medical Insurance (URRBMI) scheme. Most patients received treatment in general hospitals, with a high concentration in tertiary care institutions, which constituted 87.44% of the sample. Among these, tertiary-A hospitals accounted for 80.39%, indicating that the majority of patients were treated in high-level healthcare facilities.
3.2. Analysis Strategy
Since the 1990s, regression discontinuity design (RDD) has gradually become one of the key methods for policy evaluation. Given that the probability of adopting the new payment reform in Z City changed completely from 0 to 1, it meets the criteria for a sharp regression discontinuity design. Therefore, this study uses the policy implementation date as the cutoff point and constructs the following regression discontinuity model for empirical analysis:
where Y
i is the dependent variable such as LOS, expenditures, readmission, mortality and so on. And D
i is the treatment group indicator. The running variable d represents the difference between the settlement date and the policy implementation date. The model includes a polynomial function of both the running variable and the treatment indicator. Specifically, D = 1 when d ≥ 0, and D = 0 when d < 0. The coefficient β captures the local average treatment effect at the cutoff, which is the primary estimate of the policy impact. X
i denotes a vector of control variables, and standard errors are clustered at the hospital level. H is bandwidth window around the cutoff, selected by mean squared error minimization. ε
i is error term, clustered at the hospital level. This study primarily adopts a non-parametric estimation approach, applying a triangular kernel weighting function. The optimal bandwidth is selected by minimizing the mean squared error (MSE). For robustness checks, the results of linear parametric regressions are also reported.
To examine the Changes in trends associated with the policy reform, this study employs an Interrupted Time Series (ITS) analysis to estimate both the immediate and sustained effects of the reform. Using 1 January 2022 as the intervention point, an ITS regression model is constructed to analyze the trend in the length of hospital stay before and after the implementation of the DRG-based payment reform. Given that the policy took effect immediately, the model does not account for any time lag in the response. A single-group ITS analysis is conducted using monthly aggregated data, with the regression model specified as follows:
Yi represents the length of hospital stay. β0 denotes the estimated initial level of the outcome variable during the observation period. β1 captures the estimated pre-reform trend in the outcome variable, serving as the baseline slope. β2 reflects the estimated immediate change in the outcome variable at the point of policy intervention, while β3 represents the difference in the slope of the outcome trend before and after the reform. εi denotes the random error term. The Newey-West method is applied to address potential autocorrelation in the error terms.
RDD focuses on local causal effects around the cutoff, while ITS captures broader temporal dynamics. By combining these methods, we provide a more robust assessment of policy impact.
5. Discussion
5.1. The Impact of DRG Reform on the Structure of Medical Expenditure
Data and textual analyses indicate a significant reconfiguration in the distribution of healthcare resource utilization following the implementation of the DRG-based payment system. In particular, the relative proportion of pharmaceutical spending has exhibited a consistent downward trend, whereas the expenditure shares allocated to diagnostic services and medical consumables have shown a gradual increase. To align total healthcare outlays with the expenditure ceilings embedded within the DRG reimbursement framework, clinical practitioners have adopted more stringent prescribing behaviors, actively reducing the use of non-essential medications. This shift is plausibly associated with concurrent systemic policy measures—such as centralized drug procurement programs—that have significantly compressed the profit margins historically linked to pharmaceutical sales. As a result, hospitals have shifted away from drug-centered revenue models, leading to a more structurally balanced allocation of medical expenditure components [
21].
Nonetheless, the downward adjustment in pharmaceutical disbursements has been accompanied by a corresponding rise in the financial burden associated with diagnostic interventions and consumable materials, suggesting a potential reorientation in institutional charging practices. This observed redistribution underscores the imperative for healthcare financing authorities to recalibrate regulatory mechanisms and extend supervision to these increasingly salient expenditure domains.
5.2. The Impact of DRG Reform on Medical Expenditures
The introduction of DRG-based payment mechanisms has played a significant role in incentivizing hospitals to control healthcare costs. Under the constraint of fixed payment, hospitals are required to optimize resource allocation, avoid unnecessary interventions, and eliminate inefficient or redundant services.
Empirical analysis demonstrates that DRG implementation significantly influences various categories of medical expenditures. Notably, it has led to a measurable decrease in total hospitalization costs and reduced the financial burden on the pooled social insurance fund. These outcomes underscore the reform’s effectiveness in curbing excessive medical spending, avoiding overtreatment, and safeguarding the sustainability of the insurance pool.
However, the observed increase in patients’ out-of-pocket spending raises concerns. This trend may be attributable to the relatively severe disease profiles treated post-reform and to provider behaviors such as recommending non-reimbursable drugs or services to compensate for lost revenues. Such dynamics underscore the importance of strengthening oversight of provider practices and aligning hospital incentives with the broader goals of payment reform [
22]. To this end, policymakers should consider expanding the reimbursement scope of the pooled fund and adjusting copayment ratios to prevent undue financial burdens on patients, thereby enhancing the mutual aid function of the insurance system [
23]. In parallel, reforms should strengthen catastrophic health insurance mechanisms by offering additional subsidies for high-cost conditions, thus preventing cost-shifting to vulnerable patients.
Heterogeneity analyses indicate that cost reductions are concentrated in tertiary hospitals and among urban employee insurance beneficiaries. Therefore, differentiated incentive mechanisms should be designed to tailor reform strategies to institutional and population-specific contexts [
24].
Moreover, the current study is limited to a single disease category within one city, which may not reflect the reform’s broader impact. The effectiveness of DRG payment reforms in controlling costs can vary across healthcare systems. In more developed regions, hospitals benefit from stronger managerial capabilities and higher levels of digital infrastructure, allowing them to leverage DRG mechanisms to improve efficiency and reduce overall expenditures. Conversely, in under-resourced areas, the implementation of DRG reforms may face significant barriers, including poor data quality and limited information systems, making it difficult for payment standards to reflect real-world treatment costs accurately. In such contexts, reforms may inadvertently compromise care quality. Hence, a comprehensive and context-sensitive evaluation of cost-containment outcomes is essential [
25].
In conclusion, while DRG-based payment systems hold promise for healthcare cost containment, their ultimate effectiveness hinges on the nuanced details of implementation, such as the appropriateness of payment benchmarks, the administrative capacity of healthcare providers, and the robustness of information systems. Ultimately, cost containment should not be viewed as the end goal of reform. Instead, DRG-based payment systems should serve as managerial instruments and policy levers to guide rational resource allocation and facilitate value-based, precision-oriented healthcare management.
5.3. The Impact of DRG Reform on Healthcare Efficiency
The implementation of DRG payment reform has demonstrated considerable potential in enhancing healthcare delivery efficiency by restructuring provider payment mechanisms. Central to the DRG approach is the use of prospective, case-based payments that incentivize healthcare institutions to optimize service provision and reduce unnecessary utilization of medical resources. In contrast to traditional FFS models—where provider revenue is positively correlated with the volume of procedures and services, often resulting in overutilization and inefficiency—the DRG model establishes predetermined payment rates, thereby encouraging providers to deliver care within budgetary constraints. This mechanism promotes the adoption of streamlined clinical pathways, minimizes redundant diagnostics and treatments, and contributes to more efficient inpatient care management [
26].
Moreover, DRG reform has catalyzed innovation in hospital administration and altered the prevailing incentive structures within healthcare delivery. Faced with tighter financial margins, hospitals are prompted to improve internal management, refine clinical workflows, and strengthen oversight of both service quality and expenditure. These reforms also necessitate enhanced data governance and greater investment in health information systems, fostering greater transparency and accountability in care delivery. Under such a payment regime, healthcare institutions must not only ensure clinical effectiveness but also design cost-effective therapeutic regimens and optimize bed utilization—ultimately reducing average length of stay (LOS) and enhancing overall system efficiency. Notably, the adoption of DRG systems has contributed to the institutionalization of more scientific and standardized performance assessment mechanisms, with average LOS increasingly regarded as a critical indicator of resource consumption and service efficiency.
Nonetheless, realizing the full efficiency potential of DRG-based payment requires overcoming several structural and operational challenges. Chief among them is the need to establish equitable and clinically appropriate reimbursement rates that reflect the true cost of care delivery, particularly for complex or high-risk patient populations. Failure to do so may inadvertently compromise care quality or discourage providers from admitting patients with more severe conditions. Additionally, designing nuanced payment models that balance cost containment with care adequacy remains essential to avoid under-provision and prevent unintended consequences, such as care avoidance or premature discharge. Therefore, the success of DRG reform is contingent not only upon sound policy design but also upon coordinated efforts from government regulators, healthcare institutions, and other stakeholders to support system-wide transformation.
Finally, the ITS analysis reveals an immediate reduction in average LOS following DRG implementation, suggesting short-term efficiency gains. However, this trend gradually reverses over time, highlighting the need for sustained regulatory oversight to mitigate policy fatigue and preserve the long-term efficacy of reform. Continuous monitoring, periodic adjustment of regulatory instruments, and the introduction of supplementary policy measures will be essential to ensure that DRG reforms yield durable improvements in healthcare efficiency and cost control.
5.4. The Impact of DRG Reform on Healthcare Quality
Empirical evidence suggests that the implementation of DRG payment reform has not compromised the quality of care, despite its emphasis on cost containment and efficiency enhancement. Notably, observed reductions in 30-day readmission rates and mortality rates among low-risk patients indicate that the reform has achieved initial success in promoting value-based care without sacrificing clinical outcomes [
22]. Furthermore, the absence of patient selection behavior among hospitals—such as rejecting or avoiding less profitable cases—demonstrates that providers have not pursued short-term financial gains at the expense of equitable patient access [
27].
This outcome may reflect the effectiveness of regulatory oversight by health insurance authorities in China, who have implemented mechanisms to influence provider behavior within a reasonable range while maintaining healthcare quality. Quality indicators have been incorporated into routine performance evaluations and integrated into the assessment frameworks of public hospitals, forming part of a broader accountability system. These efforts serve to continuously monitor clinical practice, reinforce professional standards among healthcare workers, and deter unethical or non-compliant behaviors.
However, the long-term impact of DRG reform on healthcare quality warrants further investigation. As the reform matures, it is essential to ensure that payment mechanisms continue to incentive quality improvement alongside efficiency. Future iterations of the reform must prioritize the design of clinically appropriate reimbursement standards and strengthen quality assurance systems to guard against unintended adverse effects. Sustained monitoring and adaptive regulation will be critical to ensure that hospitals remain focused on patient-centered outcomes, even as they strive for operational efficiency and cost control.
Ultimately, the success of DRG-based payment reform depends not only on its ability to curb excessive spending but also on its capacity to drive improvements in care delivery. When effectively designed and regulated, DRG reform can serve as a catalyst for hospitals to optimize clinical pathways, enhance management practices, and deliver higher-quality care at a sustainable cost.
5.5. Limitations of the Study
This study has several limitations that should be acknowledged. First, The study focus on colorectal cancer patients in a single city (Z City) limits generalizability. Although the sample size is relatively adequate, it lacks horizontal comparisons across different pilot cities and disease types. As a result, the conclusions drawn are context-specific and may not be fully applicable to other diseases or regions. Future research should include multi-site and multi-disease analyses to enhance the external validity and compare with more findings from other diseases or regions, thereby providing more comprehensive evidence for policy evaluation. Second, methodological scope. This study relied on quasi-experimental econometric approaches, namely regression discontinuity design (RDD) and interrupted time series (ITS). While these methods provide credible causal inference in the absence of randomized experiments, they are limited in handling complex nonlinear relationships and high-dimensional interactions. Recent advances in machine learning (e.g., Random Forest, Gradient Boosting) and interpretability tools (e.g., SHAP, LIME) offer promising avenues for capturing heterogeneity in treatment effects and improving predictive performance. Future studies could consider employing more rigorous research designs, such as quasi-experimental approaches or natural experiments, and incorporate data from control cities to generate stronger empirical evidence or integrate econometric and machine learning approaches to strengthen both inference and prediction. Third, choice of outcome measures. We used length of stay (LOS) as the primary indicator of efficiency and defined quality of care based on 30-day readmission and low-risk mortality. These measures are widely applied in health services research and supported by policy practice in China. However, they inevitably capture only certain aspects of efficiency and quality. Other important dimensions, such as patient-reported outcomes, complication rates, and longer-term survival, were not available in our dataset. Future research could incorporate richer quality indicators to provide a more comprehensive evaluation.
5.6. Implications for Payment Reform and Future Research
This study provides empirical support for the effectiveness of DRG-based payment reforms in improving hospital efficiency and controlling medical costs without compromising care quality. The observed reductions in length of stay, medical expenditures, and readmission and mortality rates suggest that DRG implementation can enhance resource utilization and promote value-based care. However, the increase in patients’ out-of-pocket burden and the attenuation of policy effects over time highlight the need for continuous policy refinement and monitoring.
For future payment reform, policymakers should consider strategies to mitigate the financial burden on patients while sustaining cost-control incentives for providers. Differentiated policy designs may be necessary to account for variations across insurance types and hospital tiers, as evidenced by the heterogeneity in outcomes. Moreover, the gradual weakening of reform effects underscores the importance of institutionalizing performance evaluation mechanisms and introducing adaptive policy tools to maintain long-term efficacy.
Future research should expand beyond a single-city, single-disease focus to include multi-regional and multi-disease analyses, integrate control groups where feasible, and explore the behavioral responses of providers and patients over longer time horizons. Such efforts will contribute to a more comprehensive understanding of the dynamic impacts of DRG reform and inform evidence-based policy decisions in China and other developing health systems undergoing similar transitions.