Next Article in Journal
An Extrinsic Enriched Finite Element Method Based on RBFs for the Helmholtz Equation
Previous Article in Journal
Parastrophe of Some Inverse Properties in Quasigroups
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

The Effect of Bundled Payment Schemes on Cost–Speed Tradeoff for Outpatient Service: A Queueing-Game Analysis

School of Insurance, Guangdong University of Finance, Guangzhou 510521, China
*
Author to whom correspondence should be addressed.
Mathematics 2026, 14(1), 199; https://doi.org/10.3390/math14010199
Submission received: 26 November 2025 / Revised: 21 December 2025 / Accepted: 27 December 2025 / Published: 5 January 2026
(This article belongs to the Section D2: Operations Research and Fuzzy Decision Making)

Abstract

In recent years, payment schemes in healthcare have garnered attention for their potential impact on service delivery and cost management. This paper explores the impact of the bundled payment scheme (BP) on hospital outpatient services, focusing on the cost–speed tradeoff. Specifically, a higher service rate increases patient demand but also raises medical costs. We consider a queueing-game theoretical model to analyze servers’ service rate behaviors under different payment schemes (fee-for-service and BP) and the payer’s optimal payment scheme setting. Our study shows that achieving the first-best outcome under centralized decision making using the BP requires specific conditions. When the medical budget is sufficiently high, the payer can guide hospitals toward the first-best decision by setting an optimal price under the BP. However, when the budget is at an intermediate level, hospitals may set slower equilibrium service rates to control costs. To address this issue, the payer can implement service level regulation based on the BP scheme to achieve the first-best outcome. This scheme encourages hospitals to choose higher service rates by limiting expected waiting times. When the budget is too low, hospitals may be unwilling to provide service due to unprofitability. Moreover, as competition between hospitals intensifies, it becomes easier to maximize social welfare under the BP scheme.

1. Introduction

The aging of the population and the evolution of the medical insurance system have led to an increase in demand for healthcare services, particularly for outpatient services. A key factor influencing the quality and satisfaction of outpatient services is consultation time (consultation time refers to the time that doctors spend on patients while providing outpatient consultation services). Unfortunately, many doctors still spend limited time with patients during consultations. For example, a study analyzing consultation times in 67 countries found that in 18 countries, about 50% of the global population experiences consultations of less than 5 min with primary care physicians [1]. In 2019, a report highlighted that waiting for up to two hours for a two-minute consultation with a doctor had become a major issue in the pediatric department of public hospitals (https://finance.sina.cn/chanjing/gdxw/2019-04-17/detail-ihvhiqax3547366.d.html (accessed on 15 October 2025)). In 2022, at a tertiary comprehensive hospital in Zhejiang Province, China, a doctor, during a 7 h workday, has to conduct consultations for up to 150 patients. The average consultation time allocated to each patient was less than 3 min (https://new.qq.com/rain/a/20220823A0976T00 (accessed on 15 October 2025)).
The short consultation time in outpatient care can reduce patients’ waiting time; however, it may also lead to increased medical costs. Outpatient services cover a wide range of diagnostic tests, prescriptions, treatment services, and minor outpatient surgical procedures when necessary [2]. When healthcare providers do not conduct careful inquiries and examinations due to time constraints, it can lead to a decline in the quality of diagnosis and treatment, including even misdiagnosis. As a result, during follow-up treatment, the service provider may need to perform more high-level medical tests, prescribe more effective drugs and equipment, and carry out more complex surgeries, resulting in higher medical costs for an entire episode of care [3]. Furthermore, Guan et al. discovered a correlation between inappropriate prescriptions and workload, indicating that as the workload increases, the incidence of inappropriate prescriptions is higher [4]. Therefore, the rapid pace of outpatient services poses a high risk of prescribing inappropriately, which may further increase medical costs.
A key factor contributing to short consultation times and high medical costs is the payment scheme used to reimburse healthcare providers. Payment schemes regulate provider behavior, control costs, and improve care quality [5]. The most commonly used medical insurance payment scheme is fee-for-service (FFS). Under FFS, service providers are paid based on the volume of services performed. This payment structure incentivizes providers to treat more patients quickly [6]. While FFS has been effective in reducing waiting times [7], it is also associated with lower service quality and higher medical costs [8].
Bundled payment (BP), a prospective payment, is currently a key medical insurance payment scheme being promoted in China. The principle is that the provider is compensated with one fixed lump sum for the entire episode of care, regardless of the specific tests and procedures conducted or any complications that may arise [9]. Under the BP scheme, hospitals assume all financial risks during the episode of care. If the patient is not cured within the allotted payment, the hospital covers the excess medical costs, while any surplus becomes the hospital’s profit [10]. Therefore, hospitals are motivated to provide high-quality medical services while controlling medical costs. The purpose of this payment scheme is to transfer the responsibility of controlling medical costs and ensuring service quality to hospitals, which would have an impact on the hospital’s operational management. The BP scheme originated in the United States in 1984, and, since then, many countries, including the Netherlands, Sweden, the United Kingdom, Canada, and China, have adopted the BP scheme as part of their medical reforms (https://healthcarefunding.ca/key-issues/bundle-test-2/ (accessed on 16 October 2025)). For instance, Sweden uses BP for total hip and knee replacement, and the National Health Services in the UK uses BP for cystic fibrosis and maternal care [11]. In China, the BP scheme for chronic and special critical diseases in outpatient settings implemented in Liuzhou City, Guangxi, has effectively controlled the unreasonable growth of outpatient medical expenses [12]; the implementation of the Big Data Diagnosis Intervention Packet in Guangzhou, Guangdong, and Diagnosis Related Groups in Shenzhen, Guangdong, have achieved the dual goals of controlling medical insurance fees and incentivizing medical institutions [13]. However, studies have shown that compared with FFS, the BP scheme may increase patients’ waiting time [14].
Under the BP scheme, the hospitals bear the total medical cost for a whole episode of care. As mentioned above, short outpatient consultation times may not only lead to high medical costs, but they also attract more patients at the same time because of the short wait time. As a result, under the BP, hospitals face a cost–speed tradeoff. Despite observed implementation effects in practice, the broader implications and impact of the BP scheme on healthcare operational efficiency, such as how to approach the cost–speed tradeoff, remain unclear. Therefore, many public healthcare systems continue to use the FFS.
To examine the impact of the BP scheme on the equilibrium behavior of hospitals providing outpatient elective care, we develop a three-stage Stackelberg game to characterize the public outpatient service system, which includes a payer, n hospitals, and a population of patients. The payer, acting as a social welfare maximizer or social cost minimizer, selects a payment scheme and designs its parameters. The hospitals consider the reimbursement they receive for providing medical services, including diagnostic services, medical tests, and drug therapy, as well as the total medical cost per patient for an entire episode of care, when making decisions about their service rates. The objective of the hospitals is to maximize profits, with higher service rates leading to higher equilibrium arrival rates and increased medical costs, while patients choose which hospital to seek medical care from based on the expected utility of each hospital, which takes into account the service value and the expected delay cost.
Based on these settings, we constructed a game-theoretical queueing model to capture the three-stage Stackelberg game and the interactions among all three parties. This framework allows us to analyze how hospitals choose service rates under different payment schemes and how patients respond through their hospital choices and compare these decentralized outcomes with a centralized benchmark. On this basis, the main contributions of this paper are summarized as follows:
(1) We incorporate a cost–speed tradeoff into a competitive public outpatient service system and develop a three-stage Stackelberg queueing-game model with one payer, n competing hospitals, and a pool of patients under both FFS and BP schemes. In our model, hospitals’ service rates simultaneously affect patients’ waiting time and the total medical cost for an entire episode of care.
(2) We analytically characterize the equilibrium service rates and patient arrival rates under FFS and BP and compare them with a centralized first-best benchmark in which the payer directly chooses hospitals’ service rates to minimize social cost. We show that FFS always induces excessively high service rates and can never achieve the first-best outcome, whereas the ability of BP to reach the first-best outcome critically depends on the level of the medical budget. We further identify how competition intensity among hospitals and the market size affect the payer’s optimal price and the resulting equilibrium service rates.
(3) Motivated by the limitations of BP, we propose and analyze a new payment mechanism, bundled payment with service level regulation (BPW), in which the payer combines a bundled price with an explicit waiting time guarantee. We derive the optimal price and waiting time guarantee, prove that BPW can implement the first-best outcome when the budget is at an intermediate level, and use numerical examples to illustrate how BPW improves total social cost and waiting performance compared with standard BP.
The remainder of this paper is organized as follows. Section 2 reviews the relevant literature. In Section 3, we introduce our queueing model and the problems of patients, hospitals, and the payer. In Section 4, we examine patients’ and hospitals’ equilibrium behavior and the payer’s pricing strategy under the BP scheme. Section 5 explores the equilibrium behavior of hospitals under the FFS scheme. Section 6 presents an analysis of a “benchmark” scenario—the first-best case and the performance comparison between the first-best case and the FFS and BP schemes. In Section 7, we provide a payment scheme, the BP scheme with service level regulation, which can achieve the first-best outcome. Conclusions are provided in Section 8.

2. Literature Review

Our paper is related to research on queueing economics, specifically focusing on competition in service levels (capacity levels). Hassin and Haviv have conducted extensive comprehensive studies and literature reviews in this field [15,16]. Chen and Wan [17] analyze equilibrium behavior when two identical service providers simultaneously compete on price and capacity. Allon and Federgruen [18] generalize the M/M/1 queueing model to a G/GI/s system, where firms compete based on price and waiting time. Lu et al. [19] build both analytical and empirical models to explore the impact of competition on the service quality of nursing homes under different payer settings. Han et al. [20] study quality competition between two hospitals under a payment scheme set by one insurer. However, our work differs from these studies as we focus on competition related to waiting time (service rate), a critical issue affecting patient choice in public healthcare service systems [21]. Therefore, considering waiting time competition is particularly relevant in outpatient care.
Queueing models are widely used in healthcare service systems to address patient waiting time problems. For example, Yu et al. [22] develop a queueing model to characterize the referral process in China’s tiered hospital system, which includes both general hospitals and community health centers, with a green referral channel based on priority theory. Lim et al. [23] use queueing theory to analyze staffing and operational efficiency in obstetric operating rooms, evaluating the impact of various staffing models on patient waiting times and operating room efficiency. Wang et al. [24] establish a queueing-game-theoretical model to capture the interactions between a comprehensive hospital and a primary hospital, examining the tradeoff between efficiency and quality. Guo et al. [14] use an M/M/1 queueing model with endogenous arrivals and revisits to investigate the impact of different payment schemes on the performance of a public healthcare system consisting of a public funder, a public healthcare provider, and a pool of patients. Their findings suggest that the size of the patient pool is an important factor affecting which payment scheme is more desirable for social welfare, revisit rate, and waiting time. Our contribution to this stream of literature is that we consider the impact of service rates on medical costs when building the queueing model to describe patient waiting times, which is crucial for customer-intensive services like outpatient care.
Our study also contributes to the literature on healthcare operation management and payment schemes. Many scholars have discussed the impact of BP and other payment schemes on healthcare system performance and the behavioral decisions of all parties involved in healthcare services. Adida et al. [9] compare the impact of FFS and BP on doctors’ behavior in patient selection and diagnosis quality. Andritsos and Tang [25] examine the incentive programs—FFS, Pay-for-Performance (P4P), and BP—for reducing readmissions when patient care is co-produced. Han et al. [20] examine quality competition between two hospitals under BP and FFS payment schemes. Ghamat et al. [26] study a healthcare system consisting of four parties (a payer, a hospital, a service provider, and patients) and analyze the impact of a “target price minimum quality” (TPMQ) bundled payment model on the hospital’s profit, the payer’s medical expense, and nursing quality, as well as the optimal conditions for this care coordination. Fan et al. [11] conduct an empirical study using insurance claim data from China to investigate the impact of bundled payment schemes on medical cost, efficiency, quality, and shift of care. Li et al. [27] conduct an experimental analysis of how internal compensation incentives based on medical insurance payment schemes—FFS and Diagnosis Related Group (DRG)—influence physician behavior. The results showed that different compensation structures significantly affect physicians’ service provision behaviors, thereby impacting patient welfare. In contrast to these operational management studies, this paper focuses on the impact of service rate competition on the equilibrium behavior of hospitals, patients, and the payer under the BP scheme, an issue widely observed in healthcare markets under BP [28].
This paper also relates to research on the quality–speed tradeoff in operations management, which considers the relationship between waiting time and service quality. A higher service rate reduces waiting time but simultaneously reduces service quality. Anand et al. [29] build a queueing model to explore the quality–speed tradeoff strategy of a service provider when demand is affected by service quality, waiting time, and price. In contrast to the static model of Anand et al., Kostami and Rajagopalan [30] build a dynamic model in a monopoly environment to describe such a quality–speed tradeoff and discuss the optimal decision of the supplier when customer demand is determined by the service quality of previous suppliers. The above two models assume that customers are rational. In contrast, Li et al. [31] expand the model to allow for boundedly rational customers. Furthermore, Li et al. [32] extend the service rate competition game with boundedly rational customers to the case with a general reward function and multiple servers. In customer-intensive services, Zhang and Wang [33], considering boundedly rational customers and the quality–speed tradeoff, analyze the optimal service rate and service price for revenue maximization and social welfare maximization. Sun et al. [34] consider the tradeoff between improving diagnostic accuracy (service quality) and waiting costs (service speed) in healthcare service systems and explore the impact of blockchain implementation on service quality, service price, and system congestion. In contrast to the aforementioned work on the quality–speed conundrum, we investigate the cost–speed tradeoff for hospitals in a competitive setting. The key difference is that our study focuses on the supply-side tradeoff (hospitals’ decisions related to medical costs and demand), while the quality–speed tradeoff is on the demand side (patients’ decisions related to waiting costs and service quality). Wang et al. [3] also consider the cost–speed tradeoff, specifically in terms of service rates and medical consumption costs in customer-intensive services. However, our research differs in that we focus on whether the BP scheme can achieve the first-best outcome (maximum social welfare) when hospitals face the cost–speed tradeoff. To the best of our knowledge, this is the first study to evaluate the BP scheme based on the relationship between medical costs and service rates.
As shown in Table 1, the comparison between this study and previous related literature highlights the key features and distinctions of the research conducted.

3. Model Setup

3.1. Basic Assumptions

At the beginning of this section, we summarize the main notation used in the model in Table 2.
We consider a public healthcare system that consists of a payer, n identical hospitals denoted by H i ,   i = 1 ,   ,   n , providing outpatient elective services (e.g., hernia repairs, knee and hip replacement), and a pool of homogeneous patients. The payer designs the payment scheme, the hospitals decide the service rates, and patients determine which hospital to seek treatment from. The assumption of homogeneous patients and hospitals is realistic. For example, in Germany and the United States, patients are divided into different diagnosis-related groups according to different symptoms. Patients in the same group need similar medical service resources [35]. This assumption is reasonable and supported by the literature on healthcare using the queueing economy model [14].
The hospitals provide diagnosis and treatment services of a particular disease for patients throughout a whole episode of care, including outpatient consultation services, diagnostic tests, treatment services, and minor surgical procedures [20]. We consider a super-M/M/1 system for each hospital with s physicians/servers. We assume that when s independent and identical single-server systems, each with an arrival rate λ d i , and service rate μ d i , are pooled together, the resulting system of each hospital is another single server system with arrival rate λ i = s λ d i and service rate μ i = s μ d i (refer to [36]). We model potential patient arrivals according to a Poisson process with a maximum rate of Λ . The arrival rate of hospital H i formed by those joining patients is called the initial arrival rate λ i . For developing analytical results and general insights, we consider the simpler super-M/M/1 model. We find that the outpatient service under the BP scheme is consistent with most practical settings. (There exists literature that considers the pooling of s M/M/1 systems as an M/M/s system. However, how to set a specific model mainly depends on the actual coordination mechanism of the servers. If s customers are independently served by s servers at the same time, then the system can be characterized as an M/M/s system. If a single customer is served by s servers jointly, the service configuration is a super-M/M/1 system rather than an M/M/s system). Under the BP scheme, multiple physicians who provide consultation, medical tests, medicine, or other procedures are all for one patient during a whole episode of care. Figure 1 provides a schematic representation of the model.
In this queueing model, hospital H i serves patients on a first-come–first-service (FCFS) basis, and the service takes an exponentially distributed service time at rate s μ d i . The Poisson arrival process and exponential service time have been well-tested in the healthcare operations management literature. For example, Kim et al. [37] empirically verify that the arrival process to a hospital intensive care unit follows a Poisson process, while Guo et al. [14] consider it theoretically. They also find that service time follows an exponential distribution.
Outpatient service is an example of customer-intensive services that require a high level of diligence and attention for good customer service [29]. The hospital/doctor can spend less time with patients to increase demand ([38], p. 27). Anand et al. [29] also propose that service value could be improved by increasing the time spent serving each customer. Therefore, hospitals can adjust servers’ service rates by adjusting the service time per patient, maximizing their profits. In other words, for more profit, hospitals shall induce more/fewer patients to join the queue while offering a shorter/longer service time. The adjustable service rate settings have been applied to a large variety of settings [3]. We assume that the service rates are bounded, i.e., s μ d i μ ¯ for i = 1, …, n, where µ ¯ is the upper bound of the service rate with 0 < μ ¯ < . The rationale behind this assumption is three-fold. First, the case s μ d i = could lead to trivial/nonexistent solutions. Second, in the context of outpatient service problems, it is difficult to conceive of an unlimited service rate. Third, excessive consultation services may result in high misdiagnosis rates, poor service quality, and low patient satisfaction. To ensure high-quality service and sufficient demand, the service rate must not be too high.
Due to the high value of healthcare services and the consideration that all medical expenses are covered by medical insurance, we assume that all patients would opt to enroll in this outpatient service system (or with no balking and reneging) to ensure the accessibility of treatment for all patients. That is, all patients would join and eventually complete their outpatient service for a whole episode of care in the system, i.e., i = 1 n λ i = Λ [39]. This assumption is rational in medical practice, particularly for non-urgent or elective surgical cases. Taking some provinces in Canada as an example, almost all patients waiting for knee or hip replacement surgery join the wait lists of public hospitals, with an extremely low abandonment rate that can be ignored. This is mainly attributed to the fact that knee or hip problems do not pose a direct threat to life, and thus patients usually do not choose to seek faster but more expensive services elsewhere. Moreover, this assumption helps to simplify the analysis and obtain some interesting and important theoretical results in the three-tier service system. This assumption is consistent with public healthcare systems where patients face minimal out-of-pocket costs and have limited alternative providers, especially for non-urgent elective procedures (e.g., knee or hip replacements in Canada) [39]. It simplifies the analysis by ensuring market coverage and allows us to focus on the strategic interaction between payment schemes and hospital behavior. However, in systems where patients bear significant costs or have private alternatives, some may opt out, potentially moderating the demand effects. Our results should thus be interpreted as applicable to publicly funded, comprehensive coverage contexts.
The expected waiting time represents general queuing processes, the expected waiting plus service time in a steady state, which is denoted by W . The physical meaning and calculation method are in line with the classic literature of queueing theory [40]. The expected waiting time of patients at hospital H i satisfies
W i ( μ d i , λ i ) = 1 s μ d i λ i .
To ensure the stability of the service system, we further assume s μ d i > λ i . Because patients’ waiting time cannot be infinitely long and customers will not join a queue if their expected waiting time exceeds their patience time [41], we consider that there is an upper limit to the expected waiting time w ¯ , which is a sufficiently large positive number; therefore, W i ( s μ d i , λ i ) w ¯ .
We adopt the classic Naor model [42] to characterize the utility function of patients in this outpatient service system. Patients receive a reward v after being served and incur a linear waiting cost with rate d when they join the queueing system. Specifically, if the patient joins the service system, after the entire episode of care, the patient will receive a reward v for being cured. We assume that all patients have the same service reward, which represents the perceived value of achieving the service goal. Throughout the outpatient service process, patients receive consultation and treatment services and are discharged from the hospital after being cured. Therefore, no matter which hospital the patient chooses to receive service at, the perceived value of a cured patient is equal. This assumption is consistent with the relevant literature on healthcare service systems (e.g., [39]).
When a patient is waiting in the queue, a linear cost with rate d is incurred. To ensure the model’s tractability, we adopt the conventional assumption that for homogeneous patients, d is a fixed value [43]. In general, waiting cost represents the opportunity cost of the patient in the waiting process, especially in the outpatient service, and it may also reflect the monetary value of the patient’s anxiety and pain before the consultation. Although waiting cost heterogeneity exists, it is not as apparent as the heterogeneity in the benefits of receiving outpatient service. For tractability purposes, we assume that patients are homogeneous in delay sensitivity. For the whole episode of care, we model the patient’s waiting cost as a linear function of waiting time. This assumption is common in the relevant literature on queueing systems (e.g., [14,44]). The patient’s expected waiting time is W , and thus the expected waiting cost (delay cost) can be expressed as d W . We consider that patients’ medical out-of-pocket expenses are reimbursed by medical insurance, which we refer to as a “zero” co-payment. This situation has been studied in the healthcare operations management literature. For example, Qian et al. [44] assume that public healthcare services do not have any user fees but incur a delay cost. Therefore, the patient’s utility function when joining the hospital H i , i = 1,2 ,   ,   n is
u i λ i , s μ d i = v d W i λ i , s μ d i .
The arriving patients decide which hospital to join to maximize their expected utility, that is, the patient’s joining decision. When patients make their joining decisions, they observe the waiting time and anticipate the expected utility of receiving service from each hospital. An implicit assumption is that patients know the expected waiting time [40]. In practice, although patients do not know in advance their actual waiting time, they have an expectation based on past experiences of what the expected waiting time will be at each service provider. Knowledge of the expected waiting time can also be accumulated from experiences of repeated visits and various information channels, such as patient reviews, medical studies, and the news.

3.2. The Hospital’s Problem

Consider the medical costs of patients for the whole episode of care incurred by the hospital. We focus on a context where a longer service time with more inquiry and examination can reduce the total medical cost [3]. In other words, if the hospital provides slow healthcare outpatient service to patients, the probability of misdiagnosis and medical expense for the whole episode of care (e.g., service cost, medical test cost, and prescription cost) will be low. In line with the setting in Wang et al. [3], we model the expected medical cost of a patient for the whole episode of care in H i , i = 1,2 ,   ,   n through a monotonic increasing function of the service rate. (For analytical tractability, although our medical cost model assumes that the medical cost C ( μ ) is linear in service rate μ , our main conclusions and the major insights still hold if an S-shaped logistic function is considered).
C μ d i = C 0 + C u s μ d i
where C 0 > 0 represents the unit benchmark medical cost and C u > 0 is the constant unit medical cost, which represents the rate by which the medical cost changes at the actual service rate. (Although we do not discuss the results of a case where the unit medical cost is negative C u < 0 in this study, meaning that a longer service time results in higher medical costs, our technical analysis carries over to that case, and the conclusions are mostly reversed).
Remark 1. 
Our model assumes that the expected medical cost is a monotonic increasing function of the service rate μ d i , as stated in Equation (3). This relationship is consistent with the healthcare operations literature [3,45,46], particularly Wang et al. [3], who model the expected medical cost as an increasing function of the service rate, where longer service times (i.e., slower service rates) lead to more thorough examination and reduced downstream medical consumption, such as fewer medical tests and prescriptions [3]. We adopt a linear form for analytical tractability, which is a common simplification in the literature. While we acknowledge that the true relationship between the service rate and medical costs may be nonlinear, Wang et al. [3] note that their key findings are robust to nonlinear forms, such as S-shaped logistic functions. This suggests that the primary insights of our model are not highly sensitive to the specific functional form.
The problem of hospital H i is to maximize its profit π i by designing the service rates of servers µ d i . Hospital H i earns income p , referred to as the service price, by providing outpatient services for the whole episode of care. This can be formulated as follows:
m a x μ d i π i ( μ d i ) = ( p C ( μ d i ) ) λ i e ( μ d i , μ d i ) s . t . π i ( μ d i ) 0 , λ i e ( μ d i , μ d i ) + 1 w ¯ s μ d i μ ¯ .
where µ d i is the service rate of servers/physicians in any hospital other than H i and λ i e ( μ d i , μ d i ) is the equilibrium arrival rate of patients at hospital H i .

3.3. The Payer’s Problem

In this outpatient service system, the payer acts as a medical insurance bureau, funder, or social planner who is responsible for making decisions on medical insurance payment schemes, including choosing the appropriate scheme and setting parameters. The goal of the payer is to maximize social welfare. Based on the definition of social welfare in medical services, we define social welfare as the patients’ service reward minus their waiting cost and the hospitals’ medical costs. Combined with the previous assumption of patient reward homogeneity, the patient’s reward is a fixed value. Consequently, the goal of maximizing social welfare is equivalent to minimizing social cost.
The social cost is composed of two parts: (i) the expected waiting cost of patients, i.e., d Λ s μ d i ( p ) λ i e ( p ) , i = 1,2 ,   ,   n , where d is the unit delay cost/unit waiting cost; (ii) the medical costs incurred by the hospitals, i.e., C μ d i Λ . Note that μ d i e p represents the optimal service rate of servers in hospital H i . Taking the payer’s budget B for all patients into consideration [14], the optimization problem of the payer can be formulated as follows:
m i n p S C ( p ) = d Λ s μ d i e ( p ) λ i e ( p ) + Λ C ( μ d i e ( p ) ) s . t . Λ p B .
This paper investigates a three-stage Stackelberg game that involves the payer, n hospitals, and a pool of patients. The sequence of decisions is as follows. First, the payer designs the payment scheme, taking into account the three different payment schemes proposed in the introduction—FFS, BP, and BP with service level regulation (BPW for simplicity in this paper). The objective of the payer is to minimize the social cost. Next, given the payment scheme setting and anticipating patients’ reactions, n hospitals decide their servers’ service rates µ d i simultaneously. The hospitals aim to maximize their respective profits. Finally, based on the service rates, the patients choose one of the hospitals to seek service at to maximize their utilities. Figure 2 describes the game sequence.

4. Bundled Payment

We consider the BP scheme under which hospitals are compensated with a fixed service price for a whole episode of care for a certain disease, regardless of what kind of examinations, tests, and procedures are adopted and irrespective of eventual complications [9]. Each hospital is paid a uniform price set by the payer p B P for providing outpatient services and bears the total medical cost incurred for their respective services. Therefore, for more profits, hospitals have the incentive to minimize their medical costs as much as possible.
We use the backward introduction to analyze the three-stage Stackelberg game. First, patients decide which hospital to seek service at based on their expected waiting time. Second, anticipating the patients’ arrival rates in equilibrium, hospitals determine their service rates simultaneously to maximize their profits in a competitive setting. Finally, we derive the payer’s payment scheme setting that minimizes social cost.

4.1. The Patients’ Joining Decision

From Equation (1), we know that patient demand depends solely on waiting time, meaning that patients choose the hospital with the shortest expected waiting time. We assume that except for hospital H i , n 1 hospitals (unified as H i ) adopt a symmetric strategy s μ d i and the responding arrival rate is λ i . For patients, if the expected waiting times of n services are equal, there is no difference between their choices, leading to W μ d i , λ i = W μ d i , λ i in equilibrium. Therefore, the patients’ equilibrium arrival rates satisfy
W i ( μ d i , λ i ) = W i ( μ d i , λ i ) , λ i + ( n 1 ) λ i = Λ , s μ d i > λ i , s μ d i > λ i , s μ d i μ ¯ , s μ d i μ ¯ , 1 s μ d i λ i w ¯ , 1 s μ d i λ i w ¯ .
The second equation of (6) ensures that all patients will join the public outpatient service system. The values of s μ d i > λ i and s μ d i > λ i are set to ensure the stability of the queueing system. The last inequality of (6) is the regulation of waiting time. The expected waiting time of patients at hospital H i or hospital H i cannot exceed w ¯ .
Denote λ i e and λ i e as the equilibrium arrival rates of hospitals H i and H i , respectively. According to (6), λ i e and λ i e mainly depend on the service rates of the hospitals, the number of the hospitals, and the market size. In Proposition 1, we provide the equilibrium arrival rates for different conditions of the service rate μ d i .
Proposition 1. 
The equilibrium arrival rates λ i e   a n d   λ i e can be given as follows:
(i)
I f μ d i [ 0 , Y 1 μ d i , λ i e = λ i e = 0 ;
(ii)
I f μ d i [ Y 1 μ d i , Y 2 μ d i , λ i e = 0 , λ i e = Λ n 1 ;
(iii)
I f μ d i [ Y 2 μ d i , Y 3 μ d i , λ i e = Λ + s ( n 1 ) μ d i μ d i n , λ i e = Λ + s μ d i s μ d i n ;
(iv)
I f μ d i [ Y 3 μ d i , μ ¯ , λ i e = Λ , λ i e = 0 .
Note. The functions Y 1 μ d i , Y 2 μ d i and Y 3 μ d i   are defined as piecewise functions, which make their expressions relatively complex and lengthy. To maintain the readability and flow of the main text, we provide the detailed definitions and derivations in Appendix A, proof of Proposition 1, and Figure 3. However, its graphics can be found in Figure 3.
Proof. 
See Appendix A. □
Combining Proposition 1 and Figure 3, we find that when the service rate is low enough (case (i)), no patient will be willing to enter the queue to wait for treatment service, that is, λ i e = λ i e = 0 ; when the hospitals’ service rate s μ d i is low enough while the others s μ d i are high (case (ii)), then no patient will enter this hospital to wait for service, and patient demands will be evenly distributed among other n − 1 hospitals, i.e., λ i e = 0 , λ i e = Λ n 1 ; conversely, when the hospital’s service rate is high enough, while the others’ is low (case (iv)), all patients will choose to join because the expected waiting cost will be low enough, i.e., λ i e = Λ , λ i e = 0 ; only when the service rate of all hospitals is high enough (case (iii)) will each hospital have patients choose to join.

4.2. The Hospitals’ Service Rate Decision

The objective of each hospital is to set its service rate for servers to maximize its profit within the competitive setting. From the hospital’s profit function (given by model (4)), we find that under the BP scheme with a fixed service price determined by the payer, the hospital’s profit depends on the medical cost and the equilibrium demand. Recall the situation that when the hospital serves more slowly and spends more time on patient consultation, the total medical cost decreases. However, slow service also results in longer waiting time and decreased demand (Li et al., [31]). Therefore, under the BP scheme with a given service price, the hospitals face a cost–speed tradeoff. The problem of hospital H i can be shown as model (4) in the model’s setup, as follows:
m a x μ d i π i ( μ d i ) = ( p C ( μ d i ) ) λ i e ( μ d i , μ d i ) s . t . π i ( μ d i ) 0 , λ i e ( μ d i , μ d i ) + 1 w ¯ s μ d i μ ¯ .
The equilibrium service rate of each server in hospital H i , μ d i B P , i = 1,2 ,   ,   n for a given service price under the BP is provided in the following proposition.
Proposition 2. 
Under the BP scheme, for a given service price p B P , the optimal service rates of the servers in hospital  H i  are
μ d i B P = 0 i f p B P [ 0 , p 1 ) , Λ s n + 1 s w ¯ i f p B P [ p 1 , p 4 ) , p B P C 0 s C u Λ s ( n 1 ) i f p B P [ p 4 , + ) .
The equilibrium arrival rate of hospital  H i  is
λ i B P = 0                       i f p B P [ 0 , p 1 ) , Λ n                   i f p B P [ p 1 , + ) .
where  p 1 = C 0 + C u w ¯ , p 4 = C 0 + ( 2 n 1 ) C u Λ n ( n 1 )   +   C u w ¯ .
Proof. 
See Appendix A. □
Proposition 2 provides the optimal service rates of servers and equilibrium demands for a given service price p B P . When the service price is low enough, i.e., p B P [ 0 , p 1 , hospitals will not provide services. When the service price is within a moderate range, i.e., p B P [ p 1 , p 4 , hospitals’ strategies are independent of the service price. However, when the price is high , p B P [ p 4 , , the optimal service rates increase the service price and are at least as high as those in the previous case. The rationale for the behavior of hospitals under different service price levels is as follows. When the service price is very low, hospitals are unprofitable and therefore opt to withdraw from the market. When the price is within a moderate range, hospitals would not earn much profit from serving patients. Therefore, the only way for them to be profitable is to slow down the service rate to lower the medical cost. In this case, this results in very high patient waiting times. However, when the price is sufficiently high, hospitals would earn more profit by serving more patients. The optimal strategy in this case is to improve the service rate to shorten the waiting time and attract more patients.
When the service price is sufficiently low, i.e., p B P [ 0 , p 1 , no hospitals are willing to provide healthcare service and therefore no patients will enter the market, i.e., λ i B P = 0 . When the service price is high due to the homogeneous hospitals and focusing on symmetrical equilibrium, all hospitals will provide services and share the total market demand equally λ i B P = Λ n . As the competition intensity n increases, the demand for each hospital decreases.

4.3. The Payer’s Pricing Strategy

Under the BP scheme, the goal of the payer is to minimize the total social cost SC by setting a unified price p B P for the hospitals. Recall Proposition 2, which states that for any p B P [ 0 , p 1 , the marginal profit of the hospitals is negative, and hence no hospitals are willing to provide service; thus, we focus on the situation where p B P [ p 4 , . In addition, according to the value range of the price, the budget needs to be high enough, i.e., B [ Λ p 1 , + . Therefore, combining the problem of the payer (5) and the equilibrium strategy of the hospital μ d i B P in Proposition 2, the specific optimization model of the payer can be given by
m i n p B P S C ( p B P ) s . t . p 1 p B P B Λ .
where
S C ( p B P ) = d Λ w ¯ + ( C 0 + C u ( Λ n + 1 w ¯ ) ) Λ i f p B P [ p 1 , p 4 ) , d Λ p B P C 0 C u ( 2 n 1 ) Λ n ( n 1 ) + ( p B P C u Λ n 1 ) Λ i f p B P [ p 4 , + ) .
By solving the model (9), we can obtain the following Proposition 3.
Proposition 3. 
Under the BP scheme, the optimal service price of the payer that minimizes social cost  p ~ B P and the corresponding equilibrium service rates  μ ~ d i B P  can be given as follows:
p ~ B P = B Λ i f B [ Λ p 1 , B 1 ) , C 0 + C u ( d C u + ( 2 n 1 ) Λ n ( n 1 ) ) i f B [ B 1 , + ) .
μ ~ d i B P = Λ s n + 1 s w ¯   if B [ Λ p 1 , B 0 ) , B Λ C 0 s C u Λ s ( n 1 ) if B [ B 0 , B 1 ) , 1 s d C u + Λ s n i f B [ B 1 , + ) .
where  B 0 = Λ p 4 = Λ ( C 0 + C u Λ ( 2 n 1 ) n ( n 1 ) + C u w ¯ ) , B 1 = Λ ( C 0 + C u ( d C u + ( 2 n 1 ) Λ n ( n 1 ) ) ) .
Proof of Proposition 3. 
When p B P [ p 1 , p 4 ) , the social cost function S C ( p B P ) is not affected by p B P and is larger than that for p B P [ p 4 , + ) due to sufficiently large w ¯ ; therefore, we next focus on the case p B P [ p 4 , + ) . In this case, the second-derivative of the social cost function is
2 S C ( p B P ) 2 = 2 d Λ C u 2 [ p B P C 0 C u Λ n ( n 1 ) ] 3 > 0 .
Hence, S C ( p B P ) is convex in p B P . The first-order condition S C p B P = d Λ C u [ p B P C 0 C u Λ n ( n 1 ) ] 2 + Λ = 0 leads to p B P = C 0 + C u ( d C u + ( 2 n 1 ) Λ n ( n 1 ) ) , noted by p d , and so we have p d > p 4 .
By considering the budget constraint Λ p B P B , p B P [ p 4 , B Λ ) . Next, we analyze the optimal price based on the relationship between p d and B Λ . When B < Λ p d (below, we note that B 1 = Λ p d ), S C ( p B P ) is strictly decreasing in p B P ; thus, the optimal price is p ~ B P = B Λ and the corresponding equilibrium service rate   μ ~ d i B P = μ d i B P ( B Λ ) = B Λ C 0 s C u Λ s ( n 1 ) ; when B B 1 , S C ( p B P ) is first decreasing and then increasing in p B P , and thus p ~ B P = p d = C 0 + C u ( d C u + ( 2 n 1 ) Λ n ( n 1 ) ) , μ ~ d i B P = μ d i B P ( p d ) = 1 s d C u + Λ s n . □
We can further observe the impact of parameters (the number of hospitals n , the market size Λ , and the unit medical cost C u ) on equilibrium results (the optimal service price p ~ B P and service rates of the hospitals μ ~ i B P = s μ ~ d i B P ). We formalize our discussion in Corollary 1 and Table 3.
Corollary 1. 
Under the BP scheme,
As the number of hospitals n increases, the optimal service price is unaffected if the budget is low  B [ Λ p 1 , B 1  and decreases if the budget is relatively high  B [ B 1 , + .
As the market size  Λ  increases, the optimal service price decreases if the budget is low  B [ Λ p 1 , B 1  and increases if  B [ B 1 , + ;
As the unit medical cost  C u  increases, the optimal service price is unaffected if the budget is low  B [ Λ p 1 , B 1  and increases if  B [ B 1 , + .
Proof. 
See Appendix A. □
According to case (i) of Corollary 1 and Table 3, the impact of the number of hospitals n on equilibrium results is closely related to the budget. When the budget is low B [ Λ p 1 , B 1 , the payer sets a service price that is not influenced by n, because the pricing is constrained by the budget. However, as n increases, for low budget B [ Λ p 1 , B 0 , hospitals will provide slower services to achieve positive profits while still meeting the basic waiting time constraint; for relatively high budget B [ B 0 , B 1 , hospitals will choose higher service rates to compete for demand, as this behavior reduces patient waiting costs. When the budget is high enough B [ B 1 , + , as n increases, the greater the competition, and the payer sets a lower service price to control the service rate of hospitals, thereby reducing the hospitals’ total medical costs.
Consider case (ii) next. When the budget is low B [ Λ p 1 , B 1 , the impact of Λ on medical costs dominates that of waiting costs. Hence, as Λ increases, the payer sets a lower price to control the hospitals’ service rate, thereby reducing medical costs. However, as n increases, for B [ Λ p 1 , B 0 , the hospitals would choose higher service rates for achieving the basic waiting time guarantee and reducing the waiting cost; for B [ B 0 , B 1 , the hospitals would serve more slowly to reduce medical costs. When the budget is high enough B [ B 1 , + , the impact of Λ on the waiting cost dominates that of the medical cost. Hence, as the market size increases, to control the total waiting cost of the patients, the payer would set a high price to encourage the hospitals to increase the service rate.
From case (iii) and Table 3, when the budget is low B [ Λ p 1 , B 1 , the pricing is constrained by the budget, and thus the optimal price setting is only related to the unit budget B Λ , regardless of the unit medical cost C u . When the budget is high, the pricing is no longer limited by the budget, and thus as the unit medical cost C u becomes higher, the payer would set a higher price to ensure the profit of the hospitals. Moreover, for a low enough budget B [ Λ p 1 , B 0 , the equilibrium service rate is not affected by the unit medical cost; for B [ B 0 , + , the equilibrium service rate decreases because to control the total medical cost, the hospitals will be willing to spend more time on consulting patients.

5. Fee-for-Service

After analyzing the BP scheme, we now turn to examine the fee-for-service (FFS) scheme, where patients’ equilibrium arrival rates remain the same as in the BP case, as shown in Proposition 1. Next, we would like to examine the equilibrium service rate decision of the hospitals under the FFS scheme.
The FFS scheme is currently the predominant and influential payment scheme in healthcare system, under which the payment depends on service item and the number of services. Assuming that the hospitals have a profit margin of α ∈ [0, 1] on the total medical cost [26], i.e., p = 1 + α C μ i . Therefore, along with (3), the problem of the hospital H i is to maximize the profit by designing service rates for its servers, which can be formulated as follows:
m a x μ d i π i ( μ d i ) = α C ( μ d i ) λ i e ( μ d i , μ d i ) s . t . π i ( μ d i ) 0 , λ i e ( μ d i , μ d i ) + 1 w ¯ s μ d i μ ¯ .
Solving problem (12) leads to an optimal service rate for the hospital, as shown in Proposition 4.
Proposition 4. 
Under the FFS, the optimal service rate of the servers in hospital  H i  is  μ d i F F S = μ ¯ s .
Proof of Proposition 4. 
Following from that, both the medical cost function C ( μ d i ) and the equilibrium demand function λ i e ( μ d i ) are strictly increasing in μ d i , and therefore the optimal service rate is the maximum, μ d i F F S = μ ¯ . □
Proposition 4 shows that under the FFS scheme, the hospitals will choose the maximum service rate. Because the service rate has the same positive effect on medical cost and demand, the higher the hospital’s service rate, the higher the misdiagnosis rate and the higher the follow-up medical cost, and, at the same time, the greater the patients’ demand because of the lower waiting cost. Therefore, it follows that the higher the total medical cost, the greater the hospital’s profit. As a result, the hospitals will provide higher service rates for more demand under FFS.
Having established the optimal service rate under the FFS scheme, we next conduct a comparative analysis of service rates and system performance between the FFS and BP schemes.
Proposition 5. 
μ ~ d i F F S > μ d i B P ,W( μ ~ d i F F S ) < W( μ ~ d i B P ), C ( μ ~ d i F F S ) > C ( μ ~ d i B P ) , S C F F S S C B P .
Proof of Proposition 5. 
Under FFS, o p t i m a l   s e r v i c e   r a t e   μ d i F F S = μ ¯ which is the upper bound of the service rate. However, the optimal service rate under the BP scheme is a piecewise function. I f B [ Λ p 1 , B 0 ) , μ ~ d i B P = Λ s n + 1 s w ¯ ; i f B [ B 0 , B 1 ) , μ ~ d i B P = B Λ C 0 s C u Λ s ( n 1 ) ; i f B [ B 1 , + ) , μ ~ d i B P = 1 s d C u + Λ s n , where w ¯ is an upper limit of the expected waiting time.
Because the waiting time is W ( μ d i , λ i ) = 1 s μ d i λ i , a larger μ d i leads to a shorter waiting time, so W( μ ~ d i F F S ) < W( μ ~ d i B P ); because the expected medical cost C μ d i = C 0 + C u s μ d i , a larger μ d i leads to a larger medical cost C ( μ ~ d i F F S ) > C ( μ ~ d i B P ) ; because the social cost S C = d Λ W ( μ d i , λ i ) + C μ d i ,   i f B [ Λ p 1 , B 0 ) , the waiting cost under BP is d Λ w ¯ , and the medical cost under FFS is also the upper bound, so it can be approximated that two social costs are equal and both are very large, i . e . , S C F F S = S C B P ; i f B [ B 0 , + ) , the social cost under BP is a fixed positive value, but it is very large under FFS, and thus S C F F S > S C B P . □
Proposition 5 shows that FFS induces hospitals to choose higher service rates, resulting in shorter waiting times but higher medical and social costs, whereas BP internalizes medical costs and leads to lower overall inefficiency. The result highlights that a higher service rate does not necessarily imply greater efficiency, and cost-internalizing payment schemes, such as BP, are more effective in balancing service rates and cost controls.

6. First-Best Case and Performance Comparison

To better explore the impact of different payment schemes on the operational efficiency of the outpatient service system, we establish a “benchmark” as a comparison standard under ideal conditions, which is called the first-best case. In this section, we first analyze the optimal service rates of each server in hospitals in the first-best case and then compare the results with those under different schemes.

6.1. First-Best Case

In the first-best case, the entire outpatient service system is fully owned by the payer in a centralized public outpatient service system, and the payer sets the optimal service rates for the servers to minimize social costs. Based on the previous assumptions of homogeneous hospitals and no balking behavior of patients, the payer equally distributes the patients’ demand, that is, the arrival rate of patients to each hospital is Λ n . According to the definition of social costs in model setup, the optimization problem of the payer is
m i n μ d i S C ( μ d i ) = d Λ s μ d i Λ n + C ( μ d i ) Λ s . t . Λ n + 1 w ¯ s μ d i μ ¯ .
By solving the optimization model (13), we have the first-best optimal service rate of each server in hospital H i , μ d i f , as shown in the following proposition.
Proposition 6. 
The first-best solution is to set  μ d i f = 1 s d C u + Λ s n .
Proof of Proposition 6. 
According to the optimization model (13), the first-derivative and the second-derivative of the social cost function SC with respect to μ d i are
S C μ d i = s d Λ ( s μ d i Λ n ) 2 + s C u Λ , 2 S C μ d i 2 = 2 s 2 d Λ ( s μ d i Λ n ) 3 .
Because 2 s 2 d Λ ( s μ d i Λ n ) 3 > 0 , S C ( μ d i ) is a convex function with respect to μ d i , the first-order condition leads to μ d i = 1 s d C u + Λ s n and the value is in the domain [ Λ s n + 1 s w ¯ , μ ¯ s ] ; hence, the first-best solution μ d i f = 1 s d C u + Λ s n . □
Proposition 6 shows that the first-best service rate decreases in the number of hospitals and the unit medical cost while it increases in the unit waiting cost and the market size. The results are driven by the impact of the service rate on the medical cost and the waiting cost; as the provider increases the service rate, the medical cost increases, and the waiting cost decreases. Therefore, the larger the number of hospitals or the higher the unit medical cost, the payer sets a lower service rate for the servers to control the medical cost. As the unit waiting cost or the market size increases, the payer would set a higher service rate for the servers to reduce the total waiting cost.

6.2. Performance Comparison

In this subsection, we examine whether the first-best outcome can be achieved under the FFS and BP schemes by comparing the equilibrium service rates of the hospitals under the above two schemes and the first-best solution.
Proposition 7. 
If  B [ Λ p 1 , B 1 ) , μ ~ d i B P < μ d i f < μ d i F F S ; i f B [ B 1 , + ) , μ ~ d i B P = μ d i f < μ d i F F S .
Proof of Proposition 7. 
If B [ Λ p 1 , B 1 ) , μ d i B P μ d i f = ( B Λ C 0 s C u Λ s ( n 1 ) ) 1 s d C u + Λ s n . Let f ( p B P ) = p B P C 0 s C u Λ s ( n 1 ) , then μ d i B P = f ( B Λ ) , μ d i f = f ( p d ) , and f ( p B P ) is strictly increasing in p B P . Because B < Λ p d , we have μ d i B P μ d i f = f ( B Λ ) f ( p d ) < 0 . Thus, μ d i B P < μ d i f < μ d i F F S .
If B [ B 1 , + ) , μ ~ d i B P = μ d i f = d C u + Λ n < μ d i F F S .□
Proposition 7 reveals that under the FFS payment scheme, the first-best outcome cannot be implemented. In addition, higher healthcare service rate brings high medical costs with the FFS scheme, which is consistent with Calsyn and Lee [8]. For the BP scheme, it turns out that only when the budget is high enough B [ B 1 , + can the payer regulate the hospitals’ service rate decision to the first-best solution by adjusting the price. However, when the budget is low B [ Λ p 1 , B 1 , the service rate under the BP scheme is slower than the first-best solution.
Compared with BP, we find that FFS is effective in increasing the service rate (i.e., μ ~ d i B P < μ d i F F S ) and reducing waiting time, which is consistent with the results shown in Guo et al. [14]. In other words, the BP scheme leads to a slower service rate of the servers and more waiting time for patients. At the same time, it also realizes lower medical costs for the whole episode of care because the hospital pays more time and patience to serve patients, improving the efficiency and quality of diagnosis and treatment. To minimize the total waiting cost and medical cost, in the next section, we design a coordination mechanism, the BP with service level regulation (BPW), to adjust the hospitals’ behavior.

7. Bundled Payment with Service Level Regulation

For many service industries, providers often advertise a kind of service level regulation, the expected waiting time guarantee of their services [47]. For example, for non-emergency medical services, many countries have implemented some form of a maximum waiting time. Taking Canada as an example, in 2005, provinces agreed to establish common waiting time guarantees nationwide to reduce patient waiting times [48]. Ameritrade promises that trade takes no more than 10 s to be executed. Some call centers guarantee that customers can be serviced within an hour. Some studies characterize this waiting time guarantee through the queueing model [47,49]. Therefore, in line with the above commonly observed phenomenon regarding the waiting time guarantee, we assume that the expected waiting time of patients needs to be shorter than a waiting time guarantee w 0 , and it is set by the payer. Because w ¯ is a sufficiently large positive number, and thus w 0 < w ¯ .
Combining the waiting time guarantee setting and the basic assumptions, the patients’ equilibrium arrival rates satisfy the following condition:
W ( μ d i , λ i ) = W ( μ d i , λ i ) , λ i + ( n 1 ) λ i = Λ , s μ d i > λ i , s μ d i > λ i , s μ d i μ ¯ , s μ d i μ ¯ , 1 s μ d i λ i w 0 , 1 s μ d i λ i w 0 .
Let λ i e w and λ i e w denote the equilibrium arrival rates of hospitals H i and H i under the BPW scheme, respectively. By deducing the above model, we have the λ i e w and λ i e w specified in following Proposition 8.
Proposition 8. 
Under the BPW scheme, the equilibrium arrival rates of the hospital  H i  and the other n − 1 hospitals  H i  are as follows:
(i)
I f μ d i [ 0 , Y 1 w ( μ d i ) ) , λ i e w = λ i e w = 0 ;
(ii)
i f μ d i [ Y 1 w ( μ d i ) , Y 2 w ( μ d i ) ) , λ i e w = 0 , λ i e w = Λ n 1 ;
(iii)
i f μ d i [ Y 2 w ( μ d i ) , Y 3 w ( μ d i ) ) , λ i e w = Λ + s ( n 1 ) ( μ d i μ d i ) n , λ i e w = Λ + s μ d i s μ d i n ;
(iv)
i f μ d i [ Y 3 w ( μ d i ) , μ ¯ ) , λ i e w = Λ , λ i e w = 0 .
Proof. 
Note: The definitions of Y 1 w ( μ d i ) , Y 2 w ( μ d i ) ,   a n d   Y 3 w ( μ d i ) ) are presented in Appendix A. □
Compared with the BP without service level regulation, under the BP with service level regulation, the equilibrium demands of the patients are the same, while the regional boundaries are different. Due to the higher requirement for the expected waiting time, only when the server’s service rate is higher will patients choose to join the service.

7.1. The Hospitals’ Service Rate Decision

After observing the given service price p w B P and the waiting time guarantee w 0 , the hospitals maximize their profits by determining the service rates. Given the n − 1 hospitals’ service rates decisions μ d i and the corresponding arrival rate λ i e w , the hospital H i ’s maximization problem is
m a x μ d i π i ( μ d i ) = ( p w B P C 0 C u · s μ d i ) λ i e w ( μ d i , μ d i ) s . t . π i ( μ d i ) 0 , λ i e w ( μ d i , μ d i ) + 1 w 0 s μ d i μ ¯ .
Applying the same analytical method to that under the BP scheme, we can derive the equilibrium under BPW, μ d i B P w , and the corresponding equilibrium arrival rate λ i e w as follows.
Proposition 9. 
Under BPW, the equilibrium service rate of the servers in hospital  H i   is
μ d i B P w = 0 i f p w B P [ 0 , p 1 w ) , Λ s n + 1 s w 0 i f p w B P [ p 1 w , p 4 w ) , p w B P C 0 s C u Λ s ( n 1 ) i f p w B P [ p 4 w , + ) .
the corresponding equilibrium arrival rate is
λ i e w = 0 i f p w B P [ 0 , p 1 w ) , Λ n i f p w B P [ p 1 w , + ) .
where  p 1 w = C 0 + C u w 0 , p 4 w = C 0 + ( 2 n 1 ) C u Λ n ( n 1 ) + C u w 0 .
Proof. 
See Appendix A. □
Compared with the BP (Proposition 3), under the BPW, the equilibrium service rate of the servers in hospital H i is different in the following ways. (1) The service price interval changes, and the equilibrium service rates shift to the right according to the segment interval of the price ( p 1 w > p 1 , p 4 w > p 4 ). It can be found that the payer needs to provide a higher service price and the hospitals are willing to provide outpatient services. The reason is that because of the setting of the waiting time guarantee, the hospitals need to increase the service rate to meet the guarantee, which leads to higher medical costs. Therefore, the hospitals will only provide services when the price is higher. (2) When the price is low enough, p w B P [ 0 , p 1 w ) , the equilibrium service rate is higher ( Λ s n + 1 s w 0 > Λ s n + 1 s w ¯ ). Due to the regulation of waiting time, the hospitals need to improve the service rate while guaranteeing non-negative profits. In this case, the waiting time of the patients just meets the guarantee w 0 .

7.2. The Payer’s Price and Waiting Time Guarantee Strategy

Under the BPW, the payer designs the service price p w B P and waiting time guarantee w 0 to minimize the total social cost. The total social cost includes the delay cost of the patients and the medical cost of the hospitals. We focus on cases where the hospitals are willing to provide services, that is, when the service price is high enough, i.e., p w B P [ p 1 w , + ) . Accordingly, we consider that the budget is high enough, i.e., B [ B 1 w , + ) , where B 1 w = Λ p 1 w . In this case, substituting the equilibrium service rates μ d i B P w in Proposition 8 into the model (5), we have the problem of the payer as follows:
M i n p w B P , w 0 S C ( p w B P , w 0 ) s . t . p 1 w p w B P B Λ , w 0 0 .
where the social cost function is
S C ( p w B P , w 0 ) = d Λ w 0 + ( C 0 + C u ( Λ n + 1 w 0 ) ) Λ i f p w B P [ p 1 w , p 4 w ) , d Λ p w B P C 0 C u ( 2 n 1 ) Λ n ( n 1 ) + ( p w B P C u Λ n 1 ) Λ i f p w B P [ p 4 w , + ) .
We solve the payer’s social cost minimization problem in two steps. First, we find the optimal service price p w B P ( w 0 ) for a given waiting time guarantee w 0 . Then, using p w B P ( w 0 ) , we find the optimal waiting time guarantee, which minimizes the social cost w 0 * , and further the optimal service price p ~ w B P . The following proposition presents the outcomes.
Proposition 10. 
Under the BPW, the payer’s optimal strategies for service price and waiting time guarantee  ( p ~ w B P , w 0 * ) , and the corresponding service rate  μ ~ d i B P w  satisfies the following:
i f t h e   b u d g e t B B 1 w , B 1 , w 0 * = C u d , p ~ w B P = C 0 + d C u , μ d i B P w = Λ s n + 1 s d C u ;
i f t h e b u d g e t B [ B 1 , + ) , w 0 * [ C u d , + ) , p ~ w B P = C 0 + C u ( d C u + ( 2 n 1 ) Λ n ( n 1 ) ) ,   μ ~ d i B P w = Λ s n + 1 s d C u .
Proof. 
See Appendix A. □
Proposition 10 indicates that under the BPW scheme, the optimal strategy is related to the medical budget. When the budget is immediate, the optimal waiting time guarantee increases in unit medical cost and decreases in patient delay sensitivity. This is because as the unit medical cost increases, the payer sets more a lenient waiting time guarantee to relax constraints on the hospital service rates. This encourages the hospitals to provide slower services to control overall medical costs. When patients are more sensitive to waiting times, the payer needs to set a stricter time guarantee to incentivize higher service rates, thereby enhancing patient satisfaction with waiting times. With a higher budget, there is a broader range of choices for optimal waiting time guarantee setting. However, from the perspective of patient satisfaction with waiting time, the same value as the immediate budget scenario can be set. In this case, patients can maximize their waiting time savings. The optimal service price under the immediate budget is lower than that under the high budget because the lower budget restricts setting of the price. Therefore, the payer needs to carefully consider the medical budget when establishing healthcare payment schemes.
Comparing the optimal service rates of the servers under the three payment schemes (FFS, BP, BPW) and the first-best service rate, we have the result in the following proposition.
Proposition 11. 
I f B [ B 1 w , B 1 ) , μ d i F F S > μ ~ d i B P w = μ d i f > μ ~ d i B P ; i f B [ B 1 , + ) , μ d i F F S > μ ~ d i B P w = μ ~ d i B P = μ d i f .
Proof of Proposition 11. 
When B [ B 1 w , B 1 ) ,
μ ~ d i B P = B Λ C 0 s C u Λ s ( n 1 ) < B 1 Λ C 0 s C u Λ s ( n 1 ) = p d C 0 s C u Λ s ( n 1 ) = Λ s n + 1 s d C u = μ ~ d i B P w = μ d i f < μ ¯ s = μ d i F F S
when B [ B 1 , + ) , μ ~ d i B P = μ ~ d i B P w = Λ s n + 1 s d C u = μ d i f < μ ¯ s = μ d i F F S . □
Proposition 11 shows that regardless of the budget value, the optimal service rate of the servers under the FFS is the highest. The reason behind this is as follows. Under the FFS, the hospitals obtain income according to a certain proportion of the total medical cost, and hence the hospitals will aim to maximize the service rate to attract patient demand and increase medical costs to maximize revenue. Under the BP, the optimal service rate is lower than the first-best outcome when the medical budget is in the middle, and it achieves the first-best service rate when the budget is high enough. Under the BPW, the optimal service rates of hospitals can achieve optimal social welfare outcomes.
To facilitate our exposition, we numerically illustrate the comparison results here—the comparison of optimal service rates, waiting cost, medical cost, and social cost under first-best, BP, and BPW. Because the extreme results under FFS are easy to compare, the numerical analysis does not discuss the FFS case. From our numerical analysis, shown in Figure 4a and Proposition 11, we observe the optimal service rates’ comparison.
When the budget is immediate with B [ B 1 w , B 1 ) , the hospitals are motivated to provide services under the BPW, and the first-best outcome can be picked. In addition, the optimal service rate is higher than that under the BP scheme. The corresponding social cost under BP is higher than that under first-best and BPW. The reasons behind this can be explained by Figure 4c,d. Under BP, the hospitals provide slower services. Although the medical cost is relatively low, the waiting cost is much higher than that under BPW because the setting of the waiting time regulation can urge the hospitals to improve the service rate under BPW. Therefore, the BPW scheme dominates the BP scheme in terms of the social cost and the waiting cost. This result indicates that for the payer, reducing patient waiting time and waiting cost and improving patient satisfaction should be some of the important indicators for evaluating bundled payment schemes.
When the budget is high enough with B [ B 1 , + ) , the optimal service rates under both two BP schemes (BP and BPW) are the same, which is equal to the first-best service rate (Figure 4a). At the same time, the medical cost and waiting cost of the social costs under both BP schemes are the same as those in the first-best case (Figure 4b–d). The reason is that in this case, the price interval is wider. Even if there is no service level regulation, the payer can stimulate the hospitals to improve the service rate by increasing the service price.
To further assess the robustness of our results, we conduct additional numerical experiments by varying key system parameters over wider ranges, including the total potential arrival rate Λ , the number of hospitals n, the unit waiting cost d, and so on. The qualitative patterns observed in Figure 4 remain unchanged across all tested parameter ranges. For brevity, the detailed robustness results are reported in Appendix B.
According to Proposition 11, we have the optimal payment scheme strategy of the payer shown in Theorem 1.
Theorem 1. 
When the budget  B [ B 1 w , B 1 ) , the payer can implement the first-best outcome only by introducing the BPW; when the budget  B [ B 1 , + ) , the first-best outcome can be achieved by the BP scheme. However, the first-best outcome cannot be achieved when  B [ 0 , B 1 w ) .
Proof of Theorem 1. 
Following from Proposition 9, we know that when B [ 0 , B 1 w ) , no hospitals will provide service, and thus the first-best solution cannot be picked; when B [ B 1 w , B 1 ) , under the BP with service level regulation, by setting p w B P = C 0 + d C u and w 0 = w 0 * , the first-best solution can be picked; when B [ B 1 , + ) , under the BP scheme, by setting p B P = p d , the first-best outcome can be picked. □
From Theorem 1 and Figure 5, we find that when the budget is low enough B [ 0 , B 1 w ) , the hospitals are not profitable because the service price is too low, and hence no hospital is willing to provide service. In this case, the first-best solution cannot be picked. When the budget is intermediate, from Proposition 11, under the BP scheme, the payer sets a uniform price, which will make the service rate too slow. Meanwhile, under the BPW scheme, the payer sets the price p ~ w B P = C 0 + d C u and sets waiting time constraint w 0 * = C u d to encourage the hospitals to increase the service rate to achieve the first-best outcome. When the budget is high enough, the payer can implement the first-best outcome through the BP scheme because the price can be set higher to induce a high service rate. Our study provides management insight for the payer when making payment scheme decisions.
Next, we analyze the impact of the parameters on the boundaries B 1 and B 1 w . Recall that B 1 = Λ [ C 0 + C u ( d C u + ( 2 n 1 ) Λ n ( n 1 ) ) ] , B 1 w = Λ ( C 0 + C u w 0 ) . Let Δ B = B 1 B 1 w denote the difference between the two boundaries, which represents the area where the first-best solution cannot be implemented with only the BP scheme and the BP with service level regulation is required.
Corollary 2. 
For the number of hospitals n, B 1  and  Δ B  are decreasing in  n   a n d   B 1 w  is independent in  n , i.e., B 1 n < 0 , B 1 w n = 0 , Δ B n < 0 ;
For the market size  Λ , B 1 , B 1 w a n d Δ B  are increasing in Λ, i.e., B 1 Λ > 0 , B 1 w Λ > 0 , Δ B Λ > 0 .
Corollary 2 reveals that as the number of hospitals n increases, the budget boundary that the payer needs for added wait time regulation to implement the first-best outcome B 1 decreases, the budget boundary of the hospital’s willingness to serve B 1 w keeps constant, and the area where only by adding service level regulation based on the BP scheme can first-best outcomes be implemented ∆B decreases. The reasons can be given as follows. Under the BP scheme with an intermediate budget B [ B 1 w , B 1 ] , as the competition becomes greater, the hospitals would increase the service rate and the first-best service rate becomes smaller; hence, in such cases, the first-best outcome can be picked for the relatively high budget in [ B 1 w , B 1 ] . That is, B 1 decreases. In addition, this also means that under the BP scheme, it is easier to realize the first-best outcomes, which also corresponds to the conclusion that the area that needs added waiting time regulation is smaller.
Corollary 2 shows that as the market size Λ increases, two budget boundaries and the area where the BPW can implement the first-best outcomes increase. Because under the BP scheme with an intermediate budget B [ B 1 w , B 1 ] , as the market size becomes larger, the hospitals would serve more slowly and the first-best service rate becomes larger; hence, in such cases, it is more difficult to realize the first-best outcomes, which also corresponds to the conclusion that the area that needs added waiting time regulation is larger.

8. Conclusions

This paper investigates the impact of the FFS and BP schemes on service rate decisions of hospitals and the performance of the service system. Hospitals face a cost–speed tradeoff when making decisions, where accelerating service rates increases medical costs for the whole episode of care but can attract more patient demand. In such a situation, we focus on how to configure healthcare payment schemes to achieve the first-best outcome—a benchmark scenario that maximizes social welfare ideally. The objective is to explore the inherent mechanisms and performance of the BP scheme, aiming to offer theoretical guidance for operational management strategies for the payer and hospitals. To the best of our knowledge, this paper is the first to explore the impact of the BP scheme on hospital service rates and system performance from an operations management perspective, considering the dependency of medical costs on service rates.
This paper employs a game-theoretic queueing model to investigate a Stakelberg game involving three participants: (1) the payer, responsible for deciding the payment scheme; (2) hospitals, which determine their service rates in a competitive environment under different schemes; and (3) patients, who choose one hospital for treatment. By comparing the performance of FFS, BP, and the first-best case, the study yields the following managerial insights. First, as expected, FFS is unlikely to be an optimal payment scheme, as rapid increases in service rates lead to lower quality of care and higher medical costs. Furthermore, whether the BP scheme can achieve the minimum sum of medical and waiting costs is mainly related to the medical budget. When the budget is low, hospitals are unwilling to provide services, because the payer cannot set a price that allows hospitals to be profitable. When the budget is moderate, the hospitals slow down service rates to control marginal medical costs, resulting in lower medical costs but higher and more significant waiting costs for patients. In such cases, we propose a coordination mechanism—BP with service level regulation—whereby setting guarantees for average waiting time can achieve optimal service rates, minimizing social costs. However, when the budget is sufficiently high, the payer can achieve the same incentivizing effect by setting a uniform optimal service price under the BP scheme. This is because a high budget corresponds to a high service price, achieving an optimal service rate that minimizes overall social costs. Therefore, when formulating a healthcare payment scheme, the payer needs to carefully consider the healthcare budget.
In addition, several factors influence the service rate strategies of the hospitals under BP, including the intensity of competition and market size. The results reveal that intense competition facilitates a more straightforward achievement of the first-best outcome under BP. Intense competition leads to higher service rates, and the socially optimal service rate is slower, even with a lower budget, making it possible to minimize social costs. On the other hand, a larger market size hinders BP from achieving the first-best outcome, necessitating a higher budget for hospitals to be willing to provide services and incentivize hospitals to reach the optimal service rate.
There are several possible directions along which the models in the paper can be extended. Our paper implicitly assumes risk neutrality, meaning that the hospitals aim to maximize expected profits. While this assumption is standard in the operations management literature, extending our analysis to the risk-averse utility functions of the hospitals would be useful. We assume no information asymmetry. Considering asymmetric information, where the hospitals may have better information about medical costs and patient waiting time than the payer, is also a valuable extension to our work. Moreover, our model assumes that all patients join the system, which is a reasonable approximation for publicly funded, non-urgent elective care systems with minimal patient out-of-pocket costs. In systems where patients face significant financial or waiting time barriers, patient attrition could moderate demand responses and affect equilibrium outcomes. Future work could incorporate balking or reneging assumptions to further generalize the analysis. Lastly, while we use a linear cost function for simplicity, we acknowledge that the true relationship between service rate and medical cost may be nonlinear in some settings. Nonlinearities could shift quantitative outcomes but are unlikely to alter the key qualitative conclusions of the model.

Author Contributions

X.L.: conceptualization, methodology, formal analysis, writing the original draft, reviewing and editing the subsequent drafts, and funding acquisition. M.F.: validation and reviewing and editing the drafts. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China (72201071) and the General Program of Guangdong Basic and Applied Basic Research Foundation (2023A1515011837).

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A. Proof

Proof of Proposition 1 and Figure 3. 
Following from 1 s μ d i λ i = 1 s μ d i λ i and λ i + ( n 1 ) λ i = Λ , we have     λ i = Λ + s ( n 1 ) ( μ d i μ d i ) n , λ i = Λ + s μ d i s μ d i n .
Note that λ i a = Λ + s ( n 1 ) ( μ d i μ d i ) n , λ i a = Λ + s μ d i s μ d i n . There are three cases for different values of the service rate μ d i . From the model setup, we have μ i = s μ d i , μ i = s μ d i . To simplify the derivation process, the service rate μ i shall be analyzed first, and then the target service rate of servers shall be displayed.
(1)
If μ i Λ n 1 + μ i , we have λ i a 0 , and hence λ i e = 0 and λ i e = Λ n 1 . For other conditions, because the waiting time’s upper limit ( 1 μ i λ i w ¯ ) is more stringent than the system’s stable condition μ i > λ i , we just analyze the waiting time’s upper limit. In this case, the waiting time condition 1 μ i Λ n 1 w ¯ can be rewritten as μ i Λ n 1 + 1 w ¯ . Hence, the comprehensive conditions of this case are μ i μ i Λ n 1 , μ i Λ n 1 + 1 w ¯ , μ i μ ¯ , μ i μ ¯ .
(2)
If μ i > μ i Λ n 1 and μ i < Λ + μ i , we have 0 < λ i a < Λ ; hence, λ i e = λ i a , λ i e = λ i a . In this case, the waiting time conditions 1 μ i λ i a w ¯ and 1 μ i λ i a w ¯ can be unified and rewritten as μ i n w ¯ + Λ ( n 1 ) μ i . Hence, the comprehensive conditions of this case are max { μ i Λ n 1 , n w ¯ + Λ ( n 1 ) μ i } μ i < Λ + μ i , μ i μ ¯ , μ i μ ¯ .
(3)
If μ i Λ + μ i , we have λ i a Λ ; hence,   λ i e = Λ , λ i e = 0 . In this case, 1 μ i Λ w ¯ can be rewritten as μ i 1 w ¯ + Λ . Hence, the comprehensive conditions of this case are μ i max { Λ + μ i , 1 w ¯ + Λ } , μ i μ ¯ , μ i μ ¯ .
Let L 1 μ i = μ i Λ n 1 , L 2 μ i = μ i + Λ , L 3 μ i = n w ¯ + Λ ( n 1 ) μ i . Because μ ¯ is a sufficiently large positive number, we have L 3 μ i < μ ¯ . Note that
Y 1 μ i = Λ + 1 w ¯   if   μ i [ 0 , 1 w ¯ ) , L 3 μ i   if   μ i [ 1 w ¯ , Λ n 1 + 1 w ¯ ) , 0   if   μ i [ Λ n 1 + 1 w ¯ , μ ¯ ] .
Y 2 μ i = Λ + 1 w ¯   if   μ i [ 0 , 1 w ¯ ) , L 3 μ i   if   μ i [ 1 w ¯ , Λ n 1 + 1 w ¯ ) , min L 1 μ i , μ ¯   if   μ i [ Λ n 1 + 1 w ¯ , μ ¯ ] .
Y 3 μ i = Λ + 1 w ¯   if   μ i [ 0 , 1 w ¯ ) , min L 2 μ i , μ ¯   if   μ i [ 1 w ¯ , μ ¯ ] .
we have Y 1 μ d i = Y 1 μ i s , Y 2 μ d i = Y 2 μ i s , Y 3 μ d i = Y 3 μ i s . According to the definition of Y 1 , Y 2 , Y 3 , and μ ¯ , we have Y 1 μ d i < Y 2 μ d i < Y 3 μ d i < μ ¯ s .
In all, by integrating the conditions of μ d i and μ d i , we can obtain the equilibrium arrival rates of the patients at hospitals H i and H i , i = 1,2 ,   ,   n , as follows:
(i)
I f μ d i [ 0 , Y 1 μ d i , λ i e = λ i e = 0 ;
(ii)
I f μ d i [ Y 1 μ d i , Y 2 μ d i , λ i e = 0 , λ i e = Λ n 1 ;
(iii)
I f μ d i [ Y 2 μ d i , Y 3 μ d i , λ i e = Λ + s n 1 μ d i μ d i n , λ i e = Λ + s μ d i s μ d i n ;
(iv)
I f μ d i [ Y 3 μ d i , μ ¯ , λ i e = Λ , λ i e = 0 .
Proof of Proposition 2. 
We first analyze the best response strategy of H i when other n − 1 hospitals adopt service rate strategy μ i , μ i * ( μ i ) . Then, we calculate the equilibrium service rates of the hospitals.
Lemma A1. 
The best response of the hospital  H i , μ i * ( μ i ) , is a piecewise function in four cases with different values of the service price  p B P (Figure A1):
  when   p B P [ 0 , p 1 , μ i * μ i = 0 ;
  when   p B P [ p 1 , p 3 , μ i * μ i = 0     i f   μ i [ 0 , μ i g , Λ ( n 1 ) μ i + n w ¯   i f   μ i [ μ i g , μ i a , μ i 2 + p B P C 0 2 C u Λ 2 ( n 1 )   i f   μ i [ μ i a , μ i b ) , 0     i f   μ i [ μ i b , μ ¯ ) .
  when   p B P [ p 3 , p 2 ) , μ i * μ i = Λ + 1 w ¯     i f   μ i [ 0 , 1 w ¯ ) , Λ ( n 1 ) μ i + n w ¯ i f   μ i [ 1 w ¯ , μ i a , μ i 2 + p B P C 0 2 C u Λ 2 ( n 1 )   i f   μ i [ μ i a , μ i b ) ,   0           i f   μ i [ μ i b , μ ¯ ) .
  when   p B P [ p 2 , + ) , μ i * μ i = Λ + 1 w ¯   i f   μ i [ 0 , 1 w ¯ ) , μ i + Λ     i f   μ i [ 1 w ¯ , μ i c ) , μ i 2 + p B P C 0 2 C u Λ 2 ( n 1 ) i f   μ i [ μ i c , μ i b ) , 0   i f   μ i [ μ i b , μ ¯ ) .
where  p 1 = C 0 + C u w ¯ , p 2 = C 0 + ( 2 n 1 ) C u Λ n 1 + C u w ¯ , p 3 = C 0 + C u Λ + C u w ¯ , μ i a = Λ n 1 p B P C 0 ( 2 n 1 ) C u + 2 n ( 2 n 1 ) w ¯ , μ i b = p B P C 0 C u + Λ n 1 , μ i c = p B P C 0 C u ( 2 n 1 ) Λ n 1 , μ i g = Λ n 1 p B P C 0 ( n 1 ) C u + n ( n 1 ) w ¯ , and we have  p 1 < p 3 < p 2 , μ i c < μ i b , μ i g μ i a μ i b   for   p B P [ p 1 , + . Note that  μ i * = 0  means the hospital  H i  would quit the market.
Figure A1. Best response curves.
Figure A1. Best response curves.
Mathematics 14 00199 g0a1
Proof of Lemma A1. 
For given n − 1 hospitals’ service rates μ i , the optimization model of H i , i = 1,2 ,   ,   n , can be shown as
max π i ( μ i ) = ( p B P C ( μ i ) ) λ i e ( μ i , μ i ) s . t . π i ( μ i ) 0 , λ i e ( μ i , μ i ) + 1 w ¯ μ i μ ¯ .                                                                                                
For the other n − 1 hospitals’ service rates μ i , the profit function of hospital H i when all hospitals have patients choose to join is
π i ( μ i ) = ( p B P C 0 C u μ i ) Λ + ( n 1 ) ( μ i μ i ) n .
Because 2 π i ( μ i ) μ i 2 = 2 C u n 1 n < 0 , π i ~ ( μ i ) is concave in μ i . Note that μ i d ( μ i ) is H i ’s best response, which maximizes the profit π i ~ ( μ i ) . According to the first-order condition (FOC), μ i d ( μ i ) = μ i 2 + p B P C 0 2 C u Λ 2 ( n 1 ) .
According to Proposition 1, the equilibrium arrival rates in cases (ii)–(iv), the profit function of H i can be shown as
π i μ i = 0 μ i Y 1 μ i , Y 2 μ i , π i ~ μ i μ i Y 2 μ i , Y 3 μ i , π i T μ i μ i Y 3 μ i , μ ¯ .
where π i T ( μ i ) = ( p B P C 0 C u μ i ) Λ . Because π i ~ ( Y 2 ) = 0 , π i ~ ( Y 3 ) = π i T ( Y 3 ) , it follows that π i ( μ i ) is continuous in μ i .
Note that μ i a , μ i b and μ i c are the intersections of μ i d ( μ i ) with three boundaries (L3, L1, and L2, respectively), μ i g = L 3 1 ( L 1 ( μ i b ) ) . Through calculation, we can obtain
μ i a = Λ n 1 p B P C 0 ( 2 n 1 ) C u + 2 n ( 2 n 1 ) w ¯ , μ i b = p B P C 0 C u + Λ n 1 , μ i c = p B P C 0 C u ( 2 n 1 ) Λ n 1 , μ i g = Λ n 1 p B P C 0 ( n 1 ) C u + n ( n 1 ) w ¯ .
Note that p 1 = C 0 + C u w ¯ , p 2 = C 0 + ( 2 n 1 ) C u Λ n 1 + C u w ¯ , p 3 = C 0 + C u Λ + C u w ¯ . For p B P [ p 1 , + ) , we have   p 1 < p 3 < p 2 , μ i c < μ i b , μ i g μ i a μ i b .
Next, we analyze the best response of H i for different p B P .
Equation (A1) When p B P [ 0 , p 1 ) , μ i b < Λ n 1 + 1 w ¯ , the H i ’s marginal profit p B P C 0 C u μ i < 0 ; therefore, for any μ i [ 0 , + ) , the best response of H i in this case is to quit the market.
Equation (A1a) When p B P [ p 1 , p 3 ) (see Figure A1a), μ i b Λ n 1 + 1 w ¯ , μ i a [ 1 w ¯ , Λ n 1 + 1 w ¯ ) , μ i g [ 1 w ¯ , μ i a ] ; therefore, when μ i [ 0 , μ i g ) , for μ i min { Λ + 1 w ¯ , Λ ( n 1 ) μ i + n w ¯ } , the marginal profit p B P C 0 C u μ i < 0 ; therefore, for any μ i [ 0 , + ) , the best response of H i in this case is to quit the market.
When μ i [ μ i g , μ i a ) , μ i d ( μ i ) < L 3 ( μ i ) < L 2 ( μ i ) , which means π i ~ ( μ i ) is strictly decreasing in μ i [ L 3 , L 2 ) = [ Λ ( n 1 ) μ i + n w ¯ , μ i + Λ ) . Moreover, for μ i > μ i + Λ , π i T is also strictly decreasing in μ i [ μ i + Λ , + ) . Hence,
m a x Λ ( n 1 ) μ i + n w ¯ μ i < μ i + Λ π i ( μ i ) = π i ( Λ ( n 1 ) μ i + n w ¯ ) > π i T ( μ i + Λ ) = m a x μ i + Λ μ i π i ( μ i )
therefore, the best response of H i in this case is L 3 ( μ i ) = Λ ( n 1 ) μ i + n w ¯ .
when μ i [ μ i a , μ i b ) , max { L 1 ( μ i ) , L 3 ( μ i ) } < μ I d ( μ i ) < L 2 ( μ i ) , and thus
max 0 μ i < max { Λ ( n 1 ) μ i + n w ¯ , μ i Λ n 1 } π i ( μ i ) = 0 < max max { Λ ( n 1 ) μ i + 1 w ¯ , μ i Λ n 1 } μ i < μ i + Λ π i ( μ i ) = π i ( μ i d ( μ i ) ) > π i T ( μ i + Λ ) = max μ i + Λ μ i π i ( μ i )
therefore, the best response of H i is μ i d ( μ i ) .
When μ i [ μ i b , + ) , μ i d ( μ i ) < L 1 ( μ i ) < L 2 ( μ i ) , for p B P [ μ i Λ n 1 , + ) , H i ’s marginal profit p B P C 0 C u μ i < 0 . Therefore, the best response of H i in this case is to quit the market.
Equation (A1b) When p B P [ p 3 , p 2 ) (see Figure A1b), μ i b Λ n 1 + 1 w ¯ , μ i a [ 1 w ¯ , Λ n 1 + 1 w ¯ ) , μ i g ( , 1 w ¯ ) ; therefore, when μ i [ 0 , 1 w ¯ ) , for μ i [ Λ + 1 w ¯ , + ) , π i T is also strictly decreasing in μ i ; hence, the best response of H i in this case is Λ + 1 w ¯ .
When μ i [ 1 w ¯ , μ i a ) , μ i d ( μ i ) < L 3 ( μ i ) < L 2 ( μ i ) , which means that for μ i [ Λ ( n 1 ) μ i + n w ¯ , μ i + Λ ) , π i ~ ( μ i ) is strictly decreasing in μ i . Moreover, for μ i [ μ i + Λ , + ) , π i T is also strictly decreasing in μ i . Hence,
m a x Λ ( n 1 ) μ i + n w ¯ μ i < μ i + Λ π i ( μ i ) = π i ( Λ ( n 1 ) μ i + n w ¯ ) > π i T ( μ i + Λ ) = m a x μ i + Λ μ i π i ( μ i )
therefore, the best response of H i in this case is L 3 ( μ i ) = Λ ( n 1 ) μ i + n w ¯ .
When μ i [ μ i a , + ) , the same analysis as in case (A1a), the best response of H i is μ i d ( μ i ) for μ i [ μ i a , μ i b ) , and H i quits the market for μ i [ μ i b , + ) .
Equation (A1c) When p B P [ p 2 , + ) (see Figure A1c) , μ i b Λ n 1 + 1 w ¯ , μ i a < 1 w ¯ , μ i c 0 ; therefore, when μ i [ 0 , 1 w ¯ ) , for μ i [ Λ + 1 w ¯ , + ) , π i T is also strictly decreasing in μ i . Hence, the best response of H i in this case is Λ + 1 w ¯ .
When μ i [ 1 w ¯ , μ i c ) , L 3 ( μ i ) < L 2 ( μ i ) < μ i d ( μ i ) , which means that for μ i [ Λ ( n 1 ) μ i + n w ¯ , μ i + Λ ) ,   π i ~ ( μ i ) is strictly increasing in μ i . Moreover, for μ i ( Λ + μ i , + ) , π i T ( μ i ) is decreasing in μ i ; hence,
m a x Λ ( n 1 ) μ i + n w ¯ μ i < μ i + Λ π i ( μ i ) = π i ( μ i + Λ ) = π i T ( μ i + Λ ) = m a x μ i + Λ μ i π i ( μ i )
therefore, the best response of H i is L 2 ( μ i ) = μ i + Λ .
When μ i [ μ i c , + ) , the same analysis as in the case μ i [ μ i a , + ) , the best response of H i is μ i d ( μ i ) for μ i [ μ i c , μ i b ) , and H i quits the market for μ i [ μ i b , + ) .
This completes the Proof of the Lemma A1. □
Next, we calculate the optimal service rates of the hospitals. Because n hospitals are identical, we focus on the symmetric equilibrium for them; hence, we can analyze the equilibrium service rate decision of the hospitals by finding the intersection of the best response curve of H i and line μ i = μ i .
Note that μ i h is the intersection of μ i = μ i with μ i = L 3 ( μ i ) ; hence, μ i h = Λ n + 1 w ¯ . Let p a = C 0 + ( 2 n 1 ) C u Λ n ( n 1 ) , p 4 = C 0 + ( 2 n 1 ) C u Λ n ( n 1 ) + C u w ¯ . According to the Lemma A1, we analyze the equilibrium point for different p B P .
(1) When p B P [ 0 , p 1 ) , all hospitals quit the market; this is the unique Nash equilibrium.
(2) When p B P [ p 1 , min { p 3 , p a } ) , μ i b > Λ n 1 + 1 w ¯ , μ i a μ i h , L 1 ( μ i b ) < Λ + 1 w ¯ , the best response of H i is the result of case (A1a) in Lemma A1 (Figure A1a); hence, there are two intersection points. One is μ i = μ i = 0 , and another one is at the intersection of μ i = Λ ( n 1 ) μ i + n w ¯ and μ i = μ i . It follows that μ i = μ i = Λ n + 1 w ¯ . The profit is positive when the hospitals adopt the strategy Λ n + 1 w ¯ ; hence, μ i = μ i = Λ n + 1 w ¯ is the unique equilibrium.
(3) When p B P [ min { p 3 , p a } , p a ) , μ i b > Λ n 1 + 1 w ¯ , μ i a μ i h , L 1 ( μ i b ) Λ , the best response of H i is the result of case (A1b) in Lemma A1 (Figure A1b); hence, the equilibrium point is at the intersection of μ i = Λ ( n 1 ) μ i + n w ¯ and μ i = μ i . It follows that, in equilibrium, the hospitals choose the service rate Λ n + 1 w ¯ .
(4) When p B P [ p a , max { p 3 , p a } ) , μ i b > Λ n 1 + 1 w ¯ , 1 w ¯ μ i a < μ i h , L 1 ( μ i b ) < Λ , the best response of H i is the result of case (A1a) in Lemma A1 (Figure A1a); hence, there are two intersection points. One is μ i = μ i = 0 , and another one is at the intersections of μ i d ( μ i ) and μ i = μ i , which brings positive profits to the hospitals. Hence, in equilibrium, the hospitals decide the service rate p B P C 0 C u Λ n 1 .
(5) When p B P [ max { p 3 , p a } , p 2 ) , μ i b > Λ n 1 , 1 w ¯ μ i a < μ i h , L 1 ( μ i b ) Λ , the best response of H i is the result of case (A1b) in Lemma A1 (Figure A1b); hence, the equilibrium point is at the intersection of μ i d ( μ i ) and μ i = μ i . It follows that, in equilibrium, the hospitals choose the service rate p B P C 0 C u Λ n 1 .
(6) When p B P [ p 2 , + ) , μ i b > Λ n 1 + 1 w ¯ , μ i a < 1 w ¯ , the best response of H i is the result of case (A1c) in Lemma A1 (Figure A1c); hence, the equilibrium point is at the intersection of μ i d ( μ i ) and μ i = μ i . It follows that, in equilibrium, the hospitals choose the service rate p B P C 0 C u Λ n 1 .
Together with the stability conditions, the following results can be derived. When the service price p B P [ 0 , p 1 ) , no hospitals are willing to provide service, and hence the equilibrium service rate of the hospital μ d i B P = 0 , i = 1,2 ,   ,   n ; when the service price   p B P [ p 1 , p 4 ) , μ d i B P = Λ s n + 1 s w ¯ ; when p B P [ p 4 , + ) , μ d i B P = p B P C 0 s C u Λ s ( n 1 ) . According to Proposition 2, substituting μ d i B P into λ i e yields the equilibrium arrival rate λ i B P ; when p B P [ 0 , p 1 ) , λ i B P = 0 ; when p B P [ p 1 , + ) , λ i B P = Λ n . □
Proof of Corollary 1. 
For the optimal solution ( p ~ B P , μ ~ d i B P ) in Proposition 3, we calculate the first derivative with respect to the parameters ( n , Λ , C u ) . It is not difficult to judge the sign of the first derivative, and the results are shown in Corollary 1. □
Proof of Proposition 8. 
We use the same method of the proof of Proposition 2. L 3 w ( μ i ) = n w 0 + Λ ( n 1 ) μ i , as μ ¯ is a sufficiently large positive number, and we have L 3 w ( μ i ) < μ ¯ . Note that
Y 1 w ( μ i ) = Λ + 1 w 0                           i f μ i [ 0 , 1 w 0 ) , L 3 w μ i           i f μ i [ 1 w 0 , Λ n 1 + 1 w 0 ) , 0       i f μ i [ Λ n 1 + 1 w 0 , μ ¯ ] . Y 2 w ( μ i ) = Λ + 1 w 0 i f μ i [ 0 , 1 w 0 ) , L 3 w ( μ i ) i f μ i [ 1 w 0 , Λ n 1 + 1 w 0 ) , min { L 1 ( μ i ) , μ ¯ } i f μ i [ Λ n 1 + 1 w 0 , μ ¯ ] . Y 3 w ( μ i ) = Λ + 1 w 0 i f μ i [ 0 , 1 w 0 ) , min { L 2 ( μ i ) , μ ¯ } i f μ i [ 1 w 0 , μ ¯ ] .
We have Y 1 w μ d i = Y 1 w μ i s , Y 2 w μ d i = Y 2 w μ i s , Y 3 w μ d i = Y 3 w μ i s . According to the definition of Y 1 w , Y 2 w , Y 3 w , and μ ¯ , we have Y 1 w μ d i < Y 2 w μ d i < Y 3 w μ d i < μ ¯ s .
By integrating the conditions of μ d i and μ d i , we can obtain the equilibrium arrival rates of the patients at hospitals H i and H i , i = 1,2 ,   ,   n , as follows:
(i)
I f μ d i [ 0 , Y 1 w μ d i , λ i e = λ i e = 0 ;
(ii)
I f μ d i [ Y 1 w μ d i , Y 2 w μ d i , λ i e = 0 , λ i e = Λ n 1 ;
(iii)
I f μ d i [ Y 2 w μ d i , Y 3 w μ d i , λ i e = Λ + s n 1 μ d i μ d i n , λ i e = Λ + s μ d i s μ d i n ;
(iv)
I f μ d i [ Y 3 w μ d i , μ ¯ , λ i e = Λ , λ i e = 0 .
Proof of Proposition 9. 
For the BP with service level regulation, we use the same analysis method as the BP scheme case. First, we analyze the optimal service rate decision of hospital H i given the remaining n − 1 hospitals’ service rates, and we obtain the best response of H i , μ i * w ( μ i ) as follows.
w h e n p w B P [ 0 , p 1 w ) , μ i * w ( μ i ) = 0 ;
w h e n p w B P [ p 1 w , p 3 w ) , μ i * w ( μ i ) = 0 i f μ i [ 0 , μ i g w ) , Λ ( n 1 ) μ i + n w 0 i f μ i [ μ i g w , μ i a w ) , μ i 2 + p w B P C 0 2 C u Λ 2 ( n 1 ) i f μ i [ μ i a w , μ i b ) , 0 i f μ i [ μ i b , μ ¯ ] .
w h e n p w B P [ p 3 w , p 2 w ) , μ i * w ( μ i ) = Λ + 1 w 0   if μ i [ 0 , 1 w 0 ) , Λ ( n 1 ) μ i + n w 0 if μ i [ 1 w 0 , μ i a w ) , μ i 2 + p w B P C 0 2 C u Λ 2 ( n 1 )   if μ i [ μ i a w , μ i b ) , 0   if μ i [ μ i b , μ ¯ ] .
w h e n p w B P [ p 2 w , + ) , μ i * w ( μ i ) = Λ + 1 w 0 if μ i [ 0 , 1 w 0 ) , μ i + Λ if μ i [ 1 w 0 , μ i c ) , μ i 2 + p w B P C 0 2 C u Λ 2 ( n 1 )   if μ i [ μ i c , μ i b ) , 0   if μ i [ μ i b , μ ¯ ] .
where p 1 w = C 0 + C u w 0 , p 2 w = C 0 + ( 2 n 1 ) C u Λ n 1 + C u w 0 , p 3 w = C 0 + C u Λ + C u w 0 , μ i a w = Λ n 1 p w B P C 0 ( 2 n 1 ) C u + 2 n ( 2 n 1 ) w 0 , μ i g w = Λ n 1 p w B P C 0 ( n 1 ) C u + n ( n 1 ) w 0 .
Next, we find the intersections of the best responses of H i and the line μ i = μ i and μ i = s μ d i to analyze the equilibrium service rates of the servers, the same as the case with the BP scheme. Then, we can obtain the result, as shown in Proposition 9. □
Proof of Proposition 10. 
We solve the social cost minimization problem of the payer through two steps. (1) First, we calculate the optimal service price p w B P ( w 0 ) for a given waiting time guarantee w0. (2) Next, using p w B P ( w 0 ) , we find the optimal waiting time guarantee w 0 * and the optimal service price p ~ w B P .
(1) For the given waiting time guarantee w0, from the optimization model (12), we know that the social cost is fixed when p w B P [ p 1 w , p 4 w ) ; hence, we first analyze the case p w B P [ p 4 w , + ) . In this case, the social cost function is the same as the case under the BP scheme because the second-derivative of the social cost function in the service price 2 S C ( p w B P ) 2 > 0 , S C is a convex function in p w B P . l i m p w B P + S C p w B P = Λ , S C is first decreasing and then increasing in p w B P or strictly increasing in p w B P .
The FOC leads to p w B P = p d = C 0 + C u ( d C u + ( 2 n 1 ) Λ n ( n 1 ) ) .
Next, we analyze the optimal service price for a given waiting time guarantee by the relationship between p d and p 4 w .
When w 0 > C u d , p d > p 4 w ; hence, the social cost function S C ( p w B P ) is first decreasing and then increasing in p w B P for p w B P [ p 4 w , + ) . In this case, if the unit budget B Λ [ p 1 w , p d ) , the optimal price for the given waiting time guarantee is B Λ ; if B Λ [ p d , + ) and the optimal price for the given waiting time guarantee is p d .
When   w 0 C u d , p d p 4 w ; hence, the social cost function   S C ( p w B P ) is strictly increasing in p w B P for p w B P [ p 4 w , + ) . In such a case, for any B Λ [ p 1 w , + ) , the optimal price for the given waiting time guarantee is p 1 w ( w 0 ) = C 0 + C u w 0 .
(2) Next, substituting the optimal service price for w0 into the optimization model (12), we have the social cost function as follows:
when   w 0 C u d , S C ( w 0 ) = S C a ( w 0 ) ; when   w 0 > C u d , S C ( w 0 ) = S C b , i f B Λ [ p 1 w , p d ) S C c , i f B Λ [ p d , + ) .
where
S C a ( w 0 ) = d Λ w 0 + [ C 0 + C u ( Λ n + 1 w 0 ) ] Λ , S C b = S C ( p w B P = B Λ ) = d Λ B Λ C 0 C u ( 2 n 1 ) Λ n ( n 1 ) + [ C 0 + C u ( B Λ C 0 C u Λ n 1 ) ] Λ , S C c = S C ( p w B P = p d ) = d Λ p d C 0 C u ( 2 n 1 ) Λ n ( n 1 ) + [ C 0 + C u ( p d C 0 C u Λ n 1 ) ] Λ .
For w 0 C u d , following from S C a w 0 = d Λ C u Λ w 0 2 < 0 and   2 S C a w 0 2 > 0 , we know that S C ( w 0 ) is decreasingly convex in w 0 ; hence, the minimal social cost in this case is   S C ( C u d ) .
For w 0 > C u d , both SCb and SCc are independent of w0. Because p d is the first-order solution, S C b = S C ( B Λ ) > S C ( p d ) = S C c . In addition, because   S C a ( w 0 = C u d ) = S C c ; hence, S C b > S C a ( w 0 = C u d ) .
In all, for B [ Λ p 1 w , Λ p d ) , we have S C b > S C a ( w 0 = C u d ) ; hence, the optimal waiting time guarantee is w 0 * = C u d , and the corresponding optimal service price   p ~ w B P = C 0 + d C u ; for B [ Λ p d , + ) , w 0 * [ C u d , + ) , and p ~ w B P = p d = C 0 + C u ( d C u + ( 2 n 1 ) Λ n ( n 1 ) ) . □

Appendix B. Robustness Analysis with Respect to System Parameters

In this appendix, we examine the robustness of the numerical results reported in Figure 4 by varying key system parameters over wider ranges. Because the service rate and the medical cost exhibit consistent monotonic patterns, and the social cost already incorporates both waiting costs and medical costs, we mainly report the results for the service rate and the social cost to avoid redundancy and for brevity. Specifically, we repeat the numerical experiments for different values of the total potential arrival rate Λ , the number of hospitals n, and the patients’ waiting cost parameter d while keeping all other parameters unchanged, as in the baseline settings in Section 7.

Appendix B.1. Robustness with Respect to Λ

Figure A2. Robustness analysis with respect to the total potential arrival rate Λ. Panel (a) reports the optimal service rates under different payment schemes when Λ = 0.1; panel (b) reports the corresponding social costs when Λ = 0.1; panel (c) reports the optimal service rates when Λ = 10; panel (d) reports the social costs when Λ = 10.
Figure A2. Robustness analysis with respect to the total potential arrival rate Λ. Panel (a) reports the optimal service rates under different payment schemes when Λ = 0.1; panel (b) reports the corresponding social costs when Λ = 0.1; panel (c) reports the optimal service rates when Λ = 10; panel (d) reports the social costs when Λ = 10.
Mathematics 14 00199 g0a2
The results show that scaling the market size does not alter the relative efficiency ranking among different payment schemes.

Appendix B.2. Robustness with Respect to n

Figure A3. Robustness analysis with respect to the number of hospitals n. Panel (a) reports the optimal service rates under different payment schemes when n = 2; panel (b) reports the corresponding social costs when n = 2; panel (c) reports the optimal service rates when n = 10; panel (d) reports the social costs when n = 10.
Figure A3. Robustness analysis with respect to the number of hospitals n. Panel (a) reports the optimal service rates under different payment schemes when n = 2; panel (b) reports the corresponding social costs when n = 2; panel (c) reports the optimal service rates when n = 10; panel (d) reports the social costs when n = 10.
Mathematics 14 00199 g0a3aMathematics 14 00199 g0a3b

Appendix B.3. Robustness with Respect to d

Figure A4. Robustness analysis with respect to the unit waiting cost d. Panel (a) reports the optimal service rates under different payment schemes when d = 0.1; panel (b) reports the corresponding social costs when d = 0.1; panel (c) reports the optimal service rates when d = 10; panel (d) reports the social costs when d = 10.
Figure A4. Robustness analysis with respect to the unit waiting cost d. Panel (a) reports the optimal service rates under different payment schemes when d = 0.1; panel (b) reports the corresponding social costs when d = 0.1; panel (c) reports the optimal service rates when d = 10; panel (d) reports the social costs when d = 10.
Mathematics 14 00199 g0a4
We also vary other parameters, including C 0 , C u , and s, and find that the qualitative results remain unchanged. This further confirms the robustness of our main conclusions.

References

  1. Irving, G.; Neves, A.L.; Dambha-Miller, H.; Oishi, A.; Tagashira, H.; Verho, A.; Holden, J. International variations in primary care physician consultation time: A systematic review of 67 countries. BMJ Open 2017, 7, e017902. [Google Scholar] [CrossRef]
  2. Du, Y.; Wu, J. Practice and improvement of process optimization in hospital outpatient management. Mod. Hosp. J. 2020, 20, 827–829. (In Chinese) [Google Scholar]
  3. Wang, X.; Wu, Q.; Lai, G.; Scheller-Wolf, A. Offering discretionary healthcare services with medical consumption. Prod. Oper. Manag. 2019, 28, 2291–2304. [Google Scholar] [CrossRef]
  4. Guan, X.; Ni, B.; Zhang, J.; Zhu, D.; Cai, Z.; Meng, W.; Shi, L.; Ross-Degnan, D.S. Association between physicians’ workload and prescribing quality in one tertiary hospital in China. J. Patient Saf. 2021, 17, e1860–e1865. [Google Scholar] [CrossRef] [PubMed]
  5. Zorc, S.; Chick, S.E.; Hasija, S. Outcomes-Based Reimbursement Policies for Chronic Care Pathways. INSEAD Working Paper, 2023. Available online: http://ssrn.com/abstract=2973048 (accessed on 20 October 2025).
  6. Rabin, R. 15-Minute Doctor Visits Take a Toll on Patient-Physician Relationships; PBS NewsHour: Arlington, VA, USA, 2014; Available online: https://www.pbs.org/newshour/health/need-15-minutes-doctors-time (accessed on 20 October 2025).
  7. Blomqvist, A.; Busby, C. Paying Hospital-Based Doctors: Fee for Whose Service? CD Howe Institute Commentary: Toronto, ON, Canada, 2013; p. 392. [Google Scholar]
  8. Calsyn, M.; Lee, E.O. Alternatives to Fee-For-Service Payments in Health Care: Moving from Volume to Value; Center for American Progress: Washington, DC, USA, 2012; Volume 18. [Google Scholar]
  9. Adida, E.; Mamani, H.; Nassiri, S. Bundled payment vs. fee-for-service: Impact of payment scheme on performance. Manag. Sci. 2017, 63, 1606–1624. [Google Scholar] [CrossRef]
  10. Chen, X.; Zeng, Y.; Chen, W. Comparisons of the fee-for-service and diagnosis-related groups payment modes based on patients’ delay-sensitivity. J. Manag. Sci. China 2022, 25, 1–21. (In Chinese) [Google Scholar]
  11. Fan, Y.; Wang, J.; Xie, J.; Yu, L.; Cao, L. The impact of bundled payment on hospital operations. Serv. Sci. 2023, 15, 129–155. [Google Scholar] [CrossRef]
  12. Liao, C. Design of Policy Mechanism for Head Package Payment in Outpatient Chronic and Special Disease Medical Insurance. China Hum. Resour. Soc. Secur. 2021, 11, 59. (In Chinese) [Google Scholar]
  13. Liu, J. Study on the impact of dip payment reform on the economic operation of medical institutions–taking a tertiary hospital in Guangzhou as an example. Health Econ. Res. 2022, 39, 29–33. [Google Scholar]
  14. Guo, P.; Tang, C.S.; Wang, Y.; Zhao, M. The impact of reimbursement policy on social welfare, revisit rate, and waiting time in a public healthcare system: Fee-for-service versus bundled payment. Manuf. Serv. Oper. Manag. 2019, 21, 154–170. [Google Scholar] [CrossRef]
  15. Hassin, R.; Haviv, M. To Queue or not to Queue: Equilibrium Behavior in Queueing Systems; Springer: Boston, MA, USA, 2003. [Google Scholar]
  16. Hassin, R. Rational Queueing; CRC Press: Boca Raton, FL, USA, 2016. [Google Scholar]
  17. Chen, H.; Wan, Y. Capacity competition of make-to-order firms. Oper. Res. Lett. 2005, 33, 187–194. [Google Scholar] [CrossRef]
  18. Allon, G.; Federgruen, A. Service competition with general queueing facilities. Oper. Res. 2008, 56, 827–849. [Google Scholar] [CrossRef]
  19. Lu, S.F.; Serfes, K.; Wedig, G.; Wu, B. Does competition improve service quality? the case of nursing homes where public and private payers coexist. Manag. Sci. 2021, 67, 6493–6512. [Google Scholar] [CrossRef]
  20. Han, Z.; Arikan, M.; Mallik, S. Competition between hospitals under bundled payments and fee-for-service: An equilibrium analysis of insurer’s choice. Manuf. Serv. Oper. Manag. 2022, 24, 1821–1842. [Google Scholar] [CrossRef]
  21. Zhou, W.; Li, X.; Qian, Q. Comparison of gatekeeping and non-gatekeeping designs in a service system with delay-sensitive customers. J. Syst. Sci. Syst. Eng. 2021, 30, 125–150. [Google Scholar] [CrossRef]
  22. Yu, M.; Zhou, W.; Jiang, B. Referral strategies and capacity decisions in a tiered hospital system with gatekeeping designs–exemplified with Chinese healthcare system. Comput. Ind. Eng. 2022, 171, 108447. [Google Scholar] [CrossRef]
  23. Lim, G.; Lim, A.J.; Quinn, B.; Carvalho, B.; Zakowski, M.; Lynde, G.C. Obstetric operating room staffing and operating efficiency using queueing theory. BMC Health Serv. Res. 2023, 23, 1147. [Google Scholar] [CrossRef]
  24. Wang, J.; Wang, Z.; Zhang, Z.G.; Wang, F. Efficiency-quality trade-off in allocating resource to public healthcare systems. Int. J. Prod. Res. 2022, 60, 6469–6490. [Google Scholar] [CrossRef]
  25. Andritsos, D.A.; Tang, C.S. Incentive programs for reducing readmissions when patient care is co-produced. Prod. Oper. Manag. 2018, 27, 999–1020. [Google Scholar] [CrossRef]
  26. Ghamat, S.; Zaric, G.S.; Pun, H. Care-coordination: Gain-sharing agreements in bundled payment models. Prod. Oper. Manag. 2021, 30, 1457–1474. [Google Scholar] [CrossRef]
  27. Li, X.; Teng, J.; Li, X.; Han, Y. The effect of internal salary incentives based on insurance payment on physicians’ behavior: Experimental evidence. BMC Health Serv. Res. 2023, 23, 1410. [Google Scholar] [CrossRef]
  28. Siciliani, L.; Chalkley, M.; Gravelle, H. Policies towards hospital and gp competition in five european countries. Health Policy 2017, 121, 103–110. [Google Scholar] [CrossRef]
  29. Anand, K.S.; Paç, M.F.; Veeraraghavan, S. Quality–speed conundrum: Trade-offs in customer-intensive services. Manag. Sci. 2011, 57, 40–56. [Google Scholar] [CrossRef]
  30. Kostami, V.; Rajagopalan, S. Speed–quality trade-offs in a dynamic model. Manuf. Serv. Oper. Manag. 2014, 16, 104–118. [Google Scholar] [CrossRef]
  31. Li, X.; Guo, P.; Lian, Z. Quality-speed competition in customer-intensive services with boundedly rational customers. Prod. Oper. Manag. 2016, 25, 1885–1901. [Google Scholar] [CrossRef]
  32. Li, X.; Li, Q.; Guo, P.; Lian, Z. On the uniqueness and stability of equilibrium in quality-speed competition with boundedly-rational customers: The case with general reward function and multiple servers. Int. J. Prod. Econ. 2017, 193, 726–736. [Google Scholar] [CrossRef]
  33. Zhang, Y.; Wang, J. Quality-speed trade-offs in customer-intensive services with boundedly rational customers and retrials. Comput. Ind. Eng. 2022, 167, 107983. [Google Scholar] [CrossRef]
  34. Sun, M.; Chai, Q.; Ng, C.T. Managing the quality-speed tradeoff in blockchain-supported healthcare diagnostic services. Omega 2023, 120, 102911. [Google Scholar] [CrossRef]
  35. Street, A.; O’Reilly, J.; Ward, P.; Mason, A. Diagnosis-Related Groups in Europe: Moving Towards Transparency, Efficiency and Quality in Hospitals; Open University Press: Maidenhead, UK, 2011. [Google Scholar]
  36. Zhou, W.; Wang, D.; Huang, W.; Guo, P. To pool or not to pool? the effect of loss aversion on queue configurations. Prod. Oper. Manag. 2021, 30, 4258–4272. [Google Scholar] [CrossRef]
  37. Kim, S.C.; Horowitz, I.; Young, K.K.; Buckley, T.A. Analysis of capacity management of the intensive care unit in a hospital. Eur. J. Oper. Res. 1999, 115, 36–46. [Google Scholar] [CrossRef]
  38. Surowiecki, J. What Ails Us. The New Yorker. 2003. Available online: https://www.newyorker.com/magazine/2003/07/07/what-ails-us (accessed on 20 October 2025).
  39. Hua, Z.; Chen, W.; Zhang, Z.G. Competition and coordination in two-tier public service systems under government fiscal policy. Prod. Oper. Manag. 2016, 25, 1430–1448. [Google Scholar] [CrossRef]
  40. Edelson, N.M.; Hilderbrand, D.K. Congestion tolls for poisson queuing processes. Econom. J. Econom. Soc. 1975, 43, 81–92. [Google Scholar] [CrossRef]
  41. Wu, C.; Bassamboo, A.; Perry, O. Service system with dependent service and patience times. Manag. Sci. 2019, 65, 1151–1172. [Google Scholar] [CrossRef]
  42. Naor, P. The regulation of queue size by levying tolls. Econom. J. Econom. Soc. 1969, 37, 15–24. [Google Scholar] [CrossRef]
  43. Yang, L.; Guo, P.; Wang, Y. Service pricing with loss-averse customers. Oper. Res. 2018, 66, 761–777. [Google Scholar] [CrossRef]
  44. Qian, Q.; Guo, P.; Lindsey, R. Comparison of subsidy schemes for reducing waiting times in healthcare systems. Prod. Oper. Manag. 2017, 26, 2033–2049. [Google Scholar] [CrossRef]
  45. Gawande, A. The Cost Conundrum. The New Yorker. 2009. Available online: https://www.newyorker.com/magazine/2009/06/01/the-cost-conundrum (accessed on 20 October 2025).
  46. Gawande, A. Overkill: An Avalanche of Unnecessary Medical Care is Harming Patients Physically and Financially. What Can We Do About it? The New Yorker. 2015. Available online: https://www.plc-network.com/wp-content/uploads/2025/05/Americas-Epidemic-of-Unnecessary-Care-_-The-New-Yorker.pdf (accessed on 20 October 2025).
  47. Allon, G.; Federgruen, A. Competition in service industries. Oper. Res. 2007, 55, 37–55. [Google Scholar] [CrossRef]
  48. Babashov, V.; Sauré, A.; Ozturk, O.; Patrick, J. Setting wait time targets in a multi-priority patient setting. Prod. Oper. Manag. 2023, 32, 1958–1974. [Google Scholar] [CrossRef]
  49. Allon, G.; Federgruen, A. Competition in service industries with segmented markets. Manag. Sci. 2009, 55, 619–634. [Google Scholar] [CrossRef]
Figure 1. A schematic of the model.
Figure 1. A schematic of the model.
Mathematics 14 00199 g001
Figure 2. Game sequence of decisions: payer, hospitals, and patients.
Figure 2. Game sequence of decisions: payer, hospitals, and patients.
Mathematics 14 00199 g002
Figure 3. Equilibrium arrival rates.
Figure 3. Equilibrium arrival rates.
Mathematics 14 00199 g003
Figure 4. Performance comparison under different payment schemes (first-best case, BP, and BPW) ( Λ = 1 , C 0 = 2 , C u = 0.5 , n = 5 , d = 1 , μ ¯ = 150 , w ¯ = 150 , s = 3 ). Panel (a) reports the optimal service rates under different payment schemes; panel (b) reports the corresponding social costs; panel (c) reports the expected waiting costs; panel (d) reports the medical costs.
Figure 4. Performance comparison under different payment schemes (first-best case, BP, and BPW) ( Λ = 1 , C 0 = 2 , C u = 0.5 , n = 5 , d = 1 , μ ¯ = 150 , w ¯ = 150 , s = 3 ). Panel (a) reports the optimal service rates under different payment schemes; panel (b) reports the corresponding social costs; panel (c) reports the expected waiting costs; panel (d) reports the medical costs.
Mathematics 14 00199 g004
Figure 5. Payment schemes for the first-best outcome.
Figure 5. Payment schemes for the first-best outcome.
Mathematics 14 00199 g005
Table 1. Comparison of this study’s main features and previous related literature. The ✔ symbol indicates "features that were present in the study”.
Table 1. Comparison of this study’s main features and previous related literature. The ✔ symbol indicates "features that were present in the study”.
ResearchQueueing ModelProvider CompetitionHealthcare Payment SchemesQuality–Speed Tradeoff
/Cost–Speed Tradeoff
Chen and Wan [17]--
Allon and Federgruen [18]--
Lu et al. [19] -
Han et al. [20] -
Yu et al. [22]---
Lim et al. [23]---
Wang et al. [24]---
Guo et al. [14]--
Adida et al. [9] --
Andritsos and Tang [25] --
Ghamat et al. [26] --
Fan et al. [11] --
Li et al. [27] --
Anand et al. [29]--
Kostami and Rajagopalan [30] --
Li et al. [31]--
Li et al. [32]-
Zhang and Wang [33]--
Sun et al. [34]--
Wang et al. [3]-
This study
Table 2. Notations.
Table 2. Notations.
SymbolDescription
n Number of hospitals in the public outpatient system
H i Hospital H i , i = 1 , . . . , n
Λ Total potential arrival rate
λ i The initial arrival rate of patients who join hospital H i
μ i Service rate of hospital H i
µ ¯ The upper bound of the service rate
d Unit delay cost/unit waiting cost
W i Expected waiting time of patients at hospital H i
w ¯ An upper limit of the expected waiting time
v Service value that a patient receives after being served
C 0 The unit benchmark medical cost
C u The constant unit medical cost
p Service price for the whole episode of care
B Medical budget level
SCSocial cost of the whole healthcare system (waiting cost + medical cost)
w 0 Waiting time guarantee (service level requirement) under the BP with service level regulation (BPW) scheme
α A profit margin of α ∈ [0, 1] on the total medical cost under the FFS scheme
Table 3. The impact of parameters n , Λ , C u on the equilibrium p ~ B P , μ ~ i B P .
Table 3. The impact of parameters n , Λ , C u on the equilibrium p ~ B P , μ ~ i B P .
( 1 )   B [ Λ p 1 , B 0 ( 2 )   B [ B 0 , B 1 ( 3 )   B [ B 1 , +
n Λ C u n Λ C u n Λ C u
p ~ B P
μ ~ i B P
The symbol “–” indicates no change, “ ” indicates a monotonic increase, and “ ” indicates a monotonic decrease.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Li, X.; Fu, M. The Effect of Bundled Payment Schemes on Cost–Speed Tradeoff for Outpatient Service: A Queueing-Game Analysis. Mathematics 2026, 14, 199. https://doi.org/10.3390/math14010199

AMA Style

Li X, Fu M. The Effect of Bundled Payment Schemes on Cost–Speed Tradeoff for Outpatient Service: A Queueing-Game Analysis. Mathematics. 2026; 14(1):199. https://doi.org/10.3390/math14010199

Chicago/Turabian Style

Li, Xiuzhang, and Minghui Fu. 2026. "The Effect of Bundled Payment Schemes on Cost–Speed Tradeoff for Outpatient Service: A Queueing-Game Analysis" Mathematics 14, no. 1: 199. https://doi.org/10.3390/math14010199

APA Style

Li, X., & Fu, M. (2026). The Effect of Bundled Payment Schemes on Cost–Speed Tradeoff for Outpatient Service: A Queueing-Game Analysis. Mathematics, 14(1), 199. https://doi.org/10.3390/math14010199

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop