Next Article in Journal
EUS-FNA versus EUS-FNB in Pancreatic Solid Lesions ≤ 15 mm
Previous Article in Journal
Possibility of Using Surgical Pleth Index in Predicting Postoperative Pain in Patients after Vitrectomy Performed under General Anesthesia
Previous Article in Special Issue
Investigation of Cognitive Impairment in the Course of Post-COVID Syndrome
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Novel Method for Assessing Risk-Adjusted Diagnostic Coding Specificity for Depression Using a U.S. Cohort of over One Million Patients

1
School of Data Science, University of North Carolina at Charlotte, Charlotte, NC 28223, USA
2
Department of Public Health Sciences, University of North Carolina at Charlotte, Charlotte, NC 28223, USA
3
ITS Data Science, Premier, Inc., Charlotte, NC 28277, USA
4
School of Public Health, Faculty of Medicine, Imperial College London, London W6 8RP, UK
*
Author to whom correspondence should be addressed.
Diagnostics 2024, 14(4), 426; https://doi.org/10.3390/diagnostics14040426
Submission received: 1 January 2024 / Revised: 7 February 2024 / Accepted: 8 February 2024 / Published: 15 February 2024
(This article belongs to the Special Issue New Advances in the Diagnosis and Treatment of Mental Disorders)

Abstract

:
Depression is a prevalent and debilitating mental health condition that poses significant challenges for healthcare providers, researchers, and policymakers. The diagnostic coding specificity of depression is crucial for improving patient care, resource allocation, and health outcomes. We propose a novel approach to assess risk-adjusted coding specificity for individuals diagnosed with depression using a vast cohort of over one million inpatient hospitalizations in the United States. Considering various clinical, demographic, and socioeconomic characteristics, we develop a risk-adjusted model that assesses diagnostic coding specificity. Results demonstrate that risk-adjustment is necessary and useful to explain variability in the coding specificity of principal (AUC = 0.76) and secondary (AUC = 0.69) diagnoses. Our approach combines a multivariate logistic regression at the patient hospitalization level to extract risk-adjusted probabilities of specificity with a Poisson Binomial approach at the facility level. This method can be used to identify healthcare facilities that over- and under-specify diagnostic coding when compared to peer-defined standards of practice.

1. Introduction

The International Classification of Diseases (ICD) is a medical coding system that is continuously updated and used to catalog health conditions by categories of similar diseases under more specific conditions [1]. The World Health Organization has been responsible for ICD since 1992, providing a standardized method of recording and tracking diseases worldwide [1]. ICD-10 (10th revision) coding affects healthcare delivery, payments and reimbursements, and disease surveillance. ICD-10-CM, which is the clinical modification (CM) developed and maintained by the Centers for Disease Control and Prevention (CDC) that was introduced shortly after, provided a 400% increase in diagnosis codes while also increasing 18-fold the coding for procedures [2]. This enhancement aimed to add granularity and specificity in clinical records for diagnoses and procedures, though some clinical specialties have been more directly affected than others [1,2,3]. However, a larger catalog of ICD-10 codes does not directly imply widespread use of the more granular coding options [4].
Medical coding is expected to be accurate, complete, and specific to the finest degree possible. Rigorous coding ultimately benefits patients, healthcare providers, and payors [5]. Coding errors can occur when the physician documentation is insufficient [6] or the coding staff is improperly trained. One essential aspect of the coding process is the concept of coding specificity, which can be regarded as medical coding to the greatest level of precision supported by a clinical diagnosis code [5]. Unspecified codes should be the last resort when a more specific diagnosis is not viable. The United States (U.S.) Centers for Medicare and Medicaid Services’ (CMS) guidelines indicate that “When sufficient clinical information is not known or available about a particular health condition to assign a more specific code, it is acceptable to report the appropriate unspecified code” [7]. While increasing coding specificity rates has been recommended when appropriate [5], the focus on rates alone can be problematic. Rates can be sensitive to hospital volume, patient mix, and other factors, which may result in varying degrees of specificity, whether clinically supported or not. Additionally, coding to the highest degree of specificity is not always clinically justified, such as in instances where a specified secondary diagnosis may not be needed for the provision of treatment for the principal diagnosis during an inpatient or emergency stay or when resources are not available to provide that additional level of specificity [8,9,10].
In the inpatient setting, coding specificity is the responsibility of the provider and coder, who must work together to record a detailed clinical description of the diagnosis or procedure [5,11]. Accurate levels of diagnostic coding specificity, when possible, help align reimbursements with healthcare costs and provide patients with accurate medical records to more effectively guide treatment plans. Detailed documentation and coding of diagnoses influence reimbursement but also represent an additional cost for healthcare providers and payors. Higher degrees of coding specificity, especially upon introducing ICD-10-CM, require additional levels of expertise among coders [5], which may be a larger burden in facilities providing more general services or those with limited resources or personnel. Practices may also suffer financial loss if coding specificity is insufficient or inappropriate [5]. In some cases, payors may determine that codes lacking specificity are used improperly (or overused), potentially leading to a denied claim [5,7]. Conversely, over-specificity, when not clinically warranted, is problematic, as such coding may overstate patient care needs, unduly inflate reimbursement through Diagnosis Related Group (DRG) creep, and exaggerate a patient’s clinical risk.
Despite the importance of accurate coding for patient conditions, enforcing and maintaining a high, yet appropriate, level of coding specificity has remained an issue since the inception of the ICD-10 system [5,11]. While studies have been conducted to identify sources of coding errors throughout a patient encounter or episode, there is minimal literature examining how best to quantify coding specificity as an independent metric or how to predict or identify where unspecified codes are (or have the potential to be) most misaligned with the patient’s true diagnosis, that is, when the diagnosis may be accurate but the level of coded specificity is not appropriate for the clinical diagnosis [11,12]. Even in studies where coding specificity is observed or utilized as an analytical component, the methods used to develop the specificity metric are often vague, overlooked entirely, and/or narrowly defined in a disease-specific form, resulting in additional complexity to generalize across conditions [13,14,15]. The lack of standardized methods for measuring, quantifying, and analyzing coding specificity represents a significant gap in knowledge for the healthcare community.
While similar methodologies have been developed in the literature for measuring coding intensity [16,17], we aim to create a metric by which to measure and risk-adjust coding specificity that would allow for comparative analysis of facilities, thus identifying where coding specificity may need improvement against healthcare industry standards or aspirational peers. A metric that could have potential for widespread implementation would not require clinical inputs regarding appropriateness of specificity, which may not be agreed upon across physicians, change substantially over time, and be costly to obtain and maintain, as well as being less generalizable across health conditions. Such metric should also be relatively easy to implement without major costs and with readily available patient and facility data, such as administrative claims data, though sufficiently flexible to account for other information when available.
Depression, affecting 18.5% of the U.S. adult population in 2020 [18], has been identified as one of the conditions that is commonly reported with unspecified diagnosis codes [5]. Three of the most common codes produced by the ICD-10 criteria for depression include: major depressive disorder (F32); dysthymic disorder (F34.1); and unspecified depression (F32.A) [5,19,20]. ICD-10 codes related to depression are also grouped within the DRG list of depressive neuroses (DRG 881) [20]. Recommendations for an initial diagnosis using ICD-10 codes require identifying five symptoms of depression lasting two weeks or more and must include depressed mood or loss of interest [19,21]. However, depression should only be considered after accounting for the absence of medical conditions that can mimic symptoms of depression (e.g., thyroid problems or brain tumors) and after ruling out bereavement or sadness caused by life-altering events [21].
The degree to which coding specificity varies across providers for depression patients remains unclear. Facilities do not have a standard against which they can measure their levels of coding specificity of depression diagnoses during inpatient hospitalizations, particularly because of potential case mix differences across facilities. This calls for a method that risk-adjusts for such differences and provides an objective and standardized metric against which each facility can measure variation in coding specificity. This study aims to demonstrate a novel approach for measuring the risk-adjusted probability of coding specificity controlling for patient and facility characteristics, both across principal and secondary diagnoses of depression, while building an aggregated metric that can be used at coarser levels. Such an approach can be used by quality control personnel to enhance standards of practice around coding specificity, not only for individuals diagnosed with depression but also across a wide spectrum of health conditions.

2. Materials and Methods

Data were obtained from the Premier Healthcare Database (PHD), a national, hospital-based, service-level, and private all-payor database that contains information on inpatient discharges [22]. The analysis comprises N = 1,071,575 observations of acute inpatient hospitalizations of first-patient stays with discharge dates in 2022 with an identified principal or secondary diagnosis of depression. Specificity for a depression principal diagnosis was identified, and, when multiple depression secondary diagnoses occurred, specificity for the secondary diagnosis was defined as specificity for at least one of these depression secondary diagnoses. The ICD-10 codes defining the patient cohort consisted of the F32 (depressive episode) and F33 (major depressive disorder, recurrent) codes.
The data consist of the following information, in addition to masked patient and facility identifiers: (1) binary response variables representing specificity of principal and secondary diagnoses of depression; (2) patient characteristics, which include age, sex, race, length of stay (log-transformed due to its large right-skewness), primary payor, point of origin, discharge status, count of procedures performed during the inpatient stay, CMS fiscal year indicator, five county-level Agency for Toxic Substance & Disease Registry (ATSDR) social vulnerability indices (SVIs) [23], COVID-19 indicator, and Medicare Severity (MS)-DRG type; and (3) facility characteristics including teaching status, academic status, urban/rural status, ownership status, bed count, hospital-level case mix index (CMI), and state. The primary payor variable refers to the insurance provider that assumes the primary responsibility for covering the costs of a healthcare claim. For example, “Medicare traditional” indicates that the patient is covered under Medicare, the U.S. government’s insurance plan for patients aged 65 or older, while “Medicaid traditional” refers to the U.S. government’s insurance plan for low-income patients and their families.
Descriptive statistics were calculated for all aforementioned variables, including means/counts and standard deviations/percentages. Categories with low counts and similar meanings (e.g., charity and indigent primary payor types) or adjacent ordered categories (e.g., ages 0–9) with low counts were grouped together.
Univariate and multivariate logistic regression analyses were used to extract associations between patient-level and facility-level variables and the coding specificity of depression principal and secondary diagnoses. Odds ratios (ORs), 95% confidence intervals (CIs), and p-values were calculated and tabulated for all four analyses. The receiver operating characteristic (ROC) curve was constructed, and the corresponding area under the curve (AUC) was computed for both the depression principal and secondary diagnosis specificity multivariate logistic regression models.
The unknown probability ( π p , f ) of coding specificity of principal, or secondary, diagnosis for patient hospitalization p in facility f ( S p , f ) was modeled with a multiple logistic regression including covariates ( X p , f ) that represent both patient and facility characteristics using the equation below:
logit   ( π p , f [ S p , f | X p , f ] ) = α + β T X p , f ,
where α is the intercept and the vector β T contains corresponding regression coefficients for X p , f . Each patient hospitalization’s coding specificity within a facility f was assumed to be independently distributed with an unequal, unknown probability π p , f . Since the coding specificity events were not identically distributed, this total count for each facility f ( i . e . , p f S p , f ) was assumed to follow a Poisson Binomial (PB) distribution with a probability vector π ^ p f , composed of the probabilities for each patient hospitalization p within facility f ( i . e . , p f ) . Upon extracting the estimated probabilities π ^ p , f for each patient hospitalization p and facility f, these were used to assess whether each facility’s total count of specified diagnoses was under-, in line with, or over-specified compared with their healthcare industry peers via a user-defined probability threshold t.
Without loss of generality, we applied a common threshold t = 0.025 to identify facilities operating outside peer standards’ confidence bounds (2.5th and 97.5th percentiles) denoted by Q L and Q U , representing under and over specificity, respectively, for facility f:
P ( p f S p , f ~ P B [ { π ^ p f } ] < Q L ) = t
P ( p f S p , f ~ P B [ { π ^ p f } ] > Q U ) = t
Visualizations of the facility-specific metrics were produced to demonstrate under-specifying (p < 0.025) and over-specifying (p > 0.975) facilities using the cumulative distribution function of the facility-specific Poisson Binomial distribution and the observed specificity count across patient hospitalizations for that facility. Geospatial U.S. maps of adjusted odds ratios of coding specificity by state were also produced across both outcomes, with New York selected as the reference state based on its largest healthcare expenditure (per capita) in the U.S. [24].

3. Results

Table 1 provides descriptive statistics for all variables across N = 1,071,575 unique inpatient hospital admissions where depression was recorded as the principal or secondary diagnosis. Of these hospitalizations, 16,437 had depression as a principal diagnosis. Of the principal diagnoses, 4736 (28.8%) were coded as unspecified.
Most of the patients were aged 65 to 69 years old (12%), female (65%), and identified as White (80%). The median length of stay was 4 days, the average number of procedures per hospitalization was 2.9 (SD 2.7), and most hospitalizations occurred in the CMS 2022 fiscal year (75%). Traditional Medicare was the most common primary payor (29%), the most common point of origin was a non-healthcare facility (80%), and most patients were discharged to home or self-care (53%). Average scores for patients’ SVI values were 0.53 (SD 0.26) for socioeconomic status, 0.51 (SD 0.25) for household characteristics, 0.66 (SD 0.24) for racial and ethnic minority status, 0.60 (SD 0.25) for housing type and transportation, and 0.58 (SD 0.25) for overall vulnerability. Seven percent of patients experienced COVID-19 during their hospitalization, and 74% of patients had a medical MS-DRG type.
Most of the hospitals were non-teaching (73%), non-academic (83%), located in an urban setting (88%), voluntary non-profit private (65%), and had more than 400 beds (43%). The average patient case mix index was 1.7 (0.29). Data were collected from facilities in all fifty states, but the five states with the largest numbers of observed hospitalizations were Florida (9%), New York (7%), Texas (6%), North Carolina (6%), and Ohio (5%).
Table 2 contains the univariate and multivariate logistic regression results, including odds ratio estimates, 95% confidence intervals, and p-values for the specificity of a principal depression diagnosis. Patient characteristics such as age, primary payor, and SVI had a significant association with depression principal diagnosis coding specificity across multiple categories based on the multivariate logistic regression model. The odds of depression principal diagnosis coding specificity were at least 46% higher among patients aged less than 80 years, with the exception of those less than 10 years old, when compared to patients 85+ years old (OR ≥ 1.459; p ≤ 0.041). Males experienced 24% lower odds of depression principal diagnosis specificity compared to females (OR = 0.76; p < 0.001). No significant differences in odds of specificity were found by race upon accounting for all other factors. However, every additional unit in the Racial and Ethnic Minority Status SVI was associated with approximately 49% lower odds of specificity (OR = 0.506; p = 0.003). Length of stay (log-transformed) was also positively associated with higher odds of principal diagnosis specificity (OR = 1.82; p < 0.001). Differences were found across some categories of primary payor, point of origin, and discharge status. However, there was no significant association between COVID-19 status, CMS fiscal year period, count of procedures, or other SVI measures with depression principal diagnosis specificity. Patients grouped with a surgical MS-DRG type experienced substantially lower odds of depression-related principal diagnosis specificity (OR = 0.288; p < 0.001) when compared to those with a medical MS-DRG.
Patients attending rural facilities did not experience statistically different odds of specificity of a depression principal diagnosis compared to those attending urban facilities (OR = 1.009; p = 0.917). Patients attending teaching facilities experienced lower odds of depression principal diagnosis specificity (OR = 0.680; p < 0.001), whereas those attending facilities with an academic status experienced higher odds of depression principal diagnosis specificity (OR = 1.465; p = 0.001). All significant ownership categories were associated with lower odds of depression principal diagnosis specificity compared to the reference category (voluntary nonprofit private). No clear pattern emerged by bed size, and the case mix index was found to be non-significantly associated with principal diagnosis coding specificity. However, substantial differences were detected by state when compared to New York as the reference state. For example, states like California experienced much higher odds of specificity of a depression principal diagnosis (OR = 1.995; p < 0.001), while others like New Jersey experienced substantially lower odds of principal diagnosis specificity (OR = 0.247; p < 0.001).
Table 3 contains the univariate and multivariate logistic regression results, including odds ratio estimates, 95% CIs, and p-values, for the specificity of depression-related secondary diagnoses. The multivariate analysis demonstrates that individuals of all age groups experienced significantly higher odds of specificity of depression secondary diagnoses compared to those 85 and older (OR ≥ 1.116; p ≤ 0.036). Black individuals experienced 12.5% lower odds of secondary diagnosis specificity than White patients (OR = 0.875; p < 0.001). Males experienced approximately 5% higher odds of secondary diagnosis specificity than females (OR = 1.054; p < 0.001). Length of stay (log-transformed) was also positively associated with higher odds of depression secondary diagnosis specificity (OR = 1.237; p < 0.001). Primary payor type, point of origin, and discharge status all contained categories with statistically significant associations with the outcome. Patients who experienced larger numbers of procedures also experienced higher odds of secondary diagnosis specificity (OR = 1.007; p < 0.001). Those discharged in the 2023 CMS fiscal year experienced 4.2% increased odds of depression secondary diagnosis specificity (OR = 1.042; p < 0.001). All SVI indices were also significant, as was COVID-19 status, with COVID-19-positive patients experiencing approximately 7% lower odds of depression secondary diagnosis specificity (OR = 0.929; p < 0.001). Patients with a surgical MS-DRG also experienced 14.5% lower odds of depression secondary diagnosis specificity compared to those with a medical MS-DRG type (OR = 0.855; p < 0.001).
Patients admitted to teaching facilities experienced significantly higher odds of depression secondary diagnosis specificity (OR = 1.177; p < 0.001), while those attending academic facilities experienced lower odds of secondary diagnosis specificity (OR = 0.790; p < 0.001). Rural facilities provided higher odds of specificity to their patients (OR = 1.409; p < 0.001). Some differences were found by ownership status, and patients attending facilities with lower bed counts had lower odds of depression secondary diagnosis specificity compared to those with over 400 beds (OR ≤ 0.937; p < 0.001). Patients attending hospitals with larger case mix index values were associated with lower odds of secondary diagnosis specificity (OR = 0.873; p < 0.001). Finally, substantial state-based differences were detected, with most states experiencing higher odds of depression secondary diagnosis specificity than NY. For example, individuals in states like MN experienced substantially larger odds of depression secondary diagnosis specificity compared to NY (OR = 11.255; p < 0.001).
Figure 1 contains the ROC curves resulting from the multivariate logistic regression analyses of the coding specificity of the principal (a) and secondary (b) diagnoses of depression. The corresponding AUC values were 0.7555 and 0.6874, respectively, indicating a slightly better fit for the model assessing the specificity of a depression principal diagnosis.
Figure 2 contains a visual representation of the use of the Poisson Binomial metric for identification of facilities’ specificity of depression principal (a) and secondary (b) diagnoses against healthcare industry peers upon adjusting for patient and facility characteristics. A sample of 20 facilities is portrayed in each plot, with colors denoting coding specificity performance versus peers. Observed counts below the 95% CIs identify facilities that under-specify depression diagnoses compared with their peers (blue), while observed counts above the 95% CIs identify facilities that over-specify depression diagnoses versus peers (orange). Finally, those depicted in black represent facilities that specify depression diagnoses in line with their healthcare industry peers.
Finally, Figure 3 contains U.S. maps representing adjusted odds ratios for the two outcomes. States portrayed in grayscale represent those in which patients have similar odds of coding specificity of depression diagnoses compared with the reference state (New York). States where the odds of diagnosis specificity are below those of New York are represented in blue, while the other color scales represent different degrees of state-level over-specificity of depression diagnoses (see Figure 3 legend). Both maps indicate that New York is generally underspecified across both principal and secondary depression diagnoses when compared to most states.

4. Discussion

We propose a two-step approach for modeling coding specificity at the facility level. First, a multivariate logistic regression model is proposed to measure, at the patient hospitalization level, the association between the coding specificity of principal and secondary diagnoses of depression and a set of patient- and facility-level characteristics. In a second step, a Poisson Binomial approach builds upon the risk-adjusted logistic-derived patient-level specificity probabilities to estimate the anticipated 95% confidence interval for coding specificity counts per facility across patient hospitalizations if facilities were to operate in line with healthcare industry standards. Over- and under-specifying facilities are then identified upon comparing their observed coding specificity counts across patient hospitalizations and the aforementioned 95% confidence intervals. We then visualize the facility-specific metrics to demonstrate under- and over-specifying facilities. While outside the scope of this manuscript, facilities can also be ranked by risk-adjusted specificity since the p-value-based metric already adjusts for both size (i.e., counts) and strength of evidence.
Patient characteristics were associated with the coding specificity of both the principal and secondary diagnoses. Higher odds of specificity for both principal and secondary diagnoses were generally associated with lower ages compared with those 85+ years old. This may be related to a larger complexity in diagnosis or the presence of more comorbidities. However, it could also be related to a lower quality of coding and/or care provided to older populations [25]. Race was not associated with differences in odds of specificity, with the exception of the secondary diagnosis, where Black patients experienced substantially lower odds of coding specificity, which may relate to differences in coding practices by practitioners and/or differences in information-seeking behaviors by patients [26]. This could reflect findings in prior research showing that disparities in the treatment of depression by race/ethnicity among older adults may still be present [27]. Males experienced substantially lower odds of principal diagnosis specificity but higher odds of secondary diagnosis specificity for depression compared to females. It is unclear whether this is confounded by other factors, such as age, due to differentials in life expectancy and sex-related imbalances in the age-sex pyramid, particularly in the U.S. [28].
Patients with longer stays experienced higher levels of specificity in both principal and secondary diagnosis. This could be due to the additional time and resources employed during the inpatient stay or as a result of the complexity of their cases. Clinicians may spend less time documenting patients with shorter stays. Some differences were observed by the primary payor. However, the patient mix by payor could also be heterogeneous. For example, those with employer contracts as the primary payor may be experiencing higher odds of principal and secondary diagnosis coding specificity because they are a younger population than those receiving healthcare through Medicare, which is the reference category, though it could also relate to requirements related to worker’s compensation. Some social vulnerability indices were also related to differing degrees of coding specificity. However, the information content in this variable likely overlaps with other variables such as age and race/ethnicity. Patients grouped with a surgical MS-DRG experienced lower odds of principal and secondary diagnosis specificity when compared to those with a medical MS-DRG. One possible explanation for this discrepancy is that surgical patients may receive a principal diagnosis that is primarily focused on their surgical condition, which can overshadow or lead to a less detailed assessment and diagnosis of mental health conditions such as depression. Surgical patients who undergo a range of medical tests and evaluations specific to their surgical procedures may experience a more limited extent to which mental health concerns are addressed and documented as the principal diagnosis during the inpatient hospitalization. Additionally, those performing surgical procedures who may be responsible for the patient during inpatient stay may not be the same physicians identifying and/or treating any underlying depression diagnosis. Multiple procedures may require increased attention and precision, leading to more detailed physician consultations and billing practices that may impact coding specificity.
Facility-level characteristics were also associated with the specificity of both principal and secondary diagnosis. However, differences by diagnosis type were found. For example, patients who attended teaching facilities experienced lower odds of principal diagnosis specificity yet higher odds of secondary diagnosis specificity. However, the reverse is seen in academic status. This could relate to high levels of collinearity affecting some of the facility-level variables, so cautious interpretation is advisable. Facilities’ case mix index was significantly associated with lower odds of specificity for both types of diagnoses. This indicates that hospitals dealing with more complex cases tend to underspecify in terms of depression diagnoses. This could relate to the severity of cases and the potential need to allocate resources unevenly across health conditions. Substantial differences were observed by state, with the odds of coding specificity higher across multiple states when compared to NY. This again could reflect differences in patient composition or complexity by state, but also the variability in spending per capita, price levels, overall healthcare affordability, and differences in uptake of Medicaid by state [29].
While of some interest, the ultimate purpose of this study is not to explore associations between these patient and facility characteristics and coding specificity outcomes but to leverage them to build a risk-adjusted estimate of the probability of coding specificity that can be used to evaluate facilities’ standards of practice. The purpose of the multivariate logistic regression is to capture the probability of specificity, and the combination of patient and facility characteristics led to high levels of explanatory power, even when a large number of clinical factors were not included in this study. The AUC was 0.76 and 0.69 for the principal and secondary diagnosis specificity models, respectively. It would be reasonable to expect that principal diagnoses are specified at a higher level, since secondary diagnoses could be very unrelated to the primary reason for the inpatient hospitalization, and an accurate diagnosis may not be needed to treat the patient’s condition. However, good levels of explanatory power were also found among patient and facility characteristics for the secondary diagnoses model, which comprises a larger number of individuals between the two analyses. This explanatory power was achieved with relatively low levels of clinical information about the patient. Additional variables describing the clinical characteristics of the patient hospitalization are likely to enhance the AUC levels substantially more.
The AUC values across both types of diagnoses highlight that risk-adjustment of specificity outcomes is important when evaluating hospital coding specificity performance. Otherwise, facilities could be unfairly compared and evaluated. For example, a hospital treating a large population of younger patients may demonstrate high levels of overall coding specificity while actually providing low levels of risk-adjusted specificity. Risk-adjustment allows practitioners to adjust for industry-level differences, while it also allows policymakers to explore whether such differences are warranted or demonstrate disparities or inappropriate standards of practice at the industry level that need to be addressed.
Upon risk-adjusting for patient and facility characteristics, we demonstrate that substantial differences in coding specificity by facility still remain. These differences are more likely to be due to idiosyncrasies and facility-specific processes and practices. We demonstrate these differences in risk-adjusted specificity with a sample of facilities. Our proposed metric can help identify facilities that, upon adjusting for common factors that affect variability in coding specificity, still perform substantially away from common healthcare practice.
From a practical standpoint, the model outcomes can serve multiple purposes toward enhancing clinical data abstraction, such as: (1) Serve as flags for facilities that, upon risk-adjusting for their patient mix, may be operating at standards that widely differ from those of their peers. This can take the form of under-specificity or over-specificity; (2) Serve as an intra-facility flag for physicians or units who may also be under- or over-specifying when measured against peers, which may be internal or external to the facility; (3) Serve as an intra-facility flag for specificity practices across health conditions; and (4) Serve to measure the clinical abstractors themselves to conduct practical root cause analysis. In all cases, the actionable steps from flagging such differences in operations against peers could be a more in-depth gathering of information as to whether diagnoses are insufficiently precise, personnel may not be sufficiently versed in the granularity offered in ICD-10 codes, or clinical abstraction may be enhanced (e.g., due to insufficient or incorrectly recorded clinical diagnoses), or whether the diagnoses are overly precise given the information within the respective clinical records. Our approach can be applied across health conditions and units, thus serving as an automated and low-cost first-warning system for coding specificity practices. Thus, both quality-control personnel within the facilities and outside of them (e.g., claims personnel) can assess coding practices that may depart from standard practice, with or without cause, and without the need for a full clinical assessment across patients, which would be substantially more costly. While false positives may occur and coding specificity practices may be warranted on a clinical basis, this approach can serve to identify facilities, units, or physicians most likely to be true positives (intended or unintended) and who may be departing from such practices in ways that may need to be addressed. Ultimately, this would result in a benefit for both the facilities and patients, enhancing the quality of medical records and identifying and resolving inefficiencies where present. Facilities could benefit from the maximization of reimbursement (when under-specifying) and the minimization of risks (e.g., reputational or financial) due to over-specification [2,5].
As the U.S. and other countries look toward the implementation of ICD-11, standardized methods to measure variation in coding, such as those proposed here, will have an important role in providing hospitals with a fair benchmark against which coding practices can be evaluated. Since it is unclear whether there may already exist coding specificity differences between the U.S. and other countries due to the lack of literature, further studies are needed across healthcare delivery systems to assess whether the findings in our study also apply to systems that may be more centralized, such as the United Kingdom’s National Health Service. The effect of the changes from ICD-10 to ICD-11 on such potential differences across healthcare systems is also unclear. However, our model allows for a rolling estimation of specificity levels. Thus, the impact of interventions, such as those derived from quality-control actions or from transitions from ICD-10 to ICD-11, could be measured with approaches such as interrupted time series analyses.
While our approach does not provide a raw measure to define ‘correct’ levels of coding specificity, it provides the user with a peer-based metric. Institutions that aspire to perform in line with industry standards (or standards defined by a subset of peers) can compare themselves with these standards through the counterfactual outcomes of this model. To our knowledge, the approach demonstrated in this manuscript is the first to address, in a fully extrapolatable way, the issue of diagnostic coding specificity in a large, population-based study. Finally, while our approach is built on a logistic regression model, alternative approaches are possible. We proposed a logistic regression approach due to the additional interpretability of the intermediate model outcomes. Also, this approach is useful as it serves as a natural intermediate outcome (estimated probability of specificity) for grouping/clustering across hospitalizations that share common underlying traits, such as hospitals, physicians, states, or any other clustering variable. However, other artificial intelligence/supervised learning approaches may be better suited when predictability at the hospitalization level is more relevant than the analysis of coding specificity practices.

Strengths and Limitations

Claims data are generally more readily available and standardized to a greater degree than medical records, allowing for a larger observation cohort and greater generalizability of methods and results across diseases/patient cohorts. Our cohort, which is comprised of over one million observations, is, to our knowledge, the largest cohort in the literature for measuring and modeling the coding specificity practices of any disease. The primary limitation of relying on claims data are a lack of patient-level clinical data that would be contained in an electronic health record (EHR) or similar medical record. Clinical factors such as patient underlying health conditions, severity of patient health concerns, or whether procedures are urgent or elective likely play a role in the way patient diagnoses are coded and would serve to improve the robustness of our evaluation metrics. However, the information contained in our claims data are sufficient to develop a metric by which to evaluate coding specificity, including patient- and facility-level characteristics, and would only be improved by this additional information when it is available. Also, limiting a model to only be usable when such EHRs are available would hamper its practical utility. Some variable categories were also grouped due to low value counts (e.g., ages 1–4 and 5–9 combined into a single 0–9 category), but arguably some of these groupings could be deemed subjective. Regardless, their impact on the results is unlikely to be relevant, especially given the very low counts as a proportion of the overall sample size.
The facility type and distribution of physician specialties are not considered but could be relevant factors. Facilities that provide healthcare across a wide range of health conditions may not have the level of specialization among their physicians and coders compared with those in more specialized facilities.
Race was included to measure potential inequity of care (i.e., coding) and to demonstrate the approach in general terms among practitioners. However, the inclusion of this variable in the construction of risk-adjusted metrics continues to be a debatable topic, with practitioners still using it to guide clinical decision-making [30]. Because of the nature of this ongoing debate between recommendable and currently implemented practice, the variable was included to demonstrate differences by race, whether warranted by clinical diagnosis or not. The model can easily be adapted to exclude race and cluster practices, thus demonstrating differences by race and practice (or grouping across practices by race). This is outside the scope of this study and would be future research.
While a facility may contain multiple hospitalizations per patient, we restricted our dataset to one (specifically their first with a 2022 discharge date) hospitalization per patient to avoid excessive influence by patients who may have large numbers of inpatient stays due to recurring needs. This exclusion helped mitigate concerns that subsequent stays would no longer be independent hospitalizations. This cohort definition can be relaxed by including additional patient hospitalizations and random effects per patient, or by including a factor to account for second or later hospitalizations. However, the computational complexity and burden of such an approach should also be considered, as well as the heterogeneity of such a population. Ultimately, coding specificity during the first inpatient stay may likely be a lower bound for the specificity of further inpatient stays with the same diagnoses if adequate records are maintained and clinical staff carefully review them, thus providing a conservative metric for each facility. At the facility level, random effects could be used for facilities; however, this would increase the computational complexity substantially.
Multicollinearity was observed for several variables, both at the patient and facility levels, so caution is recommended when drawing conclusions about individual variable relevance (or directionality of any association) for risk adjustment. However, to account for this limitation, we also performed univariate analyses in addition to multivariate analyses, providing additional information to measure variable associations. It is also important to note that this multicollinearity does not impact overall model performance or the development of a metric to assess variations in coding specificity. The multivariate models’ AUCs and the subsequent Poisson Binomial metrics would not be affected by multicollinearity, and, therefore, the model is flexible enough to be expanded with additional variables, if available, even if highly correlated with existing ones. Also, state-level clustering of facilities was not considered in this study, where hospitals may be part of a shared health system using centralized teams of coders or commonly defined standards. This may result in inter-facility correlations. In this case, the borrowing of information across facilities could be explored, though outside the scope of this study.
Observations are likely not independent since common latent factors could exist. For example, shared coders or physicians who may operate across facilities could breach the assumption of independence. Also, facilities may have common ownerships, which, in turn, could lead to similar standards of practice. However, these issues do not invalidate the methodology proposed. The grouping was demonstrated at the facility level, but it could be performed at any level, including at the physician or facility owner levels.
The definition of the secondary diagnosis was made to reflect any specified secondary diagnosis of depression. However, when multiple secondary diagnoses of depression are present, this binary definition could be subjective. Regardless, only a small number of hospitalizations reflected multiple secondary diagnoses of depression, and an analysis using an alternative definition of ‘all specified diagnoses of depression’ as the outcome rendered very small differences in AUC.

5. Conclusions

This study aims to demonstrate a novel approach for measuring the risk-adjustment specificity controlling for patient and facility-level characteristics for principal and secondary diagnoses of depression. This approach is extended to create an aggregate metric that can be used at coarser levels, grouping by any observable common factor, and demonstrated at the facility level. In this study, we propose a multivariate logistic regression model for coding the risk-adjusted specificity of depression principal and secondary diagnoses. Our findings demonstrate that both patient and facility characteristics commonly available in claims data are relevant to explaining variability in the coding specificity of both the principal and secondary diagnoses of depression. This approach represents one of the building blocks for designing a risk-adjusted, facility-specific index that can be used by quality control personnel to compare facilities’ coding specificity practices with peers across diseases. While we demonstrate our novel approach with a large patient cohort diagnosed with depression during hospitalizations, the method can be applied to any disease cohort and any grouping-level variable. Therefore, our approach fills a gap in the already scarce literature on coding specificity.

Author Contributions

Conceptualization, J.M., M.K. and L.H.G.; methodology, L.H.G.; formal analysis, A.G., N.C.M., C.M., S.M., K.M., A.H., K.P., K.T. and L.H.G.; data curation, M.K.; writing—original draft preparation, A.G., A.H., N.C.M., S.M., C.M., K.M., K.P., K.T. and L.H.G.; writing—review and editing, A.G., A.H., N.C.M., S.M., C.M., K.M., K.P., K.T., J.M., M.K. and L.H.G.; visualization, A.G., A.H., N.C.M., S.M., C.M., K.M., K.P., K.T. and L.H.G.; supervision, L.H.G. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

This study is exempt from UNC Charlotte Institutional Review Board review.

Informed Consent Statement

Data was de-identified and provided by Premier Inc. No informed consent was required.

Data Availability Statement

The datasets presented in this article are not readily available because they are the property of Premier, Inc. Requests to access the datasets should be directed to Premier, Inc. via pinc-ai.com.

Conflicts of Interest

John Martin and Michael Korvink work for and own stock in Premier, Inc. All other authors declare no conflicts of interest.

References

  1. Hirsch, J.A.; Nicola, G.N.; McGinty, G.; Liu, R.W.; Barr, R.M.; Chittle, M.D.; Manchikanti, L. ICD-10: History and Context. Am. J. Neuroradiol. 2016, 37, 596–599. [Google Scholar] [CrossRef] [PubMed]
  2. Centers for Disease Control and Prevention (CDC). International Classification of Diseases, (ICD-10-CM/PCS) Transition-Background. CDC. 2015. Available online: https://www.cdc.gov/nchs/icd/icd10cm_pcs_background.htm (accessed on 18 October 2023).
  3. Boyd, A.D.; Li, J.; Burton, M.; Jonen, M.; Gardeux, V.; Achour, I.; Luo, R.Q.; Zenku, I.; Bahroos, N.; Brown, S.B.; et al. The Discriminatory Cost of ICD-10-CM Transition between Clinical Specialties: Metrics, Case Study, and Mitigating Tools. J. Am. Med. Inform. Assoc. 2013, 20, 708–717. [Google Scholar] [CrossRef] [PubMed]
  4. Grasso, M.A.; Dezman, Z.D.W.; Jerrard, D.A. Coding Disparity and Specificity during Emergency Department Visits after Transitioning to the Tenth Version of the International Classification of Diseases. In AMIA Annual Symposium Proceedings; American Medical Informatics Association: Washington, DC, USA, 2022; pp. 495–501. [Google Scholar]
  5. Zegan, J. Improving Specificity in ICD-10 Diagnosis Coding. American Health Information Management Association (AHIMA). Available online: https://library.ahima.org/doc?oid=302473 (accessed on 18 October 2023).
  6. Rangachari, P. Coding for Quality Measurement: The Relationship between Hospital Structural Characteristics and Coding Accuracy from the Perspective of Quality Measurement. Perspect. Health Inf. Manag. 2007, 4, 3. [Google Scholar] [PubMed]
  7. Department of Health and Human Services. Information and Resources for Submitting Correct ICD-10 Codes to Medicare. Medicare Learning Network (MLN) Matters 2014: SE1518. Available online: https://www.hhs.gov/guidance/sites/default/files/hhs-guidance-documents/SE1518.pdf (accessed on 11 November 2023).
  8. American Hospital Association (AHA). Using the X-ray Report for Specificity. AHA Coding Clinic for ICD-10-CM and ICD-10-PCS (First Quarter 2013); AHA Central Office: Chicago, IL, USA, 2013; p. 28. [Google Scholar]
  9. American Hospital Association (AHA). Use of Imaging Reports for Greater Specificity. AHA Coding Clinic for ICD-10-CM and ICD-10-PCS (Third Quarter 2014); AHA Central Office: Chicago, IL, USA, 2014; p. 5. [Google Scholar]
  10. American Hospital Association (AHA). Use of X-ray to Determine Site of Pain. AHA Coding Clinic for ICD-10-CM and ICD-10-PCS (Fourth Quarter 2016); AHA Central Office: Chicago, IL, USA, 2016; p. 143. [Google Scholar]
  11. Mendez, C.M.; Harrington, D.W.; Christenson, P.D.; Spellberg, B. Impact of Hospital Variables on Case Mix Index as a Marker of Disease Severity. Popul. Health Manag. 2014, 17, 28–34. [Google Scholar] [CrossRef] [PubMed]
  12. O’Malley, K.J.; Cook, K.F.; Price, M.D.; Wildes, K.R.; Hurdle, J.F.; Ashton, C.M. Measuring Diagnoses: ICD Code Accuracy. Health Serv. Res. 2005, 40, 1620–1639. [Google Scholar] [CrossRef] [PubMed]
  13. Horsky, J.; Drucker, E.A.; Ramelson, H.Z. Accuracy and Completeness of Clinical Coding Using ICD-10 for Ambulatory Visits. In AMIA Annual Symposium Proceedings; American Medical Informatics Association: Washington, DC, USA, 2017; pp. 912–920. [Google Scholar]
  14. Beam, K.S.; Lee, M.; Hirst, K.; Beam, A.; Parad, R.B. Specificity of International Classification of Diseases Codes for Bronchopulmonary Dysplasia: An Investigation Using Electronic Health Record Data and a Large Insurance Database. J. Perinatol. 2021, 41, 764–771. [Google Scholar] [CrossRef] [PubMed]
  15. Quan, H.; Li, B.; Saunders, D.L.; Parsons, G.A.; Nilsson, C.; Alibhai, A.; Ghali, W.A. Assessing Validity of ICD-9-CM and ICD-10 Administrative Data in Recording Clinical Conditions in a Unique Dually Coded Database. Health Serv. Res. 2008, 43, 1424–1441. [Google Scholar] [CrossRef] [PubMed]
  16. Rios, N.G.; Oldiges, P.E.; Lizano, M.S.; Daucet-Wadford, D.S.; Quick, D.L.; Martin, J.K.; Korvink, M.; Gunn, L.H. Modeling Coding Intensity of Procedures in a U.S. Population-Based Hip/Knee Arthroplasty Inpatient Cohort Adjusting for Patient- and Facility-Level Characteristics. Healthcare 2022, 10, 1368. [Google Scholar] [CrossRef]
  17. Mishra, R.; Verma, H.; Aynala, V.B.; Arredondo, P.R.; Martin, J.K.; Korvink, M.; Gunn, L.H. Diagnostic Coding Intensity among a Pneumonia Inpatient Cohort Using a Risk-Adjustment Model and Claims Data: A U.S. Population-Based Study. Diagnostics 2022, 12, 1495. [Google Scholar] [CrossRef]
  18. Lee, B.; Wang, Y.; Carlson, S.A.; Greenlund, K.J.; Lu, H.; Liu, Y.; Croft, J.B.; Eke, P.I.; Town, M.; Thomas, C.C. National, State-Level, and County-Level Prevalence Estimates of Adults Aged ≥ 18 Years Self-Reporting A Lifetime Diagnosis of Depression—United States, 2020. MMWR Morb. Mortal. Wkly. Rep. 2023, 72, 644–650. [Google Scholar] [CrossRef] [PubMed]
  19. Cuncic, A.; Block, D.B. What Are the ICD-10 Criteria for Depression? Available online: https://www.verywellmind.com/icd-10-criteria-for-depression-5308497 (accessed on 11 November 2023).
  20. 2024 ICD-10-CM Diagnosis Code F32.9: Major Depressive Disorder, Single Episode, Unspecified. Available online: https://www.icd10data.com/ICD10CM/Codes/F01-F99/F30-F39/F32-/F32.9 (accessed on 11 November 2023).
  21. Torres, F. What Is Depression? American Psychiatric Association. October 2020. Available online: https://www.psychiatry.org/patients-families/depression/what-is-depression (accessed on 18 October 2023).
  22. PINC AI Applied Sciences. PINC AI Healthcare Database White Paper: Data That Informs and Performs; Premier Inc.: Charlotte, NC, USA, 2023; Available online: https://offers.pinc-ai.com/PINC-AI-Healthcare-Database-White-Paper-LP.html (accessed on 18 October 2023).
  23. Centers for Disease Control and Prevention (CDC), Agency for Toxic Substances and Disease Registry. CDC SVI Documentation. 2020. Available online: https://www.atsdr.cdc.gov/placeandhealthsvi/documentation/SVI_documentation_2020.html (accessed on 14 November 2023).
  24. Centers for Medicare & Medicaid Services, Office of the Actuary, National Health Statistics Group. National Health Expenditure Data: Health Expenditures by State of Residence. 2022. Available online: https://www.cms.gov/data-research/statistics-trends-and-reports/national-health-expenditure-data/state-residence (accessed on 14 November 2023).
  25. Higashi, T.; Shekelle, P.G.; Solomon, D.; Knight, E.L.; Roth, C.P.; Chang, J.T.; Kamberg, C.; MacLean, C.; Young, R.; Adams, J.L.; et al. Quality of Health Care Received by Older Adults; RAND Corporation: Santa Monica, CA, USA, 2004; Available online: https://www.rand.org/pubs/research_briefs/RB9051.html (accessed on 15 November 2023).
  26. Richardson, A.; Allen, J.A.; Xiao, H.; Vallone, D. Effects of Race/Ethnicity and Socioeconomic Status on Health Information-Seeking, Confidence, and Trust. J. Health Care Poor Underserved. 2012, 23, 1477–1493. [Google Scholar] [CrossRef] [PubMed]
  27. Vyas, C.M.; Donneyong, M.; Mischoulon, D.; Chang, G.; Gibson, H.; Cook, N.R.; Manson, J.E.; Reynolds III, C.F.; Okereke, O.I. Association of Race and Ethnicity with Late-Life Depression Severity, Symptom Burden, and Care. JAMA Netw. Open 2020, 3, e201606. [Google Scholar] [CrossRef]
  28. United States Census Bureau. Age-Sex Pyramid for the United States. Available online: https://www.census.gov/library/visualizations/interactive/age-sex-pyramid-for-the-united-states.html (accessed on 11 November 2023).
  29. Johnson, E.K.; Wojtesta, M.A.; Crosby, S.W.; Duber, H.C.; Jun, E.; Lescinsky, H.; Nguyen, P.; Sahu, M.; Thomson, A.; Tsakalos, G.; et al. Varied Health Spending Growth Across US States Was Associated with Incomes, Price Levels, and Medicaid Expansion, 2000–2019. Health Aff. 2022, 41, 1088–1097. [Google Scholar] [CrossRef] [PubMed]
  30. Vyas, D.A.; Eisenstein, L.G.; Jones, D.S. Hidden in Plain Sight—Reconsidering the Use of Race Correction in Clinical Algorithms. N. Engl. J. Med. 2020, 383, 874–882. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Receiver operating characteristic (ROC) curves of specificity of a depression-related principal diagnosis (a) and secondary diagnosis (b) using the multivariate logistic regression model.
Figure 1. Receiver operating characteristic (ROC) curves of specificity of a depression-related principal diagnosis (a) and secondary diagnosis (b) using the multivariate logistic regression model.
Diagnostics 14 00426 g001
Figure 2. Observed counts of specificity of depression principal (a) and secondary (b) diagnoses by facility (dots) for two samples of 20 facilities, together with 95% confidence intervals based on the Poisson Binomial model. Facilities that under-specify depression diagnoses compared to healthcare industry peers are depicted in blue (p < 0.025), while those that over-specifying depression diagnoses compared to peers are depicted in orange (p > 0.075). Facilities that specify depression diagnoses in line with peers are depicted in black.
Figure 2. Observed counts of specificity of depression principal (a) and secondary (b) diagnoses by facility (dots) for two samples of 20 facilities, together with 95% confidence intervals based on the Poisson Binomial model. Facilities that under-specify depression diagnoses compared to healthcare industry peers are depicted in blue (p < 0.025), while those that over-specifying depression diagnoses compared to peers are depicted in orange (p > 0.075). Facilities that specify depression diagnoses in line with peers are depicted in black.
Diagnostics 14 00426 g002
Figure 3. U.S. map representing the adjusted odds ratios, by state, of specificity of depression-related principal (a) and secondary (b) diagnoses against the reference state of New York. Non-significant adjusted odds ratios are represented in gray. Under-specificity is represented in blue, while over-specificity is clustered across three different groups (yellow, orange, and brown) based on adjusted odds ratio ranges.
Figure 3. U.S. map representing the adjusted odds ratios, by state, of specificity of depression-related principal (a) and secondary (b) diagnoses against the reference state of New York. Non-significant adjusted odds ratios are represented in gray. Under-specificity is represented in blue, while over-specificity is clustered across three different groups (yellow, orange, and brown) based on adjusted odds ratio ranges.
Diagnostics 14 00426 g003
Table 1. Summary statistics, including counts (%) and means/proportions (standard deviations [SD]) of study outcomes as well as patient- and facility-level characteristics.
Table 1. Summary statistics, including counts (%) and means/proportions (standard deviations [SD]) of study outcomes as well as patient- and facility-level characteristics.
Study VariablesCount or Mean/Proportion (% or Standard Deviation [SD])
Outcomes
Specificity of the depression principal diagnosis (count, proportion)11,701 (71%)
Specificity of depression secondary diagnoses (count, proportion)80,116 (8%)
Patient Characteristics
Age (Years)
  0–9208 (<1%)
  10–145408 (1%)
  15–1916,017 (1%)
  20–2428,158 (3%)
  25–3486,719 (8%)
  35–4490,578 (8%)
  45–54120,430 (11%)
  55–5991,701 (9%)
  60–64114,939 (11%)
  65–69124,098 (12%)
  70–74123,580 (12%)
  75–79108,929 (10%)
  80–8477,339 (7%)
  85+83,471 (8%)
Sex
  Female698,038 (65%)
  Male373,537 (35%)
Race
  Asian11,726 (1%)
  Black118,837 (11%)
  Other63,219 (6%)
  Unable to determine24,007 (2%)
  White853,786 (80%)
Log (Length of Stay) (Days) (mean, SD)1.4 (0.87)
Primary Payor
  Charity/Indigent2068 (<1%)
  Commercial indemnity59,773 (6%)
  Direct employer contract3106 (<1%)
  Managed care capitated2466 (<1%)
  Managed care non-capitated143,667 (13%)
  Medicaid-managed care capitated 22,545 (2%)
  Medicaid-managed care non-capitated.120,312 (11%)
  Medicaid traditional57,926 (5%)
  Medicare-managed care capitated.38,172 (4%)
  Medicare-managed care non-capitated.251,837 (24%)
  Medicare traditional312,783 (29%)
  Other11,980 (1%)
  Other government payors21,483 (2%)
  Self-pay 21,213 (2%)
  Workers compensation2244 (<1%)
Point of Origin
  Clinic83,151 (8%)
  Court/Law enforcement1330 (<1%)
  Information not available14,016 (1%)
  Non-healthcare facility 856,254 (80%)
  Other859 (<1%)
  Transfer from ambulatory surgery center 1187 (<1%)
  Transfer from department unit in same hospital, separate claim5690 (1%)
  Transfer from health facility 12,915 (1%)
  Transfer from hospice and under hospice94 (<1%)
  Transfer from hospital (different facility)68,715 (6%)
  Transfer from SNF 1 or ICF 227,364 (3%)
Discharge Status
  Acute inpatient readmission2038 (<1%)
  Discharged to home health organization171,209 (16%)
  Discharged to home or self-care571,921 (53%)
  Discharged to hospice-home13,897 (1%)
  Discharged to hospice-medical facility14,015 (1%)
  Discharged/Transferred to another rehab facility33,314 (3%)
  Discharged/Transferred to cancer center/children’s hospital292 (<1%)
  Discharged/Transferred to court/law enforcement2061 (<1%)
  Discharged/Transferred to critical access hospital196 (<1%)
  Discharged/Transferred to federal hospital307 (<1%)
  Discharged/Transferred to ICF 28244 (1%)
  Discharged/Transferred to long-term care hospital6954 (1%)
  Discharged/Transferred to nursing facility1394 (<1%)
  Discharged/Transferred to other facility16,660 (2%)
  Discharged/Transferred to other health institute not in the list2823 (<1%)
  Discharged/Transferred to psychiatric hospital16,276 (2%)
  Discharged/Transferred to SNF 1153,708 (14%)
  Discharged/Transferred to swing bed2706 (<1%)
  Expired28,121 (3%)
  Information not available3794 (<1%)
  Left against medical advice21,555 (2%)
  Still a patient, expected to return90 (<1%)
Count of Procedures (mean, SD)2.9 (2.7)
CMS 3 Fiscal Year
  2022804,999 (75%)
  2023266,576 (25%)
Social Vulnerability Indices (mean, SD)
  Household characteristics0.51 (0.25)
  Housing type and transportation0.60 (0.25)
  Overall 0.58 (0.25)
  Racial and ethnic minority status0.66 (0.24)
  Socioeconomic status0.53 (0.26)
COVID-19 Status
  Not identified994,349 (93%)
  Positive77,226 (7%)
MS-DRG 4 Type
  Medical794,118 (74%)
  Surgical277,407 (26%)
  Unknown50 (<1%)
Facility Characteristics
Teaching Status
  No780,718 (73%)
  Not available15,313 (1%)
  Yes275,544 (26%)
Academic Status
  No885,959 (83%)
  Yes185,616 (17%)
Rural/Urban Status
  Rural 132,668 (12%)
  Urban938,907 (88%)
Ownership
  Government—federal1439 (<1%)
  Government—hospital district/authority62,451 (6%)
  Government—local34,529 (3%)
  Government—state10,281 (1%)
  Not available4771 (<1%)
  Physician1135 (<1%)
  Proprietary43,045 (4%)
  Voluntary non-profit—church156,542 (15%)
  Voluntary non-profit—other 61,799 (6%)
  Voluntary non-profit—private695,583 (65%)
Bed Count
  1–5027,588 (3%)
  51–10052,634 (5%)
  101–15076,610 (7%)
  151–20071,842 (7%)
  201–25096,827 (9%)
  251–300105,830 (10%)
  301–350106,327 (10%)
  351–40075,304 (7%)
  >400458,613 (43%)
Hospital Case Mix Index (mean, SD)1.7 (0.29)
State Abbreviation
  AK460 (<1%)
  AL7755 (1%)
  AR11,427 (1%)
  AZ33,032 (3%)
  CA50,806 (5%)
  CO9128 (1%)
  CT12,879 (1%)
  DE7421 (1%)
  FL99,679 (9%)
  GA13,079 (1%)
  HI5284 (<1%)
  IA18,412 (2%)
  ID6538 (1%)
  IL50,471 (5%)
  IN20,789 (2%)
  KS9021 (1%)
  KY21,224 (2%)
  LA7732 (1%)
  MA10,212 (1%)
  MD14,701 (1%)
  ME85 (<1%)
  MI41,774 (4%)
  MN9933 (1%)
  MO14,874 (1%)
  MS11,179 (1%)
  MT6795 (1%)
  NC59,473 (6%)
  ND2704 (<1%)
  NE6775 (1%)
  NH3185 (<1%)
  NJ18,251 (2%)
  NM5922 (1%)
  NV6944 (1%)
  NY73,612 (7%)
  OH56,422 (5%)
  OK22,716 (2%)
  OR18,000 (2%)
  PA54,495 (5%)
  RI3212 (<1%)
  SC25,162 (2%)
  SD4187 (<1%)
  TN33,118 (3%)
  TX59,602 (6%)
  UT119 (<1%)
  VA35,871 (3%)
  VT3534 (<1%)
  WA22,441 (2%)
  WI30,770 (3%)
  WV28,164 (3%)
  WY2206 (<1%)
1 SNF: Skilled Nursing Facility; 2 ICF: Intermediate Care Facility; 3 CMS: Centers for Medicare and Medicaid Services; 4 MS-DRG: Medicare Severity Diagnosis Related Group.
Table 2. Odds ratios (ORs), 95% confidence intervals (CIs), and p-values for univariate and multivariate logistic regression analyses for coding the specificity of a depression principal diagnosis.
Table 2. Odds ratios (ORs), 95% confidence intervals (CIs), and p-values for univariate and multivariate logistic regression analyses for coding the specificity of a depression principal diagnosis.
Univariate Analysis Multivariate Analysis
VariableOR95% CIpOR95% CIp
Intercept---0.6220.315–1.2280.171
Age (Ref: 85+)
  0–94.4491.964–10.076<0.0011.8120.746–4.4030.190
  10–145.1793.888–6.899<0.0012.1411.483–3.091<0.001
  15–194.7583.605–6.279<0.0012.3331.634–3.331<0.001
  20–242.9302.204–3.895<0.0011.9821.383–2.841<0.001
  25–342.2921.742–3.017<0.0011.5781.113–2.2380.010
  35–442.0401.547–2.690<0.0011.4591.030–2.0670.034
  45–542.5181.904–3.330<0.0011.7601.243–2.4900.001
  55–592.3461.748–3.147<0.0011.6601.160–2.3750.006
  60–642.2751.690–3.063<0.0011.6991.184–2.4380.004
  65–692.1281.567–2.889<0.0011.6281.137–2.3310.008
  70–742.2071.612–3.021<0.0011.8151.258–2.6170.001
  75–791.7951.291–2.4970.0011.4881.015–2.1790.041
  80–841.4551.013–2.0910.0431.2480.821–1.8980.299
Sex (Ref: Female)
  Male0.7350.686–0.786<0.0010.7600.702–0.821<0.001
Race (Ref: White)
  Asian1.3871.095–1.7550.0071.2210.924–1.6150.161
  Black0.8740.791–0.9650.0080.9480.841–1.0690.386
  Other1.0080.898–1.1330.8891.0040.874–1.1530.954
  Unable to determine1.2221.013–1.4740.0370.9580.766–1.1970.703
Log (Length of Stay)1.6651.589–1.745<0.0011.8201.724–1.922<0.001
Primary Payor (Ref: Medicare traditional)
  Charity/indigent2.8381.169–6.8910.0211.6500.599–4.5490.333
  Commercial-indemnity1.3801.197–1.591<0.0011.3941.141–1.7040.001
  Direct employer contract2.1671.234–3.8040.0072.2261.215–4.0770.010
  Managed care capitated2.9601.684–5.203<0.0011.3320.718–2.4700.363
  Managed care non-capitated1.9051.686–2.153<0.0011.1991.009–1.4250.039
  Medicaid-managed care capitated1.0350.883–1.2120.6751.3661.093–1.7070.006
  Medicaid-managed care non-capitated1.9391.706–2.204<0.0011.0500.876–1.2600.597
  Medicaid traditional1.1801.030–1.3510.0170.8380.692–1.0150.071
  Medicare-managed care capitated0.8280.626–1.0950.1851.0070.717–1.4140.970
  Medicare-managed care non-capitated1.5011.275–1.767<0.0011.3131.090–1.5820.004
  Other2.1771.700–2.787<0.0011.1820.846–1.6500.327
  Other government payors2.1481.700–2.714<0.0011.3030.989–1.7180.060
  Self-Pay1.4601.220–1.746<0.0011.1580.922–1.4540.207
  Workers compensation1.6220.429–6.1350.4763.3140.753–14.5920.113
Point of Origin (Ref: Non-healthcare facility)
  Clinic0.8060.691–0.9400.0061.0310.826–1.2870.786
  Court/Law enforcement0.6450.457–0.9110.0130.5400.368–0.7920.002
  Information not available1.4321.213–1.689<0.0010.8430.652–1.0900.193
  Transfer from ambulatory surgery center1.3150.477–3.6200.5960.8200.286–2.3480.711
  Transfer from dept unit in same hospital, separate claim1.7571.406–2.196<0.0011.3711.056–1.7800.018
  Transfer from health facility1.1990.935–1.5380.1531.1350.854–1.5090.383
  Transfer from hospice and under hospice program0.8760.079–9.6700.9142.4410.172–34.6250.510
  Transfer from hospital (different facility)1.4871.346–1.642<0.0011.0980.968–1.2460.145
  Transfer from SNF 1 or ICF 20.7120.428–1.1860.1920.7930.444–1.4170.433
Discharge Status (Ref: Discharged to Home or Self Care)
  Acute inpatient readmission0.7190.369–1.4020.3331.2780.612–2.6690.513
  Discharged to home health organization0.5230.335–0.8160.0041.0670.646–1.7630.800
  Discharged to hospice-home0.4110.149–1.1350.0860.5160.161–1.6570.266
  Discharged to hospice-medical facility0.4320.186–1.0000.0500.2800.108–0.7210.008
  Discharged/Transferred to another rehab facility0.6170.443–0.8590.0040.7660.522–1.1240.173
  Discharged/Transferred to cancer ctr/children’s hospital1.9190.559–6.5890.3011.2820.354–4.6460.705
  Discharged/Transferred to court/law enforcement0.5230.335–0.8160.0041.0670.646–1.7630.800
  Discharged/Transferred to critical access hospital0.3600.022–5.7530.4700.2930.014–6.0960.428
  Discharged/Transferred to federal hospital0.3600.022–5.7530.4700.4910.025–9.6990.641
  Discharged/Transferred to ICF 20.6480.414–1.0130.0570.4590.279–0.7560.002
  Discharged/Transferred to long term care hospital0.8990.282–2.8690.8580.6030.171–2.1260.432
  Discharged/Transferred to nursing facility1.2390.589–2.6050.5720.5740.258–1.2770.173
  Discharged/Transferred to other facility0.8010.613–1.0480.1061.0880.806–1.4690.582
  Discharged/Transferred to other health institute not in list0.6580.484–0.8940.0070.8480.599–1.2010.354
  Discharged/Transferred to psychiatric hospital0.7190.638–0.812<0.0010.9740.843–1.1250.719
  Discharged/Transferred to SNF 10.3520.285–0.434<0.0010.4730.362–0.617<0.001
  Discharged/Transferred to swing bed0.3600.022–5.7530.4700.3310.017–6.2730.461
  Expired1.1990.330–4.3600.7831.2490.295–5.2950.763
  Information not available0.6420.334–1.2370.1861.8670.841–4.1440.125
  Left against medical advice0.3710.278–0.495<0.0010.5760.414–0.8000.001
  Still a patient-expected to return0.1800.016–1.9840.1610.1600.014–1.8970.146
Count of Procedures0.9450.903–0.9890.0151.0490.978–1.1250.184
CMS 3 Fiscal Year (Ref: 2022)
  20230.9310.861–1.0070.0740.9580.877–1.0470.349
Social Vulnerability Index
  Household characteristics0.6680.586–0.760<0.0010.6400.356–1.1490.135
  Housing type and transportation0.5850.506–0.676<0.0010.7310.359–1.4880.388
  Overall0.6730.589–0.768<0.0016.3160.807–49.4650.079
  Racial and ethnic minority status0.7170.623–0.825<0.0010.5060.321–0.7980.003
  Socioeconomic status0.7640.671–0.871<0.0010.3490.119–1.0200.054
COVID-19 Status (Ref: Not identified)
  Positive0.9330.819–1.0640.3000.9430.811–1.0960.441
MS-DRG 4 Type (Ref: Medical)
  Surgical0.1350.102–0.178<0.0010.2880.204–0.408<0.001
Teaching Status (Ref: No)
  Not Available1.0380.625–1.7250.8851.2460.672–2.3110.485
  Yes0.5960.554–0.641<0.0010.6800.572–0.810<0.001
Academic Status (Ref: No)
  Yes0.6980.636–0.765<0.0011.4651.169–1.8360.001
Rural/Urban Status (Ref: Urban)
  Rural1.1181.022–1.2230.0141.0090.854–1.1930.917
Ownership (Ref: Voluntary non-profit private)
  Government—hospital district/authority1.5301.323–1.769<0.0010.8930.716–1.1140.317
  Government—local1.1060.920–1.3160.2560.5120.378–0.693<0.001
  Government—state0.5650.469–0.681<0.0010.4080.29–0.574<0.001
  Not available0.7760.070–8.5580.8362.3020.174–30.3750.527
  Physician>1000.000–Inf0.936>1000.000–Inf0.958
  Proprietary0.5470.484–0.619<0.0010.8150.650–1.0220.076
  Voluntary non-profit—church1.1351.000–1.2890.0500.5760.457–0.727<0.001
  Voluntary non-profit—other0.6880.592–0.800<0.0010.8650.688–1.0880.215
Bed Count (Ref: >400)
  1–501.3890.995–1.9410.0540.9790.615–1.5570.928
  51–1001.3861.225–1.568<0.0010.9140.714–1.1710.477
  101–1500.2760.223–0.342<0.0010.4710.352–0.628<0.001
  151–2001.6281.361–1.947<0.0011.7281.340–2.229<0.001
  201–2500.9920.892–1.1040.8890.9360.775–1.1320.497
  251–3001.6741.502–1.865<0.0011.2501.035–1.5100.021
  301–3501.2351.088–1.4020.0011.4071.110–1.7830.005
  351–4001.0400.914–1.1820.5531.2661.002–1.6000.048
Hospital Case Mix Index0.8810.780–0.9960.0430.8360.637–1.0970.196
State Abbreviation (Ref: NY)
  AK1.0860.302–3.9090.8991.9880.450–8.7730.364
  AL0.7240.345–1.5220.3940.8710.386–1.9650.740
  AR1.0370.751–1.4300.8273.6412.073–6.395<0.001
  AZ0.2690.196–0.368<0.0010.4020.264–0.614<0.001
  CA1.1810.870–1.6050.2861.9951.393–2.857<0.001
  CO0.6030.255–1.4290.2511.0370.409–2.6280.939
  CT0.9960.392–2.5290.9931.1450.422–3.1040.790
  DE1.1500.607–2.1780.6681.6000.803–3.1870.181
  FL2.1181.654–2.711<0.0012.6521.895–3.712<0.001
  GA1.1590.590–2.2760.6691.7600.835–3.7110.138
  HI2.1061.625–2.730<0.0012.8831.927–4.313<0.001
  IA1.7911.014–3.1630.0452.0561.083–3.9040.028
  ID1.5630.876–2.7880.1311.8190.888–3.7230.102
  IL6.7704.882–9.389<0.0015.7923.905–8.593<0.001
  IN2.3251.339–4.0370.0032.9561.583–5.5190.001
  KS3.6452.349–5.655<0.0014.5202.686–7.606<0.001
  KY1.9701.195–3.2460.0082.5271.368–4.6670.003
  LA4.9183.067–7.887<0.0013.1681.744–5.754<0.001
  MA1.3200.708–2.4640.3831.1380.565–2.2910.718
  MD2.3671.637–3.423<0.0012.4991.567–3.985<0.001
  ME0.7240.045–11.660.820.6920.041–11.7530.799
  MI1.6961.118–2.5710.0132.8221.724–4.619<0.001
  MN2.8581.664–4.911<0.0013.4331.916–6.152<0.001
  MO3.8992.088–7.280<0.0017.5733.844–14.917<0.001
  MS0.8000.417–1.5360.5031.0410.509–2.1300.911
  MT2.6852.004–3.596<0.0011.5200.981–2.3550.061
  NC3.2802.553–4.215<0.0014.3363.099–6.067<0.001
  ND3.3522.351–4.781<0.0013.2861.912–5.648<0.001
  NE2.7232.054–3.611<0.0013.3392.092–5.327<0.001
  NH0.4830.080–2.9210.4280.2630.040–1.7140.163
  NJ0.2100.152–0.289<0.0010.2470.170–0.358<0.001
  NM0.8400.475–1.4860.5491.0460.536–2.0410.895
  NV0.6610.354–1.2340.1941.1880.584–2.4160.635
  OH2.7632.178–3.505<0.0013.7382.682–5.209<0.001
  OK2.5251.993–3.198<0.0012.9172.029–4.192<0.001
  OR1.6731.036–2.7000.0351.5430.890–2.6720.122
  PA0.6230.500–0.776<0.0010.6730.497–0.9110.010
  RI0.9210.711–1.1940.5360.6710.429–1.0490.080
  SC1.7981.367–2.365<0.0011.5751.085–2.2850.017
  SD3.3311.241–8.9400.0174.0601.423–11.5860.009
  TN1.9771.210–3.2320.0072.2231.269–3.8960.005
  TX6.7535.012–9.099<0.0017.0514.861–10.227<0.001
  UT0.7240.045–11.6600.8200.5690.030–10.8090.707
  VA1.0340.665–1.6090.8801.5350.928–2.5380.095
  VT>1000.000–Inf0.921>1000.000–Inf0.918
  WA1.2280.811–1.8590.3321.7551.059–2.9090.029
  WI3.9113.026–5.056<0.0013.0422.129–4.345<0.001
  WV2.2861.696–3.081<0.0011.5131.017–2.2510.041
  WY0.6770.393–1.1670.1600.6960.362–1.3410.279
1 SNF: Skilled Nursing Facility; 2 ICF: Intermediate Care Facility; 3 CMS: U.S. Centers for Medicare and Medicaid Services; 4 MS-DRG: Medicare Severity Diagnosis Related Group.
Table 3. Odds ratios (ORs), 95% confidence intervals (CIs), and p-values for the univariate and multivariate logistic regression analyses for coding the specificity of depression-related secondary diagnoses.
Table 3. Odds ratios (ORs), 95% confidence intervals (CIs), and p-values for the univariate and multivariate logistic regression analyses for coding the specificity of depression-related secondary diagnoses.
Univariate Analysis Multivariate Analysis
VariableOR95% CIpOR95% CIp
Intercept---0.0260.024–0.029<0.001
Age (Ref: 85+)
  0–91.9501.195–3.1820.0071.7291.036–2.8830.036
  10–146.8606.345–7.416<0.0014.0363.688–4.416<0.001
  15–193.9573.756–4.169<0.0013.0022.822–3.192<0.001
  20–241.8061.718–1.898<0.0011.8041.704–1.909<0.001
  25–341.4101.356–1.465<0.0011.4661.401–1.535<0.001
  35–441.5171.461–1.575<0.0011.5541.487–1.624<0.001
  45–541.4471.396–1.499<0.0011.4901.430–1.553<0.001
  55–591.3941.342–1.448<0.0011.4341.375–1.496<0.001
  60–641.2851.238–1.333<0.0011.3161.264–1.370<0.001
  65–691.2831.238–1.331<0.0011.2731.225–1.322<0.001
  70–741.2691.224–1.316<0.0011.2691.222–1.317<0.001
  75–791.2101.166–1.257<0.0011.2111.166–1.259<0.001
  80–841.1171.072–1.164<0.0011.1161.070–1.164<0.001
Sex (Ref: Female)
  Male1.0981.082–1.114<0.0011.0541.038–1.071<0.001
Race (Ref: White)
  Asian1.1481.074–1.227<0.0010.9720.905–1.0430.423
  Black0.9210.899–0.943<0.0010.8750.853–0.898<0.001
  Other1.1431.110–1.177<0.0011.0321.000–1.0650.051
  Unable to determine1.0581.009–1.1100.0201.0390.988–1.0930.139
Log (Length of Stay)1.1771.168–1.187<0.0011.2371.225–1.250<0.001
Primary Payor (Ref: Medicare traditional)
  Charity/indigent1.8531.619–2.120<0.0011.5011.305–1.727<0.001
  Commercial-indemnity1.0601.024–1.0970.0010.8460.813–0.880<0.001
  Direct employer contract2.0571.849–2.288<0.0011.4971.338–1.676<0.001
  Managed care capitated0.7450.621–0.8940.0020.5490.455–0.662<0.001
  Managed care non-capitated1.1361.109–1.164<0.0010.9230.896–0.951<0.001
  Medicaid-managed care capitated1.2051.146–1.268<0.0010.9510.898–1.0070.086
  Medicaid-managed care non-capitated1.3421.310–1.375<0.0010.9790.949–1.0100.187
  Medicaid traditional1.4031.359–1.448<0.0011.0110.972–1.0500.593
  Medicare-managed care capitated0.9570.917–0.9990.0471.0250.979–1.0730.286
  Medicare-managed care non-capitated1.1101.088–1.133<0.0011.1241.100–1.148<0.001
  Other1.0610.988–1.1400.1060.9200.853–0.9930.032
  Other government payors0.9920.938–1.0480.7640.8460.798–0.896<0.001
  Self-Pay1.4891.419–1.562<0.0011.1491.089–1.212<0.001
  Worker’s compensation0.7750.645–0.9330.0070.7790.646–0.9400.009
Point of Origin (Ref: Non-healthcare facility)
  Clinic0.8010.778–0.825<0.0010.9140.886–0.943<0.001
  Court/Law enforcement1.4821.237–1.777<0.0010.9640.786–1.1830.728
  Information not available0.9740.913–1.040.4301.3751.281–1.476<0.001
  Other1.0090.786–1.2950.9440.9060.702–1.1690.448
  Transfer from ambulatory surgery center0.8780.699–1.1020.2611.0710.849–1.3520.561
  Transfer from dept unit in same hospital, separate claim1.4811.357–1.616<0.0011.4181.293–1.555<0.001
  Transfer from health facility1.0831.016–1.1540.0140.9520.891–1.0180.153
  Transfer from hospice and under hospice program1.1500.556–2.3750.7071.2130.576–2.5550.612
  Transfer from hospital (different facility)1.0030.974–1.0330.8280.8860.859–0.915<0.001
  Transfer from SNF 1 or ICF 20.6940.659–0.732<0.0010.7540.713–0.797<0.001
Discharge Status (Ref: Discharged to Home or Self Care)
  Acute inpatient readmission2.1251.872–2.413<0.0011.8641.634–2.126<0.001
  Discharged to home health organization0.9960.975–1.0170.6830.9960.973–1.0190.732
  Discharged to hospice-home0.9410.880–1.0070.0770.8930.833–0.9570.001
  Discharged to hospice-medical facility0.8740.816–0.937<0.0010.8800.819–0.945<0.001
  Discharged/Transferred to another rehab facility1.0200.977–1.0640.3690.9670.924–1.0110.410
  Discharged/Transferred to cancer ctr/children’s hospital1.9191.346–2.737<0.0011.4761.021–2.1360.039
  Discharged/Transferred to court/law enforcement1.4661.266–1.698<0.0011.2451.059–1.4620.008
  Discharged/Transferred to critical access hospital1.3350.822–2.1680.2441.3310.811–2.1840.257
  Discharged/Transferred to federal hospital0.8670.545–1.3790.5471.0200.637–1.6350.933
  Discharged/Transferred to ICF 21.2751.181–1.377<0.0011.1271.040–1.2210.004
  Discharged/Transferred to long term care hospital1.0130.924–1.1100.7810.8580.781–0.9440.002
  Discharged/Transferred to nursing facility1.5961.345–1.894<0.0011.7491.469–2.083<0.001
  Discharged/Transferred to other facility1.1361.073–1.203<0.0011.1381.074–1.207<0.001
  Discharged/Transferred to other health institute not in list1.6611.472–1.874<0.0011.6321.442–1.848<0.001
  Discharged/Transferred to psychiatric hospital7.7797.514–8.053<0.0016.6306.387–6.883<0.001
  Discharged/Transferred to SNF 11.0120.990–1.0340.2830.9900.965–1.0160.462
  Discharged/Transferred to swing bed1.0480.907–1.2100.5250.9420.812–1.0920.429
  Expired0.8760.834–0.921<0.0010.8070.766–0.850<0.001
  Information not available1.4391.293–1.602<0.0012.1211.885–2.386<0.001
  Left against medical advice0.9700.919–1.0230.2631.0250.970–1.0830.380
  Still a patient-expected to return1.6950.877–3.2750.1171.4840.754–2.9200.253
Count of Procedures1.0121.010–1.015<0.0011.0071.004–1.009<0.001
CMS 3 Fiscal Year (Ref: 2022)
  20231.0331.016–1.050<0.0011.0421.025–1.060<0.001
Social Vulnerability Index
  Household characteristics0.8640.840–0.889<0.0010.7760.704–0.856<0.001
  Housing type and transportation0.9410.914–0.969<0.0010.8210.729–0.9250.001
  Overall0.7320.712–0.753<0.0012.3501.672–3.305<0.001
  Racial and ethnic minority status0.9430.915–0.971<0.0011.1111.034–1.1920.004
  Socioeconomic status0.6240.607–0.642<0.0010.5290.442–0.634<0.001
COVID-19 Status (Ref: Not identified)
  Positive0.9860.959–1.0140.3330.9290.902–0.957<0.001
MS-DRG 4 Type (Ref: Medical)
  Surgical0.8430.829–0.858<0.0010.8550.839–0.871<0.001
  Unknown1.2960.515–3.2660.5821.7350.679–4.4330.250
Teaching Status (Ref: No)
  Not Available1.8481.762–1.940<0.0011.7131.622–1.810<0.001
  Yes1.0641.047–1.082<0.0011.1771.143–1.212<0.001
Academic Status (Ref: No)
  Yes1.0421.022–1.062<0.0010.7900.763–0.819<0.001
Rural/Urban Status (Ref: Urban)
  Rural1.0811.058–1.105<0.0011.4091.371–1.447<0.001
Ownership (Ref: Voluntary non-profit private)
  Government—federal0.4910.378–0.639<0.0010.2320.177–0.304<0.001
  Government—hospital district/authority1.3451.309–1.382<0.0011.4571.411–1.505<0.001
  Government—local0.9920.953–1.0330.7070.9870.942–1.0340.577
  Government—state0.7820.720–0.849<0.0011.0760.981–1.1800.122
  Not available0.7820.696–0.88<0.0010.8350.739–0.9430.004
  Physician0.5620.425–0.742<0.0010.4230.318–0.563<0.001
  Proprietary0.6980.669–0.728<0.0010.7930.757–0.831<0.001
  Voluntary non-profit—church0.7510.734–0.768<0.0010.8460.825–0.868<0.001
  Voluntary non-profit—other0.9180.890–0.948<0.0010.8940.863–0.926<0.001
Bed Count (Ref: >400)
  1–500.9730.931–1.0170.2300.5780.547–0.610<0.001
  51–1000.9250.894–0.957<0.0010.8840.850–0.920<0.001
  101–1500.6050.585–0.626<0.0010.5710.550–0.593<0.001
  151–2000.9670.939–0.9950.0210.7530.728–0.779<0.001
  201–2500.8730.85–0.896<0.0010.8850.858–0.912<0.001
  251–3000.7350.715–0.755<0.0010.6920.670–0.714<0.001
  301–3500.8430.822–0.865<0.0010.9370.909–0.965<0.001
  351–4000.8670.841–0.893<0.0010.8750.847–0.905<0.001
Hospital Case Mix Index0.9660.942–0.9900.0060.8730.843–0.904<0.001
State Abbreviation (Ref: NY)
  AK2.1421.520–3.017<0.0013.0132.124–4.274<0.001
  AL0.7420.647–0.851<0.0010.8290.719–0.9540.009
  AR0.9970.899–1.1050.9541.0910.975–1.2200.130
  AZ1.1191.049–1.1940.0011.1211.044–1.2040.002
  CA2.4372.322–2.557<0.0012.7782.634–2.931<0.001
  CO1.4921.355–1.642<0.0011.6911.529–1.869<0.001
  CT2.1421.990–2.306<0.0012.2272.059–2.409<0.001
  DE1.8061.637–1.991<0.0011.5031.356–1.666<0.001
  FL1.8021.723–1.885<0.0012.0001.900–2.104<0.001
  GA1.5921.468–1.727<0.0011.7391.594–1.898<0.001
  HI3.0922.800–3.416<0.0012.7862.493–3.112<0.001
  IA6.1135.803–6.440<0.0017.0556.647–7.488<0.001
  ID7.1006.625–7.610<0.00110.9510.09–11.88<0.001
  IL1.7661.677–1.859<0.0012.2782.153–2.409<0.001
  IN1.1331.050–1.2230.0011.4201.310–1.540<0.001
  KS1.0420.932–1.1660.4711.1671.038–1.3120.010
  KY1.4771.378–1.583<0.0011.7171.594–1.850<0.001
  LA2.8902.661–3.139<0.0012.9432.696–3.212<0.001
  MA1.6991.557–1.854<0.0011.8011.642–1.975<0.001
  MD1.3511.244–1.467<0.0011.5401.413–1.679<0.001
  ME1.5790.639–3.9030.3232.1240.856–5.2680.104
  MI1.9771.876–2.083<0.0012.5022.360–2.653<0.001
  MN9.7029.158–10.277<0.00111.2510.57–11.98<0.001
  MO3.4753.268–3.696<0.0013.4353.212–3.673<0.001
  MS0.9240.831–1.0280.1460.9620.862–1.0750.494
  MT2.8552.611–3.121<0.0013.7513.400–4.139<0.001
  NC2.4212.310–2.538<0.0012.4092.285–2.540<0.001
  ND1.8621.586–2.187<0.0012.5052.118–2.962<0.001
  NE3.5483.262–3.858<0.0015.4124.944–5.923<0.001
  NH0.9240.764–1.1170.4151.2921.064–1.5670.010
  NJ1.5441.436–1.661<0.0011.7491.622–1.887<0.001
  NM4.0863.763–4.436<0.0014.1933.839–4.580<0.001
  NV1.6451.482–1.826<0.0012.4452.184–2.737<0.001
  OH1.4441.371–1.522<0.0011.7331.636–1.835<0.001
  OK2.2702.135–2.412<0.0012.3112.159–2.474<0.001
  OR3.3593.168–3.562<0.0014.1433.884–4.419<0.001
  PA3.0212.885–3.165<0.0013.7233.536–3.919<0.001
  RI1.0330.849–1.2580.7431.3811.125–1.6950.002
  SC1.8961.783–2.015<0.0012.0651.933–2.206<0.001
  SD4.6714.264–5.116<0.0015.3274.829–5.876<0.001
  TN1.0370.970–1.1080.2861.281.191–1.376<0.001
  TX1.8981.807–1.993<0.0012.1202.006–2.241<0.001
  UT1.8080.881–3.7100.1071.7750.841–3.7480.132
  VA1.3441.266–1.427<0.0011.3831.295–1.477<0.001
  VT0.6440.521–0.797<0.0010.6390.512–0.797<0.001
  WA2.0341.912–2.164<0.0012.8372.653–3.035<0.001
  WI3.4243.253–3.603<0.0014.2163.985–4.461<0.001
  WV1.6241.527–1.727<0.0012.0111.878–2.155<0.001
  WY2.4622.115–2.865<0.0013.0072.565–3.524<0.001
1 SNF: Skilled Nursing Facility; 2 ICF: Intermediate Care Facility; 3 CMS: U.S. Centers for Medicare and Medicaid Services; 4 MS-DRG: Medicare Severity Diagnosis Related Group.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Glass, A.; Melton, N.C.; Moore, C.; Myrick, K.; Thao, K.; Mogaji, S.; Howell, A.; Patton, K.; Martin, J.; Korvink, M.; et al. A Novel Method for Assessing Risk-Adjusted Diagnostic Coding Specificity for Depression Using a U.S. Cohort of over One Million Patients. Diagnostics 2024, 14, 426. https://doi.org/10.3390/diagnostics14040426

AMA Style

Glass A, Melton NC, Moore C, Myrick K, Thao K, Mogaji S, Howell A, Patton K, Martin J, Korvink M, et al. A Novel Method for Assessing Risk-Adjusted Diagnostic Coding Specificity for Depression Using a U.S. Cohort of over One Million Patients. Diagnostics. 2024; 14(4):426. https://doi.org/10.3390/diagnostics14040426

Chicago/Turabian Style

Glass, Alexandra, Nalander C. Melton, Connor Moore, Keyerra Myrick, Kola Thao, Samiat Mogaji, Anna Howell, Kenneth Patton, John Martin, Michael Korvink, and et al. 2024. "A Novel Method for Assessing Risk-Adjusted Diagnostic Coding Specificity for Depression Using a U.S. Cohort of over One Million Patients" Diagnostics 14, no. 4: 426. https://doi.org/10.3390/diagnostics14040426

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop