1. Introduction
The total number of living kidney transplant recipients with a functioning graft is expected to pass 250,000 in the next 1–2 years [
1]. In 2019, there were 23,401 renal transplant procedures performed, which represents a 10.6% increase from 2018 [
2]. Kidney transplantation is the best treatment option for end-stage renal disease. However, the growing number of transplant allograft rejection continues to be a major problem in maintaining the value of the kidney transplant benefits. Incidence of acute rejection (AR) by one-year posttransplant among adult kidney transplant recipients is about 7.8% [
1]. Scientific Registry of Transplant Recipients (SRTR) data showed that 97% of kidney transplants are working at the end of a month, 93% are working at the end of a year, and 83% are working at the end of 3 years [
1]. Despite life-long immunosuppressive maintenance regimens designed to optimize the outcome, approximately 20–30% of patients experience overall renal graft failure within the first 5 years, and only 55% of transplanted kidneys last to 10 years [
3].
Before molecular transplant rejection tests, such as tests to measure donor-derived cell-free DNA (dd-cfDNA) or mRNA expression profiling [
4,
5,
6,
7], became available, renal allograft injuries were commonly monitored by measuring serum creatinine (SCr) levels. Although elevated serum creatinine correlates with low estimated glomerular filtration rate (eGFR) and functional decline, SCr is an insensitive biomarker for diagnosing allograft rejection. In recent clinical validation studies of blood-based transplant rejection tests measuring dd-cfDNA, SCr has been consistently shown to be inferior to the molecular tests in identifying active rejection [
5,
6].
Renal biopsy is the standard in diagnosing allograft rejection. However, renal biopsy is not a convenient surveillance method for detecting rejections and is commonly used to provide a definitive confirmation for rejections. In addition to high costs, there are procedure-related risks and discomfort associated with invasive biopsies, and the histological analysis of biopsies is subjective to intra- and interobserver variations [
8]. About 20% of the U.S. kidney transplant centers use a protocol-based biopsy approach to detect subclinical renal pathology. Protocol biopsy may enable early graft abnormalities detection at the stage when effective treatment can change rejection trajectory and improve long-term outcomes. However, the procedure carries a finite risk of complications and about 25% biopsies yield an inadequate specimen to determine histological rejections [
9]. The disadvantages of histological evaluation for routine monitoring also include the inability to predict rejection or determine treatment response. Therefore, there is a critical clinical need for a sensitive, quantitative, non-invasive diagnostic test that can detect changes in graft status to guide clinically indicated biopsy decisions and monitor rejection risk longitudinally to improve allografts lifespan and long-term outcomes.
The QSant six biomarkers had previously been studied in the Kidney Injury Test, and the corresponding Kidney Injury Score (KIT Score) was shown to be able to assess the stage of chronic kidney disease (CKD) much earlier than proteinuria and renal function measurements alone [
10]. In a foundational clinical study encompassing running 601 urine samples at the University of California San Francisco (from patients without kidney disease, with chronic kidney disease and both stable and dysfunctional kidney transplants), the same six biomarkers were adapted to generate the transplant quiescence assay called QSant, formerly known as QiSant [
11], and the combination of the six biomarkers and clinical variables generated a scaled Q-Score that allowed for accurate detection of acute kidney transplant rejection and differentiated acute rejection injury from stable allografts in both children and adults [
11].
In this study, we propose a further re-validation of the foundational clinical study [
11] that developed, validated, and cross-validated the performance of the Q-Score for non-invasive diagnosis of acute rejection by the QSant assay. The primary objective of this validation study is to re-assess the performance of the QSant assay on allograft kidney rejection status, after assay transfer, assay lock, and algorithm lock in the Clinical Laboratory Improvement Amendments (CLIA) lab of NephroSant. The Q-Score scaled from 0–100 was similarly generated as discussed in the foundational clinical paper by Yang et al. [
11], with the same fixed Q-Score cut-off of greater than or equal to 32 for diagnosis of acute rejection, and then applied to two separate datasets, generated in the CLIA Lab, to re-evaluate the sensitivity, specificity, negative predictive value (NPV), positive predictive value (PPV), and accuracy of assessment of allograft rejection status, as assessed by a renal allograft biopsy, paired with the urine sample processed for QSant.
The results of this study, combined with our initial foundational study by Yang et al. [
11], demonstrate that the accuracy and the non-invasive nature of a QSant urine test makes it most relevant for the routine and repeated monitoring of transplant health for the lifetime of a renal allograft. The Q-Score provides detection and quantitative evaluation of rejection injury, which can be tracked over time to influence choices/dose of immunosuppression. An accurate measure of changes in kidney transplant health over time can inform clinical decision-making and support choices of treatment regimens. Increasing use and applications of the Q-Score in prospective clinical trials may further support the Q-Score as a surrogate endpoint for acute rejection therapy effectiveness and kidney transplant rejection outcome measurements without the need of repeated invasive transplant biopsies.
2. Experimental Section
2.1. Study Population and Sample Collection
The urine samples used in the clinical validation study were prospectively collected between 2010 and 2018 from adult (18 to 76 years of age) and pediatric (3 to 18 years of age) kidney transplant recipients who had transplant surgeries at the Stanford University Medical Center, the University of California San Francisco, or the Instituto Nacional de Ciencias Medicas y Nutricion in Mexico.
The studies were approved by the institutional review boards (IRBs) of all three institutions. Stanford University and Institute Nacional de Ciencias Medicas y Nutricion samples were obtained through a materials transfer agreement. The relevant University of California San Francisco IRB for this research was IRB #14-13573, with the most recent reapproval date of 29 June 2020. Informed consent for participation in the research was obtained from all patients or their legal guardians. The study protocol conforms to the ethical guidelines of the 1975 Declaration of Helsinki. The clinical and research activities being reported are consistent with the Principles of the Declaration of Istanbul, as outlined in the Declaration of Istanbul on Organ Trafficking and Transplant Tourism.
Urine samples were collected before the performance of an indicated or protocol biopsy in sterile containers with a pre-defined collection protocol. On arrival to the lab, urine samples were centrifuged at 2000× g at 4 °C for 30 min. The supernatant was aliquoted, and pH was adjusted to 7.0 by adding 1 part 1 M tris-HCl to 10 parts urine supernatant. The urine was stored at −80 °C until it was ready to use.
2.2. Kidney Biopsy
All samples were biopsy-matched and had urine collected at the time of clinical dysfunction, for-cause biopsy, or protocol biopsy. All kidney biopsies were analyzed by a pathologist who was blinded to the clinical course and graded using the 2017 Banff classification [
1]. Intragraft C4d stains were performed to assess for antibody-mediated rejection (ABMR). AR was defined, at minimum, by the following criteria: (1) T-cell-mediated rejection (TCMR) consisting of either a tubulitis (t) score > 2 accompanied by an interstitial inflammation (i) score > 2 or vascular changes (v) score > 0; (2) C4d-positive ABMR consisting of positive donor-specific antibodies (DSAs) (MFI > 1500) with a glomerulitis (g) score > 0 or peritubular capillaritis score (ptc) > 0 or v > 0 with unexplained acute tubular necrosis/thrombotic microangiopathy (ATN/TMA) with C4d = 2; or (3) C4d-negative ABMR consisting of positive DSA with unexplained ATN/TMA with g + ptc ≥ 2 and C4d = 0 or 1. Subclinical AR histology on a surveillance biopsy is similar to acute rejection. Stable transplant allografts (STAs) were defined by an absence of substantial injury on the matched biopsy pathology and definitions of the inflammation or i score and the tubulitis or t score.
2.3. QSant Assay
The QSant assay measures six urinary biomarkers including cfDNA, methylated cfDNA (m-cfDNA), CXCL10, clusterin, creatinine, and total protein with measurements performed as previously described [
10,
11]. Briefly, the cfDNA was measured using a proprietary biotinylated probe complementary to the Arthrobacter luteus (ALU) human element [
11] and streptavidin–horseradish peroxidase (HRP) (R&D Systems). Custom generated ELISAs for m-cfDNA, CXCL10, and clusterin concentration were used for these biomarkers. Both DNA assays used SuperSignal ELISA Femto Substrate (Thermo Fisher Scientific) for luminescent detection. FDA-approved tests on the Beckman Coulter AU400 analyzer were used for the measurements of creatinine and total protein. All biomarker assay and measurement interpolation calculations were performed blinded to the clinical information.
2.4. Statistical Analysis and Algorithm Development
The primary objective of this analysis was to assess the performance of these six urine-based biomarkers to inform on allograft kidney rejection status. The study involved 223 kidney transplant urine samples collected from 215 pediatric and adult recipients who had undergone transplant surgery at three different centers. Out of the 223 samples, acute kidney allograft rejection was observed in 71 samples, while 152 samples displayed stable allografts. Additionally, urine samples were collected from these patients from 1 to 1539 days post-transplant.
Random sampling was used to split the 223 samples into a training (
n = 157 with 45 AR cases and 112 STA controls) set and a validation set (
n = 66 with 26 AR cases and 40 STA controls). Sample phenotypes were based on the pathology of the paired kidney transplant biopsy, utilizing the Banff pathology classification [
12]. Non-parametric Wilcoxon Mann–Whitney test and chi-squared test were used to test the significant difference between training and test cohorts for demographic variables. Random forest model based on the measurements of the six biomarkers and days post-transplant was built to predict kidney injury status. Additionally, the Q-Score obtained using the random forest model was used for predicting allograft kidney rejection. To evaluate the performance of the model, we used area under the curve (AUC), sensitivity, specificity to discriminate the acute kidney rejection group from the non-rejection group. Methods such as fivefold cross validation and bootstrapping were used to assess the performance of the random forest model. All the above analysis was performed with the use of R 1.2.5 or Python 3.7.0. Visualizations were performed in GraphPad Prism 8.4.3 (GraphPad, Carlsbad, CA, USA).
4. Discussion
Renal transplant recipients generally require life-long immunosuppression to prevent graft rejection. However, despite improved immunosuppressive maintenance regimens designed to optimize the outcome, approximately 20–30% of patients experience overall renal graft failure within the first 5 years, and only 55% of transplanted kidneys last to 10 years [
3]. Our goal w to create a safe and high-performance clinical test that can be used at sufficient frequency to detect early stage graft dysfunctions where intervention can alter the progression course and extend allograft lifespan.
In previous clinical studies [
10,
11], the six selected urinary biomarkers were able to effectively measure the burden of chronic kidney injury and detect acute renal allograft rejection with much higher sensitivity and specificity than eGFR, protein/creatinine, or SCr. The focus of this paper was to refine on the basis of optimized assays and validate the clinical performance of Q-Score for the accurate detection of acute allograft rejection as defined by Banff classification criteria using biopsy-matched clinical samples and optimized CLIA-validated assays.
Our validation study results confirmed our previous findings that the QSant test enables objective and early clinical assessment of renal allograft [
11]. The biomarker panel is robust and has superior performance when compared with existing tests for detecting risk for AR in all ages of renal transplant patients. Q-Score quiescent threshold also provides additional clinically relevant information.
In a large serial protocol biopsy study, renal transplant protocol biopsies were shown to have a 1% rate of causing major complications, as well as having other less severe complications, such as 3.5% risk of gross hematuria and 2.5% risk of perirenal hematomas [
13]. QSant can effectively reduce the need to implement a screening protocol biopsy to monitor subclinical allograft rejections. This study, similar to the foundation study by Yang et al. [
11], shows that the majority of protocol biopsies done to screen for acute rejection could be avoided; 65% were unnecessary in the Yang et al. [
11] study and 73% of protocol biopsies could have been avoided in the current study cohort, as all these patients had a Q-Score below the 32 (rejection) threshold. Our study data suggested that the inclusion of a high-performance biomarker test can effectively reduce the use of for-cause and surveillance biopsies by 69%.
AR covers a wide range of complex syndromes and underlying etiologies. However, the Q-Score quiescent threshold was able to effectively detect both TCMR and ABMR in the study cohort, showing that the two pathologies were not distinctly different from one another. At this time, the QSant assay cannot discriminate between ABMR and TCMR. While ABMR manifests with microcirculation damage and DSA, TCMR manifests with interstitial–epithelial changes [
2]. These differences are clinically meaningful because the treatment for each type of rejection is different For TCMR, pulse high-dose intravenous glucocorticoids and rabbit anti-thymocyte globulin (rATG/Thymoglobulin) are treatments of choice [
3]. For ABMR, a combination of glucocorticoids, plasmapheresis, and intravenous immune globulin is used [
4]. We imagine that the QSant assay would be used to indicate if an episode of acute rejection is occurring, at which time a biopsy could be ordered to determine the exact type of acute rejection.
In addition, QSant can be safely performed 1 day post transplantation and is applicable to detect early risk of acute rejection. The inclusion of Q-Score information obtained from QSant assays in clinical management can allow early and proactive use of immunosuppressive therapy while titrating the drug levels to suit each patient’s need.
Ultimately the integration of objective molecular testing into the existing histopathology criteria would create a true gold standard for allograft rejection diagnosis that is robust and has strong histological rationale. QSant can add real value to the healthcare system by reducing unnecessary biopsies with accurate diagnosis of acute rejection, irrespective of a change in the serum creatinine. The high sensitivity of the QSant urine test can avoid the risk of an invasive biopsy when a rise in serum creatinine is due to a cause other than acute rejection, and the high specificity of the urine QSant test can avoid an unnecessary protocol biopsy by identifying patients who have normal serum creatinine and no rejection.
The main limitation of the study was that even though we used one pathologist to conduct centralized pathohistological lesion grading to minimize observer variations, this gold standard’s innate sampling limitations is known to have a false negative rate up to 30% in detecting rejections. The cohort had a single urine collection from the subclinical disease patients. Had we had subsequent biopsy confirmation data of STA patients with initial high Q-Score in this study, we could potentially reclassify these STA patients as scAR instead of false-positive AR. Another limitation of this cross-sectional study was not having longitudinal samples to demonstrate the progression or improvement of Q-Scores to reflect the impact of test-guided treatment on immune rejection risk and outcomes.
5. Conclusions
In the validation cohort, the levels of six urinary biomarkers—cfDNA, m-cfDNA, CXCL-10, clusterin, total protein, and creatinine—along with days post-transplant were used to generate a composite Q-Score. The study demonstrated multiple benefits in aiding the management of renal transplant patients: (i) the overall benefit in detecting clinical allograft rejection without performing unnecessary biopsies, (ii) the net reduction in unnecessary biopsies without missing any of the clinical rejections, (iii) detection of TCMR and ABMR with equally high accuracy, (iv) detection of subclinical rejection with high accuracy, and (v) applicability to all ages (3 to 76 years) of transplantation patients.
The QSant test is intended to supplement evaluation and management of kidney injury and acute rejection in patients who have undergone renal transplantation, regardless of recipient age or whether the patient is the recipient of a repeat or multi-organ transplant. It can be used by physicians considering the diagnosis of acute rejection, helping to rule in or out this condition. In conclusion, the QSant assay is a multimodal risk assessment approach that can improve the diagnosis and management of renal transplant patients and reduce unnecessary biopsy.