1. Introduction
Over the past decades, there has been increased interest in assessing pharmacokinetic drug–drug interaction (DDI) risks of molecules in the early phases of drug development. This is particularly important in therapeutic areas where patients are on polypharmacy, as it is imperative that potential drug candidates have minimal DDI risks which could lead to decreased efficacy or increased safety events or off-target effects of the developmental drug and/or comedications.
In drug discovery, it is recommended to promptly assess the probability of pharmacokinetic DDIs to limit the need for clinical DDI studies and to obtain a drug that patients can take without significant DDI risks. The best drug candidates typically have high pharmacological activity and low potential for DDIs. Furthermore, drug interaction risks can halt a promising compound from further clinical development. The ultimate goal in this field of research is to limit the prediction of false positive and false negative interactions, while also minimizing the resources needed to make robust DDI assessments.
Pharmacokinetic DDIs are generally caused by the effect of a drug on the activity of one or more subtypes of the Cytochrome P450 (CYP) family. This can be through either inhibition or induction of CYPs, and is most often related to the activity of CYP3A4. Besides CYPs, DDIs can be propagated through other mechanisms including attenuation of UDP-glucuronosyltransferases and drug transporters interactions [
1].
In 2006, the US Food and Drug Administration (FDA) provided guidance on studying DDIs in in vitro models; the formulas for DDI predictions were similar to basic models [
2]. In the following 15 years, many studies were published comparing calculated theoretical data of groups of compounds with established clinical data using various calculation methods or available software [
1,
3,
4]. These methods to improve the in vitro to in vivo extrapolation (IVIVE) were mostly based on commercially available approved therapeutics; the FDA subsequently updated the DDI guidance (in vitro and clinical) in 2012 as the draft guideline and in 2020 as final guidelines based on these research findings [
5,
6]. Efforts have been made to evaluate these models of potential pharmacokinetic DDIs mediated by CYPs and/or drug transporters to improve the accuracy of IVIVEs. Shardlow et al. (2011) [
3] retrospectively studied 26 small molecule drugs and drug candidates from GlaxoSmithKline with diverse physiochemical properties, including both reversible and potentially irreversible CYP inhibitors. Besides basic static models, mechanistic static models (MSMs) that differentiate between reversible and metabolism-dependent inhibition were also used, and the contribution of intestinal metabolism for CYP3A4 substrates was studied as well. The results suggested that the estimated unbound liver concentration or unbound hepatic inlet concentration, with consideration of the intestinal contribution, offered the most accurate predictions of DDIs (occurrence and magnitude) for the drugs in this dataset.
Subsequently, Peters et al. (2012) [
4] used 11 AstraZeneca proprietary compounds to evaluate the effectiveness of a mathematically revised static model equation (including the hepatic first pass metabolism, etc.), and to test the applicability of SimCYP for DDIs-associated variabilities. Interestingly, the static calculations using an unbound average steady-state systemic inhibitor concentration (I
sys) and a fixed fraction of gut extraction performed better than SimCYP (84% compared with 58% of the interactions predicted within a 2-fold range). Differences in the prediction outcomes between the static and dynamic models are attributable to differences in the first-pass contribution to DDI. These investigations support the use of static models when elimination routes of the victim compound and the role of gut extraction for the victim and/or inhibitor in humans are unknown.
In a similar approach, Einolf et al. (2014) [
7] tested various DDI models for their performance in predicting DDI potential via CYP3A substrates with data from 28 clinical trials based mostly on approved drugs. All models worked with high fidelity and low false negative or false positive rates. The correlation approaches (ratio of the in vivo peak plasma concentration (C
max) to the in vitro half-maximal effective concentration (EC
50); and relative induction score) and basic static model (calculated R
3-value) resulted in no false negatives when the total C
max was incorporated. Mechanistic static models (net effect) and mechanistic dynamic models (physiologically based pharmacokinetic modeling, PBPK) that included both CYP inactivation and induction resulted in less accurate DDI predictions, likely due to an overprediction of the effect of inactivation.
As continuous efforts are made to minimize false positive and false negative predictions, and, thus, improve overall prediction rates, research groups have been trying to improve the static models through various modifications (e.g., using higher cut-off values, refining the effective concentration of the perpetrator, etc.). For SLC transporters, Rodrigues (2023) [
1] demonstrated that only compounds with high R values (>1.5) or higher maximal concentration/mean half-maximal inhibitory concentration (C
max,u/IC
50) ratios (>0.5) are likely to significantly influence hepatic (area under the curve ratio, AUCR > 1.25) and renal (ΔCL
renal > 25%) endogenous biomarkers in DDIs. Consequently, he suggested raising the cutoff values for DDI assessment to reduce the rate of false positive predictions.
Efforts in improving pharmacokinetic DDI predictions have also been made using dynamic models, where the time-course for the inhibition process as well as active metabolites are characterized and integrated into the model to generate more physiologically relevant scenarios. In a study to identify DDIs with methadone and buprenorphine, Miano et al. (2022) [
8] included the factor (1 − f
e) (f
e is the fraction of the target drug that is excreted unchanged in the urine) in the formula to improve the prediction. However, these modifications did not always significantly improve the predicted outcome when compared against clinical data [
9]. All of these efforts to improve model predictions appear to be limited for screening of DDI potential as described in the guidelines published by the FDA in 2020 [
5]. Hence, a further refinement of the models is necessary to increase the rate of true DDI predictions.
In this work, we evaluated the performance of the FDA static models (both basic model (BSM) and mechanistic static models (MSMs)) in predicting pharmacokinetic DDI risks using Boehringer Ingelheim (BI) proprietary development compounds. In addition, based on our work, we suggested an improvement in the workflow for assessing DDIs in preclinical and clinical development. The results from this study demonstrated that the FDA BSM and MSM performed well in pharmaceutical industry settings to support drug development; this supports the use of basic static models to rule-out DDI risks in the early phases of drug development, as the rate of false negative predictions are negligible. The use of the more resource-intensive mechanistic models should be reserved when additional assessments are necessary. In addition, when available, clinical renal excretion data can be incorporated into MSMs to improve the precision in predicting the compound of interest as a victim of DDIs; excretion data from preclinical species can potentially be used in lieu of clinical data to refine prediction accuracy.
2. Materials and Methods
For the current investigation, a total of 44 clinical studies were used to assess the predictive performance of basic and MSMs outlined in the FDA guidance on in vitro drug interaction studies [
5]. These studies encompass 18 BI proprietary compounds that have clinical DDI data. The dataset includes small molecules with a diverse range of physicochemical properties, spanning various therapeutic areas. BI compounds were assessed as both the object drug and precipitant of reversible inhibition, time-dependent inhibition, and induction of CYP1A2, CYP2C8, CYP2C9, CYP2D6, and/or CYP3A4. For these 18 BI proprietary compounds investigated, in vitro CYP inhibition, inactivation, and induction studies were performed in similar fashion as previously reported (Sabo et al. (2015) [
10] or Sane et al. (2016) [
11]). A brief outline of the methodology is described below.
2.1. CYP Inhibition Studies
The IC
50 values for BI developmental compounds were determined using standard inhibition assays as previously described [
10,
11,
12,
13,
14]. In brief, test articles were incubated at 37 °C with relevant substrate (i.e., midazolam, dextromethorphan, diclofenac, S-mephenytoin), and human liver microsomes (HLMs).
2.2. CYP Induction Studies
CYP3A induction parameters were determined using sandwich-cultured primary human hepatocyte as described [
11,
14]. In brief, hepatocytes were treated with test articles for 48 h. Induction was assessed via two methods: quantification of 6β-hydroxytestosterone and mRNA level. Test articles were compared against the control (25 μM rifampin).
The IC50 values were generated using CYP isoform-specific reactions in HLMs. The following CYP probe substrates were used at concentrations equal to the apparent Michaelis–Menten kinetic constant (Km) of that substrate: phenacetin (CYP1A2), bupropion (CYP2B6), diclofenac (CYP2C9), (S)-mephenytoin (CYP2C19), bufuralol (CYP2D6), and midazolam/testosterone (CYP3A). Each isoform-specific CYP probe substrate was incubated in the presence of various concentrations of the BI test compounds with pooled HLM in potassium phosphate buffer and respective isoform-specific metabolites, and respective isoform-specific metabolites were monitored.
2.3. CYP Inactivation Studies
The methods used were published previously [
10,
11,
13]. In brief: The BI compounds were assessed for their potential to inactivate CYP2B6, CYP2C9, CYP2D6, and CYP3A using recombinant human CYP isoforms (rCYP) or HLMs. The following CYP probe substrates were used: bupropion (CYP2B6), diclofenac (CYP2C9), dextromethorphan (CYP2D6), and midazolam/testosterone (CYP3A). Primary incubations consisted of buffer, rCYP or HLMs, and various concentrations of the 18 BI test compounds. Samples were preincubated at 37 °C for 5 min. Reactions were initiated by adding NADPH (2 mM), with control reactions conducted without NADPH. At specified time points (0–35 min), a 50 µL aliquot was removed and diluted 1:20 into secondary incubation wells, which contained the isoform-selective probe substrate. After the secondary incubation (37 °C, 6 min), samples were filtered using centrifugation and filtrates were analyzed using liquid chromatography–tandem mass spectrometry. Rate constants for loss of CYP activity, normalized to the corresponding controls without the test compound, were used in Equation (1) to obtain inactivation parameters:
where k
obs is the observed rate constant for inactivation, k
inact is the maximal inactivation rate constant, [I] is the concentration of inactivator in the primary incubation, and K
I is the concentration of inactivator at which the rate of inactivation is half maximal.
2.4. P450 Induction and Cytotoxicity
The methods used were published previously [
10,
11,
14]. In brief: To assess induction of CYP1A2, CYP2B6, and CYP3A4 mRNA, three separate lots of cryoplateable human hepatocytes, precharacterized for prototypical induction response, were used. When concentration dependent induction was observed, the data were fitted to determine EC
50 values using Equation (2):
where X is the logarithm of concentration, Top is the maximal response, and Bottom is the baseline value.
2.5. Clinical Data
Clinical DDI data (AUC, Cmax) were obtained from BI clinical trial reports.
Informed consent was obtained from all participants of the clinical trials and, and all clinical studies were conducted in accordance with the Declaration of Helsinki and approved by the respective Institutional Review Board.
Various static and mechanistic models suggested by the FDA guidance [
5] were applied to retrospectively evaluate the DDI potential.
2.6. Basic Static Models
The basic static models for reversible inhibition were used according to the FDA guidelines [
5] to determine if the compound is an inhibitor of metabolizing enzymes.
In short, for reversible inhibition, the ratio (R
1) of intrinsic clearance values for an enzymatic pathway in the absence and in the presence of the interacting drug was calculated as follows:
where I
max,u is the maximal unbound plasma concentration of the interacting drug at steady state and K
i,u is the unbound inhibition constant determined in vitro.
For time-dependent inhibition, the ratio R
2 was calculated as
where
; k
obs is the observed (apparent first order) inactivation rate of the affected enzyme; k
deg is the apparent first-order degradation rate constant of the affected enzyme; k
inact is the maximal inactivation rate constant; K
I,u is the unbound inhibitor concentration causing half-maximal inactivation; and I
max,u is the maximal unbound plasma concentration of the interacting drug at steady state.
A compound is regarded to have a DDI potential if R1 ≥ 1.02 or R2 ≥ 1.25.
For determination, if the compound is an inducer of metabolizing enzymes, the following ratio of intrinsic clearance values of a probe substrate for an enzymatic pathway in the absence and presence of an inducer (R
3) was calculated:
where d is the scaling factor and is assumed to be 1, E
max is the maximum induction effect determined in vitro, I
max,u is the maximal unbound plasma concentration of the interacting drug at steady state, and EC
50 is the concentration causing half-maximal effect determined in vitro. A compound is regarded to have a DDI potential if R
3 ≤ 0.8.
2.7. MSM
Static mechanistic models incorporate more detailed drug disposition and drug interaction mechanisms for both interacting and substrate drugs [
5].
The Equation assumes that the drug has negligible extrahepatic clearance. AUC
i is the AUC in presence of the investigational drug; AUC
r is the AUC in absence of the investigational drug; A is the effect of reversible inhibitions; B is the effect of time-dependent inhibition; C is the effect of induction; F
g is the fraction available after intestinal metabolism; and f
m is the fraction of hepatic clearance of the substrate mediated by the CYP enzyme that is subject to inhibition/induction:
where subscript ‘h’ denotes liver and subscript ‘g’ denotes gut.
Each value can be estimated with the following Equations:
| Gut | Liver |
Reversible inhibition | | |
Time-dependent inhibition | | |
Induction | | |
where
;
;
fu,p is the unbound fraction in plasma;
Cmax is the maximal total (free and bound) inhibitor concentration in the plasma at steady state;
Fa is the fraction absorbed after oral administration; a value of 1 should be used when the data are not available;
Fg is the fraction available after intestinal metabolism; a value of 1 should be used when the data are not available;
ka is the first order absorption rate constant in vivo; a value of 0.1 min−1 can be used when the data are not available;
Qen is the blood flow through enterocytes (e.g., 18 L/h/70 kg);
Qh is the hepatic blood flow (e.g., 97 L/h/70 kg);
RB is the blood-to-plasma concentration ratio;
d is the scaling factor and is assumed to be 1. A different value can be used if supported by prior experience with the system used.
The Drug–drug Interaction Risk Calculator (DDRIC) (PharmaPendium, Elsevier, Amsterdam, The netherlands) [
15], developed based on the FDA model, is a computer system used to assess the MSM. This tool can calculate a predicted DDI between proprietary substances and marketed drugs. The DDRIC is built based on published results from in vitro and in vivo experiments in the drug library in the program. If the proprietary drug is a victim (substrate), the DDI calculation is made against perpetrators and related data: (K
i, IC
50, k
inact, EC
50, E
max, C
max, etc.) from the DDIRC database. If the proprietary compound is a perpetrator (inhibitor and/or inducer), the calculation is made with victims and the corresponding mandatory data: fm(E), Fg. The result is a profile of drug–drug interactions between the proprietary drug and potential co-medication drugs on the market.
The DDIRC Drug Library mainly includes drugs that are on the market, but also includes a small number of drugs that are withdrawn. More than 400 perpetrators and 200 victims, along with their own in vitro and in vivo data, are included in the DDIRC. This Drug Library covers the most popular cytochrome P450s (CYP3A4, 2D6, 2C9, 2C19, 2C8, 2B6, 1A2, and 2E1) and, to a lesser extent, UGTs involved in the metabolism of drugs on the market. The Drug Library covers more than 100 therapeutic classes, including the most commonly prescribed ones. The workflow is depicted in
Figure 1.
Input parameters for the BI compounds, i.e., inhibition (Ki, IC50, KI, Kinact) and induction (Emax, EC50) parameters, were derived from standard metabolic assays (conducted in HLMs and/or hepatocytes) as described above. Pharmacokinetic parameters (e.g., Cmax, AUC0–-t, half-life (t1/2)), determined via noncompartmental analysis, were obtained from in-house clinical studies. BI compounds were assessed as both the object drug and precipitant of reversible inhibition, time-dependent inhibition, and induction of CYP1A2, CYP2C8, CYP2C9, CYP2D6, and/or CYP3A4.
Based on the idea from Miano et al. [
8], a comparison was made between the predicted DDI potential when this was calculated via the basic MSM method as in the DDIRC and when this MSM was enhanced by incorporating human excretion data or excretion data of preclinical species.
In the latter case, the MSM formula,
is changed such that f
m (the fraction of hepatic clearance of the substrate mediated by the CYP enzyme that is subject to inhibition/induction) now also incorporates human urine excretion data:
in which f
e is the fraction excreted as parent compound.
Comparison of model predictability was calculated using the following formulas:
Root Mean Square Error (RMSE) [
7]:
Geometric Mean Fold Error (GMFE) [
16]:
3. Results
In this investigation, a total of 44 clinical studies, encompassing 18 BI proprietary compounds, was used to assess the predictive performance of basic and MSMs outlined in the FDA guidance on in vitro DDI studies [
5]. This clinical dataset includes small molecules with diverse range of physicochemical properties and various therapeutic areas (
Table 1).
Table 2 and
Table 3 summarize the input parameters of these BI development compounds as a perpetrator and a victim of a DDI in various models, respectively.
To verify the predicted DDI potential of the compounds, comparisons between the predicted value and the observed data from clinical studies are necessary. Therefore, an overview of the clinical studies, describing the compound, the interacting drug, the dose and dosing regimen for studies where the BI test compound is a perpetrator or a victim, as well as some characteristic of the population in the study, are depicted in
Table 4.
The pharmacokinetic data were obtained in clinical studies in which either single-dose or multiple-dose treatment was given. To quantify the accuracy of our predictions, we compared several parameters between the predicted values and the observed values, including the numbers of true positives, true negatives, false positives, and false negatives, along with specificity, sensitivity, the percentage of predictions within a 2-fold range, RMSE, and GMFE values, as summarized in
Table 5.
Basic vs. mechanistic static model: A comparison of the basic static models, with the calculation of R
1 and or R
2, and the mechanistic static models (MSMs), with the BI compound as perpetrator, is graphically shown in
Figure 2 and
Table 5.
While the false negative rate remained the same at 12% (R
1), the percentage of false positive predictions was significantly lower, decreasing from 31% to 25% when calculated with the MSM (competitive inhibition). For combined competitive and mechanism-based inhibition (n = 20), the specificity increased from 50% to 57%, while the sensitivity remained at 66.7% (
Table 5). Despite the limited sample size (n = 8), the MSM eliminated false negatives for induction (
Table 5,
Figure 3), though there was an increase in false positive predictions, as shown in
Figure 3 and
Table 5. For induction, the sensitivity markedly increased from 50% to 100%, while the specificity decreased from 50% to 30% (
Table 5). Notably, for induction, two compound–enzyme pairs predicted to be positive in DDIs with the basic model were predicted as negative in DDIs with the MSM. Clinical data confirmed that the induction DDI was negative. These results demonstrated that MSMs should be used for the decision to perform clinical studies to avoid unnecessary DDI studies.
Predictive performance for BI compounds as perpetrator: When evaluating BI compounds as the perpetrator in DDIs, the MSM model successfully predicted the exposure ratio (AUC in the presence-to-absence of inhibitor) to within 2-fold of that observed in clinical studies for 80% for all inhibitors, 50% for inducers, and 75% for all perpetrators (
Table 5,
Figure 4). This demonstrated a good performance of MSMs in the prediction of DDIs for the prediction of inhibition.
Predictive performance for BI compounds as victim: When calculating the effects with the BI compound as a victim, two methods were used for the MSM: with or without considering the fraction of the compound excreted in human urine, or the corresponding data in preclinical species, such as rats. A clear difference of the predicted values between values from the two methods was observed, as illustrated in
Figure 5 (with human excretion vs. without) and
Figure 6 (with preclinical animal excretion vs. without). The MSM successfully predicted the exposure ratio (AUC in the presence-to-absence of inhibitor) within 2-fold of that observed in clinical studies for 60% of the compounds (
Table 5). The inclusion of the urinary excretion data from Phase I studies (i.e., inclusion of the factor (1 − f
e) in the formula) resulted in 75% of the tested compounds falling within the 2-fold range (
Table 5); if included in the excreta from preclinical species, 60% of the AUC ratios were within a 2-fold range (
Table 5). However, based on the criteria of RMSE and GMFE to evaluate the predictive performance, MSM + human urine data was better than MSM + preclinical species excretion, while they were all better than MSM alone (
Table 5). Since GMFE < 2 indicates a good predictive performance [
16], except for the MSM only model, the other two models with excreta both had GMFE < 2 (
Table 5).
4. Discussion
The interest in reducing the costs of development of new drugs and reducing the failure rate during the development process have been growing for several decades. One of the factors leading to the early failure of the development of new drugs is the degree of pharmacokinetic DDIs between the developing drug and any comedication that is being taken by the patient.
Many researchers from academia and pharmaceutical industry have been trying to improve pharmacokinetic DDI predictions as early as possible in the development of new drugs. It is commonly believed that IVIVE is the best technique to determine whether a compound has a risk for DDIs. Over the years, several methods and formulas have been used to calculate the risk for DDIs. Extensive investigations on the parameters to include in the formulas for the calculations of DDI predictions have been performed more than 10 years ago by Fahmi et al., who developed a more comprehensive mathematical model that accounts for the simultaneous influences of competitive inhibition, time-dependent inactivation, and induction of CYP3A in both the liver and intestine [
17], and a model that simultaneously combines reversible inhibition, time-dependent inactivation, and induction data with static estimates of relevant in vivo concentrations [
18], which both served as the input for the rules in the FDA guidelines for DDIs published in 2012 (draft guideline) and 2020 (final guidelines). Subsequently, Yoshida et al. [
19] pointed out that for a more accurate and quantitative prediction of DDIs based on in vitro observations, sophisticated model-based predictions, such as PBPK modeling, which incorporate mechanistic considerations of victim and perpetrator drug disposition, should be explored, especially for DDIs involving multiple elimination pathways.
The FDA’s 2012 draft DDI guidance and the subsequent 2020 DDI guidance recommended the use of both basic and mechanistic models (BSMs and MSMs) to predict DDIs; the guidelines were based on studies with approved drugs that were developed decades ago. To our knowledge, our study is the first to systematically evaluate the predictive performance of these FDA-recommended models using proprietary development compounds. This approach provides a much-needed “reality check” on the applicability of these formulas in current drug development settings, highlighting the performance differences between BSMs and MSMs—an analysis that has not been performed before with development-stage compounds. Our findings offer significant insights for industry peers and regulatory bodies, and may inform future refinements to DDI prediction guidance. There are two papers published more than 13 years ago that also used proprietary drugs (Shardlow et al. (2011) [
3] and Peterset al. (2011) [
4]); however, they were published before the FDA published their 2012 draft guidance; thus, they do not entirely follow the FDA-recommended methods.
By evaluating current FDA DDI formulas with proprietary development compounds, we offer the scientific community valuable data on how these models perform with novel and emerging drug classes. This extension beyond traditionally marketed drugs allows scientists to assess the robustness and applicability of FDA-recommended models across a broader spectrum of chemical entities. Our study could thus catalyze further research and collaborative improvements, refining pharmacokinetic DDI predictions for a wide array of new compounds.
To test for improvement of the prediction via IVIVE, a collection of 18 BI proprietary small molecule compounds which had been developed at least until Phase I clinical studies were selected (Phase I given that human PK data have been accrued for these compounds). Pharmacokinetic parameters (e.g., Cmax, AUC0–t, t1/2), were determined via noncompartmental analysis in in-house clinical studies. This clinical dataset included small molecules with a diverse range of physicochemical properties and various therapeutic areas. Basic in vitro data, including the IC50, Ki, EC50, and Emax for specific CYP enzymes, have been collected in earlier studies for these compounds.
The potential for pharmacokinetic DDIs was then calculated according to the FDA guidelines for the BI compounds as a perpetrator or a victim of reversible inhibition, time-dependent inhibition, and induction of CYP1A2, CYP2C8, CYP2C9, CYP2D6, and/or CYP3A4. The guidelines describe basic methods of calculation as well as an MSM. Within our group, we thought that incorporating more extensive data related to the degree of excretion of the compound in humans may influence the prediction accuracy. The DDIRC [
15], which was developed based on the FDA MSM, was used to assess the MSM.
For each set of data, we applied both basic static models (BSMs) and mechanistic static models (MSMs) in drug–drug interaction predictions. The basic static models (BSMs R1, R2, and R3) are easy to apply and understand, with fewer parameters and assumptions needed. They calculate risk ratios (R-values) for potential DDIs using readily available data. Due to their simplicity, BSMs save time and resources when flagging possible interaction risks. The BSMs may overestimate or underestimate the DDI risk due to their oversimplified nature. Basic static models do not account for complex interactions, such as multiple enzyme pathways, or specific tissue distributions.
MSMs provide a deeper mechanistic understanding of the DDI, helping to identify which specific enzymes are involved, thereby offering insights into mitigating strategies (e.g., dose adjustment, alternative pathways). MSMs can handle more complex interactions, including competitive and non-competitive inhibition, and multiple metabolic pathways. This flexibility enhances their predictive power for drugs with complex metabolic pathways. MSMs require extensive data on enzyme kinetics, tissue distribution, and sometimes transporter activity, which may not always be readily available in early drug development stages. MSMs are computationally more intensive and may require more time and resources to implement and interpret accurately. With numerous parameters involved, mechanistic models carry a risk of overfitting to specific data sets, which may reduce the generalizability of the predictions to broader populations or different study conditions.
For the prediction of DDIs, from a regulatory agencies point of view, it is important to minimize the false negative prediction, as it leads to missed potential clinical DDIs; on top of that, a reduction in the false positive rate is beneficial as it leads to unnecessary clinical DDIs [
20]. In our set of data, despite a small sample size, we have shown that for induction (n = 8), the MSM significantly reduces (eliminates) the false negative rate (sensitivity = 100%) (
Figure 3). For inhibition (n = 20), while the false negative rate remains the same, the MSM significantly reduced the false positive rate (
Figure 2). These results demonstrated that MSM should be used for the decision to perform clinical studies. These results appear to be consistent with the findings of Einolf [
21] using approved drugs.
Although the MSM provided greater prediction accuracy, the input required to generate robust extrapolations can be inefficient and costly. The current study demonstrated that the basic models for drug interactions should be used in early drug discovery to “rule out” DDI risks because of the minimal inputs required and the low rate of false negative predictions. Mechanistic static models can then be implemented for compounds requiring additional evaluation, whereas clinical studies are reserved only for compounds predicted to have DDI liabilities in both models.
For BI compound as the perpetrator, if judged based on GMFE and RMSE values, as compared to the literature [
7,
12,
17] using approved drugs, our prediction based on the FDA formula using BI development compounds had a good prediction performance for all perpetrators, all inhibitors, and all inducers (
Table 5). The only GMFE value > 2 was associated with induction (
Table 5); for all others, the GMFE values < 2, and GMFE < 2 indicates a good predictive performance [
16]. In BI, we took a conservative approach and fixed the d value to 1 for the prediction of induction [
7,
22], which may lead to a slightly higher GMFE value. In the investigation, the MSMs successfully predicted the exposure ratio (AUC in the presence-to-absence of inhibitor) to within 2-fold of that observed in clinical studies for 80% for all inhibitors, 50% for inducers, and 75% for all perpetrators (
Table 5,
Figure 4). This also demonstrated the good performance of MSM (based on the FDA formula) using BI development compounds.
In this investigation, we have demonstrated that including excretion data from preclinical species significantly improves the predictive performance for BI compounds (as victim) compared to not using excretion data (
Table 5,
Figure 6). At the preclinical development stage, only preclinical data, such as the percentage contribution of a specific CYP enzyme to overall metabolism and excretion data (e.g., rat urine and bile excretion), were available for the development compounds. Using preclinical excretion data as a surrogate for human urine data yields a better predictive performance based on GMFE and RMSE values comparable to those obtained with human urine data (
Table 5). This method can help the project teams select candidate drugs for clinical development.
Recently, machine learning-based computational models have been applied in the prediction of DDIs (e.g., ref. [
23]). Computational models using machine learning analyze large datasets, including pharmacokinetics, pharmacodynamics, biochemical pathways, and molecular interaction data. They can discover hidden patterns and interactions that may not be readily apparent through traditional rule-based methods, which have higher screening efficiency and integrate heterogeneous data sources to build predictive frameworks to predict DDIs. However, these computational models may raise concerns of potentially inconsistent data sources, data imbalance issues, overfitting, high complexity, and resource intensity, etc. Therefore, computational models may play a complementary role, not a replacing role, to traditional working models (BSMs and MSMs) in predicting DDIs by offering enhanced efficiency, accuracy, and insights into complex biological interactions.
It is acknowledged that the calculations performed have some limitations, as the dataset used is somewhat biased for three reasons. First, the compounds that were identified as being most potent inhibitors in in vitro studies did not make it to the clinical stage of development, because for various reasons they had a poor outlook as drug candidates. Secondly, since the “no-effect” range in the FDA guideline is very narrow (0.8–1.25), only a limited amount of data fell within the true negative category. Furthermore, because of a lack of metabolite characterization for the 18 compounds in the study, a metabolite-mediated DDI was not considered as part of the calculations. The small sample size, in particular for induction (n = 8) and mechanism-based inhibition (n = 4), is a limitation of this research.
In summary, the current research evaluated the performance of the FDA static models in predicting pharmacokinetic DDI risks using BI proprietary compounds. The DDIRC (PharmaPendium®) was used to assess the MSMs and data were correlated with human pharmacokinetic studies from Phase I clinical trials. The results showed that the FDA formulas for both basic static modes and mechanistic static models performed well in an industrial setting in support of drug development to predict DDIs. The results also demonstrated that basic models for drug interactions should be used in early drug discovery to “rule out” DDI risks because of the minimal inputs required and the low rate of false negative predictions. The MSMs showed good performance in predicting clinical DDIs and should be implemented for compounds needing further clinical evaluation. Furthermore, the inclusion of human urine or preclinical species excretion data in the formula for the MSM enhanced the predictivity for the DDI potential of (candidate) drugs.