Abstract
We describe a comprehensive retrospective analysis in which the abilities of several methods by which human pharmacokinetic parameters are predicted from preclinical pharmacokinetic data and/or in vitro metabolism data were assessed. The prediction methods examined included both methods from the scientific literature as well as some described in this report for the first time. Four methods were examined for their ability to predict human volume of distribution. Three were highly predictive, yielding, on average, predictions that were within 60% to 90% of actual values. Twelve methods were assessed for their utility in predicting clearance. The most successful allometric scaling method yielded clearance predictions that were, on average, within 80% of actual values. The best methods in whichin vitro metabolism data from human liver microsomes were scaled to in vivo clearance values yielded predicted clearance values that were, on average, within 70% to 80% of actual values. Human t1/2 was predicted by combining predictions of human volume of distribution and clearance. The best t1/2 prediction methods successfully assigned compounds to appropriate dosing regimen categories (e.g., once daily, twice daily and so forth) 70% to 80% of the time. In addition, correlations between humant1/2 and t1/2values from preclinical species were also generally successful (72–87%) when used to predict human dosing regimens. In summary, this retrospective analysis has identified several approaches by which human pharmacokinetic data can be predicted from preclinical data. Such approaches should find utility in the drug discovery and development processes in the identification and selection of compounds that will possess appropriate pharmacokinetic characteristics in humans for progression to clinical trials.
The process by which new drug candidates are discovered and developed is both time consuming and expensive (DiMasi, 1994; DiMasi et al., 1994). This is due in part to the high rate of attrition of drug candidates that enter clinical development, such that only ∼10% of drug candidates that are selected for clinical development eventually become marketed drugs. In analyzing the reasons for attrition of drug candidates that enter clinical development, it has been reported that the clinical development of 40% of drug candidates was discontinued due to unacceptable pharmacokinetic properties (Prentis et al., 1988).
These observations strongly suggest that the process by which new drugs are discovered and developed could benefit greatly if drug candidates were advanced to clinical development when predicted human pharmacokinetic characteristics were deemed to be acceptable (e.g., oral bioavailability and duration of exposure are projected to be appropriate for conducting pivotal efficacy studies). Thus, the development and application of reliable methods to predict human drug disposition may decrease the overall attrition of drug candidates during clinical development by decreasing the number of candidates lost due to unacceptable pharmacokinetic characteristics. Furthermore, the eventual clinical utility as well as market success of a newly approved drug could be maximized by selecting for development only those compounds with optimal, rather than acceptable, pharmacokinetic characteristics for the intended therapeutic use.
The best described technique to predict human pharmacokinetics fromin vivo preclinical pharmacokinetic data is allometric scaling. In its original form, allometry was a technique developed to explain observed relationships between organ size and body weight of mammals (Dedrick et al., 1970; Mordenti, 1986). Additional studies demonstrated further relationships between mammalian body weight and physiological parameters. Considerations of the relationship between drug elimination and physiological parameters such as hepatic or renal blood flow inevitably led to the application of allometric scaling in correlating human pharmacokinetics with pharmacokinetic parameters in preclinical species (Boxenbaum, 1982, 1984). Allometric scaling of pharmacokinetic data typically focuses on interspecies relationships between clearance or volume of distribution of unbound drug and species body weight; the relationships for these parameters established in preclinical species are then extrapolated to humans, allowing for predictions of human clearance and volume of distribution. Although a number of physiologically rather than allometrically based approaches have also been developed for interspecies scaling of pharmacokinetic data (Iwatsubo et al., 1996; Suzuki et al., 1995), allometry continues to be the most widely used approach due to its simplicity.
In recent years, there has been a resurgence in the use of allometric scaling to establish relationships among preclinical species and humans for both compounds that are metabolically and nonmetabolically cleared (Boxenbaum and DiLea, 1995; Mahmood and Balian, 1995, 1996a, 1996b). The major drawback in allometric scaling is its empirical nature. For example, traditional allometric scaling of plasma clearance does not allow for an understanding of species differences in pathways of metabolic clearance that may have significant impact on the ability to accurately extrapolate human clearance from preclinical data. However, recent publications have proposed novel methods of combining allometric scaling with knowledge of species differences in metabolism derived from in vitro metabolism data to improve the utility of allometry for compounds prone to major species differences in metabolism (Lave et al., 1995, 1996a, 1996b; Ubeaud et al., 1995)
Methods by which in vivo clearance can be predicted fromin vitro data were first described ∼20 years ago (Rane etal., 1977). The methodologies and mathematics behind approaches to predict in vivo clearance from intrinsic clearance data have been summarized in a recent review by Houston (1994). Although the data described by Houston are from rat, the principles described are applicable to other species, including humans (Iwatsubo et al., 1997). In the seminal work by Raneet al. (1977), it was demonstrated that the extent of hepatic extraction of several drugs in rats could be estimated from enzyme kinetic parameters of the oxidative biotransformation of these drugs in rat liver microsomes. The concept of an in vitro/in vivo correlation that included data from both human and preclinical species was reduced to practice for felodipine 10 years later (Baarnhielm et al., 1986). Various in vitro systems are available to obtain hepatic intrinsic clearance data; those most commonly used are liver microsomes, hepatocytes and precision-cut liver slices. Each system possesses unique advantages and disadvantages in both ease of use and accuracy and completeness of the data obtained. In general, for kinetic experiments, such as determination of intrinsic clearance, the body of data available suggest that hepatocytes are a superior method with regard to accurate predictions of in vivo data, with microsomes also providing good data (Ashforth et al., 1995; Hayes et al., 1995; Vickers et al., 1993; Zomorodi et al., 1995).
In this article, we describe a comprehensive retrospective analysis of preclinical pharmacokinetic and in vitro metabolism data accrued over a 14-year period for Pfizer proprietary compounds. The compounds in the data set used for this analysis cover a broad range of small-molecule (e.g., molecular weight <600) organic compounds designed for therapeutic use in a variety of disease states. Thus, use of this data set presents a great challenge to pharmacokinetic prediction methods because each method must not only be applicable to a close-in homologous series of compounds but also be broadly applicable to compounds of all types and physicochemical properties. These data were used in several methods, described herein, designed to predict the pharmacokinetics (clearance, volume of distribution, t1/2 and oral bioavailability) of drugs in humans. The methods include a battery ofin vitro, in vivo and combined in vivo/in vitro approaches both obtained from the scientific literature and described for the first time here. A comparison of the predicted values to authentic human pharmacokinetic data was made to compare the accuracies and uses of these prediction methods.
Methods
Sources of Pharmacokinetic and In Vitro Data
The original pool of compounds included in this analysis were all of those brought into preclinical development at Pfizer over a 14-year period from 1981 through 1994 (n = 83). From this set, those compounds for which no human data were available were removed (n = 30). Another three were excluded because they were developed as prodrugs. Thus, the data used in this analysis included all available preclinical pharmacokinetic and in vitro metabolism data for those compounds for which a minimum of a human in vivo t1/2 value was available (n = 50; table1). The amount of preclinical data available for each compound ranged from extensive (in which case, all prediction methods could be tested) to scant (in which case, only one or two prediction methods could be applied). Human in vivoclearance and oral bioavailability data used for a given compound were from the lowest dose in which sufficient plasma concentration-vs.-time data were available to adequately describe the terminal phase. This was done to minimize the potential of including CL and F values that could be confounded by saturation of CL and/or F or limitations on oral absorption at high doses.
Methods for Predicting Human Volume of Distribution
Four methods were examined for their ability to accurately and successfully predict human volume of distribution (table2): (1) a method in which an average fraction unbound in tissue in preclinical species is used with human plasma protein binding data to calculate human VDss (method V1), (2) a method in which a proportionality is established between VDss and fu in dog and human (method V2) and (3) allometric scaling without (method 3a) and with (method 3b) considerations for interspecies differences in plasma protein binding. This yielded a total of four methods, which are further described below.
Average fraction unbound in tissues method (method V1).
In this method, experimentally determined values for volume of distribution (in units of liters/kg) and plasma protein binding for each species were used, along with standard values for extracellular fluid volumes, plasma volumes and so forth, to calculate the fraction unbound in tissues in animal species. The following equation, which is a rearranged form of one previously described by Oie and Tozer (1979), was used to calculate the fraction unbound in tissues for each preclinical species for each compound:
After fut was calculated for each of the preclinical species, all values for a given compound were averaged. This averaged animal value for fut is assumed to be equal to fut in humans and, along with the value experimentally determined for human fu(fraction unbound in human serum/plasma), was used in the prediction of human VDss (in units of liters/kg) using the following equation (rearranged version of equation 1) and using appropriate human values for Vp, Re/i and so forth:
Proportionality (method V2).
This method simply states that a proportionality could be set up between the free-fraction of drug in plasma in dog and human and the volume of distribution in these two species. [In other words, free VD(human) = free VD(dog).] Implicit to this method was the assumption that tissue binding of drugs is similar in dogs and humans and that physiological parameters, such as extracellular fluid volumes, are similar between the two species on a per-weight basis. Solving for the human volume of distribution (in units of liters/kg) yielded the following equation:
Allometry without protein binding (method V3a).
In allometric scaling of volume of distribution, the physiological parameter used in the scaling was total body weight (Boxenbaum, 1982). In this method, plots were constructed of total volume of distribution in preclinical species (in units of liters per animal) vs. animal body weight (table 3) on a log-log scale for each compound in the analysis. Allometric equations in the form:
Allometry corrected for protein binding (method V3b).
An identical approach was taken as described above except that animal volume of distribution values were corrected for plasma protein binding using the following equation:
Methods for Predicting Human Clearance
Three approaches were examined for their ability to accurately and successfully predict human CL, with each approach possessing important variations, leading to a total of 12 prediction methods (table 2): (1) methods in which first-order consumption of parent drug was monitored in liver microsomal incubations to yield in vitro t1/2 values (methods C1a–C1d), (2) methods in which Vmax andKMapp were determined and used in the calculation of CL′int (methods C2a–C2d) and (3) allometric scaling methods with and without considerations of interspecies differences in plasma protein binding and/or MLP (methods C3a–C3d).
In vitro t1/2 methods.
With methods C1a, C1b, C1c and C1d, values for intrinsic CL (CL′int) were calculated from in vitro t1/2 data obtained in an appropriate system (e.g., liver microsomes), which were then scaled up to represent the CL expected in an entire organism. The fundamental basis behind this simple approach lies in the derivation of the integrated Michaelis-Menten equation (Segel, 1975):
Methods C1b and C1d use equations 13 and 14, respectively, as written above. Methods C1a and C1c use equations 16 and 17, which represent variations on equations 13 and 14 in which fraction unbound (fu) was removed:
Enzyme kinetic methods.
With methods C2a, C2b, C2c and C2d, the enzyme kinetic parameters KMapp andVmax measured in liver microsomal incubations were used to define intrinsic CL as:
Allometric scaling with protein binding and MLP correction factor (method C3a).
In allometric scaling of CL, the physiological parameter used in the scaling was total body weight. In the case of this method, corrections for interspecies differences in both plasma protein binding and MLP (Boxenbaum, 1982) were applied. For plasma protein binding, free CL is defined as:
Allometric scaling without protein binding and without MLP correction factor (method C3b).
This allometric method was carried out as described above using total CL and body weight, with no correction for interspecies differences in MLP.
Allometric scaling with protein binding and without MLP correction factor (method C3c).
This allometric method was carried out as described in C3b using free CL values and body weight, with no correction for interspecies differences in MLP.
Allometric scaling without protein binding and with MLP correction factor (method C3d).
This allometric method was carried out as described in C3a, except that CL values were not converted to free CL values before regression.
Methods for Predicting Human t1/2
Two approaches were examined for their ability to accurately and successfully predict human t1/2 (table 2): (1) methods that rely on direct correlations between animal and humant1/2 values (methods T1–T3) and (2) methods in which individual volume of distribution and CL predictions are combined to yield t1/2 predictions (methods TV1C1a, TV1C1b and so forth).
Animal correlations (methods T1–T3).
Assessment of animal/human t1/2 correlations were undertaken with a data set containing both data for in-house proprietary compounds and data from the scientific literature for whicht1/2 data was available for rat, dog, monkey and human. Only compounds with t1/2data for all four species were used in these analyses. To construct correlations, measured t1/2 values in rat, dog or monkey were plotted vs. human t1/2values, and functions were derived from 1/x-weighted linear regression. The predictions of human t1/2were then obtained by inserting the animalt1/2 value into the regression equation.
Combinations of human volume and clearance predictions [methods Tv(x)c(X)].
In this approach, each method for predicting the volume of distribution was combined with each method of predicting CL to generate predictions of humant1/2 using the following formula:
Methods for Predicting Human Oral Bioavailability (Methods FC1a–FC3d)
The methods for predicting human oral bioavailability used those described for CL (table 2), with a rearranged equation that accounted only for first-pass hepatic CL and accounted for neither the potential limitations on absorption from the GI tract (i.e., fraction absorbed, Fa, was assumed to be unity) nor potential first pass extraction by the gut wall tissue (Fg = 1):
Success criteria.
For volume of distribution and CL predictions, success was assessed by the geometric mean of the ratio of predicted and actual values. Thus:
For t1/2, a similar calculation was made. In addition, a second success criterion was applied that was applicable to drug development and compound selection. In this criterion, the success rate of correctly placing compounds into an appropriatet1/2 zone was measured. These predetermined zones were based on dosing regimens associated with half-lives (when considerations of disparate PK/PD relationships and wide therapeutic indices are ignored). The zones were 0 to 4 hr (three times daily), 4 to 12 hr (twice daily), 12 to 48 hr (once daily) and >48 (once daily or less often). In addition, if a compound was predicted to have a value that was outside of the appropriate zone but the prediction was still within 2-fold of the actual t1/2 , the prediction was also considered to be successful. The success rate of a t1/2 prediction method is simply the number of compounds successfully predicted by the method divided by the total number of predictions made using the method and then multiplied by 100.
For oral bioavailability, success of prediction methods was assessed by the percentage of compounds that were appropriately predicted to be <10%, 10% to 30% and >30%; these zones represent categories of unacceptable, intermediate and satisfactory, respectively, as defined by general decision making criteria typically used in drug discovery and development processes.
Results
Predictions of volume of distribution.
There were 16 compounds that had adequate preclinical data and human intravenous pharmacokinetic data suitable for assessment of VD predictions. Predictions of VD by methods V1, V2, V3a and V3b are presented in figure 1, A–D, respectively, and the identities of outlier compounds are indicated.
Method V1 predicted human VD within 2-fold of actual for 14 of 16 compounds (88%; fig. 1A). The simplest approach, method V2, predicted human VD within 2-fold of actual for 13 of 16 compounds (81%; fig.1B). The accuracy of predictions using V1 or V2 was typically much better than 2-fold as indicated by a geometric mean accuracy value of 1.56 for each method (table 4). Method V3a was the poorest predictor of human VD (fig. 1C) in this analysis in that only 8 of 15 predictions (53%) were within 2-fold of actual. The geometric mean prediction accuracy value for method V3a was 2.78. Method V3b predicted human VD within 2-fold of actual for 10 of 13 compounds (77%; fig. 1D). The geometric mean prediction accuracy value for method V3b was 1.83. No similarities (e.g.,physicochemical properties, VD, fu and so forth) were readily apparent for the compounds for which poor predictions of VD were obtained.
Predictions of clearance.
There were as many as 14 compounds that had adequate preclinical data and human intravenous pharmacokinetic data suitable for assessment of CL predictions.
Methods C1a to C1d used easily obtainable in vitro t1/2 data in the calculation of in vitro CL′int and scale-up to in vivo CL values. The four variations of this method applied thein vitro CL′int values in the well-stirred and parallel tube models of hepatic extraction both with and without considerations for protein binding. Of these four methods, C1a and C1c, which use the well-stirred and parallel tube models, respectively, without considerations for plasma protein binding, yielded, on average, accurate predictions of human CL (table 4). In each case, predictions of the CL of six of seven compounds were within 2-fold of actual values (fig. 2, A and C). However, when protein binding values were included in the equations for the well-stirred or parallel tube models, large underpredictions of CL were obtained (fig. 2, B and D), resulting in an overall inaccuracy for methods C1b and C1d. The compounds that were underpredicted using C1b and C1d were all highly protein bound (fu<0.04).
The predictions of CL by commonly applied in vitro enzyme kinetic methods, using the well-stirred model (C2a and C2b) and the parallel tube model (C2c and C2d), are presented in figure3, A–D, respectively. Methods C2a and C2c predicted human CL within 2-fold of actual CL for seven of eight compounds (88%). The geometric mean accuracy values for prediction methods C2a and C2c were 1.63 and 1.67, respectively (table 4). As with methods C1b and C1d, inclusion of plasma protein binding corrections into the well-stirred or parallel tube models for methods C2b and C2d resulted in significant underpredictions of CL for some compounds and thereby decreased the predictive power of these approaches. Thus, only four of eight compounds (50%) were within 2-fold of actual CL values for methods C2b and C2d, and geometric mean prediction accuracy was poor, with values of 8.12 and 7.81, respectively.
The predictions of CL by allometric scaling methods C3a, C3b, C3c and C3d are presented in figure 4, A–D, respectively. The most predictive allometric method was C3c (allometric scaling of CL/fu without MLP consideration), in which nine of 13 predictions (69%; fig. 4C) were within 2-fold of actual, and the geometric mean prediction accuracy was 1.79 (table 4). When allometric scaling was done without inclusion of protein binding and MLP considerations (method C3b), nine of 14 compounds (64%; fig.4B) were predicted to have CL within 2-fold of actual and the mean geometric prediction value was 1.91. Inclusion of MLP considerations in methods C3a and C3d resulted in poor predictions of CL such that the geometric mean values were 2.67 and 3.36, respectively. Predictions of CL were within 2-fold of actual for three of 13 compounds (23%; fig.4A) using method C3a and three of 14 (21%; fig. 4D) compounds using method C3d. No readily apparent trend could be discerned among the outlier compounds (i.e., those identified in figure 3 for which allometric methods did not predict CL).
Predictions of humant½.
Combination of each of the four volume of distribution methods with the 12 CL prediction methods resulted in a total of 48 t1/2prediction methods. A histogram of success rates for each of these prediction methods is given in figure 5, and the mean accuracy values are listed in table5. Mean accuracies for the various methods ranged from 2.13 (TV3aC2a and TV3aC2c) to 8.25 (TV3aC2d). In general, the more accurate CL prediction methods yielded more accurate t1/2 prediction methods. In vitro CL methods that disregarded the impact of protein binding generally yielded t1/2 prediction methods with mean accuracies between 2- and 3-fold of actualt1/2 values, whereas those that included protein binding were generally inaccurate. Allometric CL prediction methods, when combined with any volume of distribution prediction methods, gave t1/2 predictions that were less accurate than in vitro methods that disregarded protein binding but more accurate than in vitro methods that included protein binding. Compared with CL and volume of distribution prediction methods alone, the combination of these two parameters fort1/2 prediction methods yielded generally less accurate predictions.
When assessed by success rate criteria, the best volume and CL combinations were those that included in vitro metabolic rate data but disregarded protein binding in the prediction of CL (e.g., TV2C1a, TV2C2a, TV3aC1c, TV3aC2a; fig. 5). Such methods yielded success rates in the 70% to 80% range. Methods that combined allometric volume and CL prediction methods were generally successful 50% of the time (e.g., TV3bC3d). Allometric CL prediction methods appeared to be somewhat improved when combined with method V2; in this case, success rates exceeded 50%.
Human t1/2 values were also predicted directly from animal t1/2 data. In figure6, plots of monkey, dog and ratt1/2vs. humant1/2 are presented for both in-house data and data from the scientific literature. The data were subjected to 1/x-weighted linear regression, and the functions obtained were used with individual animal t1/2 data to calculate a predicted human t1/2. (Functions are listed in the caption of fig. 6.) When subjected to the success criteria from equation 22, average -fold errors were 1.94, 2.19 and 1.79 using monkey (T1), dog (T2) and rat (T3) data, respectively. Success rates for prediction of appropriate dosing regimen using this procedure were 87%, 72% and 83% for the monkey (T1), dog (T2) and rat (T3) methods, respectively.
Predictions of human oral bioavailability.
Each of the 12 CL prediction methods were used in predictions of oral bioavailability, assuming that all compounds were 100% absorbed. Success rates for the prediction methods are given in figure 7, with values ranging from 60% to 100% success. Allometric CL methods predicted oral bioavailability in the proper zone (i.e.,unacceptable, intermediate, or satisfactory, see Methods) 71% to 93% of the time. In vitro metabolism prediction methods were comparably accurate.
Discussion
This retrospective analysis was successful in identifying methods that were generally applicable for the prediction of human pharmacokinetic parameters using preclinical pharmacokinetic andin vitro drug metabolism data. The methods examined represent a wide array of techniques, but certainly are not an exhaustive list of all possibilities. Some of these methods were taken directly from the literature (e.g., V3b, C2b), and some were developed as variations of literature methods (e.g., C2a); some involved customizing ideas and equations from the literature for prediction purposes (e.g., V1), whereas others were newly developed and described for the first time here (e.g., V2, C1a).
To our knowledge, methods by which human pharmacokinetic parameters can be accurately (e.g., within 10% of actual values) predicted for a wide range of compounds do not presently exist. In general, many of the prediction methods that were tested in this report yielded an adequate level of accuracy when success criteria were applied that represent suitable decision-making metrics in the drug discovery process. For example, it is our contention that a goal to be strived for in predicting human t1/2 should be prediction of a dosing regimen and not a preciset1/2 value. Similarly, it may be unnecessary to predict a precise value for human oral bioavailability but rather to answer the question of whether a given compound will have satisfactory or unsatisfactory oral bioavailability. There are numerous reports in the literature of a particular method accurately predicting the pharmacokinetics of an individual compound, but the purpose of our analysis was to identify methods that would be most broadly applicable in the prediction of human pharmacokinetics. Literature reports typically describe only successes, and it is expected that failed attempts at the prediction of human pharmacokinetics are neither reported nor published. The purpose of this work was to compare various prediction methods in as objective a fashion as was possible. We relied on in-house data for this purpose, rather than relying on the scientific literature for accounts of successful prediction methods. However, in many cases, the absence of authentic human CL, volume of distribution and oral bioavailability data for compounds used in this analysis confounded our efforts to obtain a large number of data points for some methods. This paucity is due to a lack of human intravenous pharmacokinetic data. The data pool is rich in humant1/2 data obtained after oral administration, and therefore predictions of CL and VD could be assessed indirectly when combined to generatet1/2 predictions.
Methods of predicting volume of distribution were, for the most part, highly successful. Our intention was not to predict this parameterper se but rather to use these predicted values in combination with CL projections for predicting humant1/2, which is a more meaningful parameter with decision-making impact on the drug discovery and development processes. The importance of plasma protein binding should be emphasized in the successful prediction of volume of distribution. Of the four methods of predicting volume of distribution described here, three contained protein-binding data as an essential element. These three represented the most successful methods, whereas the method in which protein binding was disregarded was substantially less successful. Method V2 represents a novel, simple method to accurately predict human volume of distribution. The only experiments needed for this method are determinations of protein binding in dog and human plasma (or serum) and intravenous pharmacokinetics in dog. A similar method using rat data (VD and fu) was examined (data not shown) but did not appear to approach the accuracy of the dog method and was not further pursued.
The other two volume of distribution prediction methods that appeared to have similar accuracy to method V2 were methods V1 and V3b. Both of these methods require more data than method V2 but do not appear to offer any advantages with regard to general accuracy of predictions. Method V1 is a more elaborate version of V2 (or V2 is a simplified version of V1) using the same principles regarding the relationship between plasma protein binding and volume of distribution. It was derived from an expression first described by Oie and Tozer (1979)relating plasma protein binding, tissue binding and various compartmental volumes. The equation was rearranged to calculate the free fraction in tissues (fut) of preclinical species; these values were then averaged as an estimate of tissue binding in humans, and this average fut value was used in conjunction with human plasma protein binding to compute the volume of distribution. Thus, the experiments needed to generate the minimum data required for this method include intravenous pharmacokinetics in at least two preclinical species and plasma protein binding in these species and humans. Method V3b required the same data as method V1 and was similarly predictive as this method. Clearly, allometric scaling of volume of distribution required correction to free volume of distribution (method V3b) because method V3a, which did not incorporate protein binding data, was substantially less successful than V3b.
On the whole, methods of predicting human CL were less accurate than those for predicting volume of distribution. However, some appeared to be adequate for combination with predictions of volume of distribution for subsequent predictions of t1/2 (see below). The recent increase in availability, characterization and utility of human reagents, such as human liver microsomes, has added another dimension to the prediction of human pharmacokinetics by allowing for prediction of human metabolic CL from in vitrometabolism studies. The use of in vitro hepatic microsomal intrinsic CL data to predict systemic CL carries several assumptions and caveats: (1) metabolic CL is the primary CL mechanism (i.e., CLm > CLrenal + CLbiliary + CLother), (2) the liver is the major CL organ, (3) oxidative microsomal metabolism is the predominant route of metabolism (compared with nonmicrosomal metabolism and conjugative metabolism) and (4) metabolic rates and enzyme activities measuredin vitro are truly reflective of those that occur in intact systems in vivo. Two types of in vitro methods were examined: one type that used simple in vitrodegradation rate data (1a–d) and one type that used the more elaborate enzyme kinetic data (2a–d). In both cases, human hepatic intrinsic CL values were calculated from in vitro data in human liver microsomes. The in vitro t1/2method of determining intrinsic CL carried several caveats. First, the experiment must be conducted at a substrate (drug) concentration below the apparent KM value (which is not known a priori to conducting this type of analysis). In the data we present, substrate concentrations were typically 1 μM. Second, no significant enzyme inactivation can occur during the incubation period for an accurate determination. For cytochrome P450-catalyzed reactions, enzyme inactivation due to the concomitant formation of reactive reduced oxygen species (e.g., H2O2) is typically observed. Finally, the reaction cannot approach equilibrium (which is not a problem for cytochrome P450-catalyzed reactions). Despite these caveats, the predictions of human CL obtained using in vitro t1/2 data were fairly comparable to those made using the more extensive enzyme kinetic data (Vmax/KMapp).
Four variations were applied for in vitro t1/2 and enzyme kinetic approaches using two different models of hepatic extraction (well-stirred and parallel tube) with or without inclusion of plasma protein-binding data. The inclusion of protein-binding data is traditionally a cornerstone of these models (Pang and Rowland, 1977; Wilkinson and Shand, 1975); however, in our analysis, disregarding this factor yielded superior predictions of CL (e.g., C1a vs. C1b, C1cvs. C1d, C2a vs. C2b, C2c vs. C2d). This was primarily due to highly protein bound compounds for which CL was severely underpredicted when the very low values (<0.1) for free fraction in plasma were included. Interestingly, most of these compounds were lipophilic amines. Our current working hypothesis is that these compounds were also highly bound to the liver microsomes used in in vitro incubations, leading to underestimates of free intrinsic CL (Obach, 1996). If binding to microsomes and binding to plasma proteins are equivalent, the unbound fraction terms will cancel and equations 13 and 14 will degenerate to equations 16 and 17, respectively. Experiments are under way to address this. Geometric means of predicted CL/actual CL were <2 for each of methods C1a, C1c, C2a and C2c, suggesting that CL will be predicted within 2-fold of actual values. The low number of data points (six to eight) preclude conclusions regarding the anticipated success of prospective application of these methods at this time.
In predictions of human CL, two of the four allometric scaling methods yielded accurate predictions (C3b and C3c). Interestingly, inclusion of corrections for MLP were less accurate than corresponding methods that lacked this correction. Of the compounds examined in this analysis using allometric scaling, most are cleared via hepatic oxidative metabolism. Interspecies differences in intrinsic abilities to metabolize compounds (Lin, 1995) can confound allometric scaling. Despite this, allometry generally provided good predictions of human CL. Method C3c, allometric scaling of CL that included corrections for interspecies differences in plasma protein binding but did not correct for MLP, was the most successful of the allometric methods yielding predictions of human CL that were typically within 2-fold of actual values. It should be further noted that the knowledge of regression coefficients of allometric relationships for each compound was not used. We did not discard compounds for which such a regression value was low; all data were included. In a prospective manner, allometric methods of predicting human CL would likely not be used for compounds demonstrating poor allometric relationships. Furthermore, observation of outlier species on allometric plots would suggest that such data be removed from the relationship, whereas in our analysis all data points were included, regardless of how well each of the species “lined-up” in allometric plots. Furthermore, compounds for which data were available in only two preclinical species were still included in the assessment of allometric scaling. In a pure sense, allometry should only be used when data from at least three preclinical species are available (Mahmood and Balian, 1995); however, in the drug discovery process, one is frequently faced with situations in which such extensive data are not available. In our efforts to apply these methods as they could be applied in the “real world” situations of drug discovery (as opposed to having all the data truly needed for the best predictions), assessments of the predictive abilities of allometric scaling (and other methods as well) in this report represent an underestimate of predictive use. Application of drug metabolism/pharmacokinetic insight to the process of prospectively predicting the human pharmacokinetics of any individual compounds would lead to improvements in the accuracy of predictions.
The predictions of human CL and volume of distribution, although interesting and useful in their own right, represent a means to predicting human t1/2, a parameter that is better understood by nonpharmacokineticist colleagues in the drug discovery and development field (e.g., medicinal chemists, pharmacologists, clinicians). Furthermore, thet1/2, along with knowledge of the therapeutic index and pharmacokinetic/pharmacodynamic relationships, dictates the dosing frequency. A frequently asked question in the preclinical drug discovery process is, “Will compound X be a once-per-day drug?” Thus, an ability to predict dosing regimen by predicting human t1/2 will provide tremendous value to drug discovery efforts in the compound selection process. To this end, we targeted prediction of half-lives not as absolute values but rather as an ability to place compounds into appropriate dosing regimen zones. These zones were preset before the analysis at 0 to 4, 4 to 12, 12 to 48 and >48 hr, which, in cases of “average” therapeutic index and straightforward relationships between PK and PD, approximately correlate to dosing regimens of three times a day or more often, twice a day, once a day and potentially less than once a day, respectively. Furthermore, so as not to restrict predictions to absolute cutoff values for success (e.g., not to classify a prediction of 3.8 hr a failure when the actualt1/2 was 4.5 hr), a 2-fold accuracy criterion was overlaid on the dosing regimen success criterion. In this work, we described two types of approaches to predicting humant1/2: a combination of CL and distribution volume predictions and a direct correlation of animal and humant1/2 values.
In combining CL and volume of distribution predictions to predictt1/2, all methods were combined, again to remain unbiased and comprehensive in the assessment of methods. Thus, methods that were unsuccessful in the prediction of CL and volume were not excluded from being examined in combinations to predictt1/2. Also, methods were “mixed”; for example, an allometric (in vivo) volume prediction method was combined with an in vitro CL prediction method. As might be expected, methods that were more successful in predicting the independent parameters of CL and volume were generally more successful in combination in predicting t1/2. The best combination methods yielded success rates of 70% to 80%. In light of the cost of drug development, the attrition rate of new chemical entities in clinical development and the extent to which inadequate pharmacokinetics contributes to this attrition rate, such at1/2 prediction success rate would result in an effective strategy for compound selection. An additional consideration in the prediction of humant1/2 is the fact that thet1/2 values for compounds exhibiting multiphasic plasma concentration-vs.-time profiles (i.e., multicompartmental kinetic models) will be underpredicted using combinations of CL and VD predictions. In the data set used, there were no compounds that exhibited such a behavior. In the case of multiphasic behavior, the parameter VDβ should probably be used in the prediction of terminal phase t1/2. However, this should also be done cautiously because long multiphasic terminal phaset1/2 values seldom have an impact on dosing regimens, and the underlying purpose of predicting humant1/2 is primarily to assess potential dosing regimens.
In addition to testing the ability of preclinical pharmacokinetic data to predict human pharmacokinetics by the use of allometric scaling, evaluations were conducted to determine the ability to predict human pharmacokinetics from animal data using simple correlations of pharmacokinetic parameters. Although previous reports have retrospectively described the predictability of humant1/2 and volume of distribution from rat pharmacokinetics using simple correlations, that research did not compare the success rates of these correlations between the species commonly used in preclinical evaluations (Bachmann et al., 1996). In the present research, a population of compounds was identified from both in-house and literature sources for which there was intravenously derived t1/2 data for rat, dog and monkey and t1/2 data from either intravenous or oral studies in humans. The current analysis was limited to evaluation of t1/2 predictions to take advantage of the relatively large amounts of human oral pharmacokinetic data compared to intravenous data. For this set of 46 compounds, the success rates for prediction of dosing regimen ranged from 72% for the dog to 87% for the monkey. Thus, for the prediction of human t1/2, there potentially exist relatively simple animal/human correlation methods based on preclinical intravenous pharmacokinetic data, which have success rates approaching or exceeding those for more complex techniques involving in vitro metabolism data or pharmacokinetic data from multiple species. Because this evaluation was retrospective in nature, using existing preclinical and clinical data, future research will focus on testing the usefulness of these correlations to prospectively predict humant1/2 from preclinical data.
Predictions of oral bioavailability were generally successful for the small number of compounds for which in vivo data were available. This success was despite the fact that only hepatic microsomal metabolism was considered as a limitation in these methods. Clearly, this undervalues the potential impact of limitations of absorption and first-pass metabolism mediated by the intestinal mucosa. However, our objective was to place a compound into an oral bioavailability category as dictated by drug development decision making criteria and not to predict precise values. It is anticipated that in vitro methods (e.g., Caco-2 cells) by which fraction absorbed values can be quantitatively predicted will be available in the near future, and these can be incorporated into more refined oral bioavailability predictions. In addition, recent reports have shown that intestinal metabolism can have a pronounced first-pass effect on oral bioavailability, especially for CYP3A substrates (Thummel et al., 1996; Wu et al., 1995). Thus, future methods to accurately predict of oral bioavailability may involve the prediction of intestinal first-pass effects as well.
The population of compounds used in this analysis represented all Pfizer compounds to enter Phase I clinical trials from 1981 through 1994. Over this time period, there were substantially more human pharmacokinetic data generated after oral administration than after intravenous administration. For this reason, the current data set available to assess prediction techniques for parameters requiring intravenous administration (volume of distribution, plasma CL and absolute bioavailability) was smaller than that available for techniques to predict parameters that can be adequately determined after oral administration (t1/2).
In conclusion, several methods using preclinical pharmacokinetic andin vitro human metabolism data have been found to be useful in the prediction of human pharmacokinetic parameters. Future extensions of this research will focus on increasing the population of compounds with human CL, bioavailability andt1/2 predictions derived from in vitro metabolism data. In addition, future research will include assessments of the usefulness of in vitro data to predict oral CL, thereby reducing the need for human intravenous pharmacokinetic data to determine the usefulness of these in vitro techniques.
Acknowledgments
The authors wish to acknowledge the many Pfizer Drug Metabolism Department Scientists of Groton, CT, and Sandwich, UK, past and present, who have generated data used in these analyses and Drs. Robert Ronfeld and Dennis Smith for critical evaluation of this work.
Footnotes
-
Send reprint requests to: Dr. R. S. Obach, Department of Drug Metabolism, Pfizer Central Research, Eastern Point Road, Groton, CT 06340.
- Abbreviations:
- fut
- fraction unbound in tissues
- fu
- unbound fraction in plasma (or serum)
- VDss
- steady state volume of distribution (in liters/kg)
- Vp
- plasma volume (in liters/kg), Ve, extracellular fluid volume (in liters/kg)
- Vr
- “remainder of the fluid” volume (in liters/kg)
- Re/i
- ratio of binding proteins in extracellular fluid (except plasma) to binding proteins in plasma
- CL
- clearance
- F
- oral bioavailability
- MLP
- maximum lifespan potential
- Received March 4, 1997.
- Accepted June 23, 1997.
- The American Society for Pharmacology and Experimental Therapeutics