|
|
||||||||
RESEARCH REPORT |
1,4 Department of Oral and Maxillofacial Surgery, Massachusetts General Hospital and Harvard School of Dental Medicine, 55 Fruit Street, Warren 1201, Boston, MA 02114;
1 Department of Oral Health Policy and Epidemiology, Harvard School of Dental Medicine, 188 Longwood Avenue, Boston, MA 02115; and
1,2,3 Department of Biostatistics, Harvard School of Public Health, 677 Huntington Avenue, Boston, MA 02115;
*corresponding author, PO Box 67376, Chestnut Hill Station, Chestnut Hill, MA 02467, schuang{at}hsph.harvard.edu
| ABSTRACT |
|---|
|
|
|---|
KEY WORDS: survival predictions dental implants clustered data correlated survival analysis proportional hazards model marginal approach Aalen-Breslow estimator
| INTRODUCTION |
|---|
|
|
|---|
Statisticians have significant interest in developing and applying statistical methods and models to address correlated or clustered multivariate survival data (Spiekerman and Lin, 1998). The analytical challenge lies in accounting for the correlation of the observations for valid statistical inferences. To this end, two classes of models have been proposed: (1) proportional hazards frailty and (2) marginal proportional hazards models (Vaida and Xu, 2000). The former approach formulates the dependence structure explicitly. Some investigators had mainly utilized the gamma frailty for the dependence structure of the proportional hazards frailty model discussed by Vaida and Xu (2000). The latter class of models, marginal proportional hazards, does not specify the dependence structure in the model formulation but adjusts for it in the inference by means of sandwich-type estimators (Lee et al., 1992; Lin, 1994; Spiekerman and Lin, 1998).
In dental implant research, multiple implants placed into the same patient produces challenging analytic problems, because the dataset is composed of multiple, correlated observations. To address the issue of correlated, dependent observations, many authors recommended randomly selecting one implant per patient for analysis, resulting in inefficient estimation because not all of the data are used (Haas et al., 1996; Herrmann et al., 1999; Lekholm et al., 1999; Gomez-Roman et al., 2001; Weibrich et al., 2001). The purpose of this study is to compare and contrast two methods for producing survival prediction estimates and their associated confidence intervals for a dataset composed of clustered observations. The first method, commonly used but statistically invalid, assumes independence of clustered observations. The second method is an innovative analytic survival prediction method, i.e., modified Aalen-Breslow estimates, designed to produce statistically valid, efficient prediction models of implant survival for clustered observations using the marginal approach by Spiekerman and Lin (1998). We believe that this is the first report with integrated clinical applications of theoretical clustered survival methodologies to predict dental implant survival.
| MATERIALS & METHODS |
|---|
|
|
|---|
Statistical Issues
In this article, we considered the survival prediction problem for the Cox proportional hazards model (Cox, 1972). Specifically, let T be the failure time, and let Z be the corresponding covariate vector. Also, let SZ() be the survival function of T given Z. The Cox model can be written as log(-log(SZ(t)) = h(t) + ßTZ, which is a member (special case) of the general linear transformation model g(SZ(t)) = h(t) + ßTZ for link function g (x) = log(-log(x)), where h(t) is a completely unspecified strictly increasing function and ß is a p x 1 vector of unknown regression coefficients. (Cheng et al., 1995, 1997). We propose pointwise procedures for Sz0(t), the "t-year" (or "t-months") survival probability of future patients with a given covariate vector z0 with clustered implant observations. The survival prediction method proposed by Spiekerman and Lin (1998) was integrated and applied to predict survival probabilities of correlated dental implants for a given set of covariates via the Aalen-Breslow estimators. The set of covariates selected to be used in this study, i.e., current tobacco use, implant staging, and timing of implant placement, was previously identified by statistically valid and efficient methods that adjusted for clustered observations (Chuang et al., 2002). Databases were stored in Epiinfo 2000 (Centers for Disease Control and Prevention, Atlanta, GA, USA) and SAS-PC 8.0 (Cary, NC, USA) files. Advanced statistical computations for clustered survival predictions were utilized by S-plus (Version 3.4, 1996) with special functions and coding with programming in the Unix environment provided by our co-authors (Tian and Wei).
Statistical Notation
As discussed by Spiekerman and Lin (1998), if each patient consists of Li implants, the marginal hazard function for the lth implant of the ith patient is related to the corresponding possibly time-dependent covariate vectors Zil(t) by
![]() |
For i = 1,...,N, and l = 1,...,Li, let Til and Cil be the failure and censoring times with respect to the lth implant of the ith patient, and Zil = (Z1il,...,Zpil)T be the corresponding (and possibly time-varying) covariate vector. The marginal distribution of Til is related to Zil through model (*). Define Ti = {Til; l = 1,...,Li}, with Ci and Zi defined similarly. Suppose that (Ti, Ci, Zi) (i = 1,...,N) are independent, identically distributed (i.i.d.), and that Ti is independent of Ci conditional on Zi. The patients are allowed to have different sizes of dental implants (i.e., Li, which denotes that the cluster sizes of dental implants in the same patient can vary). We define Xil = minimum (Til, Cil) and
il = 1 (Til
Cil), where 1() is the indicator function.
Under the independence working assumption, the "quasi-partial likelihood" for ß0 is:

![]() |
![]() |
L(ß) is the partial likelihood function, the observed information matrix is I(
) = -
2logL(ß)/
ß2, given that ß =
, n is the total number of dental implants, and N is the total number of patients.
To make survival predictions and the variance-covariance matrix of the survival function Sz0(t), we need to derive the Aalen-Breslow type estimators for
0(t), which is the baseline cumulative hazard function. The variance-covariance matrix for the survival function also needs to be derived and computed. [For statistical methodologies with vigorous mathematical details on the convergence and asymptotic properties of these estimators, please refer to Spiekerman and Lin (1998, pp. 1167-1169).]
The Aalen-Breslow-type estimator for
0(t) is:
![]() |
The variance of the Aalen-Breslow type estimators adjusted for clustered observations for
0(t) is:
![]() |


We are interested in constructing pointwise confidence intervals for
0(t), and the corresponding survival function S0(t) = e-
0(t), for a given time point t. Then, the corresponding confidence intervals for
0(t) and S0(t) are
![]() |
![]() |
(t) = exp(-
(
(·),t)). To make inferences about the cumulative hazard and survival functions with a given covariate vector Z0, one can simply replace ZilZ0, i=1,...,N, l=1,...,Li in the original dataset and obtain the confidence intervals for the underlying cumulative hazard and survival function with this modified dataset.
| RESULTS |
|---|
|
|
|---|
We previously identified the variables statistically associated with implant survival: timing of implant placement (delayed vs. immediate), current tobacco use (yes or no), and implant staging (one- or two-stage) (Chuang et al., 2002). For patients who had a delayed procedure, did not smoke, and underwent a two-stage implant procedure (best-case scenario), the predicted one- and five-year survival rates were 97.2% and 93.4%, respectively. For patients who had an immediate implant placed, smoked, and underwent a one-stage procedure (worst-case scenario), the predicted survival rates at one and five years were 58.5% and 27.6%, respectively. The details of various combinations are shown in Table 1
. As an example of producing valid Aalen-Breslow variance of the predicted one-year survival point estimates, we used the cohort of non-smoking patients having immediate implants placed in one stage. The predicted one- and five-year survival point estimates were 83.7%. and 65.2%, respectively. The 95% confidence intervals were 76.5% to 91.6% when computed by the naïve method and 73.3% to 94.2% when computed by the dependence method. The 95% confidence interval computed by the dependence method was 5.9% wider (absolute difference) than the confidence interval computed by the naïve method (Table 2
). For the five-year predicted survival estimate, the 95% confidence intervals were 51.2% to 83.0% computed by the naïve method and 42.2% to 88.2% computed by the dependence method. The 95% confidence interval computed by the dependence method was 14.2% wider (absolute difference) than the confidence interval computed by the naïve method. The dependence method for computing variance produced an adjusted standard error that was larger and with wider confidence intervals than with the naïve method, illustrating the positive, strongly correlated association of the survival of implants within the same subject given specific important predictors (Table 2
). The results of the above analyses suggest that, in the setting of clustered observations, the naïve method variance estimate was invalid. As such, the choice of method for variance estimation is an important consideration when survival predictions are performed.
|
|
| DISCUSSION |
|---|
|
|
|---|
To produce statistically valid prediction models adjusted for clustered failure time data, we applied the method described by Spiekerman and Lin (1998). Based on that method, patients who were non-smokers, had implants placed in two stages, and did not have immediate implants had the best survival estimates (one- and five-year survival estimates were 97.2% and 93.4%, respectively). Patients who were smokers and had implants placed immediately in one stage had the worst survival estimates (one- and five-year survival estimates were 58.5% and 27.6%, respectively).
To compare and contrast how modifying the assumptions of implant observations (independent or dependent) affected the predicted survival point and variance estimates, we utilized a common method for analyzing implant survival (Higuchi et al., 1995; Wheeler, 1996; Buser et al., 1997; Brocard et al., 2000; Testori et al., 2001), ignoring the issue of dependency among implant observations from the same subject (naïve model). In the second model, we utilized all implants but adjusted for correlation among implants from the same subject for survival predictions (dependent model). When comparing the two analytic strategies, we found that the one- and five-year survival point estimates were similar, but the variance estimates were drastically different. The 95% confidence intervals for the naïve model were narrower, by approximately 5.8% to 14.2%, than the dependent model. In the setting of clustered observations, we believe that the standard error estimates calculated by the Spiekerman and Lin (1998) method is the most statistically valid and efficient choice for analysis of correlated survival predictions in implant research, because it accounts for the correlation of implants within subjects.
The marginal modeling methodology as described by Wei et al. (1989), Lee et al. (1992), Lin (1994), and Spiekerman and Lin (1998) is likely to increase its popularity for the analysis of clustered multivariate survival data. Lin (1994) also presented a general theoretical statistical methodology for analyzing such data structure, an idea analogous to that of Liang and Zeger (1986), without censoring in longitudinal data analysis. Lin's approach formulates the marginal distributions of multivariate failure times with the familiar Cox proportional hazards models, while leaving the nature of dependence among related failure times completely unspecified. The baseline hazards functions for the marginal models may be identical or different. Generalized estimating equations investigated by Lin (1994) from the marginal approach for the regression parameters revealed consistent and asymptotically normal estimators, and robust variance-covariance estimators are constructed to account for the intra-class correlation.
Further simulation results by Lin (1994) demonstrated that the large-sample approximations are adequate for practical use, and that ignoring the intra-class correlation could yield rather misleading variance estimators which are similar in our investigation. The theoretical work by Spiekerman and Lin (1998) provided additional modeling capabilities by allowing for separate baseline hazard functions among different strata and imposing the same baseline hazard function within each stratum. Second, it provides a rigorous asymptotic theory for the estimation of the regression parameters, filling several important gaps in the existing proofs for the Lee et al. (1992) and Lin (1994) method. Third, it establishes the asymptotic properties of the Aalen-Breslow-type estimators for the cumulative baseline hazard functions and develops the corresponding inference procedures. We have utilized these new ideas, methodologies, and analytic techniques, which will facilitate further research and applications of statistical methodology and methods for analyzing multivariate dental failure time data, such as dental implant research.
In summary, clustered survival observations are frequently encountered in many different areas of patient-oriented dental research. Commonly, the key issue of dependence structure of clustered observations is ignored in analyses, resulting in statistically invalid estimates and inflated type I, risking erroneous rejection of the null hypotheses. Additionally, most current methods to adjust for clustered survival observations, i.e., randomly selecting one observation per patient, result in inefficient estimation. The methods used in this study produce statistically valid and efficient estimates. In their current state, however, these survival methods are resource-intensive, requiring high-level programming and methodological biostatistical expertise with intensive computer time. Regardless, in the setting of correlated survival observations, we recommend adjusting for the correlation of the observations to provide statistically valid variance of the Aalen-Breslow estimator under investigation to predict survival for a given set of covariates. Future efforts are focused on making these important analytic methods more accessible to the average patient-oriented researcher. Predicting survival estimation based on clustered multivariate data continues to be an important, challenging, but under-investigated, biostatistical problem in patient-oriented dental research. Future research efforts will focus on the application of mixed-effects (frailty) and accelerated failure time (AFT) survival models to datasets composed of clustered survival observations.
| ACKNOWLEDGMENTS |
|---|
Received January 14, 2002; Last revision August 7, 2002; Accepted September 10, 2002
| REFERENCES |
|---|
|
|
|---|
Buser D, Mericske-Stern R, Bernard JP, Behneke A, Behneke N, Hirt HP, et al. (1997). Long-term evaluation of non-submerged ITI implants. Part 1: 8-year life table analysis of a prospective multi-center study with 2359 implants. Clin Oral Implants Res 8:161172.[Medline]
Cheng SC, Wei LJ, Ying Z (1995). Analysis of transformation models with censored data. Biometrika 82:835845.
Cheng SC, Wei LJ, Ying Z (1997). Prediction of survival probabilities with semi-parametric transformation models. J Am Statist Assoc 92:227235.
Chuang SK, Tian L, Wei LJ, Dodson TB. (2001). Kaplan-Meier analysis of dental implant survival: a strategy for estimating survival with clustered observations. J Dent Res 80:20162020.
Chuang SK, Wei LJ, Douglass CW, Dodson TB (2002). Risk factors for dental implant failure: a strategy for the analysis of clustered failure time observations. J Dent Res 81:572577.
Cox DR (1972). Regression models and life-tables (with discussion). J R Statist Soc Series B 34:187220.
Dental Implant Clinical Research Group (1997). Introduction. J Oral Maxillofac Surg 55(Suppl 5):711.[Medline]
Gomez-Roman G, Kruppenbacher M, Weber H, Schulte W (2001). Immediate postextraction implant placement with root-analog stepped implants: surgical procedure and statistical outcome after 6 years. Int J Oral Maxillofac Implants 16:503513.[Medline]
Haas R, Mensdorff-Pouilly N, Mailath G, Watzek G (1996). Survival of 1,920 IMZ implants followed for up to 100 months. Int J Oral Maxillofac Implants 11:581588.[Medline]
Herrmann I, Lekholm U, Holm S, Karlsson S (1999). Impact of implant interdependency when evaluating success rates: a statistical analysis of multicenter results. Int J Prosthodont 12:160166.[Medline]
Higuchi KW, Folmer T, Kultje C (1995). Implant survival rates in partially edentulous patients: a 3-year prospective multicenter study. J Oral Maxillofac Surg 53:264268.[Medline]
Lekholm U, Gunne J, Henry P, Higuchi K, Linden U, Bergstrom C, et al. (1999). Survival of the Brånemark implant in partially edentulous jaws: a 10-year prospective multicenter study. Int J Oral Maxillofac Implants 14:639645.[Medline]
Lee EW, Wei LJ, Amato DA (1992). Cox-type regression analysis for large numbers of small groups of correlated failure time observations. In: Survival analysis: state of the art. Klein JP, Goel PK, editors. Dordrecht, Boston, London: Kluwer Academic, pp. 237-247.
Liang KY, Zeger SL (1986). Longitudinal data analysis using generalized linear models. Biometrika 73:1322.
Lin DY (1994). Cox regression analysis of multivariate failure time data: the marginal approach. Stat Med 13:22332247.[Medline]
SAS Institute Inc. (1999). SAS System. Version 8. Cary, NC: SAS Institute Inc.
S-Plus (1996). StatSci, A division of MathSoft, Inc., Seattle, WA.
Spiekerman CF, Lin DY (1998). Marginal regression models for multivariate failure time data. J Am Stat Assoc 93:11641175.
Testori T, Wiseman L, Woolfe S, Porter SS (2001). A prospective multicenter clinical study of the Osseotite implant: four-year interim report. Int J Oral Maxillofac Implants 16:193200.[Medline]
Vaida F, Xu R (2000). Proportional hazards model with random effects. Stat Med 19:33093324.[Medline]
Vehemente V, Chuang SK, Daher S, Muftu A, Dodson TB (2002). Risk factors affecting dental implant survival. J Oral Implantol 28:7481.[Medline]
Wei LJ, Lin DY, Weissfeld L (1989). Regression analysis of multivariate incomplete failure time data by modeling marginal distributions. J Am Stat Assoc 84:10651073.
Weibrich G, Buch RS, Wegener J, Wagner W (2001). Five-year prospective follow-up report of the Astra tech standard dental implant in clinical treatment. Int J Oral Maxillofac Implants 16:557562.[Medline]
Wheeler SL (1996). Eight-year clinical retrospective study of titanium plasma-sprayed and hydroxyapatite-coated cylinder implants. Int J Oral Maxillofac Implants 11:340350.[Medline]
This article has been cited by other articles:
![]() |
S.-K. Chuang and T. Cai Predicting Clustered Dental Implant Survival Using Frailty Methods. J. Dent. Res., December 1, 2006; 85(12): 1147 - 1151. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. A Gerds and M. Vogeler Endpoints and survival analysis for successful osseointegration of dental implants Statistical Methods in Medical Research, December 1, 2005; 14(6): 579 - 590. [Abstract] [PDF] |
||||
![]() |
M.C.M. Wong, K.F. Lam, and E.C.M. Lo Bayesian Analysis of Clustered Interval-censored Data J. Dent. Res., September 1, 2005; 84(9): 817 - 821. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| IADR Journals | Advances in Dental Research ® |
| Journal of Dental Research ® | Critical Reviews (1990-2004) |