NDT Advance Access originally published online on December 8, 2005
Nephrology Dialysis Transplantation 2006 21(4):945-956; doi:10.1093/ndt/gfi326
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
© The Author [2005]. Published by Oxford University Press on behalf of ERA-EDTA. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org
Original Articles: Clinical Nephrology
The ERA-EDTA cohort studycomparison of methods to predict survival on renal replacement therapy
1 Renal Unit, Western Infirmary, 2 Renal Unit, Glasgow Royal Infirmary, 3 Engineering Department, University of Strathclyde, Glasgow, 4 Renal Unit, Royal Infirmary of Edinburgh, UK, 5 ERA-EDTA Registry, Department of Medical Informatics, and 6 Department of Clinical Epidemiology and Biostatistics, Academic Medical Centre, Amsterdam, The Netherlands
Correspondence and offprint requests to: Dr Colin C. Geddes, Consultant Nephrologist, Renal Unit, Level 7, Western Infirmary, Dumbarton Road, Glasgow G11 6NT, UK. Email: colin.geddes.wg{at}northglasgow.scot.nhs.uk
| Abstract |
|---|
|
|
|---|
Background. Accurate prediction of patient survival from the time of starting renal replacement therapy (RRT) is desirable, but previously published predictive models have low accuracy. We have attempted to overcome limitations of previous studies by conducting an ambidirectional inception cohort study in patients on RRT from centres throughout Europe. A conventional multivariate regression (MVR) model, a self-learning rule-based model (RBM) and a simple co-morbidity score [the Charlson score modified for renal disease (MCS)] were compared.
Methods. In 1996, all 3640 dialysis centres registered with the ERA-EDTA were invited to identify all patients on RRT for end-stage renal failure (ESRF) who died during the 28 days of February 1997 (training cohort) and all patients who started RRT in the same period (validation cohort). Fifty-four clinical and laboratory variables from the time of starting RRT were collected in both cohorts using a two-page questionnaire. The data from the training cohort were given to statisticians at the Amsterdam Academic Medical Centre to create the MVR model and to engineers in Strathclyde University to create the RBM. They were then given the baseline data from patients in the validation cohort to predict how long each patient would survive. Follow-up questionnaires were sent to the centre of each patient in the validation cohort to determine actual survival.
Results. A total of 2310 patients from 793 centres in 37 countries in the ERA-EDTA area were used to construct and validate the models. For predicting 1-year survival, the RBM had the highest positive predictive value (PPV) (84.2%), the MVR model had the highest negative predictive value (NPV) (47%) and the RBM had the highest likelihood ratio (1.59). For predicting 5-year survival, the MCS had the highest PPV (79.4%), the RBM had the highest NPV (74.3%) and the MCS had the highest likelihood ratio (7.0). The proportion of explained variance in survival for MCS, MVR and RBM was 14.6, 12.9 and 3.95%, respectively.
Conclusion. Using the ambidirectional inception cohort design of this ERA-EDTA Registry survey, we have been able to create and validate two novel instruments to predict survival in patients starting RRT and compare them with a simple scoring model. The models tended to predict 5-year survival with more accuracy than 1-year survival. Examples of potential applications include informing clinical decision making about the likely benefit of starting RRT and listing for transplantation, adjusting for baseline risk in comparative studies and identifying specific risk groups to participate in clinical trials.
Keywords: Charlson score; cohort study; Cox model; patient survival; renal replacement therapy; rule-based algorithm
| Introduction |
|---|
|
|
|---|
The 1- and 5-year survivals of patients starting renal replacement therapy (RRT) for end-stage renal failure (ESRF) in the ERA-EDTA Registry 2002 report were 82.1 and 46.9%, respectively [1], but it is recognized that there is wide inter-individual variability. Many factors from the time of starting RRT have been shown to influence survival, including age, race, primary renal diagnosis, late referral to a nephrologist, patient size and co-morbidity. Accurate prediction of patient survival from the time of starting RRT is desirable for reasons that include: informing clinical decision making about the likely benefit of starting RRT; informing health care economic planning; adjusting for baseline risk in comparative studies; and identification of specific risk groups of patients to participate in clinical trials.
Predictive models attempt to create a formal description of complex relationships that allow prediction of future behaviour (e.g. patient survival) that may also provide insight into the relationships described. Several published studies have used scoring systems and multivariate analysis to predict survival from the time of starting RRT [28]. These predictive models have limitations that include the fact that they are usually developed in a small number of patients from one geographical area, low precision of prediction and lack of validation in other patient populations. Most studies have used multivariate regression (MVR) models to predict survival.
We have attempted to overcome these problems by conducting an ambidirectional inception cohort study in a large number of patients on RRT from centres throughout Europe. An ambidirectional inception cohort study involves the comparison of a retrospective cohort and a prospective cohort [9]. In this study, data from the time of starting RRT in patients who died in February 1997 were used to develop models to predict survival on RRT. Thus, the models were developed using a retrospective cohort in which all of the subjects had reached the end-point of interest, i.e. death. The models were then validated in patients who have been followed prospectively since starting RRT in February 1997 and in whom 5-year outcome is now known. The unusual design of the study was intended to address the problems of previous studies described above.
A conventional MVR model, a self-learning rule-based model (RBM) and a simple co-morbidity score were compared. Most clinicians are familiar with the use of MVR modelling to identify risk factors for a particular dependent variable (such as patient survival). The output from a multivariate model can be used to create a score such as probability of survival for an individual patient. Self-learning RBMs have been applied less commonly in medicine [10] but are used in the financial and engineering domains [11,12]. It has been suggested that novel predictive models, such as a self-learning RBM, may have better clinical utility than an MVR model in medicine [13].
The Charlson score was originally developed to adjust for age and co-morbidity in the general population [14] but has recently been modified to apply to patients on RRT. It is a simple scoring system that adds scores for co-morbid conditions to a score for age [8].
The overall aim of the study, therefore, was to compare the clinical applicability of an MVR model, a self-learning RBM and the modified Charlson score (MCS) in predicting survival in patients starting RRT.
| Methods |
|---|
|
|
|---|
In 1996, all 3640 dialysis centres registered with the ERA-EDTA were invited to participate in the study. If they agreed, they were asked to identify all patients on RRT for ESRF, who were registered at their centre, who died during the 28 days of February 1997 and all patients who started RRT in the same period (Figure 1). The patients who died during the 28 days of February 1997 were used to train models to predict duration of survival and are referred to as the training cohort. The patients who started RRT in the same time period were used to validate the predictions of the models and are referred to as the validation cohort.
|
Clinical data from the time of starting RRT were collected in both cohorts using a two-page questionnaire. The data variables collected are shown in Table 1. For the co-morbid data, only the presence or absence of the condition at the time of starting RRT was sought; no detail about the severity of the condition was collected. For the training cohort, duration of survival on RRT was calculated. All data were checked manually for completeness and consistency by two of the coordinating nephrologists. Laboratory values were converted to SI units where appropriate. A series of internal checks was used to identify errors. If the error could not be resolved with reference to the paper record, then the patient or the individual data entry was excluded, whichever was appropriate, since one of the integral features of the design of the study was to minimize the effort required by each centre.
|
The sample size was unlikely to contain enough examples of each EDTA-ERA diagnosis code to avoid potential overfitting of the models. For this reason, primary diagnosis codes were combined into seven previously validated categories [15]. In addition, duration of care by a nephrologist before RRT was categorized as <3, 3.016, 6.0111.99 and >12 months; history of angina or myocardial infarction (MI) were combined to make ischaemic heart disease; history of angina, MI, arterial bypass surgery, symptomatic peripheral vascular disease, cerebrovascular accident or transient cerebral ischaemia were combined to make vascular disease; history of malignancy, ongoing skin malignancy or ongoing non-skin malignancy were combined to make malignancy ever. Furthermore, some variables were also used to make calculated fields: body mass index, estimated creatinine clearance (using the formula of Cockcroft and Gault [16]) and calcium phosphate product. The two groups creating the models were given exactly the same data set and the data variables were explained in detail. The teams creating the predictive models were advised that the combined or calculated fields could be used in the model if the variables used to produce them were omitted to avoid interaction of independent variables in the model (with the exception of age, gender, weight and estimated creatinine clearance). For example, it was permissible to use history of angina and history of MI in the same model but it was not permissible to use history of ischaemic heart disease and history of MI in the same model as these variables were implicitly not independent. Only three patients had renal transplant as mode of first RRT and so mode of first RRT was re-categorized as haemodialysis or not haemodialysis.
Serum cholesterol and parathyroid hormone (PTH) concentrations were also requested but were available for an insufficient number of patients to allow meaningful analysis, and the data have not been used.
For patients starting RRT, we asked participating centres to report all patients that they considered probably had ESRF. During follow-up, we were able to identify and exclude patients who had recovered independent renal function either before or after the 90th day of RRT.
The ERA-EDTA Registry committee decided that the centres that had not participated in 1997 should be again invited to participate in February 1998. The training cohorts (i.e. patients who died on RRT in February 1997 or February 1998) have been amalgamated.
The data from the training cohort were then given to the statisticians in Amsterdam to create an MVR model and the Engineering Department at Strathclyde University, Glasgow (S.M. and M.M.) to create the self-learning RBM (see below). Each team was given exactly the same data set with known duration of survival on RRT and asked to create a model that predicts patient survival. They were then given the baseline data from patients in the validation cohort (i.e. patients who started RRT in February 1997) and asked to predict how long each patient would survive. The predictions from the two groups for the validation cohort were returned to the coordinating nephrologists and held in confidence until 5-year outcome became available.
During the 6 years after inception, follow-up questionnaires were sent to the centre of each patient in the validation cohort (i.e. those patients starting RRT in February 1997) after 3 and 6 months, 1, 2, 3, 4, 5 and 6 years to ask if the patient had recovered independent renal function, was alive, had died (and if so date and cause of death), was lost to follow-up or had moved centre. Patients who recovered independent renal function were excluded from further analysis. Patients who had moved centre were traced to that centre in future follow-up.
Multivariate regression (MVR) model
The time between date of start of dialysis and date of death in the training data set was modelled with a proportional hazards regression model. The hazard function and not the density function was modelled, although there were no censored observations in this cohort. It was assumed that the risk of death at t years after start of RRT [denoted by h(t)] was a function of an unspecified baseline hazard function [h0(t)] multiplied by a relative risk (RR) function: h(t) = h0(t) x RR. It was assumed that the log of the relative risk was a linear function of (i) functions of observed covariate values (X1, X2, ... , Xk); and (ii) a not-observed random region effect (zregion): ln(RR) = ln(zregion) + b1 x f1(X1) + b2 x f2(X2) + ··· + bk x fk(Xk). The random region effect was meant to account for the effects of the different health systems, economic situations and other region-specific variables that may have affected mortality in patients from the same region and was not measured by the other covariates. It was assumed that these random effects (sometimes known as frailty parameters) followed a
(
,
) distribution. The relationship between the log of the RR and each numerical covariate X[f(X)] was assessed by inspection of the marginal residuals; a linear relationship [f(X) = X] was used if possible, and, if needed, a spline function (SAS-macro %PSPLINET) [17] was used to estimate the non-linear effect of each covariate on mortality risk. Interactions between covariates were also inspected, and violation of the assumed proportionality of each covariate was assessed by evaluating interactions between covariates and time. The cumulative baseline hazard function was modelled parametrically with a polynomial function of time t:
. For the prediction of the survival of the patients in the prospective cohort, the 1- and 5-year survival probabilities S(1) and S(5) were calculated as S(t) = exp[H0(t) x exp[
x ln(RR)], where
is a shrinkage factor. This factor was derived from the results on the retrospective cohort, and was meant to guard against over-optimistic predictions.
Self-learning rule-based model (RBM)
The C5.0 rule induction algorithm attempts to derive ifthen prediction rules from a training data set, which can subsequently be used to classify unseen data [18]. Rule sets and decision trees are induced by segmenting data using partitioning lines. The self-learning RBM was created using the C5.0 rule induction facilities within the Clementine software package, developed by SPSS. The rule sets were then exported and integrated within Strathclyde University's bespoke software. The C5.0 rule induction software automatically identifies relationships between the variables within complex data sets and creates re-usable rules. In the present application, the technique predicts outcome in groups and so the nephrologists identified survival time periods of <3, 3.0112, 12.0124, 24.0159.99 and
60 months as being of clinical value.
Modified Charlson score (MCS)
MCS was calculated as described recently by Hemmelgarn et al. [8] (Table 2) with adjustment for age class as in the original description [14]. The co-morbidity score is added to the age score (one additional point for each decade beginning at 40 years). Disseminated malignancy was not specified in our questionnaire and so the only deviation we made from the MCS described in the paper by Hemmelgarn et al. was to award 5 points for history of neoplasia rather than the 10 points for disseminated malignancy. Sensitivity analysis demonstrated that this allocation had no influence on the subsequent results presented (data not shown).
|
Statistical comparisons
The baseline data in the training and validation cohorts were compared by t-test of the mean, MannWhitney test or
2 test where appropriate. After application of Bonferroni correction for multiple comparisons, a P-value of <0.001 was regarded as significant for the comparison of the 50 baseline variables. Patient survival in the training and validation cohorts was compared by log-rank test of a KaplanMeier plot.
Survival predictions of the MVR model for individual patients were brought together in the same pre-defined groups as the RBM (<3, 3.0112, 12.0124, 24.0159.99 and
60 months). The MCSs were combined to achieve the same five predicted groups based on the MCSs in the training cohort.
The positive predictive values (PPVs) and negative predictive values (NPVs) for 1- and 5-year survival in the validation cohort were analysed to compare the utility of the models for individual patient prediction. The likelihood ratios for predicting 1- and 5-year survival were calculated, using the 1- and 5-year survival from the training cohort as the pre-test probability. The ability of the models to identify groups of patients with similar risk was compared by graphing the median and interquartile range (IQR) of the actual survival of groups of validation cohort patients predicted to survive <3, 3.0112, 12.0124, 24.0159.99 and
60 months.
Finally, the proportion of explained variation (PEV) for each model (i.e. how much of the total variation in survival can be explained by each model) was compared using the method of Heinze and Schemper [19].
Statistical comparative tests were performed using SPSS v10.0 (SPSS Inc., 2001) and Microsoft Excel 2002 (Microsoft Corporation, 2001).
| Results |
|---|
|
|
|---|
Of the 3640 dialysis centres registered with the ERA-EDTA in 1997, 1220 (33.5%) centres from 37 countries agreed to participate in the study and either contributed patient data or indicated that no patients started RRT or died on RRT during the inception period.
A total of 1217 patient data forms were returned for patients who died during the inception period (training cohort). Of these, 66 forms contained invalid data (such as date of death outside the inception period, date of starting RRT apparently before birth, etc.), or had important missing data such as date of birth or sex. Only 12 patients were <16 years old and a decision was made to omit these from the models and apply the models only to adults. This left 1139 patients with sufficient data to be used to construct the predictive models. This number includes 296 patients who started RRT in February 1998 as a result of the decision by the ERA-EDTA Registry committee that the centres that had not participated in 1997 should be again invited to participate in February 1998.
A total of 1236 patient data forms were returned for patients who had started RRT in February 1997 (validation cohort). Of these, four forms contained invalid data or had important missing data. Eighteen patients were <16 years old and were omitted. Forty-three patients recovered renal function during the follow-up period and are not included in the analysis. This left 1171 patients with sufficient data to be used to validate the models.
The 2310 adult patients described above, who were used to construct and validate the models, were derived from 793 centres from 37 countries in the ERA-EDTA area. The median number of patients contributed per centre was two (range 115) and the median number of centres per country was 12.5 with a range of 1 (Ireland, Cyprus and Malta) to 117 (Italy). The median number of patients per country was 31 with a range of 1 (Cyprus) to 338 (Italy) (Figure 2). The geographical distribution of patients in the training cohort was similar to that of the validation cohort.
|
Baseline data
The baseline data are shown in Table 1, with a comparison between the training and validation cohorts. At the time of starting RRT, patients in the validation cohort were significantly younger, had been cared for by a nephrologist for longer, were less likely to have ischaemic heart disease, vascular disease or hepatitis C infection, had a higher body mass, a lower serum calcium, a higher serum albumin, and were more likely to have been on erythropoietin before starting RRT.
Patient survival
By definition, survival data were available for 100% of the training cohort. For the validation cohort, 1-year survival status was available for 1021 of the 1171 patients (87.2%) and 5-year survival status for 851 patients (72.7%). In the subsequent analysis of model predictions, only patients with known survival status are included.
KaplanMeier survival curves for the two cohorts are shown in Figure 3. The survival of patients in the validation cohort was significantly longer than in the training cohort (median survival 46.4 vs 26.0 months; P<0.0001 log rank test). The 1- and 5-year actuarial survivals in the validation cohort were 77.5 and 41.7%, respectively, compared with 69.2 and 26.5%, respectively, in the training cohort.
|
MVR model
By means of non-linear curve fitting, it was found that the baseline survivor function was very well described by
![]() |
The variables included in the final model and their influences on survival are shown in Table 3. Twenty-four independent variables were included and their influences on patient survival are consistent with previously published multivariate analyses with the exceptions that history of peripheral vascular disease and hepatitis C infection were both independent predictors of longer survival. The final predictive model based on the results in Table 3 with the incorporation of the frailty factor and theoretical shrinkage factor was:
![]() |
|
RBM
After selection of different rules, the final rule was >450 lines long and is, therefore, not presented here in full. To illustrate the principle of the rule structure, however, Figure 4 provides a visual representation of one of the rule sets generated through the analysis. In that example, the starting point is age at first RRT, indicated in green. From the database, if this was unknown, then the ensuing survival rate was <3 months. If the branch for age at first RRT >75.53 years is considered, then the next variable for classification was PTH. Depending on this measure, the subsequent variables of interest vary until a classification is achieved. The complete rule had >450 clauses within it. When applied to the validation cohort, the rule was incapable of categorizing 11 patients based on their entry data.
|
MCS
The range of Charlson scores was 015 with the median score being 4 (IQR 36). The MCSs in the training cohort were used to identify the previously defined groups as follows: MCS 915 predicts survival <3 months (n = 93) in the validation cohort; MCS 68 predicts survival 3.0112 months (n = 237); MCS 45 predicts survival 12.0124 months (n = 251); MCS 23 predicts survival 24.0159.99 months (n = 173); and MCS 01 predicts survival
60 months (n = 97).
Comparing the three predictive models
For predicting 1-year survival, the RBM had the highest PPV (84.2%), the MVR model had the highest NPV (47%) and the RBM had the highest likelihood ratio (1.59) (Table 4). For predicting 5-year survival, the MCS had the highest PPV (79.4%), the RBM had the highest NPV (74.3%) and the MCS had the highest likelihood ratio (7.0). In other words, 79.4% of patients with a MCS of 0 or 1 survived >5 years; 74.3% of patients predicted by the RBM to have a survival of <5 years died within 5 years; and patients who survived >5 years were seven times more likely to have an MCS of 0 or 1 than patients who died within 5 years.
|
Figure 5 shows the comparison of the ability of the three models to discriminate groups of patients with similar risk. For the MVR model and the MCS, there is a close correlation between predicted survival and actual median survival in these pre-defined groups. The IQRs are narrower in the MVR model, consistent with better discriminatory power. The RBM shows a close correlation between predicted survival and actual median survival for the groups 312, 1224, 2460 and
60 months, but the predicted survival <3 months group's median survival was 26.1 months (IQR 7.854.3). The MVR model and MCS both predicted
60 month survival with precision (median survival 60 months; IQR 6060). It should be emphasized that the maximum possible survival in the validation cohort at the stage of the present analysis was 60 months.
|
The observation in Figure 5 is consistent with the calculated PEV using the method of Heinze and Schemper. This showed that the performance of the MVR model and the MCS was similar, with a PEV of 14.6 and 12.9%, respectively, while the RBM only explained 3.95% of the total variation in survival.
| Discussion |
|---|
|
|
|---|
There have been many previous attempts to predict survival in patients starting RRT [28]. This study was designed in an attempt to improve on previous predictive models that were limited by the fact that they were usually developed in a small number of patients from one geographical area, had low precision of prediction, did not include many variables that have subsequently been shown to influence survival and lacked validation in other patient populations. The large number of patients from many countries makes the results generalizable.
We also wanted to compare multiple regression modelling, the self-learning rule-based algorithm and a simple bedside scoring system. At the time the study was designed, it was being suggested that novel artificial intelligence techniques such as a self-learning rule-based algorithm may be better at predicting outcome in medicine because regression modelling was perceived to make assumptions about the input data and could not resolve relationships between independent and dependent variables that were not linear [20,21]. On the other hand, it is acknowledged that novel self-learning techniques can have limited generalizability due to over-fitting to the training data set. Furthermore, refinements can be made to the MVR technique such as testing for interactions, SPLINE functions and frailty factors, that were applied in the present study, that can overcome many of these perceived limitations. It was important to compare these sophisticated techniques with an established simple scoring system and we selected the Charlson score because it has been validated in renal patients, has a wide range of possible scores and has also recently been modified for patients on RRT [8].
The ambidirectional inception cohort study design is unusual. The design takes advantage of the fact that once a patient starts RRT they remain under the care of nephrologists for the remainder of their lives, with the exception of the few patients who recover renal function unexpectedly. It meant that, unlike previous studies, all of the subjects in the training cohort had reached the end-point of interest, and a prospectively followed validation cohort was included within the same study. We anticipated that, because the inception period was only 1 month, each centre would only be required to provide information on a small number of patients, and the case records of patients who had died while on RRT (training cohort), and patients who had started RRT (validation cohort), would be easily available to the participating nephrologists at the time of sending the questionnaires.
We anticipated that the two cohorts would be comparable, as the majority of patients in the training cohort would have started RRT within the previous 5 years when practices were similar. We were surprised to find that the patients in the validation cohort were significantly younger (mean 60.4 vs 63.4 years), had been cared for by a nephrologist for longer and were less likely to have ischaemic heart disease or vascular disease; though not surprised that they were more likely to have been on erythropoietin before starting RRT. This is not consistent with data from all renal registries that show that with time, the age and co-morbidity of patients starting RRT has increased. The baseline data and 5-year survival of the validation cohort are consistent with the EDTA-ERA Registry report for the same period [22]. There may be several reasons why patients with a high burden of co-morbidity, and hence a shorter survival, are over-represented in the training cohort. First, most of the patients in the training cohort started RRT during the period of rapidly increasing acceptance rates in Europe prior to the inception period of the study [23]. This increased acceptance rate in Europe was largely due to an increased acceptance of older patients with multiple co-morbidities. Since these were the same patients who were likely to have short survival, they are likely to be over-represented in an inception cohort of patients dying in February 1997 and February 1998 compared with a cohort of patients starting RRT in February 1997. Secondly, it is likely that some long survivors (who were young and had low co-morbidity when they started RRT), who happened to die in February 1997, are missing from the training cohort because nephrologists may not have become aware of their death until after the survey was carried out. Thirdly, reported co-morbidity may be higher because of assumptions made about co-morbidity at the time of starting RRT based on events that happened after starting RRT. Finally, the significantly better survival in the validation cohort may be partly explained by changes in practice in recent years (such as more attention to anaemia before RRT is required). Despite these differences in the training and validation cohorts, the underlying biology is likely to be the same; even if the proportions of patients with particular attributes is not the same in both groups, the risk associated with these attributes should still be contained within the data sets and the comparison of the models therefore remains valid.
Comparing predictive models is difficult. The output of the MVR model is continuous, whereas the outputs of the RBM and MCS are categorical. We decided to compare the ability to predict clinically useful aspects of survival: 1- and 5-year survival. The PPV and NPV are useful statistics to quote when applying the results of the models predictions to individual patients. The PPV tells the user the probability that a patient predicted by the test to survive beyond the selected time period will actually survive beyond that time period. The likelihood ratio informs about the utility of the test in the context of the pre-test survival probability. The PPVs, NPVs and likelihood ratios for predicting individual survival were broadly similar for all three predictive models but generally better for the MVR model and MCS. All three models were better at predicting 5-year than 1-year survival. The predictions of 5-year survival could be useful in the clinic setting; for example a patient with an MCS of 0 or 1 can be informed that, although the overall 5-year survival for patients starting RRT is 35.5%, the PPV of the MCS increases their probability of 5-year survival to 79.4% (Table 3). This may be used for example to support decisions regarding transplantation: in the UK, it is stipulated that expected survival of <5 years is a contra-indication to listing for cadaveric renal transplantation [24]. The utility of the MCS in predicting 5-year survival is reflected by the likelihood ratio of 7.0; patients who survived >5 years were seven times more likely to have an MCS of 0 or 1 than patients who died within 5 years. It has been arbitrarily suggested that a likelihood ratio of 2.0 is the minimum for utility in clinical practice [25].
All three models can identify groups of patients with different survival risk (Figure 5). All three models were able to identify a group of patients with predicted survival
60 months with sufficient precision to be clinically useful, although the precision of the RBM was less than that of the other two models.
The ability to identify a group of patients with low survival might be of more value. The MVR model was able to identify a group of patients with very low (<3 months) predicted survival with useful accuracy and precision, but the prediction applied to only seven patients (0.82%) in the validation cohort. Combining the groups predicted by the MVR model to survive <3 months or 3.0112 months into one group predicted to survive <12 months provides 121 patients (14.2%) in the validation cohort with actual median survival of 10.7 months (IQR 326.7). Again this would be useful when discussing transplant listing with individual patients. Furthermore, selecting patients in this way for entry into clinical trials of interventions to reduce mortality on RRT would greatly reduce the sample size required to achieve adequate statistical power. Neither the MCS nor the RBM were able to identify a similar sized group of patients with such low survival with equal precision.
There are other considerations when comparing these predictive models. The MCS is easy to apply and could be calculated quickly on paper. It is also easy to explain to a patient. The MVR model and RBM predictions require computerized calculation although with increasing use of electronic patient records there is no reason why this could not be done in the clinic setting. The clinician needs to consider how representative the patients in the study are compared with their own patients. In this respect, our study is likely to be more representative of European dialysis patients than previous studies, but there is inevitably a bias toward the study being more representative of the countries and centres that contributed the most patients.
There are obvious reasons why the models cannot predict outcome with 100% accuracy. The first is that death is not always a direct result of the illness of interest, e.g. death in a road traffic accident. The models clearly identify co-morbid conditions as important predictors of survival, and this is consistent with all previous studies. This study and most others are limited by the fact that a co-morbid condition such as ischaemic heart disease may be present but not clinically evident and by the fact that severity of co-morbid conditions is not included. In a recent study by van Manen et al., however, adding severity of co-morbid conditions did not seem to increase the predictive power of commonly used co-morbidity indices [6]. Some factors that are likely to have a major influence on survival were included such as intravenous drug abuse but, because of the low prevalence in the training cohort, they were not identified by the models as of major importance. Finally, it is well known that clinical factors after the start of RRT, such as adequacy of dialysis [26] and renal transplant [27], influence mortality.
| Conclusion |
|---|
|
|
|---|
Using the ambidirectional inception cohort design of this ERA-EDTA Registry survey, we have been able to create and validate two novel instruments to predict survival in patients starting RRT. The Cox regression hazards model has become the standard for examining the relationship between independent variables and survival, and our data demonstrate the continued value. We have also validated the recently described modification of the generic Charlson score for patients on RRT.
We have shown that these models are good at predicting long-term survival but not good at predicting short-term survival for individual patients. The MVR model and the MCS could, however, be used to select a group of patients at high risk of short survival.
| Acknowledgments |
|---|
The authors wish to express their gratitude to the representatives from the many centres who generously took the time to complete the questionnaires that made the study possible.
Conflict of interest statement. None declared.
| References |
|---|
|
|
|---|
- ERA-EDTA Registry. ERA-EDTA Registry 2002 Annual Report. 46. Academic Medical Centre, Amsterdam, The Netherlands, 2004
- Wright LF. Survival in patients with end-stage renal disease. Am J Kidney Dis 1991; 17: 2528[Web of Science][Medline]
- Khan IH, Catto GR, Edward N, Fleming LW, Henderson IS, MacLeod AM. Influence of coexisting disease on survival on renal-replacement therapy. Lancet 1993; 341: 415418[CrossRef][Web of Science][Medline]
- Chandna SM, Schulz J, Lawrence C, Greenwood RN, Farrington K. Is there a rationale for rationing chronic dialysis? A hospital based cohort study of factors affecting survival and morbidity. Br Med J 1999; 318: 217223
[Abstract/Free Full Text] - Davies SJ, Phillips L, Naish PF, Russell GI. Quantifying comorbidity in peritoneal dialysis patients and its relationship to other predictors of survival. Nephrol Dial Transplant 2002; 17: 10851092
[Abstract/Free Full Text] - van Manen JG, Korevaar JC, Dekker FW, Boeschoten EW, Bossuyt PM, Krediet RT. How to adjust for comorbidity in survival studies in ESRD patients: a comparison of different indices. Am J Kidney Dis 2002; 40: 8289[CrossRef][Web of Science][Medline]
- Miskulin DC, Martin AA, Brown R et al. Predicting 1 year mortality in an outpatient haemodialysis population: a comparison of comorbidity instruments. Nephrol Dial Transplant 2004; 19: 413420
[Abstract/Free Full Text] - Hemmelgarn BR, Manns BJ, Quan H, Ghali WA. Adapting the Charlson Comorbidity Index for use in patients with ESRD. Am J Kidney Dis 2003; 42: 125132[CrossRef][Web of Science][Medline]
- Grimes DA, Schulz KF. Cohort studies: marching towards outcomes. Lancet 2002; 359: 341345[CrossRef][Web of Science][Medline]
- Tsumoto S. Automated discovery of positive and negative knowledge in clinical databases. IEEE Eng Med Biol 2000; 19: 5662[Medline]
- McArthur SD, Strachan SM, Jahn G. The design of a multi-agent transformer condition monitoring system. IEEE Trans Power Syst 2004; 19: 18451852[CrossRef]
- John GH, Miller P, Kerber R. Stock selection using rule induction. IEEE Expert 1996; 11: 5258
- Kahn CE. Artificial intelligence in radiology: decision support systems. Radiographics 1994; 14: 849861[Abstract]
- Charlson ME, Pompei P, Ales KL, MacKenzie CR. A new method of classifying prognostic comorbidity in longitudinal studies: development and validation. J Chronic Dis 1987; 40: 373383[CrossRef][Web of Science][Medline]
- Van Dijk PC, Jager KJ, de Charro F et al. Renal replacement therapy in Europe: the results of a collaborative effort by the ERA-EDTA registry and six national or regional registries. Nephrol Dial Transplant 2001; 16: 11201129
[Abstract/Free Full Text] - Cockcroft DW, Gault MH. Prediction of creatinine clearance from the serum creatinine. Nephron 1976; 16: 3141[Web of Science][Medline]
- Harrell FE. Survrisk.pak with %PSPLINET MACRO. http://biostat.mc.vanderbilt.edu/twiki/bin/view/Main/SasMacros. Accessed March 23, 2005
- Quinlan R. C5.0 Programs for Machine Learning. Morgan Kaufmann, San Mateo, 1993
- Heinze G, Schemper, M. RELIMPCR and RELIMPLR, SAS-macros for the analysis of relative importance of prognostic factors in Cox and logistic regression. http://www.akh-wien.ac.at/imc/biometrie/programme/relimp/. 2004. Accessed March 23, 2005
- Baxt WG, Skora J. Prospective validation of artificial neural network trained to identify acute myocardial infarction. Lancet 1996; 347: 1215[CrossRef][Web of Science][Medline]
- Dybowski R, Weller P, Chang R, Gant V. Prediction of outcome in critically ill patients using artificial neural network synthesised by genetic algorithm. Lancet 1996; 347: 11461150[CrossRef][Web of Science][Medline]
- ERA-EDTA Registry Annual Report 1998. http://www.era-edta-reg.org/files/annualreports/pdf/AnnRep1998.pdf. 2005. Accessed June 6, 2005
- Stengel B, Billon S, Van Dijk PC et al. Trends in the incidence of renal replacement therapy for end-stage renal disease in Europe, 19901999. Nephrol Dial Transplant 2003; 18: 18241833
[Abstract/Free Full Text] - NHS UK Transplant. Transplant list criteria for potential renal transplant patients. http://www.uktransplant.org.uk/ukt/about_transplants/organ_allocation/kidney_(renal)/national_protocols_and_guidelines/protocols_and_guidelines/transplant_list_criteria.jsp. 2003. Accessed August 28, 2005
- Jaeschke R, Guyatt G, Sackett DL. Users guides to the medical literature. III. How to use an article about a diagnostic test. A. Are the results of the study valid? Evidence-Based Medicine Working Group. J Am Med Assoc 1994; 271: 389391
[Abstract/Free Full Text] - Charra B, Calemard E, Ruffet M et al. Survival as an index of adequacy of dialysis. Kidney Int 1992; 41: 12861291[Web of Science][Medline]
- Wolfe RA, Ashby VB, Milford EL et al. Comparison of mortality in all patients on dialysis, patients on dialysis awaiting transplantation, and recipients of a first cadaveric transplant. N Engl J Med 1999; 341: 17251730
[Abstract/Free Full Text]
Accepted in revised form: 18.11.05
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
P. Saudan, M. Kossovsky, G. Halabi, P. Y. Martin, T. V. Perneger, and for the Western Switzerland Dialysis Study Group Quality of care and survival of haemodialysed patients in western Switzerland Nephrol. Dial. Transplant., June 1, 2008; 23(6): 1975 - 1981. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. M Mauri, M. Cleries, E. Vela, and C. R. Registry Design and validation of a model to predict early mortality in haemodialysis patients Nephrol. Dial. Transplant., May 1, 2008; 23(5): 1690 - 1696. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||







