Documentos de Académico
Documentos de Profesional
Documentos de Cultura
Personal background
Ewout Steyerberg:
Erasmus MC, Rotterdam, the Netherlands
Performance objectives
To be able to explain why validation is necessary for predictive models To be able to judge the adequacy of a validation procedure
a: intercept
bi: regression coefficient i
3 examples of regression
Quality of life one year after surgery:
Additivity of effects
Evaluation of predictions
Calibration average of predictions correct? low and high predictions correct? Discrimination distinguish low risk from high risk patients?
0.0
0.0 0.1 0.2 0.3 0.4 Predicted probability of 30-day mortality
0.1
0.2
0.3
0.4
3 types of validation
Apparent: performance on sample used to develop model Internal: performance on population underlying the sample External: performance on related but slightly different population
Apparent validity
Easy to calculate
Internal validity
More difficult to calculate
Internal validated performance sets an upper limit to what may be expected in other settings (external validity)
External validity
Moderately easy to calculate when new data are available Test model in new data, different from development population
development / validation
Cross-validation:
0.0
0.0 0.1 0.2 0.3 0.4 Predicted probability of 30-day mortality
0.1
0.2
0.3
0.4
Summary
Apparent validity gives an optimistic estimate of model performance Internal validity may be estimated by bootstrapping External validity should be determined in other populations
Key references
tutorial and book on multivariable models
(Harrell 1996, Stat Med 15:361-87; Harrell: regression modeling strategies, Springer 2001)
external validation
(Justice 1999: Ann Intern Med 130:515-24; Altman 2000: Stat Med 19: 453-73)
Links
Interactive text book on predictive modeling
http://www.neri.org/symptom/mockup/Chapter_8/