Reliability measures for local nodes assessment in classification trees
Data(s) |
01/01/2003
|
---|---|
Resumo |
Most of the modem developments with classification trees are aimed at improving their predictive capacity. This article considers a curiously neglected aspect of classification trees, namely the reliability of predictions that come from a given classification tree. In the sense that a node of a tree represents a point in the predictor space in the limit, the aim of this article is the development of localized assessment of the reliability of prediction rules. A classification tree may be used either to provide a probability forecast, where for each node the membership probabilities for each class constitutes the prediction, or a true classification where each new observation is predictively assigned to a unique class. Correspondingly, two types of reliability measure will be derived-namely, prediction reliability and classification reliability. We use bootstrapping methods as the main tool to construct these measures. We also provide a suite of graphical displays by which they may be easily appreciated. In addition to providing some estimate of the reliability of specific forecasts of each type, these measures can also be used to guide future data collection to improve the effectiveness of the tree model. The motivating example we give has a binary response, namely the presence or absence of a species of Eucalypt, Eucalyptus cloeziana, at a given sampling location in response to a suite of environmental covariates, (although the methods are not restricted to binary response data). |
Identificador | |
Idioma(s) |
eng |
Publicador |
American Statistical Association |
Palavras-Chave | #Statistics & Probability #Binary Classification #Bootstrap #Cart #Local Assessment #Spatial Prediction Maps #Logistic-regression #C1 #300803 Natural Resource Management #780101 Mathematical sciences |
Tipo |
Journal Article |