An Experimental Evaluation of Boosting Methods for Classification

Journal:Methods of Information in Medicine
ISSN:0026-1270
DOI:http://dx.doi.org/10.3414/ME0543
Issue:2010 (Vol. 49): Issue 3 2010
Pages:219-229

An Experimental Evaluation of Boosting Methods for Classification

Original Article

R. Stollhoff (1), W. Sauerbrei (2), M. Schumacher (2)

(1) Max-Planck Institute for Mathematics in the Sciences, Leipzig, Germany; (2) Institute of Medical Biometry and Medical Informatics, University Medical Center, Freiburg, Germany

Summary

Objectives: In clinical medicine, the accuracy achieved by classification rules is often not sufficient to justify their use in daily practice. In order to improve classifiers it has become popular to combine single classification rules into a classification ensemble. Two popular boosting methods will be compared with classical statistical approaches. Methods: Using data from a clinical study on the diagnosis of breast tumors and by simulation we will compare AdaBoost with gradient boosting ensembles of regression trees. We will also consider a tree approach and logistic regression as traditional competitors. In logistic regression we allow to select non- linear effects by the fractional polynomial approach. Performance of the classifiers will be assessed by estimated misclassification rates and the Brier score. Results: We will show that boosting of simple base classifiers gives classification rules with improved predictive ability. However, the performance of boosting classifiers was not generally superior to the performance of logistic regression. In contrast to the computer-intensive methods the latter are based on classifiers which are much easier to interpret and to use. Conclusions: In medical applications, the logistic regression model remains a method of choice or, at least, a serious competitor of more sophisticated techniques. Refinement of boosting methods by using optimized number of boosting steps may lead to further improvement.

Keywords

Classification, simulation study, boosting, generalized additive models, diagnosis of breast tumors

DOI

http://dx.doi.org/10.3414/ME0543

You may also be interested in...

1.
Method and Application to the Classification of Cancer Types Using Gene Expression Data

Focus Theme - Boosting Methodology

Online Supplementary Material

Z. Wang (1)

Methods of Information in Medicine 2012 51 2: 162-167

http://dx.doi.org/10.3414/ME11-02-0020

2.

W. Adler, A. Peters, B. Lausen

Methods of Information in Medicine 2008 47 1: 38-46

http://dx.doi.org/10.3414/ME0348

3.

J. E. Mezzich1, I. M. Salloum2

Die Psychiatrie 2007 4 4: 262-265



Articles

You've 41 Article(s) in your Basket.