Definition
A random forest is an Ensemble of random Decision Tree classifiers, that makes predictions by combining the predictions of the individual trees. There are different approaches to introduce randomness in the decision tree construction method. A random forest can be used to make predictions over nominal (Classification) or numeric target attributes (Regression). Random forests are one of the best performing predictive models.
Characteristics
Random Forest Construction
The term random forests has been introduced by Breiman (2001), and is a collective term for decision tree ensembles in which each tree is constructed using some random process. Different random forests differ in how the randomness is introduced in the tree building process. In Bagging (Breiman 1996), randomness is introduced by constructing each tree using a bootstrap sample of the Training Set. The randomized outputs method (Breiman 1999) randomly permutes the target attributes before constructing the trees....
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Amit Y, Geman D (1997) Shape quantization and recognition with randomized trees. Neural Comput 9:1545–1588
Breiman L (1996) Bagging predictors. Mach Learn 24(2):123–140
Breiman L (1999) Using adaptive bagging to debias regressions. Technical report, Statistics Department, University of California, Berkeley
Breiman L (2001) Random forests. Mach Learn 45(1):5–32
DÃaz-Uriarte R, Alvarez de Andrés S (2006) Gene selection and classification of microarray data using random forest. BMC Bioinformatics 7:3
Dietterich TG (2000) An experimental comparison of three methods for constructing ensembles of decision trees: bagging, boosting, and randomization. Mach Learn 40(2):139–157
Ho T (1998) The random subspace method for constructing decision forests. IEEE Trans Pattern Anal Mach Intell 20(8):832–844
Liaw A, Wiener M (2002) Classification and regression by randomForest. R News 2(3):18–22
Wu B, Abbott T, Fishman D, McMurray W, Mor G, Stone K, Ward D, Williams K, Zhao H (2003) Comparison of statistical methods for classification of ovarian cancer using mass spectrometry data. Bioinformatics 19(13):1636–1643
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer Science+Business Media, LLC
About this entry
Cite this entry
Vens, C. (2013). Random Forest. In: Dubitzky, W., Wolkenhauer, O., Cho, KH., Yokota, H. (eds) Encyclopedia of Systems Biology. Springer, New York, NY. https://doi.org/10.1007/978-1-4419-9863-7_612
Download citation
DOI: https://doi.org/10.1007/978-1-4419-9863-7_612
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4419-9862-0
Online ISBN: 978-1-4419-9863-7
eBook Packages: Biomedical and Life SciencesReference Module Biomedical and Life Sciences