ABSTRACT
Discovering additive structure is an important step towards understanding a complex multi-dimensional function because it allows the function to be expressed as the sum of lower-dimensional components. When variables interact, however, their effects are not additive and must be modeled and interpreted simultaneously. We present a new approach for the problem of interaction detection. Our method is based on comparing the performance of unrestricted and restricted prediction models, where restricted models are prevented from modeling an interaction in question. We show that an additive model-based regression ensemble, Additive Groves, can be restricted appropriately for use with this framework, and thus has the right properties for accurately detecting variable interactions.
- Camacho, R. (1998). Inducing models of human control skills. Proc. ECML'98. Google ScholarDigital Library
- Caruana, R., Elhawary, M., Fink, D., Hochachka, W. M., Kelling, S., Munson, A., Riedewald, M., & Sorokina, D. (2006). Mining citizen science data to predict prevalence of wild bird species. KDD'06. Google ScholarDigital Library
- Christensen, R. (1996). Plane answers to complex questions, the theory of linear models. Springer. Google ScholarDigital Library
- Friedman, J. (2005). RuleFit with R. http://www-stat.stanford.edu/~jhf/R-RuleFit.html.Google Scholar
- Friedman, J. H. (2001). Greedy function approximation: a gradient boosting machine. Annals of Statistics, 29, 1189--1232.Google ScholarCross Ref
- Friedman, J. H., & Popescu, B. E. (2005). Predictive learning via rule ensembles (Technical Report). Stanford University.Google Scholar
- Guyon, I., & Elisseeff, A. (2003). An introduction to variable and feature selection. JMLR, 3. Google ScholarDigital Library
- Hooker, G. (2004). Discovering ANOVA structure in black box functions. Proc. ACM SIGKDD. Google ScholarDigital Library
- Hooker, G. (2007). Generalized functional ANOVA diagnostics for high dimensional functions of dependent variables. JCGS.Google Scholar
- Jakulin, A., & Bratko, I. (2004). Testing the significance of attribute interactions. Proc. ICML'04. Google ScholarDigital Library
- Pace, R. K., & Barry, R. (1997). Sparse spatial autoregressions. Statistics and Probability Letters, 33.Google Scholar
- Rasmussen, C. E., Neal, R. M., Hinton, G., van Camp, D., Revow, M., Ghahramani, Z., Kustra, R., & Tibshirani, R. (2003). Delve. University of Toronto. http://www.cs.toronto.edu/~delve.Google Scholar
- Ruppert, D., Wand, M. P., & Carroll, R. J. (2003). Semiparametric regression. Cambridge.Google Scholar
- Sorokina, D., Caruana, R., & Riedewald, M. (2007). Additive Groves of regression trees. Proc. ECML. Google ScholarDigital Library
- Torgo, L. (2007). Regression DataSets. www.liacc.up.pt/~ltorgo/Regression/DataSets.html.Google Scholar
Index Terms
- Detecting statistical interactions with additive groves of trees
Recommendations
The expected additive weight of trees
We consider a general additive weight of random trees which depends on the structure of the subtrees, on weight functions defined on the number of internal and external nodes and on the degrees of the nodes appearing in the tree and its subtrees. ...
Cohen's weighted kappa with additive weights
Cohen's weighted kappa is a popular descriptive statistic for summarizing interrater agreement on an ordinal scale. An agreement table with $$n\in \mathbb N _{\ge 3}$$ ordered categories can be collapsed into $$n-1$$ distinct $$2\times 2$$ tables by combining adjacent categories. Weighted kappa with ...
Comments