Apr 14, 2021
Speaker · 0 followers
Speaker · 0 followers
Decision trees and their ensembles are endowed with a rich set of diagnostic tools for ranking and extracting relevant input variables in a predictive model. One of the most commonly used in practice is the Mean Decrease in Impurity (MDI), which calculates an importance score for a variable by summing the weighted impurity reductions over all non-terminal nodes split with that variable. Despite the widespread use of tree based variable importance measures such as MDI, pinning down their theoretical properties has been challenging and therefore largely unexplored. To address this gap between theory and practice, we derive rigorous finite sample performance guarantees for variable selection in additive models with MDI for a single-level decision tree (decision stump). We find that the marginal signal strength of each variable can be considerably weaker than those for state-of-the-art nonparametric variable selection methods. We also show that SDI is asymptotically equivalent to Sure Independence Screening when the model is linear with Gaussian variates. Furthermore, unlike previous methods that attempt to directly estimate each marginal regression function via a truncated basis expansion, the fitted model used here is a simple, parsimonious decision stump—eliminating the need for bandwidth calibration. Thus, surprisingly, even though single-level decision trees are highly inaccurate for prediction, they can still be used to perform consistent model selection.Decision trees and their ensembles are endowed with a rich set of diagnostic tools for ranking and extracting relevant input variables in a predictive model. One of the most commonly used in practice is the Mean Decrease in Impurity (MDI), which calculates an importance score for a variable by summing the weighted impurity reductions over all non-terminal nodes split with that variable. Despite the widespread use of tree based variable importance measures such as MDI, pinning down their theoreti…
Account · 63 followers
Category · 2.4k presentations
Category · 10.8k presentations
The 24th International Conference on Artificial Intelligence and Statistics was held virtually from Tuesday, 13 April 2021 to Thursday, 15 April 2021.
Professional recording and live streaming, delivered globally.
Presentations on similar topic, category or speaker
Joe Watson, …
Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%
Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%
Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%
Alden Green, …
Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%
Fan Zhou, …
Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%
Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%