EconPapers    
Economics at your fingertips  
 

Interpretability via Random Forests

Clément Bénard (), Sébastien Da Veiga () and Erwan Scornet ()
Additional contact information
Clément Bénard: Digital Sciences & Technologies, Safran Tech
Sébastien Da Veiga: Digital Sciences & Technologies, Safran Tech
Erwan Scornet: Institut Polytechnique de Paris, CMAP, École Polytechnique

Chapter Chapter 3 in Interpretability for Industry 4.0: Statistical and Machine Learning Approaches, 2022, pp 37-84 from Springer

Abstract: Abstract Although there is no consensus on a precise definition of interpretability, it is possible to identify several requirements: “simplicity, stability, and accuracy”, rarely all satisfied by existing interpretable methods. The structure and stability of random forests make them good candidates to improve the performance of interpretable algorithms. The first part of this chapter focuses on rule learning models, which are simple and highly predictive algorithms, but very often unstable with respect to small data perturbations. A new algorithm called SIRUS, designed as the extraction of a compact rule ensemble from a random forest, considerably improves stability over state-of-the-art competitors, while preserving simplicity and accuracy. The second part of this chapter is dedicated to post-hoc methods, in particular variable importance measures for random forests. An asymptotic analysis of Breiman’s MDA (Mean Decrease Accuracy) shows that this measure is strongly biased using a sensitivity analysis perspective. The Sobol-MDA algorithm is introduced to fix the MDA flaws, replacing permutations by projections. An extension to Shapley effects, an efficient importance measure when input variables are dependent, is then proposed with the SHAFF algorithm.

Date: 2022
References: Add references at CitEc
Citations:

There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:sprchp:978-3-031-12402-0_3

Ordering information: This item can be ordered from
http://www.springer.com/9783031124020

DOI: 10.1007/978-3-031-12402-0_3

Access Statistics for this chapter

More chapters in Springer Books from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2026-05-21
Handle: RePEc:spr:sprchp:978-3-031-12402-0_3