Cluster-Aware Grid Layout

Zhen Li1     Weikai Yang2     Jun Yuan1     Jing Wu3     Changjian Chen4     Yao Ming5     Fan Yang6     Hui Zhang1     Shixia Liu1

1Tsinghua University       2Hong Kong University of Science and Technology (Guangzhou)       3Cardiff University       4Hunan University       5Citadel Securities       5Kuaishou Technology

Teaser Image
Teaser Image

Fig. 1: RuleExplorer: (a) attribute view shows the attribute distribution and enables sample filtering; (b) matrix view shows the representative rules at a certain level of the rule hierarchy; (c) info view shows the overall statistics of the displayed rules and samples; (d) data table lists the samples covered by the displayed rules.

Abstract

The high performance of tree ensemble classifiers benefits from a large set of rules, which, in turn, makes the models hard to understand. To improve interpretability, existing methods extract a subset of rules for approximation using model reduction techniques. However, by focusing on the reduced rule set, these methods often lose fidelity and ignore anomalous rules that, despite their infrequency, play crucial roles in real-world applications. This paper introduces a scalable visual analysis method to explain tree ensemble classifiers that contain tens of thousands of rules. The key idea is to address the issue of losing fidelity by adaptively organizing the rules as a hierarchy rather than reducing them. To ensure the inclusion of anomalous rules, we develop an anomaly-biased model reduction method to prioritize these rules at each hierarchical level. Synergized with this hierarchical organization of rules, we develop a matrix-based hierarchical visualization to support exploration at different levels of detail. Our quantitative experiments and case studies demonstrate how our method fosters a deeper understanding of both common and anomalous rules, thereby enhancing interpretability without sacrificing comprehensiveness.

Materials
ieee arxiv pdf supp video bib