Hubert Baniecki
University of Warsaw, Poland
BIRS Workshop, Banff, Canada
February 12, 2024
PhD student at the University of Warsaw, Poland
explainable machine learning, evaluating explanations, robustness
statistical software: dalex
(JMLR 2021, John M. Chambers Award by ASA), modelStudio
(JOSS 2019), survex
(Bioinformatics 2023)
survex
(Bioinformatics 2023)
For classification (or regression), we have:
What about time-to-event prediction like survival analysis?
(Ribeiro et al. 2016) LIME idea: approximate a black-box function with an interpretable model in the local neighbourhood of observation \(x\).
M. S. Kovalev et al. SurvLIME: A method for explaining machine learning survival models. Knowledge-Based Systems 2020
M. Krzyzinski, M. Spytek, H. Baniecki, P. Biecek. SurvSHAP(t): Time-dependent explanations of machine learning survival models. Knowledge-Based Systems 2023
(Lundberg et al. 2017) SHAP idea: use game theory to estimate additive feature attributions \(\phi_{i}\) to the model’s prediction for observation \(x\).
\[ \phi_{i}(x)=\sum_{S\subseteq P\setminus \{i\}}{\frac {|S|!\;(|P|-|S|-1)!}{|P|!}}(v_{S\cup \{i\}}(x)-v_S(x)) \] \(P\) – feature set, \(v_S(x) = \mathbb{E}_{x|S} f(x)\) – prediction for feature values in set \(S\) that are marginalized over features not included in \(S\).
Over 24 algorithms to estimate Shapley value feature attributions:
KernelSHAP, TreeSHAP, marginal vs conditional..
Idea: explain \(f_t(x)\) – a prediction for an observation \(x\) at time point \(t\) – for all time points separately – \(\phi_t(x, i)\).
survex
, survshap
survex
: dalex
for survival modelsM. Spytek, M. Krzyzinski, S. H. Langbein, H. Baniecki, M. N. Wright, P. Biecek. survex: an R package for explaining machine learning survival models. Bioinformatics 2023
survex
: code exampleTo what extent can the patient’s length of stay in a hospital be predicted using only an X-ray image?
Motivation: explain black-box models predicting hospital LoS
(K. Stone et al. PLOS Digital Health 2022)
Task: time-to-event prediction
instead of a single-value time regression or time-span classification
We manually annotated textual radiology reports of X-ray images:
Predicting the patient’s LoS from an X-ray image is indeed possible, but challenging.
Performance of the interpretable model declines when increasing the number of features.
H. Baniecki, B. Sobieski, P. Bombinski, P. Szatkowski, P. Biecek. Hospital Length of Stay Prediction Based on Multi-modal Data Towards Trustworthy Human-AI Collaboration in Radiomics. AIME 2023
H. Baniecki, B. Sobieski, P. Bombinski, P. Szatkowski, P. Biecek. Hospital Length of Stay Prediction Based on Multi-modal Data Towards Trustworthy Human-AI Collaboration in Radiomics. AIME 2023
How to evaluate explanationsâť“ How to interpret explanationsâť“
One needs to be sure that humans properly interpret explanation visualizations, i.e. physicians and other stakeholders interpreting predictions need to understand that explanations are only an approximation of the black-box model.
🧑 🔬 Shameless plug: I’m open to a 3-month research visit in 2025 (self-)funded by “Polish NSF” as part of my PhD.
Hubert Baniecki – slides: hbaniecki.com/birs2024 – lab: mi2.ai