Hubert Baniecki
h.baniecki (at) uw.edu.pl
I am a PhD candidate in Computer Science at the University of Warsaw, advised by Przemyslaw Biecek. During my PhD, I interned at Meta in New York (‘26), and stayed at LMU Munich, hosted by Eyke Hüllermeier (‘25) and Bernd Bischl (‘24).
My research focuses on machine learning interpretability (a.k.a. explainable AI):
- interpreting multimodal, vision–language models (NeurIPS’25, ICML’25),
- statistical foundations of interpretability (ICLR’25 Spotlight, ICML’26),
- open-source software and benchmarks in this domain (JMLR’21, NeurIPS’24),
- applications of interpretability in medicine (WACV’25) and beyond (PNAS’24).
I actively contribute to the academic community by serving as a reviewer for conferences like NeurIPS, ICML, ICLR, with their workshops on interpretability/xAI, and journals like the Journal of Machine Learning Research, Machine Learning, Nature Communications.
recent news [previous]
| 2026 Jun | I’ve joined Meta as a research scientist intern in New York City for the summer of 2026. |
|---|---|
| 2026 Apr |
A paper |
| 2025 Nov | I stay in Italy until December for a 1-month research visit at the University of Pisa hosted by Riccardo Guidotti. |
| 2025 Sep |
A paper |
| 2025 Sep |
A paper |
| 2025 May | Foundation for Polish Science awarded me the START scholarship for young scientists. |
| 2025 May |
A paper |
| 2025 Mar | I stay in Germany until April for a 1-month research visit at LMU Munich hosted by Eyke Hüllermeier. |
selected publications [full list]
-
Functional decomposition and Shapley interactions for interpreting survival models
International Conference on Machine Learning (ICML) , 2026We propose a principled approach based on functional decomposition and Shapley values to explain time-dependent feature interactions in machine learning survival models. -
Explaining similarity in vision-language encoders with weighted Banzhaf interactions
Advances in Neural Information Processing Systems (NeurIPS) , 2025We introduce faithful interaction explanations of CLIP and SigLIP models (FIxLIP), offering a unique, game-theoretic perspective on interpreting image–text similarity predictions. -
Interpreting CLIP with hierarchical sparse autoencoders
International Conference on Machine Learning (ICML) , 2025We introduce the Matryoshka sparse autoencoder (MSAE) that establishes a state-of-the-art Pareto frontier between reconstruction quality and sparsity for interpreting CLIP models. -
ICLR
Spotlight Efficient and accurate explanation estimation with distribution compressionInternational Conference on Learning Representations (ICLR) , 2025 (Spotlight)We introduce compress then explain (CTE) as a new paradigm for sample-efficient estimation of post-hoc explanations, including feature attributions, importance, and effects. -
WACV
Oral Aggregated attributions for explanatory analysis of 3D segmentation modelsIEEE/CVF Winter Conference on Applications of Computer Vision (WACV) , 2025 (Oral)We discover knowledge acquired by the TotalSegmentator foundation model trained to segment all anatomical structures in computed tomography medical images. -
shapiq: Shapley interactions for machine learning
Advances in Neural Information Processing Systems (NeurIPS) , 2024We develop {shapiq}, an open-source Python package that implements several algorithms and benchmarks for efficiently approximating game-theoretic attribution and interaction indices. -
Increasing phosphorus loss despite widespread concentration decline in US rivers
Proceedings of the National Academy of Sciences , 2024We reveal a paradox in US rivers with deep learning: phosphorus concentration is down over the last 40 years, particularly in urban areas, but total phosphorus loss is up due to climate change. -
The grammar of interactive explanatory model analysis
Data Mining and Knowledge Discovery , 2023We propose to juxtapose multiple complementary explanations, and show that an interactive sequential analysis of a model improves the accuracy and confidence of human decision-making.