A detailed study of interpretability of deep neural network based top taggers

Khot, Ayush and Neubauer, Mark S and Roy, Avik (2023) A detailed study of interpretability of deep neural network based top taggers. Machine Learning: Science and Technology, 4 (3). 035003. ISSN 2632-2153

[thumbnail of Khot_2023_Mach._Learn.__Sci._Technol._4_035003.pdf]

Text
Khot_2023_Mach._Learn.__Sci._Technol._4_035003.pdf - Published Version
Download (4MB)

Official URL: https://doi.org/10.1088/2632-2153/ace0a1

Abstract

Recent developments in the methods of explainable artificial intelligence (XAI) allow researchers to explore the inner workings of deep neural networks (DNNs), revealing crucial information about input–output relationships and realizing how data connects with machine learning models. In this paper we explore interpretability of DNN models designed to identify jets coming from top quark decay in high energy proton–proton collisions at the Large Hadron Collider. We review a subset of existing top tagger models and explore different quantitative methods to identify which features play the most important roles in identifying the top jets. We also investigate how and why feature importance varies across different XAI metrics, how correlations among features impact their explainability, and how latent space representations encode information as well as correlate with physically meaningful quantities. Our studies uncover some major pitfalls of existing XAI methods and illustrate how they can be overcome to obtain consistent and meaningful interpretation of these models. We additionally illustrate the activity of hidden layers as neural activation pattern diagrams and demonstrate how they can be used to understand how DNNs relay information across the layers and how this understanding can help to make such models significantly simpler by allowing effective model reoptimization and hyperparameter tuning. These studies not only facilitate a methodological approach to interpreting models but also unveil new insights about what these models learn. Incorporating these observations into augmented model design, we propose the particle flow interaction network model and demonstrate how interpretability-inspired model augmentation can improve top tagging performance.

Item Type:	Article
Subjects:	Institute Archives > Multidisciplinary
Depositing User:	Managing Editor
Date Deposited:	04 Oct 2023 04:09
Last Modified:	04 Oct 2023 04:09
URI:	http://eprint.subtopublish.com/id/eprint/2668

Actions (login required)

: View Item