Author:
Chen Mayee F.,Nachman Benjamin,Sala Frederic
Abstract
Abstract
An important class of techniques for resonant anomaly detection in high energy physics builds models that can distinguish between reference and target datasets, where only the latter has appreciable signal. Such techniques, including Classification Without Labels (CWoLa) and Simulation Assisted Likelihood-free Anomaly Detection (SALAD) rely on a single reference dataset. They cannot take advantage of commonly-available multiple datasets and thus cannot fully exploit available information. In this work, we propose generalizations of CWoLa and SALAD for settings where multiple reference datasets are available, building on weak supervision techniques. We demonstrate improved performance in a number of settings with realistic and synthetic data. As an added benefit, our generalizations enable us to provide finite-sample guarantees, improving on existing asymptotic analyses.
Publisher
Springer Science and Business Media LLC
Subject
Nuclear and High Energy Physics
Reference48 articles.
1. H.E.P.M.L. community, A living review of machine learning for particle physics, https://iml-wg.github.io/HEPML-LivingReview/.
2. G. Karagiorgi et al., Machine learning in the search for new fundamental physics, arXiv:2112.03769 [INSPIRE].
3. G. Kasieczka et al., The LHC olympics 2020 a community challenge for anomaly detection in high energy physics, Rept. Prog. Phys. 84 (2021) 124201 [arXiv:2101.08320] [INSPIRE].
4. T. Aarrestad et al., The dark machines anomaly score challenge: benchmark data and model independent event classification for the Large Hadron Collider, SciPost Phys. 12 (2022) 043 [arXiv:2105.14027] [INSPIRE].
5. ATLAS collaboration, Dijet resonance search with weak supervision using $$ \sqrt{s} $$ = 13 TeV pp collisions in the ATLAS detector, Phys. Rev. Lett. 125 (2020) 131801 [arXiv:2005.02983] [INSPIRE].
Cited by
5 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献