1. Proctor: A semi-supervised performance anomaly diagnosis framework for production hpc systems;B Aksar;High Performance Computing: 36th International Conference, ISC High Performance 2021, Virtual Event,2021
2. An evaluation of major fault tolerance used on high performance computing (hpc) applications;M M A Baig;International Journal of Intelligent Systems and Applications in Engineering,2023
3. Machine learning methodologies to support hpc systems operations: Anomaly detection;A Bartolini;Euro-Par 2022: Parallel Processing Workshops: Euro-Par 2022 International Workshops,2022
4. Paving the way toward energy-aware and automated datacentre;A Bartolini;Workshop Proceedings of the 48th International Conference on Parallel Processing,2019
5. Interpretable anomaly detection for monitoring of high performance computing systems;E Baseman;Outlier Definition, Detection, and Description on Demand Workshop at ACM SIGKDD. San Francisco,2016