Author:
Ceccaroni Riccardo,Di Rocco Lorenzo,Ferraro Petrillo Umberto,Brutti Pierpaolo
Abstract
AbstractPersistent homology (PH) is a powerful mathematical method to automatically extract relevant insights from images, such as those obtained by high-resolution imaging devices like electron microscopes or new-generation telescopes. However, the application of this method comes at a very high computational cost that is bound to explode more because new imaging devices generate an ever-growing amount of data. In this paper, we present PixHomology, a novel algorithm for efficiently computing zero-dimensional PH on images, optimizing memory and processing time. By leveraging the Apache Spark framework, we also present a distributed version of our algorithm with several optimized variants, able to concurrently process large batches of astronomical images. Finally, we present the results of an experimental analysis showing that our algorithm and its distributed version are efficient in terms of required memory, execution time, and scalability, consistently outperforming existing state-of-the-art PH computation tools when used to process large datasets.
Funder
Università degli Studi di Roma La Sapienza
Publisher
Springer Science and Business Media LLC
Reference34 articles.
1. Poger D, Yen L, Braet F (2023) Big data in contemporary electron microscopy: challenges and opportunities in data transfer, compute and management. Histochem Cell Biol 160(3):169–192
2. Large synoptic survey telescope (2023)
3. Starck J, Murtagh F (2007) Astronomical image and data analysis. Astronomy and astrophysics library. Springer, Berlin
4. Edelsbrunner H, Harer J (2010) Computational topology: an introduction. American Mathematical Society, New York
5. Edelsbrunner H, Letscher D, Zomorodian A (2003) Topological persistence and simplification. Discrete & Computational Geometry, 01