Scalable Data Mining, Archiving, and Big Data Management for the Next Generation Astronomical Telescopes-Reference-Cited by-同舟云学术

Scalable Data Mining, Archiving, and Big Data Management for the Next Generation Astronomical Telescopes

Published: Issue: Volume: Page:196-221
ISSN:2327-1981
Container-title:Big Data Management, Technologies, and Applications
language:
Short-container-title:

Author:

Mattmann Chris A.¹,Hart Andrew¹,Cinquini Luca¹,Lazio Joseph¹,Khudikyan Shakeh¹,Jones Dayton¹,Preston Robert¹,Bennett Thomas²,Butler Bryan³,Harland David³,Glendenning Brian³,Kern Jeff³,Robnett James³

Affiliation:

1. California Institute of Technology, USA

2. SKA South Africa Project, South Africa

3. National Radio Astronomy Observatory (NRAO), USA

Abstract

Big data as a paradigm focuses on data volume, velocity, and on the number and complexity of various data formats and metadata, a set of information that describes other data types. This is nowhere better seen than in the development of the software to support next generation astronomical instruments including the MeerKAT/KAT-7 Square Kilometre Array (SKA) precursor in South Africa, in the Low Frequency Array (LOFAR) in Europe, in two instruments led in part by the U.S. National Radio Astronomy Observatory (NRAO) with its Expanded Very Large Array (EVLA) in Socorro, NM, and Atacama Large Millimeter Array (ALMA) in Chile, and in other instruments such as the Large Synoptic Survey Telescope (LSST) to be built in northern Chile. This chapter highlights the big data challenges in constructing data management systems for these astronomical instruments, specifically the challenge of integrating legacy science codes, handling data movement and triage, building flexible science data portals and user interfaces, allowing for flexible technology deployment scenarios, and in automatically and rapidly mitigating the difference in science data formats and metadata models. The authors discuss these challenges and then suggest open source solutions to them based on software from the Apache Software Foundation including Apache Object-Oriented Data Technology (OODT), Tika, and Solr. The authors have leveraged these solutions to effectively and expeditiously build many precursor and operational software systems to handle data from these astronomical instruments and to prepare for the coming data deluge from those not constructed yet. Their solutions are not specific to the astronomical domain and they are already applicable to a number of science domains including Earth, planetary, and biomedicine.

Publisher

IGI Global

Reference32 articles.

1. Agrawal, D., Das, S., & El Abbadi, A. (2011). Big data and cloud computing: Current state and future opportunities. In Proceedings of the 14th International Conference on Extending Database Technology (pp. 530-533). ACM.

2. Butler, B. J., & Chandler, C. J. (2012). Data management for the EVLA. In Proceedings of SPIE Astronomical Telescopes+ Instrumentation (pp. 84510A-84510A). International Society for Optics and Photonics.

3. Cinquini, L., Crichton, D., Mattmann, C., Harney, J., Shipman, G., Wang, F., & Schweitzer, R. (2012). The earth system grid federation: An open infrastructure for access to distributed geospatial data. In Proceedings of E-Science (e-Science), (pp. 1-10). IEEE.

4. A Multidisciplinary, Model-Driven, Distributed Science Data System Architecture

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Business-making supported via the application of big data to achieve economic sustainability;Entrepreneurship and Sustainability Issues;2022-06-01

2. New optimized ASIC multiplier in 28 nm CMOS for processing the X-part of FX correlator in radio interferometry;Experimental Astronomy;2019-05-29

3. Big Data Technology and its Importance for Decision-Making in Enterprises;Communications - Scientific letters of the University of Zilina;2016-11-30

4. Transmission of large amounts of scientific data using laser technology;Journal of Physics: Conference Series;2016-08

5. Revisiting the Anatomy and Physiology of the Grid;Journal of Grid Computing;2015-01-29