Towards Migration-Free "Just-in-Case" Data Archival for Future Cloud Data Lakes Using Synthetic DNA

Author:

Marinelli Eugenio1,Yan Yiqing1,Magnone Virginie2,Dumargne Charlotte2,Barbry Pascal2,Heinis Thomas3,Appuswamy Raja1

Affiliation:

1. Eurecom, France

2. IPMC, France

3. Imperial College London, UK

Abstract

Given the growing adoption of AI, cloud data lakes are facing the need to support cost-effective "just-in-case" data archival over long time periods to meet regulatory compliance requirements. Unfortunately, current media technologies suffer from fundamental issues that will soon, if not already, make cost-effective data archival infeasible. In this paper, we present a vision for redesigning the archival tier of cloud data lakes based on a novel, obsolescence-free storage medium-synthetic DNA. In doing so, we make two contributions: (i) we highlight the challenges in using DNA for data archival and list several open research problems, (ii) we outline OligoArchive-DSM (OA-DSM)-an end-to-end DNA storage pipeline that we are developing to demonstrate the feasibility of our vision.

Publisher

Association for Computing Machinery (ACM)

Subject

General Earth and Planetary Sciences,Water Science and Technology,Geography, Planning and Development

Reference35 articles.

1. A Survey of Uncertain Data Algorithms and Applications

2. Patrick Anderson Richard Black Ausra Cerkauskaite Andromachi Chatzieleftheriou James Clegg Chris Dainty Raluca Diaconu Austin Donnelly Rokas Drevinskas Alexander Gaunt Andreas Georgiou Ariel Gomez Diaz Peter G. Kazansky David Lara Sergey Legtchenko Sebastian Nowozin Aaron Ogus Douglas Phillips Ant Rowstron Masaaki Sakakura Ioan Stefanovici Benn Thomsen and Lei Wang. 2018. Glass: A New Media for a New Era?. In HotStorage. Patrick Anderson Richard Black Ausra Cerkauskaite Andromachi Chatzieleftheriou James Clegg Chris Dainty Raluca Diaconu Austin Donnelly Rokas Drevinskas Alexander Gaunt Andreas Georgiou Ariel Gomez Diaz Peter G. Kazansky David Lara Sergey Legtchenko Sebastian Nowozin Aaron Ogus Douglas Phillips Ant Rowstron Masaaki Sakakura Ioan Stefanovici Benn Thomsen and Lei Wang. 2018. Glass: A New Media for a New Era?. In HotStorage.

3. Raja Appuswamy and Vincent Joguin. 2021. Universal Layout Emulation for Long-Term Database Archival. In CIDR. Raja Appuswamy and Vincent Joguin. 2021. Universal Layout Emulation for Long-Term Database Archival. In CIDR.

4. R. Appuswamy Kevin Lebrigand Pascal Barbry Marc Antonini Oliver Madderson Paul Freemont James MacDonald and Thomas Heinis. 2019. OligoArchive: Using DNA in the DBMS storage hierarchy. In CIDR. R. Appuswamy Kevin Lebrigand Pascal Barbry Marc Antonini Oliver Madderson Paul Freemont James MacDonald and Thomas Heinis. 2019. OligoArchive: Using DNA in the DBMS storage hierarchy. In CIDR.

5. Tuundefinedkan Batu Sampath Kannan Sanjeev Khanna and Andrew McGregor. 2004. Reconstructing Strings from Random Traces. In SODA. Tuundefinedkan Batu Sampath Kannan Sanjeev Khanna and Andrew McGregor. 2004. Reconstructing Strings from Random Traces. In SODA.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3