Affiliation:
1. Paris Descartes University
Abstract
Massive data series collections are becoming a reality for virtually every scientific and social domain, and have to be processed and analyzed, in order to extract useful knowledge. Current data series management solutions are ad hoc, requiring huge investments in time and effort, and duplication of effort across different teams. Systems like relational databases, Column Stores, and Array Databases are not a suitable solution either, since none of these systems offers native support for data series. Our vision is to design and develop a generalpurpose Data Series Management System, able to copewith big data series, that is, very large and fast-changing collections of data series, which can be heterogeneous (i.e., originate from disparate domains and thus exhibit very different characteristics), and which can have uncertainty in their values (e.g., due to inherent errors in the measurements). Just like databases abstracted the relational data management problem and offered a black box solution that is now omnipresent, the proposed system will allow analysts that are not experts in data series management, as well as common users, to tap in the goldmine of the massive and ever-growing data series collections they (already) have
Publisher
Association for Computing Machinery (ACM)
Subject
Information Systems,Software
Reference32 articles.
1. Adhd-200. http://fcon_1000.projects.nitrc.org/ indi/adhd200/ 2011. Adhd-200. http://fcon_1000.projects.nitrc.org/ indi/adhd200/ 2011.
2. Orleans: Distributed virtual actors for programmability and scalability. MSR-TR-2014-41 2014. Orleans: Distributed virtual actors for programmability and scalability. MSR-TR-2014-41 2014.
3. Sloan digital sky survey. https://www.sdss3.org/dr10/data_access/volume.php 2015. Sloan digital sky survey. https://www.sdss3.org/dr10/data_access/volume.php 2015.
Cited by
69 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献