Demystifying Graph Databases: Analysis and Taxonomy of Data Organization, System Designs, and Graph Queries-Reference-Cited by-同舟云学术

Demystifying Graph Databases: Analysis and Taxonomy of Data Organization, System Designs, and Graph Queries

Published:2023-09-15 Issue:2 Volume:56 Page:1-40
ISSN:0360-0300
Container-title:ACM Computing Surveys
language:en
Short-container-title:ACM Comput. Surv.

Author:

Besta Maciej¹^ORCID,Gerstenberger Robert¹^ORCID,Peter Emanuel¹^ORCID,Fischer Marc²^ORCID,Podstawski Michał³^ORCID,Barthels Claude¹^ORCID,Alonso Gustavo¹^ORCID,Hoefler Torsten¹^ORCID

Affiliation:

1. Department of Computer Science, ETH Zurich, Switzerland

2. PRODYNA (Schweiz) AG, Switzerland

3. Future Processing, Poland

Abstract

Numerous irregular graph datasets, for example social networks or web graphs, may contain even trillions of edges. Often, their structure changes over time and they have domain-specific rich data associated with vertices and edges. Graph database systems such as Neo4j enable storing, processing, and analyzing such large, evolving, and rich datasets. Due to the sheer size and irregularity of such datasets, these systems face unique design challenges. To facilitate the understanding of this emerging domain, we present the first survey and taxonomy of graph database systems. We focus on identifying and analyzing fundamental categories of these systems (e.g., document stores, tuple stores, native graph database systems, or object-oriented systems), the associated graph models (e.g., Resource Description Framework or Labeled Property Graph), data organization techniques (e.g., storing graph data in indexing structures or dividing data into records), and different aspects of data distribution and query execution (e.g., support for sharding and Atomicity, Consistency, Isolation, Durability). Fifty-one graph database systems are presented and compared, including Neo4j, OrientDB, and Virtuoso. We outline graph database queries and relationships with associated domains (NoSQL stores, graph streaming, and dynamic graph algorithms). Finally, we outline future research and engineering challenges related to graph databases.

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science,Theoretical Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/3604932

Reference213 articles.

1. Daniel J. Abadi et al. 2007. Scalable semantic web data management using vertical partitioning. In VLDB. 411–422.

2. Effective Partitioning and Multiple RDF Indexing for Database Triple Store

3. A scalable processing-in-memory accelerator for parallel graph processing

4. Amazon. Amazon Neptune. Retrieved from https://aws.amazon.com/neptune/.

Cited by 8 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Unified Graph Framework for Storage-Compute Coupled Cluster and High-Density Computing Cluster;International Workshop on Big Data in Emergent Distributed Environments;2024-06-09

2. Materialized View Selection & View-Based Query Planning for Regular Path Queries;Proceedings of the ACM on Management of Data;2024-05-29

3. Efficient Multi-Query Oriented Continuous Subgraph Matching;2024 IEEE 40th International Conference on Data Engineering (ICDE);2024-05-13

4. Knowledge engineering for wind energy;Wind Energy Science;2024-04-12

5. Checking Transaction Isolation Violations Using Graph Queries;Lecture Notes in Computer Science;2024