Abstract
Graph databases are enjoying enormous popularity, through both their RDF and Property Graphs (PG) incarnations, in a variety of applications. To query graphs, query languages provide structured, as well as unstructured primitives. While structured queries allow expressing precise information needs, they are unsuited for exploring unfamiliar datasets, as they require prior knowledge of the schema and structure of the dataset. Prior research on keyword search in graph databases do not suffer from this limitation. However, keyword queries do not allow expressing precise search criteria when users do know some.This tutorial (1.5 hours) builds a continuum between structured graph querying through languages such as SPARQL and GPML, a recently proposed standard for PG querying, on one hand, and graph keyword search, on the other hand. In this space between querying and information retrieval, we analyze the features of modern query languages that go toward unstructured search, discuss their strength, limitations, and compare their computational complexity. In particular, we focus on (i) lessons learned from the rich literature of graph keyword search, in particular with respect to result scoring; (ii) language mechanisms for integratingbothcomplex structured querying and powerful methods to search for connections users do not know in advance. We conclude by discussing the open challenges and future work directions.
Publisher
Association for Computing Machinery (ACM)
Subject
General Earth and Planetary Sciences,Water Science and Technology,Geography, Planning and Development
Reference46 articles.
1. B. Aditya , Gaurav Bhalotia , Soumen Chakrabarti , Arvind Hulgeri , Charuta Nakhe , Parag, and S. Sudarshan . 2002 . BANKS: Browsing and Keyword Searching in Relational Databases. In VLDB. B. Aditya, Gaurav Bhalotia, Soumen Chakrabarti, Arvind Hulgeri, Charuta Nakhe, Parag, and S. Sudarshan. 2002. BANKS: Browsing and Keyword Searching in Relational Databases. In VLDB.
2. Christian Aebeloe Gabriela Montoya Vinay Setty and Katja Hose. 2018. Discovering Diversified Paths in Knowledge Bases. In VLDB. Christian Aebeloe Gabriela Montoya Vinay Setty and Katja Hose. 2018. Discovering Diversified Paths in Knowledge Bases. In VLDB.
3. Sanjay Agrawal Surajit Chaudhuri and Gautam Das. 2002. DBXplorer: A System for Keyword-Based Search over Relational Databases. In ICDE. Sanjay Agrawal Surajit Chaudhuri and Gautam Das. 2002. DBXplorer: A System for Keyword-Based Search over Relational Databases. In ICDE.
4. Angelos-Christos Anadiotis , Oana Balalau , Théo Bouganim , Francesco Chimienti , Helena Galhardas , Mhd Yamen Haddad , Stéphane Horel, Ioana Manolescu, and Youssr Youssef. 2021 . Empowering Investigative Journalism with Graph-based Heterogeneous Data Management. In Bulletin of the Technical Committee on Data Engineering. IEEE Computer Society . Angelos-Christos Anadiotis, Oana Balalau, Théo Bouganim, Francesco Chimienti, Helena Galhardas, Mhd Yamen Haddad, Stéphane Horel, Ioana Manolescu, and Youssr Youssef. 2021. Empowering Investigative Journalism with Graph-based Heterogeneous Data Management. In Bulletin of the Technical Committee on Data Engineering. IEEE Computer Society.
5. Angelos Christos Anadiotis Ioana Manolescu and Madhulika Mohanty. 2023. Integrating Connection Search in Graph Queries. In ICDE. Angelos Christos Anadiotis Ioana Manolescu and Madhulika Mohanty. 2023. Integrating Connection Search in Graph Queries. In ICDE.