Blind Queries Applied to JSON Document Stores-Reference-Cited by-同舟云学术

Blind Queries Applied to JSON Document Stores

Published:2019-09-21 Issue:10 Volume:10 Page:291
ISSN:2078-2489
Container-title:Information
language:en
Short-container-title:Information

Author:

Marrara Stefania^ORCID,Pelucchi Mauro,Psaila Giuseppe^ORCID

Abstract

Social Media, Web Portals and, in general, information systems offer their own Application Programming Interfaces (APIs), used to provide large data sets concerning every aspect of day-by-day life. APIs usually provide data sets as collections of JSON documents. The heterogeneous structure of JSON documents returned by different APIs constitutes a barrier to effectively query and analyze these data sets. The adoption of NoSQL document stores, such as MongoDB, is useful for gathering these data sets, but does not solve the problem of querying the final heterogeneous repository. The aim of this paper is to provide analysts with a tool, named HammerJDB, that allows for blind querying collections of JSON documents within a NoSQL document database. The idea below is that users may know the application domain but it may be that they are not aware of the real structures of the documents stored in the database—the tool for blind querying tries to bridge the gap, by adopting a query rewriting mechanism. This paper is an evolution of a technique for blind querying Open Data portals and of its implementation within the Hammer framework, presented in some previous work. In this paper, we evolve that approach in order to query a NoSQL document database by evolving the Hammer framework into the HammerJDB framework, which is able to work on MongoDB databases. The effectiveness of the new approach is evaluated on a data set (derived from a real-life one), containing job-vacancy ads collected from European job portals.

Publisher

MDPI AG

Subject

Information Systems

Link

https://www.mdpi.com/2078-2489/10/10/291/pdf

Reference41 articles.

1. WoLMIS: a labor market intelligence system for classifying web job vacancies

2. Fuzzy sets as a basis for a theory of possibility

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Soft querying powered by user-defined functions in J-CO-QL;Neurocomputing;2023-08

2. RADAR: Resilient Application for Dependable Aided Reporting;Information;2021-11-09

3. Towards Flexible Retrieval, Integration and Analysis of JSON Data Sets through Fuzzy Sets: A Case Study;Information;2021-06-22

4. J-CO: A Platform-Independent Framework for Managing Geo-Referenced JSON Data Sets;Electronics;2021-03-07

5. Creating Collections with Embedded Documents for Document Databases Taking into Account the Queries;Computation;2020-05-15