A Survey on XML Fragmentation-Reference-Cited by-同舟云学术

A Survey on XML Fragmentation

Published:2014-12-04 Issue:3 Volume:43 Page:24-35
ISSN:0163-5808
Container-title:ACM SIGMOD Record
language:en
Short-container-title:SIGMOD Rec.

Author:

Braganholo Vanessa¹,Mattoso Marta²

Affiliation:

1. Fluminense Federal University, UFF, Brazil

2. Federal University of Rio de Janeiro, COPPE/UFRJ, Brazil

Abstract

Efficient document processing is a must when large volumes of XML data are involved. In such critical scenarios, a well-known solution to this problem is to distribute (map) the data among several processing nodes, and then distribute the processing accordingly, taking advantage of parallelism. This is the approach taken by distributed databases and MapReduce environments. Fragmentation techniques play an important role in these scenarios. They provide a way to "cut" the database into pieces and distribute the pieces over a network. This way, queries can also be "cut" into sub-queries that run in parallel, thus achieving better performance when compared to the centralized environment. However, there is no consensus in the database community as to what an XML fragment is. In fact, several approaches in literature present definitions of XML fragments. In addition to query processing, using XML fragmentation techniques may also be helpful when managing XML documents distributed along the web or clouds. This paper surveys the existing XML fragmentation approaches in literature, comparing their features and highlighting their drawbacks. Our contribution resides in establishing a map of the area.

Publisher

Association for Computing Machinery (ACM)

Subject

Information Systems,Software

Link

https://dl.acm.org/doi/pdf/10.1145/2694428.2694434

Reference77 articles.

1. Dynamic XML documents with distribution and replication

2. HadoopDB

3. Efficiently Processing XML Queries over Fragmented Repositories with PartiX

4. XML-based information mediation with MIX

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Integrated method for distributed processing of large XML data;Cluster Computing;2023-05-13

2. XML2HBase: Storing and querying large collections of XML documents using a NoSQL database system;Journal of Parallel and Distributed Computing;2022-03

3. Decomposition of Fuzzy Homogeneous Classes of Objects;Communications in Computer and Information Science;2022

4. Querying XML documents using Prolog engines: When is this a good idea?;Information Processing & Management;2019-09

5. Online Integration of Fragmented XML Documents;Intelligent Information and Database Systems;2017