Affiliation:
1. Computer Engineering College, Jimei University, Xiamen 361021, China
2. Digital Fujian Big Data Modeling and Intelligent Computing Institute, Xiamen 361021, China
Abstract
XPath query is the key part of XML data processing, and its performance is usually critical for XML applications. In the process of XPath query, there is inherent seriality between query steps, which makes it difficult to parallelize the query effectively as a whole. On the other hand, although XPath query has the characteristics of data stream processing and is suitable for pipeline processing, the data flow of each query step usually varies a lot, which results in limited performance under multithreading conditions. In this paper, we propose a pipelined XPath query method (PXQ) based on cost optimization. This method uses pipelined query primitives to process query steps based on relation index. During pipeline construction, a cost estimation model based on XML statistics is proposed to estimate the cost of the query primitive and provide guidance for the creation of a pipeline phase through the partition of query primitive sequence. The pipeline construction technique makes full use of available worker threads and optimizes the load balance between pipeline stages. The experimental results show that our method can adapt to the multithreaded environment and stream processing scenarios of XPath query, and its performance is better than the existing typical query methods based on data parallelism.
Funder
Natural Science Foundation of Fujian Province
Subject
Computer Science Applications,Software
Reference33 articles.
1. Semistructured data;P. Buneman
2. XML path language (XPath);J. Robie,2017
3. Parallelization of XPath queries using multi-core processors;R. Bordawekar
4. Parallel XPath evaluation based on node relation matrix;R. Chen;Journal of Computational Information Systems,2013
5. Parallelization of XPath queries using modern XQuery processors;S. Sato;New Trends in Databases and Information Systems. ADBIS,2018
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献