Affiliation:
1. Wright State University, USA
2. Indiana University, USA
Abstract
This chapter introduces a complete storage and retrieval architecture for a database environment for XML documents. DocBase, a prototype system based on this architecture, uses a flexible storage and indexing technique to allow highly expressive queries without the necessity of mapping documents to other database formats. DocBase is an integration of several techniques that include (i) a formal model called Heterogeneous Nested Relations (HNR), (ii) a conceptual model XER (Extensible Entity Relationship), (ii) formal query languages (Document Algebra and Calculus), (iii) a practical query language (Document SQL or DSQL), (iv) a visual query formulation method with QBT (Query By Templates), and (v) the DocBase query processing architecture. This paper focuses on the overall architecture of DocBase including implementation details, describes the details of the query-processing framework, and presents results from various performance tests. The paper summarizes experimental and usability analyses to demonstrate its feasibility as a general architecture for native as well as embedded document manipulation methods.
Reference62 articles.
1. Abiteboul, S., Cluet, S., & Milo, T. (1993). Querying and updating the file. In Proceedings of the 19th International Conference on Very Large Data Bases, Dublin, Ireland (pp. 73-84).
2. The Lorel query language for semistructured data.;S.Abiteboul;International Journal on Digital Libraries,1997
3. Abiteboul, S., & Viannu, V. (1997). Regular path queries with constraints. In Proceedings of the ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, Tucson, AZ (pp. 122-133).
4. Tree pattern query minimization
5. Balmin, A., Ozcan, F., Beyer, K. S., Cochrane, R. J., & Pirahesh, H. (2004). A framework for using materialized XPath views in XML query processing. In Proceedings of the Thirtieth International Conference on Very Large Data Bases, Toronto, ON, Canada (Vol. 30).