Implementation of the Concept of a Repository for Automated Processing of Semi-Structural Data

Author:

Piech MateuszORCID,Rakoczy BartoszORCID,Dajda JacekORCID,Kisiel-Dorohinicki MarekORCID

Abstract

Semi-structural data tend to be problematic due to the sparsity of their attributes and due to the fact that, regardless of their type, they are immensely diverse. This means that data storage is a challenge, especially when the data contained within a relational database – often a strict requirement defined in advance. In this paper, we present a thoroughly described concept of a repository that is capable of storing and processing semi-structural data. Based on this concept, we establish a database model comprising the architecture and the tools needed to search the data and build relevant processors. The processor described may assign roles and dispatch tasks between the users. We demonstrate how the capacities of this repository are capable of overcoming current limitations by creating a system for facilitated digitization of scientific resources. In addition, we show that the repository in question is suitable for general use, and, as such, may be adapted to any domains in which semi-structural data are processed, without any additional work required

Publisher

National Institute of Telecommunications

Subject

Electrical and Electronic Engineering,Computer Networks and Communications

Reference45 articles.

1. [1] J. McHugh, S. Abiteboul, R. Goldman, D. Quass, and J. Widom, "Lore: A database management system for semistructured data", ACM SIGMOD Rec., vol. 26, no. 3, pp. 54-66, 1997 (doi: 10.1145/262762.262770).

2. [2] R. Goldman, J. McHugh, and J. Widom, "From semistructured data to XML: Migrating the Lore data model and query language", in Proc. of the 2nd Int. Worksh. on the Web and Databases WebDB'99, Philadelphia, PA, USA, 1999 [Online]. Available: http://infolab.stanford.edu/lore/pubs/xml.pdf

3. [3] J. Shanmugasundaram, K. Tufte, G. He, C. Zhang, D. DeWitt, and J. Naughton, "Relational databases for querying XML documents: Limitations and opportunities", in Proc. of the 25th Int. Conf. on Very Large Data Bases VLDB'99, Edinburgh, Scotland, 2008, pp. 302-314 [Online]. Available: http://www.vldb.org/conf/1999/P31.pdf

4. [4] M. Rys, "XML and relational database management systems: inside Microsoft SQL Server 2005", in Proc. of the ACM SIG-MOD Int. Conf. on Manag.t of Data, Baltimore, MD, USA, 2005, pp. 958-962 (doi: 10.1145/1066157.1066301).

5. [5] R. Marcjan and J. Wyrostek, "Processing XML documents on the basis of quasi-relational model and SQLxD language", Studia Informatica, vol. 32, no. 2A, pp. 111-120, 2011 (doi: 10.21936/si2011 v32.n2A.253).

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3