Abstract
PurposeUpon the realization of the need for guideline in cross-organizational data integration, in an exploratory manner, this study developed a public data governance framework, specifically, the governance for integrated public data (GIPD) framework and identified the influential factors of its successful implementation. This framework was then subjected to an analysis of a real data integration case in the South Korean public sector to test its efficacy.Design/methodology/approachTo develop the GIPD framework, the authors conducted an extensive meta study, focus group interviews and the analytic hierarchy process involving field experts. Further, the authors performed topic modeling on documents from Korean research and development data integration projects, and compared the extracted factors to those of the GIPD to illustrate the latter's usefulness in a real case.FindingsLegislation, policy goals and strategies, operation organization, decision-making council, financial support size and objective, system development and operation, data integration, data generation, system/data standardization and master data management were derived as the 10 important factors in implementing the GIPD framework. The illustrative case of Korea revealed that decision-making council, financial support size and objective, legislation, data generation and data integration were insufficient.Research limitations/implicationsAlthough this study reveals important findings, it has a few limitations. First, the potential factors for data governance might vary depending on the attribute of the “interviewee” (such as their career or experience period) and the goal and area of GIPD framework building. Second, the inherent limitation of topic modeling in determining topics from groups of extracted keywords means that topics may be interpreted in various ways, depending on the perspective of the expert.Practical implicationsThis study is highly significant in that it provides a starting point for discussions on the issue of data integration among public institutions. Therefore, although this study examined public data governance based on R&D data, it will contribute to providing a sufficient guideline for any type of inter-institutional data governance framework, what to discuss and how to discuss between institutions.Originality/valueThe findings are expected to provide a roadmap to formulate practical guidelines on inter-institutional data cooperation and a diagnostic matrix to improve the existing data governance system, especially in the public sector, from the existing practice of empirical analysis using a mixed methodology approach.
Subject
Library and Information Sciences,Information Systems