Author:
Panzner Melina,von Enzberg Sebastian,Meyer Maurice,Dumitrescu Roman
Abstract
AbstractComprehensive data understanding is a key success driver for data analytics projects. Knowing the characteristics of the data helps a lot in selecting the appropriate data analysis techniques. Especially in data-driven product planning, knowledge about the data is a necessary prerequisite because data of the use phase is very heterogeneous. However, companies often do not have the necessary know-how or time to build up solid data understanding in connection with data analysis. In this paper, we develop a methodology to organize and categorize and thus understand use phase data in a way that makes it accessible to general data analytics workflows, following a design science research approach. We first present a knowledge base that lists typical use phase data from a product planning view. Second, we develop a taxonomy based on standard literature and real data objects, which covers the diversity of the data considered. The taxonomy provides 8 dimensions that support classification of use phase data and allows to capture data characteristics from a data analytics view. Finally, we combine both views by clustering the objects of the knowledge base according to the taxonomy. Each of the resulting clusters covers a typical combination of analytics relevant characteristics occurring in practice. By abstracting from the diversity of use phase data into artifacts with manageable complexity, our approach provides guidance to choose appropriate data analysis and AI techniques.
Funder
Bundesministerium für Bildung und Forschung
Heinz Nixdorf Institute
Publisher
Springer Science and Business Media LLC
Subject
Economics and Econometrics
Reference51 articles.
1. Alpaydın, E. (2004). Introduction to machine learning, adaptive computation and machine learning series. MIT Press.
2. Awad, E., & Ghaziri, H. (2007). Knowledge management. Pearson Education India.
3. Banimustafa, A., & Hardy, N. (2012). A strategy for selecting data mining techniques in metabolomics. Methods in Molecular Biology (Clifton, N.J.), 860, 317–333.
4. Bilalli, B., Abelló, A., Aluja-Banet, T., & Wrembel, R. (2016). Towards intelligent data analysis: The metadata challenge, undefined.
5. Bourdreau, A., & Couillard, G. (1999). Systems integration and knowledge management. Information Systems Management, 16(4), 24–32.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献