Affiliation:
1. V.M. Glushkov Institute of Cybernetics of the NAS of Ukraine, Kyiv
2. United Institute of Informatics Problems of the National Academy of Sciences of Belarus, Minsk
Abstract
Introduction. This publication summarizes the experience of the use of applied containerized software tools in cloud environment, which the authors gained during the project “Development of methods, algorithms and intellectual analytical system for processing and analysis of heterogeneous clinical and biomedical data in order to improve the diagnosis of complex diseases”, accomplished by the team from the United Institute of Informatics Problems of the NAS of Belarus and V.M. Glushkov Institute of Cybernetics of the NAS of Ukraine. In parallel, the features of biomedical data and the main approaches to their processing and classification, implemented within the framework of an intelligent analytical system, and the possibility of their implementation as part of a container application are described.
The purpose of the paper is to describe modern technologies that ensure the reproducibility of numerical experiments in this field and the tools aimed to integrate several sources of biomedical information in order to improve the diagnostics and prognosis of complex diseases. Special attention is also paid to the methods of handling data received from different sources of biomedical information. Particular attention is paid to methods of processing data obtained from various sources of biomedical information and included to the intelligent analytical system.
Results. The experience of the use of applied containerized biomedical software tools in cloud environment is summarized. The reproducibility of scientific computing in relation with modern technologies of scientific calculations is discussed. The main approaches to biomedical data preprocessing and integration in the framework of the intelligent analytical system are described. The developed hybrid classification model presents the basis of the intelligent analytical system and aims to integrate several sources of biomedical information.
Conclusions. The experience of using the developed classification module NonSmoothSVC, which is part of the developed intelligent analytical system, gained during its testing on artificial and real data, allows us to conclude about several advantages provided by the containerized form of the created application. Namely:
• It permits to provide access to real data located in cloud environment,
• It is possible to perform calculations to solve research problems on cloud resources both with the help of developed tools and with the help of cloud services,
• Such a form of research organization makes numerical experiments reproducible, i.e. any other researcher can compare the results of their developments on specific data that have already been studied by others, in order to verify the conclusions and technical feasibility of new results,
• There exists a universal opportunity to use the developed tools on technical devices of various classes from a personal computer to powerful cluster.
The hybrid classification model as a core of the intelligent system will make it possible to integrate multidimensional, heterogeneous biomedical data with the aim to better understand the molecular courses of disease origin and development, to improve the identification of disease subtypes and disease prognosis. Keywords: classifier, cloud service, containerized application, heterogeneous biomedical data
Publisher
V.M. Glushkov Institute of Cybernetics
Reference32 articles.
1. Vorontsov K.V. Mathematical methods of learning by precedents (Machine Learning Theory) (in Russian) http://www.machinelearning.ru/wiki/images/6/6d/Voron-ML-1.pdf
2. Gupal A.M., Sergienko I.V. Symmetry in DNA. Methods for Discrete Sequences Recognition. Kyiv. Naukova Dumka, 2016. 227 p. (in Russian).
3. Baldi P., Wesley Hatfield G. DNA Microarrays and Gene Expression. From Experiments to Data Analysis and Modeling. Cambridge University Press, 2011.
4. Kuhn M., Johnson K. Applied predictive modeling. New York: Springer, 2013. https://doi.org/10.1007/978-1-4614-6849-3
5. Heath L.S., Ramakrishnan N. (Eds.). Problem solving handbook in computational biology and bioinformatics. NY: Springer Science & Business Media, 2010. https://doi.org/10.1007/978-0-387-09760-2
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献