R.ROSETTA: an interpretable machine learning framework-Reference-Cited by-同舟云学术

R.ROSETTA: an interpretable machine learning framework

Published:2021-03-06 Issue:1 Volume:22 Page:
ISSN:1471-2105
Container-title:BMC Bioinformatics
language:en
Short-container-title:BMC Bioinformatics

Author:

Garbulowski Mateusz,Diamanti Klev,Smolińska Karolina,Baltzer Nicholas,Stoll Patricia,Bornelöv Susanne,Øhrn Aleksander,Feuk Lars,Komorowski Jan^ORCID

Abstract

Abstract Background Machine learning involves strategies and algorithms that may assist bioinformatics analyses in terms of data mining and knowledge discovery. In several applications, viz. in Life Sciences, it is often more important to understand how a prediction was obtained rather than knowing what prediction was made. To this end so-called interpretable machine learning has been recently advocated. In this study, we implemented an interpretable machine learning package based on the rough set theory. An important aim of our work was provision of statistical properties of the models and their components. Results We present the R.ROSETTA package, which is an R wrapper of ROSETTA framework. The original ROSETTA functions have been improved and adapted to the R programming environment. The package allows for building and analyzing non-linear interpretable machine learning models. R.ROSETTA gathers combinatorial statistics via rule-based modelling for accessible and transparent results, well-suited for adoption within the greater scientific community. The package also provides statistics and visualization tools that facilitate minimization of analysis bias and noise. The R.ROSETTA package is freely available at https://github.com/komorowskilab/R.ROSETTA. To illustrate the usage of the package, we applied it to a transcriptome dataset from an autism case–control study. Our tool provided hypotheses for potential co-predictive mechanisms among features that discerned phenotype classes. These co-predictors represented neurodevelopmental and autism-related genes. Conclusions R.ROSETTA provides new insights for interpretable machine learning analyses and knowledge-based systems. We demonstrated that our package facilitated detection of dependencies for autism-related genes. Although the sample application of R.ROSETTA illustrates transcriptome data analysis, the package can be used to analyze any data organized in decision tables.

Funder

Foundation for the National Institutes of Health

Uppsala Universitet

Vetenskapsrådet

Polska Akademia Nauk

Uppsala University

Publisher

Springer Science and Business Media LLC

Subject

Applied Mathematics,Computer Science Applications,Molecular Biology,Biochemistry,Structural Biology

Link

http://link.springer.com/content/pdf/10.1186/s12859-021-04049-z.pdf

Reference81 articles.

1. Molnar C. Interpretable Machine Learning: Lulu. com; 2020.

2. Doshi-Velez F, Kim B. Towards a rigorous science of interpretable machine learning. arXiv preprint arXiv: 170208608 2017.

3. Azodi CB, Tang J, Shiu S-H. Opening the Black Box: Interpretable machine learning for geneticists. Trends in Genetics 2020.

4. Pawlak Z. Rough sets. Int J Comput Inform Sci. 1982;11(5):341–56.

5. Komorowski J, Pawlak Z, Polkowski L, Skowron A. Rough sets: a tutorial. In: Rough fuzzy hybridization: a new trend in decision-making 1999; pp. 3–98.

Cited by 17 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Using machine learning methods to study the tumour microenvironment and its biomarkers in osteosarcoma metastasis;Heliyon;2024-04

2. A Data-Driven and Knowledge-Based Decision Support System for Construction Planning and Control;2024

3. A practical study of methods for deriving insightful attribute importance rankings using decision bireducts;Information Sciences;2023-10

4. The role of artificial neural networks in prediction of severe acute pancreatitis associated acute respiratory distress syndrome: A retrospective study;Medicine;2023-07-21

5. Using Machine Learning Methods to Study Colorectal Cancer Tumor Micro-Environment and Its Biomarkers;International Journal of Molecular Sciences;2023-07-06