Multi-omic integration by machine learning (MIMaL)

Author:

Dickinson Quinn12ORCID,Aufschnaiter Andreas3,Ott Martin34,Meyer Jesse G12ORCID

Affiliation:

1. Department of Biochemistry, Medical College of Wisconsin , Milwaukee, WI 53226, USA

2. Department of Computational Biomedicine, Cedars-Sinai Medical Center , Los Angeles, CA 90048, USA

3. Department of Biochemistry and Biophysics, Stockholm University , Stockholm, Sweden

4. Department of Medical Biochemistry and Cell Biology, Institute of Biomedicine, University of Gothenburg , Gothenburg, Sweden

Abstract

Abstract Motivation Cells respond to environments by regulating gene expression to exploit resources optimally. Recent advances in technologies allow for measuring the abundances of RNA, proteins, lipids and metabolites. These highly complex datasets reflect the states of the different layers in a biological system. Multi-omics is the integration of these disparate methods and data to gain a clearer picture of the biological state. Multi-omic studies of the proteome and metabolome are becoming more common as mass spectrometry technology continues to be democratized. However, knowledge extraction through the integration of these data remains challenging. Results Connections between molecules in different omic layers were discovered through a combination of machine learning and model interpretation. Discovered connections reflected protein control (ProC) over metabolites. Proteins discovered to control citrate were mapped onto known genetic and metabolic networks, revealing that these protein regulators are novel. Further, clustering the magnitudes of ProC over all metabolites enabled the prediction of five gene functions, each of which was validated experimentally. Two uncharacterized genes, YJR120W and YDL157C, were accurately predicted to modulate mitochondrial translation. Functions for three incompletely characterized genes were also predicted and validated, including SDH9, ISC1 and FMP52. A website enables results exploration and also MIMaL analysis of user-supplied multi-omic data. Availability and implementation The website for MIMaL is at https://mimal.app. Code for the website is at https://github.com/qdickinson/mimal-website. Code to implement MIMaL is at https://github.com/jessegmeyerlab/MIMaL. Supplementary information Supplementary data are available at Bioinformatics online.

Funder

United States National Institute of Health (NIH) NIGMS

Swedish research council and the Knut and Alice Wallenberg foundation

NIH

Publisher

Oxford University Press (OUP)

Subject

Computational Mathematics,Computational Theory and Mathematics,Computer Science Applications,Molecular Biology,Biochemistry,Statistics and Probability

Reference50 articles.

1. ClueGO: a cytoscape plug-in to decipher functionally grouped gene ontology and pathway annotation networks;Bindea;Bioinformatics,2009

2. The yeast gene order browser: combining curated homology and syntenic context reveals gene fate in polyploid species;Byrne;Genome Res.,2005

3. The analysis of yeast mitochondrial translation;Carlström;Methods Mol. Biol. (Clifton, N.J.),2021

4. Integrating multi-omics data through deep learning for accurate cancer prognosis prediction;Chai;Computers in Biology and Medicine,2021

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3