Adaptive digital tissue deconvolution

Author:

Görtler Franziska12ORCID,Mensching-Buhr Malte3,Skaar Ørjan4,Schrod Stefan3ORCID,Sterr Thomas5,Schäfer Andreas5,Beißbarth Tim3ORCID,Joshi Anagha6ORCID,Zacharias Helena U7ORCID,Grellscheid Sushma Nagaraja1ORCID,Altenbuchinger Michael3ORCID

Affiliation:

1. Computational Biology Unit, Department of Biological Sciences, University of Bergen , N-5008 Bergen, Norway

2. Department of Oncology and Medical Physics, Haukeland University Hospital , 5021 Bergen, Norway

3. Department of Medical Bioinformatics, University Medical Center Göttingen , 37075 Göttingen, Germany

4. Department of Informatics, Computational Biology Unit, University of Bergen , N-5008 Bergen, Norway

5. Institute of Theoretical Physics, University of Regensburg , 93053 Regensburg, Germany

6. Department of Clinical Science, Computational Biology Unit, University of Bergen , N-5008 Bergen, Norway

7. Peter L. Reichertz Institute for Medical Informatics of TU Braunschweig and Hannover Medical School, Hannover Medical School , 30625 Hannover, Germany

Abstract

Abstract Motivation The inference of cellular compositions from bulk and spatial transcriptomics data increasingly complements data analyses. Multiple computational approaches were suggested and recently, machine learning techniques were developed to systematically improve estimates. Such approaches allow to infer additional, less abundant cell types. However, they rely on training data which do not capture the full biological diversity encountered in transcriptomics analyses; data can contain cellular contributions not seen in the training data and as such, analyses can be biased or blurred. Thus, computational approaches have to deal with unknown, hidden contributions. Moreover, most methods are based on cellular archetypes which serve as a reference; e.g. a generic T-cell profile is used to infer the proportion of T-cells. It is well known that cells adapt their molecular phenotype to the environment and that pre-specified cell archetypes can distort the inference of cellular compositions. Results We propose Adaptive Digital Tissue Deconvolution (ADTD) to estimate cellular proportions of pre-selected cell types together with possibly unknown and hidden background contributions. Moreover, ADTD adapts prototypic reference profiles to the molecular environment of the cells, which further resolves cell-type specific gene regulation from bulk transcriptomics data. We verify this in simulation studies and demonstrate that ADTD improves existing approaches in estimating cellular compositions. In an application to bulk transcriptomics data from breast cancer patients, we demonstrate that ADTD provides insights into cell-type specific molecular differences between breast cancer subtypes. Availability and implementation A python implementation of ADTD and a tutorial are available at Gitlab and zenodo (doi:10.5281/zenodo.7548362).

Funder

Deutsche Forschungsgemeinschaft

German Research Foundation

Digital Tissue Deconvolution—Aus Einzelzelldaten lernen

European Union’s Horizon 2020

Trond Mohn Stiftelse

German Federal Ministry of Education and Research

BMBF

DFG

Publisher

Oxford University Press (OUP)

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3