hist2RNA: An Efficient Deep Learning Architecture to Predict Gene Expression from Breast Cancer Histopathology Images

Author:

Mondol Raktim Kumar1ORCID,Millar Ewan K. A.2345ORCID,Graham Peter H.36,Browne Lois6,Sowmya Arcot1,Meijering Erik1ORCID

Affiliation:

1. School of Computer Science and Engineering, UNSW Sydney, Kensington, NSW 2052, Australia

2. Department of Anatomical Pathology, NSW Health Pathology, St. George Hospital, Kogarah, NSW 2217, Australia

3. St. George and Sutherland Clinical School, UNSW Sydney, Kensington, NSW 2052, Australia

4. Faculty of Medicine and Health Sciences, Sydney Western University, Campbelltown, NSW 2560, Australia

5. University of Technology Sydney, Ultimo, NSW 2007, Australia

6. Cancer Care Centre, St George Hospital, Sydney, NSW 2217, Australia

Abstract

Gene expression can be used to subtype breast cancer with improved prediction of risk of recurrence and treatment responsiveness over that obtained using routine immunohistochemistry (IHC). However, in the clinic, molecular profiling is primarily used for ER+ breast cancer, which is costly, tissue destructive, requires specialised platforms, and takes several weeks to obtain a result. Deep learning algorithms can effectively extract morphological patterns in digital histopathology images to predict molecular phenotypes quickly and cost-effectively. We propose a new, computationally efficient approach called hist2RNA inspired by bulk RNA sequencing techniques to predict the expression of 138 genes (incorporated from 6 commercially available molecular profiling tests), including luminal PAM50 subtype, from hematoxylin and eosin (H&E)-stained whole slide images (WSIs). The training phase involves the aggregation of extracted features for each patient from a pretrained model to predict gene expression at the patient level using annotated H&E images from The Cancer Genome Atlas (TCGA, n = 335). We demonstrate successful gene prediction on a held-out test set (n = 160, corr = 0.82 across patients, corr = 0.29 across genes) and perform exploratory analysis on an external tissue microarray (TMA) dataset (n = 498) with known IHC and survival information. Our model is able to predict gene expression and luminal PAM50 subtype (Luminal A versus Luminal B) on the TMA dataset with prognostic significance for overall survival in univariate analysis (c-index = 0.56, hazard ratio = 2.16 (95% CI 1.12–3.06), p < 5 × 10−3), and independent significance in multivariate analysis incorporating standard clinicopathological variables (c-index = 0.65, hazard ratio = 1.87 (95% CI 1.30–2.68), p < 5 × 10−3). The proposed strategy achieves superior performance while requiring less training time, resulting in less energy consumption and computational cost compared to patch-based models. Additionally, hist2RNA predicts gene expression that has potential to determine luminal molecular subtypes which correlates with overall survival, without the need for expensive molecular testing.

Publisher

MDPI AG

Subject

Cancer Research,Oncology

Reference41 articles.

1. Understanding breast cancer as a global health concern;Wilkinson;Br. J. Radiol.,2022

2. Foundation, N.B.C. (2022, November 20). Breast Cancer Stats in Australia. Available online: https://nbcf.org.au/about-breast-cancer/breast-cancer-stats/.

3. Association between socioeconomic factors at diagnosis and survival in breast cancer: A population-based study;Ji;Cancer Med.,2020

4. Breast cancer classification from histopathological images using patch-based deep learning modeling;Hirra;IEEE Access,2021

5. Radiomics in breast cancer classification and prediction;Conti;Semin. Cancer Biol.,2021

Cited by 2 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3