Self-supervised learning for characterising histomorphological diversity and spatial RNA expression prediction across 23 human tissue types-Reference-Cited by-同舟云学术

Self-supervised learning for characterising histomorphological diversity and spatial RNA expression prediction across 23 human tissue types

Published:2024-07-13 Issue:1 Volume:15 Page:
ISSN:2041-1723
Container-title:Nature Communications
language:en
Short-container-title:Nat Commun

Author:

Cisternino Francesco^ORCID,Ometto Sara^ORCID,Chatterjee Soumick,Giacopuzzi Edoardo^ORCID,Levine Adam P.^ORCID,Glastonbury Craig A.^ORCID

Abstract

AbstractAs vast histological archives are digitised, there is a pressing need to be able to associate specific tissue substructures and incident pathology to disease outcomes without arduous annotation. Here, we learn self-supervised representations using a Vision Transformer, trained on 1.7 M histology images across 23 healthy tissues in 838 donors from the Genotype Tissue Expression consortium (GTEx). Using these representations, we can automatically segment tissues into their constituent tissue substructures and pathology proportions across thousands of whole slide images, outperforming other self-supervised methods (43% increase in silhouette score). Additionally, we can detect and quantify histological pathologies present, such as arterial calcification (AUROC = 0.93) and identify missing calcification diagnoses. Finally, to link gene expression to tissue morphology, we introduce RNAPath, a set of models trained on 23 tissue types that can predict and spatially localise individual RNA expression levels directly from H&E histology (mean genes significantly regressed = 5156, FDR 1%). We validate RNAPath spatial predictions with matched ground truth immunohistochemistry for several well characterised control genes, recapitulating their known spatial specificity. Together, these results demonstrate how self-supervised machine learning when applied to vast histological archives allows researchers to answer questions about tissue pathology, its spatial organisation and the interplay between morphological tissue variability and gene expression.

Funder

Impetus Grant - Norm Group. https://impetusgrants.org/

Publisher

Springer Science and Business Media LLC

Link

https://www.nature.com/articles/s41467-024-50317-w.pdf

Reference59 articles.

1. Glastonbury, C. A. et al. Machine Learning based histology phenotyping to investigate the epidemiologic and genetic basis of adipocyte morphology and cardiometabolic traits. PLoS Comput. Biol. 16, e1008044 (2020).

2. Komura, D. et al. Restaining-based annotation for cancer histology segmentation to overcome annotation-related limitations among pathologists. Patterns (N. Y) 4, 100688 (2023).

3. Lu, M. Y. et al. Data-efficient and weakly supervised computational pathology on whole-slide images. Nat. Biomed. Eng. 5, 555–570 (2021).

4. Ferlaino, M. et al. Towards deep cellular phenotyping in placental histology. arXiv [cs.CV] (2018).

5. Fu, Y. et al. Pan-cancer computational histopathology reveals mutations, tumor composition and prognosis. Nat. Cancer 1, 800–810 (2020).