A whole-slide foundation model for digital pathology from real-world data-Reference-Cited by-同舟云学术

A whole-slide foundation model for digital pathology from real-world data

Published:2024-05-22 Issue:8015 Volume:630 Page:181-188
ISSN:0028-0836
Container-title:Nature
language:en
Short-container-title:Nature

Author:

Xu Hanwen,Usuyama Naoto,Bagga Jaspreet,Zhang Sheng,Rao Rajesh,Naumann Tristan^ORCID,Wong Cliff,Gero Zelalem,González Javier,Gu Yu^ORCID,Xu Yanbo^ORCID,Wei Mu,Wang Wenhui,Ma Shuming,Wei Furu,Yang Jianwei,Li Chunyuan,Gao Jianfeng,Rosemon Jaylen,Bower Tucker^ORCID,Lee Soohee,Weerasinghe Roshanthi,Wright Bill J.,Robicsek Ari^ORCID,Piening Brian^ORCID,Bifulco Carlo^ORCID,Wang Sheng^ORCID,Poon Hoifung^ORCID

Abstract

AbstractDigital pathology poses unique computational challenges, as a standard gigapixel slide may comprise tens of thousands of image tiles1–3. Prior models have often resorted to subsampling a small portion of tiles for each slide, thus missing the important slide-level context4. Here we present Prov-GigaPath, a whole-slide pathology foundation model pretrained on 1.3 billion 256 × 256 pathology image tiles in 171,189 whole slides from Providence, a large US health network comprising 28 cancer centres. The slides originated from more than 30,000 patients covering 31 major tissue types. To pretrain Prov-GigaPath, we propose GigaPath, a novel vision transformer architecture for pretraining gigapixel pathology slides. To scale GigaPath for slide-level learning with tens of thousands of image tiles, GigaPath adapts the newly developed LongNet5 method to digital pathology. To evaluate Prov-GigaPath, we construct a digital pathology benchmark comprising 9 cancer subtyping tasks and 17 pathomics tasks, using both Providence and TCGA data6. With large-scale pretraining and ultra-large-context modelling, Prov-GigaPath attains state-of-the-art performance on 25 out of 26 tasks, with significant improvement over the second-best method on 18 tasks. We further demonstrate the potential of Prov-GigaPath on vision–language pretraining for pathology7,8 by incorporating the pathology reports. In sum, Prov-GigaPath is an open-weight foundation model that achieves state-of-the-art performance on various digital pathology tasks, demonstrating the importance of real-world data and whole-slide modelling.

Publisher

Springer Science and Business Media LLC

Link

https://www.nature.com/articles/s41586-024-07441-w.pdf

Reference58 articles.

1. Campanella, G. et al. Clinical-grade computational pathology using weakly supervised deep learning on whole slide images. Nat. Med. 25, 1301–1309 (2019).

2. Lu, M. Y. et al. Data-efficient and weakly supervised computational pathology on whole-slide images. Nat. Biomed. Eng. 5, 555–570 (2021).

3. Song, A. H. et al. Artificial intelligence for digital and computational pathology. Nat. Rev. Bioeng. 1, 930–949 (2023).

4. Ilse, M., Tomczak, J. & Welling, M. Attention-based deep multiple instance learning. In Proc. 35th International Conference on Machine Learning (eds Dy, J. & Krause, A.) 2127–2136 (IMLS, 2018).

5. Ding, J. et al. Longnet: scaling transformers to 1,000,000,000 tokens. Preprint at https://doi.org/10.48550/arXiv.2307.02486 (2023).

Cited by 10 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Is Histopathology Deep Learning Artificial Intelligence the Future of Precision Oncology?;Journal of Clinical Oncology;2024-09-11

2. Weakly Supervised Vector Quantization for Whole Slide Images Classification;2024-09-02

3. Foundational Models for Pathology and Endoscopy Images: Application for Gastric Inflammation;Diagnostics;2024-08-30

4. The Most Disruptive Near-Term Use of AI in Cancer Care: Patient Empowerment Through Software Agents;AI in Precision Oncology;2024-08-30

5. CellRegNet: Point Annotation-Based Cell Detection in Histopathological Images via Density Map Regression;Bioengineering;2024-08-10