Comparing deep learning and pathologist quantification of cell-level PD-L1 expression in non-small cell lung cancer whole-slide images-Reference-Cited by-同舟云学术

Comparing deep learning and pathologist quantification of cell-level PD-L1 expression in non-small cell lung cancer whole-slide images

Published:2024-03-26 Issue:1 Volume:14 Page:
ISSN:2045-2322
Container-title:Scientific Reports
language:en
Short-container-title:Sci Rep

Author:

van Eekelen Leander,Spronck Joey,Looijen-Salamon Monika,Vos Shoko,Munari Enrico,Girolami Ilaria,Eccher Albino,Acs Balazs,Boyaci Ceren,de Souza Gabriel Silva,Demirel-Andishmand Muradije,Meesters Luca Dulce,Zegers Daan,van der Woude Lieke,Theelen Willemijn,van den Heuvel Michel,Grünberg Katrien,van Ginneken Bram,van der Laak Jeroen,Ciompi Francesco

Abstract

AbstractProgrammed death-ligand 1 (PD-L1) expression is currently used in the clinic to assess eligibility for immune-checkpoint inhibitors via the tumor proportion score (TPS), but its efficacy is limited by high interobserver variability. Multiple papers have presented systems for the automatic quantification of TPS, but none report on the task of determining cell-level PD-L1 expression and often reserve their evaluation to a single PD-L1 monoclonal antibody or clinical center. In this paper, we report on a deep learning algorithm for detecting PD-L1 negative and positive tumor cells at a cellular level and evaluate it on a cell-level reference standard established by six readers on a multi-centric, multi PD-L1 assay dataset. This reference standard also provides for the first time a benchmark for computer vision algorithms. In addition, in line with other papers, we also evaluate our algorithm at slide-level by measuring the agreement between the algorithm and six pathologists on TPS quantification. We find a moderately low interobserver agreement at cell-level level (mean reader-reader F1 score = 0.68) which our algorithm sits slightly under (mean reader-AI F1 score = 0.55), especially for cases from the clinical center not included in the training set. Despite this, we find good AI-pathologist agreement on quantifying TPS compared to the interobserver agreement (mean reader-reader Cohen’s kappa = 0.54, 95% CI 0.26–0.81, mean reader-AI kappa = 0.49, 95% CI 0.27—0.72). In conclusion, our deep learning algorithm demonstrates promise in detecting PD-L1 expression at a cellular level and exhibits favorable agreement with pathologists in quantifying the tumor proportion score (TPS). We publicly release our models for use via the Grand-Challenge platform.

Funder

Nederlandse Organisatie voor Wetenschappelijk Onderzoek

Publisher

Springer Science and Business Media LLC

Link

https://www.nature.com/articles/s41598-024-57067-1.pdf

Reference25 articles.

1. Onoi, K. et al. Immune checkpoint inhibitors for lung cancer treatment: A review. J. Clin. Med. 9, 1362 (2020).

2. Nasser, N. J., Gorenberg, M. & Agbarya, A. First line immunotherapy for non-small cell lung cancer. Pharmaceuticals 13, 373 (2020).

3. Prelaj, A. et al. Predictive biomarkers of response for immune checkpoint inhibitors in non–small-cell lung cancer. Eur. J. Cancer 106, 144–159 (2019).

4. Paolino, G. et al. PD-L1 evaluation in head and neck squamous cell carcinoma: Insights regarding specimens, heterogeneity and therapy. Pathol. Res. Pract. 226, 153605 (2021).

5. Uruga, H. & Mino-Kenudson, M. Predictive biomarkers for response to immune checkpoint inhibitors in lung cancer: PD-L1 and beyond. Virchows Arch. 478, 31–44 (2021).

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Autofluorescence Virtual Staining System for H&E Histology and Multiplex Immunofluorescence Applied to Immuno-Oncology Biomarkers in Lung Cancer;2024-06-13

2. A Pipeline for Evaluation of Machine Learning/Artificial Intelligence Models to Quantify Programmed Death Ligand 1 Immunohistochemistry;Laboratory Investigation;2024-06