Assessing performance of pathogenicity predictors using clinically relevant variant datasets-Reference-Cited by-同舟云学术

Assessing performance of pathogenicity predictors using clinically relevant variant datasets

Published:2020-08-25 Issue:8 Volume:58 Page:547-555
ISSN:0022-2593
Container-title:Journal of Medical Genetics
language:en
Short-container-title:J Med Genet

Author:

Gunning Adam C^ORCID,Fryer Verity,Fasham James^ORCID,Crosby Andrew H,Ellard Sian,Baple Emma L,Wright Caroline F^ORCID

Abstract

BackgroundPathogenicity predictors are integral to genomic variant interpretation but, despite their widespread usage, an independent validation of performance using a clinically relevant dataset has not been undertaken.MethodsWe derive two validation datasets: an ‘open’ dataset containing variants extracted from publicly available databases, similar to those commonly applied in previous benchmarking exercises, and a ‘clinically representative’ dataset containing variants identified through research/diagnostic exome and panel sequencing. Using these datasets, we evaluate the performance of three recent meta-predictors, REVEL, GAVIN and ClinPred, and compare their performance against two commonly used in silico tools, SIFT and PolyPhen-2.ResultsAlthough the newer meta-predictors outperform the older tools, the performance of all pathogenicity predictors is substantially lower in the clinically representative dataset. Using our clinically relevant dataset, REVEL performed best with an area under the receiver operating characteristic curve of 0.82. Using a concordance-based approach based on a consensus of multiple tools reduces the performance due to both discordance between tools and false concordance where tools make common misclassification. Analysis of tool feature usage may give an insight into the tool performance and misclassification.ConclusionOur results support the adoption of meta-predictors over traditional in silico tools, but do not support a consensus-based approach as in current practice.

Funder

Wellcome Trust

Publisher

BMJ

Subject

Genetics(clinical),Genetics

Reference38 articles.

1. Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology

2. SIFT web server: predicting effects of amino acid substitutions on proteins

3. Performance of mutation pathogenicity prediction methods on missense variants

4. Ellard S , Baple E , Berry I , Forrester N , Turnbull C , Owens M . ACGS best practice guidelines for variant classification; 2019. https://www.acgs.uk.com/news/acgs-best-practice-guidelines-for-variant-classification-2019/

5. Amino Acid Difference Formula to Help Explain Protein Evolution

Cited by 71 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Preclinical alternative drug discovery programs for monogenic rare diseases. Should small molecules or gene therapy be used? The case of hereditary spastic paraplegias;Drug Discovery Today;2024-10

2. AI-derived comparative assessment of the performance of pathogenicity prediction tools on missense variants of breast cancer genes;Human Genomics;2024-09-11

3. Autosomal dominant stromal corneal dystrophy associated with a SPARCL1 missense variant;European Journal of Human Genetics;2024-08-21

4. Artificial Intelligence-Driven Prediction Revealed CFTR Associated with Therapy Outcome of Breast Cancer: A Feasibility Study;Oncology;2024-07-18

5. A New Era in Missense Variant Analysis: Statistical Insights and the Introduction of VAMPP-Score for Pathogenicity Assessment;2024-07-13