Curating clinically relevant transcripts for the interpretation of sequence variants
Author:
DiStefano Marina T.,Hemphill Sarah E.,Cushman Brandon J.,Bowser Mark J.,Hynes Elizabeth,Grant Andrew R.,Siegert Rebecca K.,Oza Andrea M.,Gonzalez Michael A.,Amr Sami S.,Rehm Heidi L.,Abou Tayoun Ahmad N.
Abstract
AbstractVariant interpretation depends on accurate annotations using biologically relevant transcripts. We have developed a systematic strategy for designating primary transcripts, and applied it to 109 hearing loss-associated genes that were divided into 3 categories. Category 1 genes (n=38) had a single transcript, Category 2 genes (n=32) had multiple transcripts, but a single transcript was sufficient to represent all exons, and Category 3 genes (n=38) had multiple transcripts with unique exons. Transcripts were curated with respect to gene expression reported in the literature and the Genotype-Tissue Expression Project. In addition, high frequency loss of function variants in the Genome Aggregation Database, and disease-causing variants in ClinVar and the Human Gene Mutation Database across the 109 genes were queried. These data were used to classify exons as "clinically relevant", "uncertain significance", or "clinically insignificant". Interestingly, 7% of all exons, containing >124 "clinically significant" variants, were of “uncertain significance”. Finally, we used exon-level next generation sequencing quality metrics generated at two clinical labs, and identified a total of 43 technically challenging exons in 20 different genes that had inadequate coverage and/or homology issues which might lead to false variant calls. We have demonstrated that transcript analysis plays a critical role in accurate clinical variant interpretation.
Publisher
Cold Spring Harbor Laboratory
Reference97 articles.
1. Richards S , Aziz N , Bale S , Bick D , Das S , Gastier-Foster J , Grody WW , Hegde M , Lyon E , Spector E , Voelkerding K , Rehm HL : Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genet Med, 17:405–424. 2. Harrow J , Frankish A , Gonzalez JM , Tapanari E , Diekhans M , Kokocinski F , Aken BL , Barrell D , Zadissa A , Searle S , Barnes I , Bignell A , Boychenko V , Hunt T , Kay M , Mukherjee G , Rajan J , Despacio-Reyes G , Saunders G , Steward C , Harte R , Lin M , Howald C , Tanzer A , Derrien T , Chrast J , Walters N , Balasubramanian S , Pei B , Tress M , Rodriguez JM , Ezkurdia I , van Baren J , Brent M , Haussler D , Kellis M , Valencia A , Reymond A , Gerstein M , Guigo R , Hubbard TJ : GENCODE: the reference human genome annotation for The ENCODE Project. Genome Res, 22:1760–1774. 3. Casper J , Zweig AS , Villarreal C , Tyner C , Speir ML , Rosenbloom KR , Raney BJ , Lee CM , Lee BT , Karolchik D , Hinrichs AS , Haeussler M , Guruvadoo L , Navarro Gonzalez J, Gibson D , Fiddes IT , Eisenhart C , Diekhans M , Clawson H , Barber GP , Armstrong J , Haussler D , Kuhn RM , Kent WJ : The UCSC Genome Browser database: 2018 update. Nucleic Acids Res, 46:D762– D769. 4. Rosenbloom KR , Sloan CA , Malladi VS , Dreszer TR , Learned K , Kirkup VM , Wong MC , Maddren M , Fang R , Heitner SG , Lee BT , Barber GP , Harte RA , Diekhans M , Long JC , Wilder SP , Zweig AS , Karolchik D , Kuhn RM , Haussler D , Kent WJ : ENCODE data in the UCSC Genome Browser: year 5 update. Nucleic Acids Res, 41:D56–63. 5. O’Leary NA , Wright MW , Brister JR , Ciufo S , Haddad D , McVeigh R , Rajput B , Robbertse B , Smith-White B , Ako-Adjei D , Astashyn A , Badretdin A , Bao Y , Blinkova O , Brover V , Chetvernin V , Choi J , Cox E , Ermolaeva O , Farrell CM , Goldfarb T , Gupta T , Haft D , Hatcher E , Hlavina W , Joardar VS , Kodali VK , Li W , Maglott D , Masterson P , McGarvey KM , Murphy MR , O’Neill K , Pujar S , Rangwala SH , Rausch D , Riddick LD , Schoch C , Shkeda A , Storz SS , Sun H , Thibaud-Nissen F , Tolstoy I , Tully RE , Vatsan AR , Wallin C , Webb D , Wu W , Landrum MJ , Kimchi A , Tatusova T , DiCuccio M , Kitts P , Murphy TD , Pruitt KD : Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation. Nucleic Acids Res, 44:D733–745.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|