Affiliation:
1. Department of Biochemistry, University of Otago, PO Box 56, Dunedin 9054, New Zealand
2. Bio-Protection Research Centre, University of Otago, PO Box 56, Dunedin 9054, New Zealand
Abstract
Abstract
Variants within the non-coding genome are frequently associated with phenotypes in genome-wide association studies. These non-coding regions may be involved in the regulation of gene expression, encode functional non-coding RNAs, or influence splicing and other cellular functions. We have curated a list of characterized non-coding human genome variants based on the published evidence that indicates phenotypic consequences of the variation. In order to minimize annotation errors, two curators have independently verified the supporting evidence for pathogenicity of each non-coding variant in the published literature. The database consists of 721 non-coding variants linked to the published literature describing the evidence of functional consequences. We have also sampled 7228 covariate-matched benign controls, that have a population frequency of over 5%, from the single nucleotide polymorphism database (dbSNP151) database. These were sampled controlling for potential confounding factors such as linkage with pathogenic variants, annotation type (untranslated region, intron, intergenic, etc.) and variant type (substitution or indel). The dataset presented here represents a curated repository, with a potential use for the training or evaluation of algorithms used in the prediction of non-coding variant functionality.
Database URL: https://github.com/Gardner-BinfLab/ncVarDB.
Funder
Dean’s Bequest Fund
New Zealand Tertiary Education Commission Centre of Research Excellence (CoRE) grant to the Bio-Protection Research Centre
Publisher
Oxford University Press (OUP)
Subject
General Agricultural and Biological Sciences,General Biochemistry, Genetics and Molecular Biology,Information Systems
Cited by
11 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献