Abstract
ABSTRACTThe human genome contains numerous duplicated regions, such as parent-pseudogene pairs, causing sequencing reads to align equally well to either gene. The extent to which this ambiguity complicates transcriptomic analyses is currently unknown. This is concerning as many parent genes have been linked to disease, includingGBA1,causally linked to both Parkinson’s and Gaucher disease. We find that most of the short sequencing reads that map toGBA1, also map to its pseudogene,GBAP1. Using long-read RNA-sequencing in human brain, where all reads mapped uniquely, we demonstrate significant differences in expression compared to short-read data. We identify novel transcripts from bothGBA1andGBAP1, including protein-coding transcripts that are translatedin vitroand detected in proteomic data, but that lack GCase activity. By combining long-read with single-nuclear RNA-sequencing to analyse brain-relevant cell types we demonstrate that transcript expression varies by brain region with cell-type-selectivity. Taken together, these results suggest a non-lysosomal function for both GBA1 and GBAP1 in brain. Finally, we demonstrate that inaccuracies in annotation are widespread among parent genes, with implications for many human diseases.
Publisher
Cold Spring Harbor Laboratory
Cited by
5 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献