Fusion transcripts and their genomic breakpoints in polyadenylated and ribosomal RNA–minus RNA sequencing data

Author:

Hoogstrate Youri12ORCID,Komor Malgorzata A3ORCID,Böttcher René14ORCID,van Riet Job5ORCID,van de Werken Harmen J G16ORCID,van Lieshout Stef7,Hoffmann Ralf8ORCID,van den Broek Evert39,Bolijn Anne S3,Dits Natasja1,Sie Daoud3ORCID,van der Meer David10,Pepers Floor10,Bangma Chris H1ORCID,van Leenders Geert J L H11ORCID,Smid Marcel5ORCID,French Pim J2ORCID,Martens John W M5,van Workum Wilbert12,van der Spek Peter J11,Janssen Bart10,Caldenhoven Eric13,Rausch Christian14,de Jong Mark15,Stubbs Andrew P11,Meijer Gerrit A3,Fijneman Remond J A3ORCID,Jenster Guido W1ORCID

Affiliation:

1. Department of Urology, Erasmus Medical Center Cancer Institute, Wytemaweg 80, Rotterdam 3015GD, The Netherlands

2. Department of Neurology, Erasmus Medical Center Cancer Institute, Wytemaweg 80, Rotterdam 3015GD, The Netherlands

3. Department of Pathology, Netherlands Cancer Institute, Amsterdam 3015GD, The Netherlands

4. Department of Life Sciences, Barcelona Supercomputing Center, Barcelona 08034, Spain

5. Department of Medical Oncology, Erasmus Medical Center, Rotterdam 3015GD, The Netherlands

6. Cancer Computational Biology Center, Erasmus Medical Center, Rotterdam 3015GD, The Netherlands

7. Hartwig Medical Foundation, Amsterdam 1098XH, The Netherlands

8. Philips Research, Eindhoven 5656AE, The Netherlands

9. Department of Pathology and Medical Biology, University Medical Center Groningen, Groningen 9713GZ, The Netherlands

10. GenomeScan, Leiden 2333BZ, The Netherlands

11. Department of Pathology, Erasmus Medical Center, Rotterdam 3015GD, The Netherlands

12. Limes Innovations, Ruigekade 1, Leiderdorp 2351SX, The Netherlands

13. Lygature, Utrecht 3521AL, The Netherlands

14. BioLizard N.V., Ghent 9000, Belgium

15. VHLGenetics, Wageningen 6708PW, The Netherlands

Abstract

Abstract Background Fusion genes are typically identified by RNA sequencing (RNA-seq) without elucidating the causal genomic breakpoints. However, non–poly(A)-enriched RNA-seq contains large proportions of intronic reads that also span genomic breakpoints. Results We have developed an algorithm, Dr. Disco, that searches for fusion transcripts by taking an entire reference genome into account as search space. This includes exons but also introns, intergenic regions, and sequences that do not meet splice junction motifs. Using 1,275 RNA-seq samples, we investigated to what extent genomic breakpoints can be extracted from RNA-seq data and their implications regarding poly(A)-enriched and ribosomal RNA–minus RNA-seq data. Comparison with whole-genome sequencing data revealed that most genomic breakpoints are not, or minimally, transcribed while, in contrast, the genomic breakpoints of all 32 TMPRSS2-ERG–positive tumours were present at RNA level. We also revealed tumours in which the ERG breakpoint was located before ERG, which co-existed with additional deletions and messenger RNA that incorporated intergenic cryptic exons. In breast cancer we identified rearrangement hot spots near CCND1 and in glioma near CDK4 and MDM2 and could directly associate this with increased expression. Furthermore, in all datasets we find fusions to intergenic regions, often spanning multiple cryptic exons that potentially encode neo-antigens. Thus, fusion transcripts other than classical gene-to-gene fusions are prominently present and can be identified using RNA-seq. Conclusion By using the full potential of non–poly(A)-enriched RNA-seq data, sophisticated analysis can reliably identify expressed genomic breakpoints and their transcriptional effects.

Funder

Center for Translational Molecular Medicine

Publisher

Oxford University Press (OUP)

Subject

Computer Science Applications,Health Informatics

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3