Optimizing Exome Captures in Species with Large Genomes Using Species-specific Repetitive DNA Blocker

Author:

Kesälahti RobertORCID,Kumpula Timo A.ORCID,Cervantes SandraORCID,Kujala Sonja T.ORCID,Mattila Tiina M.ORCID,Tyrmi Jaakko S.ORCID,Niskanen Alina K.ORCID,Rastas PasiORCID,Savolainen Outi,Pyhäjärvi TanjaORCID

Abstract

AbstractLarge and highly repetitive genomes are common. However, research interests usually lie within the non-repetitive parts of the genome, as they are more likely functional, and can be used to answer questions related to adaptation, selection, and evolutionary history. Exome capture is a cost-effective method for providing sequencing data from protein-coding parts of the genes. C0t-1 DNA blockers consist of repetitive DNA and are used in exome captures to prevent the hybridization of repetitive DNA sequences to capture baits or bait-bound genomic DNA. Universal blockers target repetitive regions shared by many species, while species-specific c0t-1 DNA is prepared from the DNA of the studied species, thus perfectly matching the repetitive DNA contents of the species. So far the use of species-specific c0t-1 DNA has been limited to a few model species. Here, we evaluated the performance of blocker treatments in exome captures ofPinus sylvestris, a widely distributed conifer species with a large (> 20 Gbp) and highly repetitive genome. We compared treatment with a commercial universal blocker to treatments with species-specific c0t-1 (30,000 ng and 60,000 ng). Species-specific c0t-1 captured more unique exons than the initial set of targets, reduced sequencing of tandem repeats, and produced more target regions with high read coverage and narrower depth distribution than the universal blocker. Based on our results, we recommend optimizing exome captures by using at least 60,000 ng species-specific c0t-1 DNA. It is relatively easy and fast to prepare and can also be used with existing bait set designs.

Publisher

Cold Spring Harbor Laboratory

Reference36 articles.

1. Andrews, S . (2010). FastQC: A Quality Control Tool for High Throughput Sequence Data. [Computer software]. http://www.bioinformatics.babraham.ac.uk/projects/fastqc/

2. Tandem repeats finder: a program to analyze DNA sequences

3. [29] Analysis of repeating DNA sequences by reassociation

4. FISH mapping and molecular organization of the major repetitive sequences of tomato

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3