A data‐driven genome annotation approach for cassava

Author:

Chenna Swetha1,Ivanov Maxim1,Nielsen Tue Kjærgaard1,Chalenko Karina1,Olsen Evy1,Jørgensen Kirsten1,Sandelin Albin23,Marquardt Sebastian1ORCID

Affiliation:

1. Department of Plant and Environmental Sciences, Copenhagen Plant Science Centre University of Copenhagen Thorvaldsensvej 40 Frederiskberg C 1871 Denmark

2. Department of Biology University of Copenhagen Ole Maaløes Vej 5 Copenhagen N DK2200 Denmark

3. Biotech Research and Innovation Centre University of Copenhagen Ole Maaløes Vej 5 Copenhagen N DK2200 Denmark

Abstract

SUMMARYGenome annotation files play a critical role in dictating the quality of downstream analyses by providing essential predictions for gene positions and structures. These files are pivotal in decoding the complex information encoded within DNA sequences. Here, we generated experimental data resolving RNA 5′‐ and 3′‐ends as well as full‐length RNAs for cassava TME12 sticklings in ambient temperature and cold. We used these data to generate genome annotation files using the TranscriptomeReconstructoR (TR) tool. A careful comparison to high‐quality genome annotations suggests that our new TR genome annotations identified additional genes, resolved the transcript boundaries more accurately and identified additional RNA isoforms. We enhanced existing cassava genome annotation files with the information from TR that maintained the different transcript models as RNA isoforms. The resultant merged annotation was subsequently utilized for comprehensive analysis. To examine the effects of genome annotation files on gene expression studies, we compared the detection of differentially expressed genes during cold using the same RNA‐seq data but alternative genome annotation files. We found that our merged genome annotation that included cold‐specific TR gene models identified about twice as many cold‐induced genes. These data indicate that environmentally induced genes may be missing in off‐the‐shelf genome annotation files. In conclusion, TR offers the opportunity to enhance crop genome annotations with implications for the discovery of differentially expressed candidate genes during plant–environment interactions.

Funder

Novo Nordisk Fonden

H2020 European Research Council

Carlsbergfondet

Danmarks Frie Forskningsfond

Publisher

Wiley

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3