Abstract
AbstractThe sequenced genomes in the Drosophila phylogeny is a central resource for comparative work supporting the understanding of the Drosophila melanogaster non-mammalian model system. These have also facilitated studying the selected and random differences that distinguish the thousands of extant species of Drosophila. However, full utility has been hampered by uneven genome annotation. We have generated a large expression profile dataset for nine species of Drosophila and trained a transcriptome assembly approach on Drosophila melanogaster to develop a pipeline that best matched the extensively curated annotation. We then applied this to the other species to add tens of thousands of new gene models per species. We also developed new orthologs to facilitate cross-species comparisons. We validated the new annotation of the distantly related Drosophila grimshawi with an extensive collection of newly sequenced cDNAs. This reannoation will facilitate understanding both the core commonalities and the species differences in this important group of model organisms.
Publisher
Cold Spring Harbor Laboratory
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献