Author:
Peng Xin,Li Qiang,Cheng Zhentao,Huang Xiaolei
Abstract
The biogeography field benefits more and more from the growth and application of genetic data such as nucleotide sequences and whole genomes. It has been perceived by scientists that genetic data may be imbalanced among different geographical regions and taxonomic groups. However, the lack of empirical evidence prevents the understanding of current data volume and distribution of genetic data. Based on the construction of a dataset including records for 365 millions of nucleotide sequences of Animalia, Plantae, and Fungi kingdoms, 6 millions of COI sequences of insects, 77 thousands of COI sequences of mammals, 220 thousands of rbcl sequences of Magnoliopsida, and 44 thousands of ITS sequences of Dothideomycetes, here we present evidence on geographical and taxonomical imbalance of the genetic data, identify major gaps and inappropriate practices in the production, application and sharing of genetic data. We then discuss our perspectives on how to fill up gaps and improve the quantity and quality of genetic data.
Funder
National Natural Science Foundation of China
Subject
Ecology,Ecology, Evolution, Behavior and Systematics
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献