Abstract
Catalogue of Life (COL) brings together the efforts and contributions of taxonomists from around the world, and addresses the needs of researchers, policy-makers, environmental managers and the wider public for a consistent, up-to-date and authoritative listing of all the world’s known species. The names that science gives to species are fundamental tools that allow us to refer to these units of biodiversity. Knowing the name for a species unlocks everything that has been learned about its biology, distribution and relevance to humans. Every day, taxonomists continue to publish new scientific names and refine our understanding of the world’s species.
Over the past 10 years, game-changing progress has been made with the Catalogue of Life and its authoritative taxonomic name indexing services. Major steps were made towards content completion (over 2.1 million species out of all described 2.4 million are now documented and validated through COL (Bánki et al. 2023). The COL data infrastructure and services were rebuilt in collaboration with the Global Biodiversity Information Facility (GBIF). This resulted in the GBIF-COL ChecklistBank, an open global repository for publishing and sharing taxonomic and nomenclatural datasets (> 48,000 datasets, including over 160 Global Species Databases).
The data managing/editing process underlying the integration of data into a single checklist (index) could be more transparent with clearly documented protocols and procedures, and is today far too time consuming. In the ChecklistBank infrastructure, there are thousands of data sources at our disposal and we need a powerful, semi-automated, community-expertise based, quality assurance process to warrant the best possible product for users of taxonomic checklists.
COL aspires to document, with as much detail as possible, the current quality control and quality assurance processes as they relate to activities that occur at the following levels: 1) data providers, 2) pipeline and data conversions, 3) ChecklistBank infrastructure, and 4) checklist output. Ideally, this should happen throughout the consortium of partners that help the taxonomic community with mobilising and editing authoritative taxonomic data, including initiatives like Atlas of Living Australia, Catalogue of Life China, EDIT Platform for Cybertaxonomy, Integrated Taxonomic Information System, Freshwater Animal Diversity Assessment, TaxonWorks / Species Files, World Flora Online / Rhakhis, World Register of Marine Species / LifeWatch Aphia.
In recent years more attention has been given to the principles of creating an authoritative list of the world's species (Garnett et al. 2020). COL has developed and adopted new criteria for the inclusion of taxonomic data sources in the COL Checklist (Hobern et al. 2021). And ChecklistBank, certainly provides more avenues for (automated) quality checks and procedures.
But to guarantee the highest confidence in data, we are in need of a transparent and consistent data quality assurance and data quality control mechanism from the creation of taxonomic data, to data publishing, to the delivery into taxonomic data products and the global list of known species. We will present the current status of quality control and quality assurance processes in Catalogue of Life, and discuss what should constitute a quality control and quality assurance mechanism for species list building.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献