occTest: An integrated approach for quality control of species occurrence data

Author:

Serra‐Diaz Josep M.12ORCID,Borderieux Jeremy2ORCID,Maitner Brian3,Boonman Coline C. F.4ORCID,Park Daniel56,Guo Wen‐Yong7ORCID,Callebaut Arnaud2,Enquist Brian J.89ORCID,Svenning Jens‐C.4ORCID,Merow Cory1ORCID

Affiliation:

1. Department of Ecology and Evolutionary Biology, Eversource Energy Center University of Connecticut Storrs Connecticut USA

2. AgroParisTech, INRAE Silva Université de Lorraine Nancy France

3. Department of Geography University at Buffalo Buffalo New York USA

4. Department of Biology, Center for Ecological Dynamics in a Novel Biosphere (ECONOVO) & Center for Biodiversity Dynamics in a Changing World (BIOCHANGE), Department of Biology Aarhus University Aarhus C Denmark

5. Department of Biological Sciences Purdue University West Lafayette Indiana USA

6. Purdue Center for Plant Biology Purdue University West Lafayette Indiana USA

7. Research Center for Global Change and Complex Ecosystems & Zhejiang Tiantong Forest Ecosystem National Observation and Research Station, School of Ecological and Environmental Sciences East China Normal University Shanghai P.R. China

8. Department of Ecology and Evolutionary Biology University of Arizona Tucson Arizona USA

9. Santa Fe Institute Santa Fe New Mexico USA

Abstract

AbstractAimSpecies occurrence data are valuable information that enables one to estimate geographical distributions, characterize niches and their evolution, and guide spatial conservation planning. Rapid increases in species occurrence data stem from increasing digitization and aggregation efforts, and citizen science initiatives. However, persistent quality issues in occurrence data can impact the accuracy of scientific findings, underscoring the importance of filtering erroneous occurrence records in biodiversity analyses.InnovationWe introduce an R package, occTest, that synthesizes a growing open‐source ecosystem of biodiversity cleaning workflows to prepare occurrence data for different modelling applications. It offers a structured set of algorithms to identify potential problems with species occurrence records by employing a hierarchical organization of multiple tests. The workflow has a hierarchical structure organized in testPhases (i.e. cleaning vs. testing) that encompass different testBlocks grouping different testTypes (e.g. environmental outlier detection), which may use different testMethods (e.g. Rosner test, jacknife,etc.). Four different testBlocks characterize potential problems in geographic, environmental, human influence and temporal dimensions. Filtering and plotting functions are incorporated to facilitate the interpretation of tests. We provide examples with different data sources, with default and user‐defined parameters. Compared to other available tools and workflows, occTest offers a comprehensive suite of integrated tests, and allows multiple methods associated with each test to explore consensus among data cleaning methods. It uniquely incorporates both coordinate accuracy analysis and environmental analysis of occurrence records. Furthermore, it provides a hierarchical structure to incorporate future tests yet to be developed.Main conclusionsoccTest will help users understand the quality and quantity of data available before the start of data analysis, while also enabling users to filter data using either predefined rules or custom‐built rules. As a result, occTest can better assess each record's appropriateness for its intended application.

Funder

Agence Nationale de la Recherche

Danmarks Grundforskningsfond

Advanced Exploration Systems

Conseil régional du Grand Est

Villum Fonden

Publisher

Wiley

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3