How We Developed a Data Exchange Format: Lessons Learned from Camera Trap Data Package (Camtrap DP)

Author:

Desmet PeterORCID,Bubnicki Jakub

Abstract

Camera trapping has revolutionized wildlife ecology and conservation by enabling automated data acquisition, leading to the accumulation of massive amounts of camera trap data worldwide (Steenweg et al. 2016, Kays et al. 2020, Delisle et al. 2021). Although management and processing of camera trap-derived big data are becoming increasingly solvable with the help of scalable infrastructures, harmonization and exchange of the data remain limited, hindering its full potential. We therefore developed a new data exchange format, Camera Trap Data Package (Camtrap DP), to facilitate the exchange, harmonization and archiving of camera trap data at local and global scales (Camtrap DP Development Team 2023). Camtrap DP was developed with two guiding principles. It should: allow easy and interoperable data exchange and be developed openly and collaboratively. allow easy and interoperable data exchange and be developed openly and collaboratively. Camtrap DP structures the data in a simple model consisting of three tables (Deployments, Media, and Observations), which supports a wide range of camera deployment designs, classification techniques and analytical use cases. To describe these tables and the accompanying metadata, we adopted the Frictionless Standards, a collection of open specifications developed by the Frictionless Data project (Fowler et al. 2018), which offer a standardized way to describe datasets, data files and tabular data. Doing so, we did not have to reinvent how to express generic properties such as licenses, contributors, file formats, field names, data types, required values, controlled values, and relationships between tables. We expanded upon those to describe the necessary properties related to camera trapping, relying on existing standards such as Darwin Core (Wieczorek et al. 2012), Audiovisual Core (Audiovisual Core Maintenance Group 2023) and Data Cite Metadata Schema (DataCite Metadata Working Group 2021) where possible. This approach is not only efficient, but also facilitates interoperability: since a Camtrap DP is in essence a Frictionless Data Package, existing software tools can be used to read and validate data. We developed Camtrap DP openly, collaboratively, and with version control from the start. It is licensed under the permissive MIT license, allowing anyone to use it. Suggestions for change were and continue to be discussed in a public issue tracker on GitHub. These are incorporated only after review and automated testing. Once a number of changes have been adopted, a new version of the standard is released using semantic versioning. This allows Camtrap DP to evolve over time, while making sure that software and datasets referring to older versions of the standard are still valid. Equally important to the success of a data exchange format is community-wide adoption, which requires trust and implementation by existing systems. From the start, we have involved researchers as well as maintainers of software tools and management platforms from the camera trapping community. Using an iterative approach, they tested and provided feedback on Camtrap DP to make sure it met their requirements. To aid their understanding of the format, we provided a website, an example dataset that is versioned with the format, and an R package to read, explore and visualize Camtrap DP datasets (Oldoni et al. 2023). Through open development and outreach, we also managed to get the support from trusted and well-recognized organizations such as Biodiversity Information Standards (TDWG) and the Global Biodiversity Information Facility (GBIF). As a result, Camtrap DP can now be used as a data publication format in the Integrated Publishing Toolkit (Robertson et al. 2014).

Publisher

Pensoft Publishers

Subject

General Engineering

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3