Biodiversity Observations Miner: A web application to unlock primary biodiversity data from published literature-Reference-Cited by-同舟云学术

Biodiversity Observations Miner: A web application to unlock primary biodiversity data from published literature

Published:2019-01-16 Issue: Volume:7 Page:
ISSN:1314-2828
Container-title:Biodiversity Data Journal
language:
Short-container-title:BDJ

Author:

Muñoz Gabriel,Kissling W. Daniel^ORCID,van Loon E. Emiel

Abstract

A considerable portion of primary biodiversity data is digitally locked inside published literature which is often stored as pdf files. Large-scale approaches to biodiversity science could benefit from retrieving this information and making it digitally accessible and machine-readable. Nonetheless, the amount and diversity of digitally published literature pose many challenges for knowledge discovery and retrieval. Text mining has been extensively used for data discovery tasks in large quantities of documents. However, text mining approaches for knowledge discovery and retrieval have been limited in biodiversity science compared to other disciplines. Here, we present a novel, open source text mining tool, the Biodiversity Observations Miner (BOM). This web application, written in R, allows the semi-automated discovery of punctual biodiversity observations (e.g. biotic interactions, functional or behavioural traits and natural history descriptions) associated with the scientific names present inside a corpus of scientific literature. Furthermore, BOM enable users the rapid screening of large quantities of literature based on word co-occurrences that match custom biodiversity dictionaries. This tool aims to increase the digital mobilisation of primary biodiversity data and is freely accessible via GitHub or through a web server.

Publisher

Pensoft Publishers

Subject

Ecology,Ecology, Evolution, Behavior and Systematics

Link

https://bdj.pensoft.net/article/28737/download/pdf/

Reference37 articles.

1. taxize: taxonomic search and retrieval in R

2. shiny:Web Application Framework for R.;Chang,2017

3. shiny dashboard: Create Dashboards with Shiny.;Chang,2018

4. A handbook of protocols for standardised and easy measurement of plant functional traits worldwide

5. The Global Biodiversity Information Facility : An International Network of Interoperabel Biodiversity Databases

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Automating the Curation Process of Historical Literature on Marine Biodiversity Using Text Mining: The DECO Workflow;Frontiers in Marine Science;2022-07-22

2. Text as Data in Environmental Economics and Policy;Review of Environmental Economics and Policy;2022-06-01

3. Past and future uses of text mining in ecology and evolution;Proceedings of the Royal Society B: Biological Sciences;2022-05-18

4. TaxoNERD: Deep neural models for the recognition of taxonomic entities in the ecological and evolutionary literature;Methods in Ecology and Evolution;2021-12-14

5. TaxoNERD: deep neural models for the recognition of taxonomic entities in the ecological and evolutionary literature;2021-06-09