Abstract
AbstractSediment diatoms are widely used to track environmental histories of lakes and their watersheds, but merging datasets generated by different researchers for further large-scale studies is challenging because of taxonomic discrepancies caused by rapidly evolving diatom nomenclature and taxonomic concepts. We collated five datasets of lake sediment diatoms from the Northeastern USA using a harmonization process which included updating synonyms, tracking the identity of inconsistently identified taxa, and grouping those that could not be resolved taxonomically. Each harmonization step led to an increase in variation explained by environmental variables and a parallel reduction of variation attributable to taxonomic inconsistency. To maximize future use of the data and underlying specimens we provide the original and harmonized counts for 1327 core samples from 607 lakes, name translation schemes, sample metadata, specimen museum locations, and the Northeast Lakes Voucher Flora, which is a set of light microscope images grouped into 1154 morphological operational taxonomic units. Post-hoc harmonization enables data quality control when other approaches (e.g., upfront management of taxonomic consistency) are not possible.
Funder
U.S. Environmental Protection Agency
Publisher
Springer Science and Business Media LLC
Subject
Library and Information Sciences,Statistics, Probability and Uncertainty,Computer Science Applications,Education,Information Systems,Statistics and Probability
Cited by
6 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献