New deep learning‐based methods for visualizing ecosystem properties using environmental DNA metabarcoding data

Author:

Lamperti Letizia123ORCID,Sanchez Théophile23,Si Moussi Sara4,Mouillot David56,Albouy Camille23,Flück Benjamin23ORCID,Bruno Morgane1,Valentini Alice7ORCID,Pellissier Loïc23ORCID,Manel Stéphanie16ORCID

Affiliation:

1. CEFE, Univ Montpellier, CNRS, EPHE‐PSL University, IRD Montpellier France

2. Ecosystems and Landscape Evolution Department of Environmental Systems Science, ETH Zürich Zürich Switzerland

3. Ecosystems and Landscape Evolution, Land Change Science Research Unit Swiss Federal Institute for Forest, Snow and Landscape Research (WSL) Switzerland

4. Laboratoire d'Ecologie Alpine Univ. Grenoble Alpes, Univ. Savoie MontBlanc, CNRS Grenoble France

5. MARBEC, Univ Montpellier, CNRS, IFREMER, IRD Montpellier France

6. Institut Universitaire de France Paris France

7. SPYGEN Le Bourget‐du‐Lac France

Abstract

AbstractEnvironmental DNA (eDNA) metabarcoding provides an efficient approach for documenting biodiversity patterns in marine and terrestrial ecosystems. The complexity of these data prevents current methods from extracting and analyzing all the relevant ecological information they contain, and new methods may provide better dimensionality reduction and clustering. Here we present two new deep learning‐based methods that combine different types of neural networks (NNs) to ordinate eDNA samples and visualize ecosystem properties in a two‐dimensional space: the first is based on variational autoencoders and the second on deep metric learning. The strength of our new methods lies in the combination of two inputs: the number of sequences found for each molecular operational taxonomic unit (MOTU) detected and their corresponding nucleotide sequence. Using three different datasets, we show that our methods accurately represent several biodiversity indicators in a two‐dimensional latent space: MOTU richness per sample, sequence α‐diversity per sample, Jaccard's and sequence β‐diversity between samples. We show that our nonlinear methods are better at extracting features from eDNA datasets while avoiding the major biases associated with eDNA. Our methods outperform traditional dimension reduction methods such as Principal Component Analysis, t‐distributed Stochastic Neighbour Embedding, Nonmetric Multidimensional Scaling and Uniform Manifold Approximation and Projection for dimension reduction. Our results suggest that NNs provide a more efficient way of extracting structure from eDNA metabarcoding data, thereby improving their ecological interpretation and thus biodiversity monitoring.

Publisher

Wiley

Subject

Genetics,Ecology, Evolution, Behavior and Systematics,Biotechnology

Reference73 articles.

1. TensorFlow: A system for Large‐Scale Machine Learning;Abadi M.;Proceedings of the 12th USENIX conference on operating systems design and implementation,2016

2. A National Scale “BioBlitz” Using Citizen Science and eDNA Metabarcoding for Monitoring Coastal Marine Fish

3. Visualizing population structure with variational autoencoders

4. Dimensionality reduction for visualizing single-cell data using UMAP

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3