Generalized Linear Models outperform commonly used canonical analysis in estimating spatial structure of presence/absence data

Author:

Carlos-Júnior Lélis A.123,Creed Joel C.4,Marrs Rob2,Lewis Rob J.5,Moulton Timothy P.4,Feijó-Lima Rafael16,Spencer Matthew2

Affiliation:

1. Programa de Pós-Graduação em Ecologia e Evolução, Universidade do Estado do Rio do Janeiro, Rio de Janeiro, Brazil

2. School of Environmental Sciences, University of Liverpool, Liverpool, United Kingdom

3. Departamento de Biologia, Pontifícia Universidade Católica do Rio de Janeiro, Rio de Janeiro, Brazil

4. Departamento de Ecologia, Universidade do Estado do Rio de Janeiro, Rio de Janeiro, Brazil

5. Department of Forest Genetics and Biodiversity, Norwegian Institute of Bioeconomy Research, Bergen, Norway

6. Division of Biological Sciences, University of Montana, Missoula, MT, United States of America

Abstract

Background Ecological communities tend to be spatially structured due to environmental gradients and/or spatially contagious processes such as growth, dispersion and species interactions. Data transformation followed by usage of algorithms such as Redundancy Analysis (RDA) is a fairly common approach in studies searching for spatial structure in ecological communities, despite recent suggestions advocating the use of Generalized Linear Models (GLMs). Here, we compared the performance of GLMs and RDA in describing spatial structure in ecological community composition data. We simulated realistic presence/absence data typical of many β-diversity studies. For model selection we used standard methods commonly used in most studies involving RDA and GLMs. Methods We simulated communities with known spatial structure, based on three real spatial community presence/absence datasets (one terrestrial, one marine and one freshwater). We used spatial eigenvectors as explanatory variables. We varied the number of non-zero coefficients of the spatial variables, and the spatial scales with which these coefficients were associated and then compared the performance of GLMs and RDA frameworks to correctly retrieve the spatial patterns contained in the simulated communities. We used two different methods for model selection, Forward Selection (FW) for RDA and the Akaike Information Criterion (AIC) for GLMs. The performance of each method was assessed by scoring overall accuracy as the proportion of variables whose inclusion/exclusion status was correct, and by distinguishing which kind of error was observed for each method. We also assessed whether errors in variable selection could affect the interpretation of spatial structure. Results Overall GLM with AIC-based model selection (GLM/AIC) performed better than RDA/FW in selecting spatial explanatory variables, although under some simulations the methods performed similarly. In general, RDA/FW performed unpredictably, often retaining too many explanatory variables and selecting variables associated with incorrect spatial scales. The spatial scale of the pattern had a negligible effect on GLM/AIC performance but consistently affected RDA’s error rates under almost all scenarios. Conclusion We encourage the use of GLM/AIC for studies searching for spatial drivers of species presence/absence patterns, since this framework outperformed RDA/FW in situations most likely to be found in natural communities. It is likely that such recommendations might extend to other types of explanatory variables.

Funder

National Council for Technological and Scientific Development,

Science without Borders scholarship scheme

CNPq

Brazilian Coordination for the Improvement of Higher Education Personnel

CAPES- Ciências do Mar

EU’s Marie Skłodowska-Curie action

CAPES program Science Without Borders –project

Publisher

PeerJ

Subject

General Agricultural and Biological Sciences,General Biochemistry, Genetics and Molecular Biology,General Medicine,General Neuroscience

Reference53 articles.

1. Information theory and an extension of the maximum likelihood principle;Akaike,1973

2. Null hypothesis testing problems prevalence and an alternative;Anderson;The Journal of Wildlife Management,2000

3. When and how should biotic interactions be considered in models of species niches and distributions?;Anderson;Journal of Biogeography,2017

4. Disentangling good from bad practices in the selection of spatial or phylogenetic eigenvectors;Bauman;Ecography,2018

5. Optimizing the choice of a spatial weighting matrix in eigenvector-based methods;Bauman;Ecology,2018

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3