A comparison of models for interval-censored plant cover data, with applications to monitoring schemes

Centre for Ecology & Hydrology Wallingford, Crowmarsh Gifford, United Kingdom
Centre for Ecology & Hydrology Lancaster, Lancaster, United Kingdom
Botanical Society of Britain & Ireland, Harrogate, United Kingdom
DOI
10.7287/peerj.preprints.2532v1
Subject Areas
Biodiversity, Conservation Biology, Ecology, Plant Science, Statistics
Keywords
abundance data, Hierarchical Bayesian models, sample size, generalized linear mixed model, random effects, citizen science, long-term monitoring, variance, participatory monitoring, time series
Copyright
© 2016 Pescott et al.
Licence
This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Preprints) and either DOI or URL of the article must be cited.
Cite this article
Pescott OL, Jitlal M, Smart SM, Walker KJ, Roy DB, Freeman SN. 2016. A comparison of models for interval-censored plant cover data, with applications to monitoring schemes. PeerJ Preprints 4:e2532v1

Abstract

Plant cover data collected by monitoring schemes are often expressed on interval-censored scales to reduce field effort. Existing statistical approaches to such data may not make full use of available information, or may both induce bias and assume more precision than may be warranted, e.g. by analysing mid-points and disregarding the spread of observations within a class.

We compare four approaches to modelling such data: two established methods (the proportional odds model and generalised linear mixed models) and two novel methods that explicitly accommodate the interval-censored nature of much data on plant cover. Of the latter, the first is a maximum likelihood (ML) approach that incorporates knowledge of the metric interval in which each datum lies. The second uses a Bayesian approach to incorporate interval-censoring and random effects to account for variation in annual changes between sites. All four methods are compared using data simulated with parameter values derived from analysis of a long-term monitoring dataset.

We demonstrate that model choice can influence the quality of statistical inference, particularly between models that make simplifications for convenience of fitting, and those which combine realistic distributional assumptions with accommodation of imprecise observations. A comparison of three of the methods demonstrated that all provide good accuracy and increasing precision over time. A comparison of power across the three frequentist approaches showed higher power for the novel ML approach. This is likely to be due to this non-hierarchical method underestimating residual variance. The Bayesian model is not directly comparable, but the measure of belief in a negative trend considered here was generally high, providing gradual increases in the believability of a decline with increasing time, number of sites, initial abundance, and larger effect sizes.

Our results suggest that the use of hierarchical models for plant monitoring schemes, conveniently applied in a Bayesian context, will help to bring greater realism and sensitivity to assessments of population change, and allow the use of more of the underlying information contained within cover data. Interval-censored methods will also allow for the integration of long-term plant datasets collected according to different cover scales, as well as presence/absence data.

Author Comment

Version 1 is in review at Ecology & Evolution.

Supplemental Information

R and JAGS code for Models 3 and 4 respectively

DOI: 10.7287/peerj.preprints.2532v1/supp-1