I work with frogs and wetlands and have a large data set for only 12 sites (sampling effort: 181 days, 15 species). Even when collapsing the sampling effort to 20 weeks (which is more biologically relevant), I have a sparse data set...so lots of 0's relative to 1's. As a result, modelling in PRESENCE didn't work (all estimates for individual species had std. errors of zero and CI's of 1). I then moved to generalized linear modelling because I have continuous and categorical site covariates. Results make sense, but I found it interesting that most (if not all) of the covariates in the top models for the GLZ (which also are significant) were the same in top models with PRESENCE.
It is possible to still use the PRESENCE results to say which covariates could matter regarding occupancy and detection since the GLZ cannot separate them?
I have RTFM but not found much guidance on sparse data. Does anyone have any thoughts? I appreciate your time in reading this.
Jackie Guzy