Matt Stevens wrote:Hi all,
I'm attempting survival modeling for a whole suite of African passerine species using MARK (using retrap only data). Some of the species have few retraps, others many. My problem lies with the GOF tests, in that for species with few retraps I'm getting decent c-hats (i.e. <3 using Bootstrapping) but species with more data produce c-hats >4. That in itself doesn't concern me too much but if I run GOF tests using the median c approach, my c-hats very rarely resemble those obtained using the bootstrapping method. e.g. for one species, testing a very basic s(.)p(.) model I get c-hat of 7.02 using bootstrapping but 1.19 using median c!
I've generally been running the median c method using 1-3 as the min and max bounds.
I've RTFM many times now and assessed and re-assessed my data but still can't resolve this. Any thoughts would be very welcome.
Cheers,
Matt
1. don't bother with bootstrap. Use either RELEASE (if your general model is fully time-dependent) or the median c-hat (for that model and anything else).
2. you need only run the GOF on the most general model in the candidate set of approximating models.
3. the median c-hat estimates c on the assumption that lack of fit is due entirely to extra-binomial noise. If that isn't the case (e.g., if the model structure is inappropriate for your data), the estimated c-hat won't mean much.
4. most of the time, if c-hat is estimated >3, the problem is an interaction of (i) sparse data and/or (ii) structural problems (i.e., wrong general model). If (i), then you use a slightly less parameterized general model. If (ii), you need to think hard about what might be sources of variation in your data. Often, fitting a TSM model solves a multitude of problems (since even a single TSM class can help soak up some heterogeneity).
5. If you're stuck with sparse data, and can't figure out a possible structural problem, then there isn't much you can do - you'll be stuck fitting very simple models.
All of the preceding is in the GOF chapter.