To my knowledge, MARK provides a few tools to handle estimability problems, two MARK functions: (i) data cloning (Appendix F) and (ii) simulation annealing (Ch. 10 ; 10.4.3 sidebar), and more than this, they suggest some good practices to be followed when analyzing CMR data.
Estimability problems can arise in several situations: (a) particular model structure (i.e. confounded terminal parameters in CJS), (b) boundary estimates (near to zero or to 1), (c) multiple optima in the likelihood function and (d) data sparseness.
These "good practices" are summarized in Ch.10 (10.4.3 ; 10-38) where it is said that:
1) In some situations (particularly in Multi-state models), it is recommended to start with a simpler model and use the constant parameter estimate as parameter starting point, this can help when there are numerical estimation problems, in particular, to deal with multiple optima in the likelihood function.
2) In some cases it is efficient to use other link function, for instance whit boundary estimates sin link can be more efficient than logit link, even though sin link is not so good to estimate CI when parameters' values are near to the boundary and, also, you cannot use sin link in several situations (other than the manual there are quite a few posts of Gary White on this like: http://www.phidot.org/forum/viewtopic.php?f=1&t=432&p=985&hilit=+boundary#p985).
My analysis is a CJS with an individual covariate (weight), 9 occasions and two groups (males and females) of rabbits in an enclosure. This is a summary of my encounter history:
My best model seems to be {Phi(weight*t) P(sex*t)} (see http://www.phidot.org/forum/viewtopic.php?f=1&t=1966 for some details). I have some trouble with Phi4 and Phi5 as shown below.
One possibility is that there are multiple optima in the Likelihood and this might be tested by using MCMC when running the model, this is explained in the Ch.10 and a further appendix should be soon available on this. To my shame I haven't understood how to create histogram plots like those seen in the GI (10-42 -> 10-44), anyway I have tried to set different starting point when running the model and it didn't the trick in my case (no changes in the results).
I have tried to use other link functions but it didn't the trick either. Regarding to this, you cannot use every link for every analysis, as stated in another post from Gary White, "...The way to figure out if you screwed up is to run the model with both the logit and then sin links, and if the estimates and deviance differ, you probably should not have been running the sin link...". In this case I believe I couldn't use the sin link as I have an individual covariate and thus there is not a monotonic relationship, am I wrong? (the above rule of thumb tells me I should not use sin link in this case)
I have tried to use data cloning (number of replicates=100) and I get this:
This should indicate that parameters 4 and 5 are not estimable (but terminal parameters seem identifiable, more on this in http://www.phidot.org/forum/viewtopic.php?f=34&t=1482) because of the SE ratio (it should be about =10 in this case).
Given that they are not structurally unidentifiable parameters I have tried to calculate Profile likelihoods and after to data cloning on this as suggested in Appendix F. I got this:
Parameter 4 now has a (very small) CI and data cloning on that has produced an even shorter CI. I am not sure about what this result could indicate but it could suggest this parameter is identifiable and very close to the boundary.
After this, considering that parameter 4 would be solved, there would be parameter 5 still unestimated.
I have tried to use the simulated annealing by checking the box "Use Alt. Opt. Method" and this is the result:
While parameter 4 estimate has not profited from this (it would make sense given that its problem should arise from being too close to the boundary and not from multiple optima in the LI), parameter 5 has been estimated diverse from 1 but its CI is so broad that it makes it useless.
Finally, I have tried to force the model to have a constant P ({Phi(peso*t) P(.)}), in order to see what happens whit these estimates and this is what I got:
Now the estimate for the parameter 5 seems to be realistic (also by having a look at the reduced m-array shown above), parameter 4 is useless due to its huge CI as could be expected if its difficult estimation is due to closure to the boundary.
Anyway, this model has a DeltaAIC more than 30 with respect to the "best" model {Phi(weight*t) P(sex*t)}, this is not the way I have been told to make the analysis (firstly model the recapture and after the survival) and, last but not least, I haven't found this as a documented "good practice" in the forum or in GI.
Any suggestion or commentary would be really appreciated.
Simone