One of the new features of Program Mark is the tool for Data Cloning which is a numerical approach for determining if parameters in a model are identifiable or not . For example, CJS models can have nonidentifiable parameters in models with time-dependence in apparent survival (phi) and the encounter rate (p). In a model such as phi(t), p(t) it is not possible to decompose the product of phi and p for the last transition/occasion without additional information to estimate p for the last occasion (pages 3-13, 4-71 of TFM).
The Data Cloning tool can be accessed in Mark through Output | Specific Model Output | Data Cloning. The tool can be applied to any model in the candidate set and the default option for cloning the encounter histories is by a factor of 100.
A couple of observations after working with the tool. First, the useful diagnostic for whether a parameter is estimable is the SE ratio between the SE of the original model vs. the data cloned model. If a factor of 100 is used, the SE ratio for estimable parameters should be ~10, presumably because SE = SD / sqrt(N)?. Nonestimable parameters should have an SE ratio other than 10. For example, in the dipper example with phi(g*t), p(g*t) with the sin link, there is a DIV/0 error message for the nonestimable parameters. Second, the SE ratio is sensitive to which link function is used. For example, the dipper example has a year near the start of the time series where an estimate of p for males is close to the boundary of one. If the model phi(g*t), p(g*t) is run with the sin vs. the logit link, the parameter at the boundary is successfully estimated and tallied with the sin link but not the logit link. Running the data cloning tool on the two models gives different results, an SE ratio of zero under the sin link, and 9.3 under the logit link. Last, one of my students tried the data cloning tool with a closed population model where abundance (N) was included as a estimated parameter. For probabilities like phi and p, data cloning affects the SE ratio but not the point estimates. For N, because data cloning increases the number of encounter histories for marked individuals, the point estimate of N will increase and not just the SE ratio. A caution for the data cloning tool is perhaps it should only be used for parameters that are probabilities bounded 0-1.
Brett K. Sandercock