Dear Gary and all who may be concerned,
Suppose we have a 70-day nesting period over which nest data are collected and a passerine with 15 days egg period and 11 days nestling period. My intent was to see if there are stage-specific differences in DSR. First, I analyzed stage specific DSR with the Mayfield estimator and tested the difference between stages by the method developed by Johnson (1979). A highly significant difference appeared, DSR being much higher at the nestling stage. Obviously, by applying a corresponding model in MARK (B0 + B1Stage) I expect to find similar results if everything is OK.
As we discussed it previously, I have to add individual covariates for stage of each nest for each day of the 70-day nesting period (named stageDay1….stageDay70). In my example, as I have asked in a previous posting, I’ve coded age as Gary advised me with “0”s for egg stage and “1”s for nestling period for each of the 70 days. I applied 2 approaches to code the days before a nest’s individual history entered the 70-day period (PRE-days) and the days after it was no more active in the 70-day period (POST-days):
Coding 1) PRE-days filled with “0”s and POST-days filled with “1s” (if there were any nestling days before the nest became inactive due to either fledging or failure).
Coding 2) PRE-days filled with “-1”s and POST-days filled with “-1”s. In the chapter of Nest survival of the MARK book Dr. J. Rotella explains that a nest does not enter the likelihood until the value of a covariate is “0”. This second coding seems to me better and more natural.
Please, comment on how each of these two types of coding would affect estimates. Which one is theoretically better? I noted that both types of codings yielded similar estimates of stage specific DSR but the second one resulted in DSR values closer to the estimates derived from the Mayfield estimator.
Second, NO MATTER THE CODING ABOVE, we end up with 69 estimates for survival for a 70-day period (day 1=the first observation day with any nest content of the earliest nest), so we have to leave out one of the potential 70 stage covariates, when we fill the design matrix. The question is which one – the first one or the last one? I have tried both ways with my dataset and ended up with very much different results when I filled the design matrix this way:
B1 B2
1 stageDay2 1:S
1 stageDay3 2:S
1 stageDay4 3:S
.
.
.
1 stageDay70 69:S
AND this way:
B1 B2
1 stageDay1 1:S
1 stageDay2 2:S
1 stageDay3 3:S
.
.
.
1 stageDay69 69:S
With the first design matrix estimates of DSR for each stage were very nice, i.e. very close to the ones derived from the Mayfield estimator and a highly significant effect of stage on DSR as should be. With the second design matrix, I got a nonsense (survival decreased with age and no significant effect of stage, CI included 0).
The first coding is intuitive to me as day 1 of the 70-day nesting season is the starting point so we do not have survival parameter at this day, yet why the results are so dramatically different?
Thanks,
Anton