Dear all,
I run two identical simple robust design models (from the same dataset) for which all parameters (S, gam”, gam’, p, c, N) were constrained to be a function of sex.
For one of these models, I created an additional factor variable in the data (cohort variable). Therefore, one had two groups (male and female) and the other had 4 groups (i.e sex and two cohorts). When I run the two models (sex-varying parameters), I obtained two huge different deviances : 3017.302 and 3970.949 respectively.
Why is there a difference in the deviance when additional groups were defined, as these are not involved in parameter specifications ?
If I understand well, the Mt+1 matrix and -2logL(saturated) were changed due to the two additional groups, so the deviance of the current model was changed too because it is defined as the difference in model likelihood of the current model and the saturated model, isn’t it ?
But what are the consequences for model selection then ? Should I predefined all my groups at the first step of the model selection to be able to compare them ?
For example, I begin the model selection by constraining all parameters by time, sex and cohort effects. Then, from my best model, I want to constrain the survival parameter by an additional age effect. If I define the age variable at this step (so a first selection has been done before the creation of age variable), I cannot compare this model to the previous models selection because the saturated model is not the same anymore, right ? If the age variable is created in the beginning, the saturated model is always the same (age models or not). Is it correct ?
Thank you for your answer,
C. Le Coeur