I have been using simulated annealing when a model doesn't converge properly using the default optimization method. Then I compare the results of the models run with simulated annealing to the results of the other models run with the default optimization method.
Is there any reason why it would be necessary to use simulated annealing on all models in the data set, or is OK to compare models in the data set optimized with the two different methods?
Thank you.