Hi, to compare a model against a baseline, we first create a cv strategy;
If we chose ShuffleSplit
, it yield random subparts. To train a model and a baseline on exactly similar sets, I think we must provide a random_state value (like what is done in the solution).
Is it correct?
Even though I didn’t give such a parameter to the cv, I’ve got results similar to the solution (because dummy classifier/regressors may have homogenous results with respect to a data subset)