Caution ... and after

Hello,
I don’t understand this warning:

“Caution!
Be aware that we use train_test_split here for didactic purposes, to show the scikit-learn API.”

In “real life” what would you have used? because in the processing the result of this didactic purpose is really used!

1 Like

In really like, one should use cross-validation as previously demonstrated. Without cross-validation, you will not be aware of the uncertainty of the statistical performance of a model.

1 Like

the uncertainty of your estimation of the statistical performance of a model.

1 Like

Thank you, the link with the cross-validation had escaped me.

I am tagging this for v3 since this we should probably mention that cross-validation may be preferred

Addressed in Improve description in caution message by ArturoAmorQ · Pull Request #633 · INRIA/scikit-learn-mooc · GitHub