Module 3 (Hyperparameter): Overview (clarify)

reshama · 20 June 2021 18:43

@glemaitre58 Let me know your thoughts on this proposed wording related to hyperparameters.

Current wording:

In the previous modules, we show how to create, train, predict, and even evaluate a predictive model. However, we did not change on the models’ parameters that can be given when creating an instance. Indeed, we used the default values chosen by scikit-learn.

These parameters are called hyperparameters : they are parameters used to control the learning process, for instance the parameter k of the k-nearest neighbors. They should not be confused with the other parameters that are inferred during the training process. These parameters define the model itself, for instance coef_ for the linear models.

Can we edit, for clarity:

In the previous modules, we showed how to create, train, predict, and even evaluate a predictive model. We used the models’ default parameters, as chosen by scikit-learn, when creating an instance of the model. For example, for k-nearest neighbors, we initially used this default parameter: 'kneighborsclassifier__n_neighbors': 5 before trying other model parameters.

These parameters are called hyperparameters: they are parameters used to control the learning process, for instance the parameter k of the k-nearest neighbors. They are choices about which numbers we pass to the model for fitting the function. They should not be confused with the other parameters that are inferred during the training process. These parameters define the model itself, for instance coef_ for the linear models.

glemaitre58 · 20 June 2021 20:33

I am confused about the statement. With “numbers”, do you specifically refer to the k parameter or to hyperparameter in general?

For the other changes, they will improve the narration and we should include them. Thanks.

reshama · 21 June 2021 00:27

How about,
instead of:
They are choices about which numbers we pass to the model for fitting the function.

Hyperparameters are specified by the user, often manually tuned, and cannot be estimated from the data.

reshama · 21 June 2021 00:29

If there’s a place for a reference, this article is helpful:
What is the Difference Between a Parameter and a Hyperparameter?

glemaitre58 · 21 June 2021 06:54

Yes this is really clear.

We can probably include the link in the glossary for the hyperparameter section.

glemaitre58 · 22 June 2021 08:57

I made the changes here: FIX improve wording and add link in glossary · INRIA/scikit-learn-mooc@acfa6d2 · GitHub

@lfarhi I don’t recall if the “module overview” is automatically synchronized from our markdown file in GitHub. Could you let me know?

lfarhi · 22 June 2021 09:42

@glemaitre58, the "module overview” pages are dynamically displayed from the jupyterbook pages.

glemaitre58 · 22 June 2021 09:48

OK cool. So we should not have anything else to do thanks.