Formulation of Questions of M2.03

Eolindel · 20 June 2021 20:38

Hi,
For me there is a reversal in causality expressed in the questions of quizz M2.03 :

Instead of
«Fitting a model with a high bias:
a) causes underfitting?
b) causes overfitting? »

It should be reversed :
«Fitting a model with a high bias:
a) is caused by underfitting?
b) is caused by overfitting? »

(same for question 2)

ogrisel · 22 June 2021 09:11

Maybe we should not use the term “cause” in this question. What about:

Fitting a model with a high bias:
a) is a case of underfitting?
b) is a case of overfitting?

or alternatively if we want to keep the “cause”:

Fitting a model with a high bias:
a) causes an underfitted model?
b) causes an overfitted model?

The terms “underfitted” / “overfitted” feel grammatically wrong but they appear in Wikipedia:

lesteve · 22 June 2021 10:04

I’d rather use “is a case of underfitting” (or “corresponds to underfitting”). “causes” feels weird in either direction …

GaelVaroquaux · 22 June 2021 11:19

I think that Olivier’s suggestion is good.

However, to me, one of the problems is that the bias can be either at fit time, or at prediction time. It is legit to talk about biased predictions.

Still, Olivier’s second suggestion seems good to me.

Eolindel · 23 June 2021 06:37

The first solution sounds a little bit better to a novice like me. But the second one is also working.

glemaitre58 · 23 June 2021 07:35

@lfarhi @MarieCollin Could you update FUN.

glemaitre58 · 23 June 2021 07:40

I made the changes and we still need to modify in FUN. Thanks @Eolindel for the feedback.

lfarhi · 23 June 2021 10:11

This is also corrected in FUN now