Hi,
I found the results of question 4 puzzling.
If I’m not mistaken, the previous notebooks on random forests told us that the general idea is to have very deep trees that overfit, and then consider many of them to balance out such side effect.
Now, it seems to me that the results of question 4 “disprove” such general understanding. Indeed, the RF that have a max_depth=5
has a better generalization performance than the one with no limited depth.
Am I missing something here?
Thanks.