In the lesson “speeding-up gradient_boosting” , you claim that the use of the KBinsDiscretizer
in the pipeline before the GradientBoostingRegressor
drastically reduced the fit time.
But :
-
without the discretizer the fit time is 6.727 seconds on your server and 5.870 seconds on my computer
-
with the discretizer the fit time is 4.273 seconds on your server and 3.667 seconds on my computer
So only a ~2 seconds improvement, a mere 30% . I will not call that a drastic reduction.
If you confirm the fit times I obtained, and that is not a strange bug, I think you have not to use such a superlative and just say that the discretizer reduced the fit time. It’s less sexy but more true.
PS: for me the true drastic improvement is due to the use of the HistGradientBoostingRegressor
at the end of the lesson.