Hello,
Thanks a lot for your content!
I have a question about preprocessing: does it THEORITICALLY improve model performance ?
If we take linear regression, before preprocessing and we assume that the X matrix is full ranked. Il tXX is invertible (and that the difference of scale doesn’t cause rounding issue into the calculus of inverse) I don’t see why preprocessing would improve the prediction.
Same with gradient descent: different scale will cause issues/delay convergence(because the gradient might over focus on one direction), but let’s assume we find a convergence, will it improve prediction ?
If answer is no, but you have exemple of other models that will suffer without preprocessing, I’m more than interested. I took regression as it’s a model where we look over Euclidian distance (so sensitive to scale difference)
Thanks a lot,