hi,
when preprocessing data, and in the context of the current notebook (i.e. scaling data), is it not necessary to preprocess also test data?
If yes, is it right to assume that .transform(X_test)
transforms features of X-test
based on mean and variance learned for each feature in .fit(X_train)
?
Also, and assuming that train_test_split()
would be applied with a 50-50 distribution (unrealistic, I admit) between X_train
and X_test
: would applying parameters learned from X_train
affect transformation of X_test
?
thx in advance for your clarification!