You say :
Instead of using only the numerical dataset (which was the variable
data_numerical
), use the entire dataset available in the variabledata
.
So as numerical features I used the numerical_features
list you created in start of the quiz as argument in the preprocessor:
preprocessor = ColumnTransformer(transformers=[("cat-preprocessor", imputer_ordinal_transformer, categorical_columns),("num-preprocessor", scaler_imputer_transformer, numerical_features)])
But in doing that I obtained only a score of ~0.72 since in your defined numerical_features
only a part of the numerical columns names are present.
So I propose you modify the first sentence as :
Instead of using only a part of the numerical dataset (which was the variable
data_numerical
), use the entire dataset available in the variabledata
.