file: 03_categorical_pipeline_column_transformer.ipynb
If I understand the OneHotEncoder()
documentation correctly, "if_binary"
drops a column only when there is two categories in a feature to one-hot encode.
Wouldn’t "first"
be even better to avoid colinearity?
(Well I tried in the document: scores obtains with crossed-validation are hardly (almost not at all) different.)