if the sample ends up in the test set during splitting then the classifier would not have seen the category during training and will not be able to encode it.
Am I right to assume that based on the above sentence, the encoding of categorical features happens per fold during the cross-validation? If that’s the case, using the ‘ignore’ option in the categorical encoder means, “ignore the fold that contains the non encoded value?”. In this case, though, shall we increase the number of folds significantly to avoid skipping many folds?
Thanks a lot