Variable selection for Root Node

How does the decision tree pick the variable/feature for root node? In this example why did it select “Culmen Depth” instead of feature “Culmen Length”?

The Classification and Regression Trees (CART) algorithm constructs binary trees using the feature and threshold that yield the largest information gain at each node.

1 Like

Thank you very much for your reply.