I have tried to understand how this works, but i can’t understand it. In the exercise notebook you used density=True
to plot the frequency distribution so each histogram cluster sums to 1. We see on the y axis a range from 0 to 1. When we use the same argument in the test, it doesn’t seem to produce the same result in the plot, as we on the y axis we have a range from 0 to 800.
Can you also explain in the detail how the ‘stratified’ strategy of the DummyClassifier
works? Is it like for every observation we have 0.75 to predict the most frequent value? Thanks a lot