Hello,
In the “Working with numerical data” notebook, it is stated that
However, the column
"education-num"
is different.
But I can’t see why if I only look at this column’s values.
We’re asked to execute:
data["education-num"].value_counts().sort_index()
to see its specificity, but
data["hours-per-week"].value_counts().sort_index()
gives quite similar output.
Don’t we let “education-num” at part because of our understanding of its meaning, rather than on the studies of the values?
I haven’t read the next notebook, maybe answer lies in it.
Thank you.