First model with scikit-learn _error

daiana · 18 May 2021 20:44

I have this error when "ValueError: could not convert string to float: ‘State-gov’
"
when I run the commands:
from sklearn.neighbors import KNeighborsClassifier

model = KNeighborsClassifier()
model.fit(data, target)

daiana · 18 May 2021 21:03

Ok I found it…

I created another data frame whit only num values:

df_num=df.select_dtypes(include=[“int64”])
df_num
Thanks

RockL · 19 May 2021 17:51

Hello Daiana,

I am having a similar problem in that when I try to run this code:

from sklearn.neighbors import KNeighborsClassifier

model = KNeighborsClassifier()
model.fit(data, target)

I get a ValueError saying “could not convert string to float: ’ Private’”

Where did you put in your solution code to make it work for your problem? I would really appreciate your thoughts!

Thanks!

Mirzon · 19 May 2021 18:07

The select_dtypes method does not affect the data in-place. Instead, it returns a new dataframe. In your code, data is still the original dataframe, without filtering, that’s why it still contains strings.

To solve this, you could either create a new dataframe, like @daiana did:

data_numeric = data.select_dtypes(include=[“int64”])

You will then be able to call fit like so:

model.fit(data_numeric, target)

You could also replace your existing dataframe (and the other columns will be lost):

data = data.select_dtypes(include=[“int64”])

And then keep your call to fit as-is.

RockL · 19 May 2021 18:07

Nevermind; it’s my fault! I was accessing the wrong dataset. Thanks for the advice!

daiana · 19 May 2021 18:13

Hi , you can look for all data set in

So no need for this bypass