Exercise data set location?

Hello all,

Sorry for a dumb question but I am having difficulty finding the data set for the exercise in the Tabular data exploration module. Would it be possible to get some help finding it?

Thank you!

Hi, this is mentioned in cell #2 of the notebook:

Loading the adult census dataset

We will use data from the 1994 US census that we downloaded from OpenML.

You can look at the OpenML webpage to learn more about this dataset: http://www.openml.org/d/1590

Hello Thomas,

Thank you for your reply! I should have been more specific; I am having difficulty finding the “penguins_classification.csv” data set for exercise M1.01. Could I get some help locating that file?

Thank you again!

1 Like

Well, you can find all the data files in the GitHub Repo:

1 Like

Hello Thomas,

That was exactly what I needed! Thank you so much for your help!

please how do I call the dataset into my book for analysis? thanks

Hey,
So if you want to use the data set on your computer in Jupyter Notebook, you can do 2 things:

  1. Check if the data set is in the same directory in which you are presently working and call the pd.read_csv() function stating the file name.

  2. If the data set is saved in any other directory, then you will have to call the entire directory name where the data set is stored inside the read_csv function.