Recommandations to manipulate data/dataframe

Hello,

each time I have to manipulate manually data into a dataframe, it is a pain … :frowning:

on last section sometime it is good to use numpy ( such as np.ravel)
on this section you are using np.concatenante.

I was trying to use pd.concat, but the story is not the same at all.

I finnaly used the ‘df[‘newcol’]=df[‘oldcol’]’ to create new features.

I guess this is not efficient at all :slight_smile:

Do you have any tricks, recommendations, how-tos on how to do data manipulation using pandas, numpy :slight_smile:
Thanks so much

Indeed, we did not want to have to explain anything regarding the index alignment with pandas so we went directly with NumPy.

Concatenation in pandas would require paying attention to the column or index naming to align.

There are a couple of tutorials available via the pandas website: