how to use one hot encoding in python
# Basic syntax:
df_onehot = pd.get_dummies(df, columns=['col_name'], prefix=['one_hot'])
# Where:
# - get_dummies creates a one-hot encoding for each unique categorical
# value in the column named col_name
# - The prefix is added at the beginning of each categorical value
# to create new column names for the one-hot columns
# Example usage:
# Build example dataframe:
df = pd.DataFrame(['sunny', 'rainy', 'cloudy'], columns=['weather'])
print(df)
weather
0 sunny
1 rainy
2 cloudy
# Convert categorical weather variable to one-hot encoding:
df_onehot = pd.get_dummies(df, columns=['weather'], prefix=['one_hot'])
print(df_onehot)
one_hot_cloudy one_hot_rainy one_hot_sunny
0 0 0 1
1 0 1 0
2 1 0 0
4
10
from numpy import as np
from sklearn.preprocessing import LabelEncoder
from sklearn.preprocessing import OneHotEncoder
# define example
data = ['cold', 'cold', 'warm', 'cold', 'hot',
'hot', 'warm', 'cold', 'warm', 'hot']
values = np.array(data)
# first apply label encoding
label_encoder = LabelEncoder()
integer_encoded = label_encoder.fit_transform(values)
# now we can apply one hot encoding
onehot_encoder = OneHotEncoder(sparse=False)
integer_encoded = integer_encoded.reshape(len(integer_encoded), 1)
onehot_encoded = onehot_encoder.fit_transform(integer_encoded)
print(onehot_encoded)
Thank you!
10
0
Are there any code examples left?
New code examples in category Python
-
Python 2023-04-11 03:04:20
-
Python 2022-03-27 22:40:04 pycharm no module named
-
Python 2022-03-27 22:25:05 assign multiple variablesin one line
-
Python 2022-03-27 22:20:02 levenshtein distance
-
Python 2022-03-27 21:35:09 get text from url python last slash
-
Python 2022-03-27 21:30:30 df concatenate df
-
Python 2022-03-27 21:25:09 python odd or even
-
Python 2022-03-27 21:15:32 python include function from another file
-
Python 2022-03-27 21:10:01 color module python
-
Python 2022-03-27 21:00:27 python tkinter cursor types