【发布时间】:2021-12-22 10:28:48
【问题描述】:
我正在从 Pandas 数据帧构建一个热编码函数,但无法弄清楚如何将数据取回数据帧。我明白了:
"IndexError: 只有整数、切片 (
:)、省略号 (...)、 numpy.newaxis (None) 和整数或布尔数组是有效的 指数
如何将其重新整合回 pandas 数据框?
def one_hot_encoder (features, df_to_encode):
"""encoder to encoder
Parameters:
features (list): features to normalise
df_to_encode (pandas dataframe): dataframe to encode
Returns:
dataframe: dataframe to encode
"""
from sklearn.preprocessing import OneHotEncoder
for column in features:
# one hot encoder
enc = OneHotEncoder(sparse=False)
column_norm = column + "_encoded"
df = enc.fit_transform(df_to_encode[[column]])
return df
columns_to_one_hot_encode = ["type"]
df = one_hot_encoder(columns_to_one_hot_encode,df)
【问题讨论】:
标签: pandas dataframe scikit-learn