【发布时间】:2018-12-04 10:57:19
【问题描述】:
以下代码可用于将字符串转换为分类标签:
import pandas as pd
from sklearn.preprocessing import LabelEncoder
df = pd.DataFrame([['A','B','C','D','E','F','G','I','K','H'],
['A','E','H','F','G','I','K','','',''],
['A','C','I','F','H','G','','','','']],
columns=['A1', 'A2', 'A3','A4', 'A5', 'A6', 'A7', 'A8', 'A9', 'A10'])
pd.DataFrame(columns=df.columns, data=LabelEncoder().fit_transform(df.values.flatten()).reshape(df.shape))
A1 A2 A3 A4 A5 A6 A7 A8 A9 A10
0 1 2 3 4 5 6 7 9 10 8
1 1 5 8 6 7 9 10 0 0 0
2 1 3 9 6 8 7 0 0 0 0
问题:
如何查询映射(它们似乎按字母顺序排序)?
即像这样的列表:
A: 1
B: 2
C: 3
...
I: 9
K: 10
谢谢!
【问题讨论】:
标签: pandas encoding label multilabel-classification