【发布时间】:2017-08-20 06:04:25
【问题描述】:
给定一个多索引 DataFrame:
col_A col_B
level_0 level_1
A x 1.0 NaN
y NaN 1.0
x NaN 2.0
y 2.0 NaN
如何从 df 中删除 NaN 并从多索引中删除重复项以获取:
col_A col_B
level_0 level_1
A x 1.0 2.0
y 2.0 1.0
这是 MWE:
import pandas as pd
import numpy as np
index = pd.MultiIndex.from_product([['A', 'A'],
['x', 'y']],
names=['level_0',
'level_1'])
data =[
[1, np.NaN],
[np.NaN, 1],
[np.NaN,2],
[2, np.NaN],
]
df = pd.DataFrame(data=data, index=index, columns=['col_A', 'col_B'])
print df
【问题讨论】: