【发布时间】:2020-03-06 00:11:02
【问题描述】:
我有一个带有两个 ID、一个计数和一个平均值的 pandas 数据框。如何按两个 id 分组并获得加权平均值,以便得到以下数据集:
id1 id2 count average
Person A class 1 200 0.2
Person A class 1 400 0.4
Person B class 2 800 0.6
Person C class 2 200 0.4
Person B class 3 800 0.6
Person A class 4 400 0.2
Person B class 2 100 0.5
获得以下结果(以任何行顺序):
id1 id2 count average
Person A class 1 600 0.33
Person B class 2 900 0.59
Person C class 2 200 0.4
Person B class 3 800 0.6
Person A class 4 400 0.2
供参考:
pd.DataFrame({"id1" : ["Person A","Person A","Person B","Person C","Person B","Person A","Person B"],
"id2" : ["class 1","class 1","class 2","class 2","class 3","class 4","class 2"],
"count" : [200, 400, 800, 200, 800, 400, 100],
"average" : [0.2, 0.4, 0.6, 0.4, 0.6, 0.2, 0.5]})
【问题讨论】: