【发布时间】:2016-05-26 00:45:43
【问题描述】:
我有一个按日期索引的数据框
transactions_ind
Out[25]:
Ticker Transaction Number_of_units Price
Date
2012-10-11 ROG VX Equity Buy 12000 182.00000
2012-10-16 ROG VX Equity Sell -5000 184.70000
2012-11-16 ROG VX Equity Sell -5000 175.51580
2012-12-07 ROG VX Equity Buy 5000 184.90000
2012-12-11 ROG VX Equity Sell -3000 188.50000
2012-12-11 ROG VX Equity Reversal: Sell 3000 188.50000
2012-12-11 ROG VX Equity Sell -3000 188.50000
2012-12-11 ROG VX Equity Reversal: Sell 3000 188.50000
2012-12-11 ROG VX Equity Sell -3000 188.50000
2012-12-20 ROG VX Equity Sell -5000 185.80000
我想对重复的索引值 (2012-12-11) 求和,但只对“Number_of_units”列求和。
transactions_ind
Out[25]:
Ticker Transaction Number_of_units Price
Date
2012-10-11 ROG VX Equity Buy 12000 182.00000
2012-10-16 ROG VX Equity Sell -5000 184.70000
2012-11-16 ROG VX Equity Sell -5000 175.51580
2012-12-07 ROG VX Equity Buy 5000 184.90000
2012-12-11 ROG VX Equity Sell -3000 188.50000
2012-12-20 ROG VX Equity Sell -5000 185.80000
使用
transactions_ind.groupby(transactions_ind.index).sum()
删除列“Ticker”和“Transaction”,因为它们填充了非数字值。另外,当我对“Number_of_units”列求和时,我想知道如何处理“Transactions”列中的不同字符串。希望熊猫中存在单线。感谢您的帮助!
【问题讨论】:
标签: python pandas indexing duplicates