【发布时间】:2017-07-20 11:06:02
【问题描述】:
我只想在 1 个特定列上应用 cumsum,因为我在不同列中有其他值必须保持不变。
这是我目前的脚本
df.groupby(by=['name','day']).sum().groupby(level=[0]).cumsum()
但是,此脚本导致我的 pandas df 中的所有列都会累积。唯一必须累积总和的列是data。
根据要求,这里是一些示例数据:
df = pd.DataFrame({'ID': ["880022443344556677787", "880022443344556677782", "880022443344556677787",
"880022443344556677782", "880022443344556677787", "880022443344556677782",
"880022443344556677781"],
'Month': ["201701", "201701", "201702", "201702", "201703", "201703", "201703"],
'Usage': [20, 40, 100, 50, 30, 30, 2000],
'Sec': [10, 15, 20, 1, 5, 6, 30]})
ID Month Sec Usage
0 880022443344556677787 201701 10 20
1 880022443344556677782 201701 15 40
2 880022443344556677787 201702 20 100
3 880022443344556677782 201702 1 50
4 880022443344556677787 201703 5 30
5 880022443344556677782 201703 6 30
6 880022443344556677781 201703 30 2000
期望的输出
ID Month Sec Usage
0 880022443344556677787 201701 10 20
1 880022443344556677782 201701 15 40
2 880022443344556677787 201702 20 120
3 880022443344556677782 201702 1 90
4 880022443344556677787 201703 5 150
5 880022443344556677782 201703 6 120
6 880022443344556677781 201703 30 2000
【问题讨论】:
标签: python pandas cumulative-sum