【发布时间】:2023-01-20 18:33:01
【问题描述】:
图片如下熊猫数据框:
+----+------+-------+
| ID | Name | Value |
+----+------+-------+
| 1 | John | 1 |
+----+------+-------+
| 1 | John | 4 |
+----+------+-------+
| 1 | John | 10 |
+----+------+-------+
| 1 | John | 50 |
+----+------+-------+
| 1 | Adam | 6 |
+----+------+-------+
| 1 | Adam | 3 |
+----+------+-------+
| 2 | Jen | 9 |
+----+------+-------+
| 2 | Jen | 6 |
+----+------+-------+
我想应用 groupby 函数并创建一个新列,它将 Value 值存储为从当前到最后一个 groupby 值的列表。
像那样:
+----+------+-------+----------------+
| ID | Name | Value | NewCol |
+----+------+-------+----------------+
| 1 | John | 1 | [1, 4, 10, 50] |
+----+------+-------+----------------+
| 1 | John | 4 | [4, 10, 50] |
+----+------+-------+----------------+
| 1 | John | 10 | [10, 50] |
+----+------+-------+----------------+
| 1 | John | 50 | [50] |
+----+------+-------+----------------+
| 1 | Adam | 6 | [6, 3] |
+----+------+-------+----------------+
| 1 | Adam | 3 | [3] |
+----+------+-------+----------------+
| 2 | Jen | 9 | [9, 6] |
+----+------+-------+----------------+
| 2 | Jen | 6 | [9] |
+----+------+-------+----------------+
这无论如何都可以使用 pandas groupby 函数吗?
【问题讨论】: