【发布时间】:2015-10-10 13:05:00
【问题描述】:
我希望通过 app_name 按版本获取计数差异。我的数据集如下所示:app_name, version_id, count, [difference]
这是数据集
data = structure(list(app_name = structure(c(1L, 1L, 1L, 2L, 2L, 2L,
2L, 3L, 3L), .Label = c("a", "b", "c"), class = "factor"), version_id = c(1,
1.1, 2.3, 2, 3.1, 3.3, 4, 1.1, 2.4), count = c(600L, 620L, 620L,
200L, 200L, 250L, 250L, 15L, 36L)), .Names = c("app_name", "version_id",
"count"), class = "data.frame", row.names = c(NA, -9L))
鉴于此 data.frame,我如何通过 app_name 和 version_id 获得计数的滞后差异?每个应用程序的初始(第一个)版本差异为零,因为没有区别。
以下是最终结果与最终“差异”列的示例
structure(list(app_name = structure(c(1L, 1L, 1L, 2L, 2L, 2L,
2L, 3L, 3L), .Label = c("a", "b", "c"), class = "factor"), version_id = c(1,
1.1, 2.3, 2, 3.1, 3.3, 4, 1.1, 2.4), count = c(600L, 620L, 620L,
200L, 200L, 250L, 250L, 15L, 36L), diff = c(0, 20, 0, 0, 0, 1.25,
0, 0, 2.4)), .Names = c("app_name", "version_id", "count", "diff"
), class = "data.frame", row.names = c(NA, -9L))
【问题讨论】:
-
到目前为止你尝试了什么?
-
@Pascal 我一直在尝试使用 mutate() 无济于事。关注这个帖子:stackoverflow.com/questions/31362397/…