【发布时间】:2020-06-06 15:22:52
【问题描述】:
我有以下数据,dfs_alltasks:
by_hour task
1 0 Apple Receiving
2 0 Apple Receiving
3 0 Orange Receiving
4 0 Banana Receiving
5 0 Banana Receiving
6 0 Orange Receiving
7 1 Orange Receiving
8 1 Banana Receiving
9 1 Banana Receiving
10 1 Banana Receiving
11 1 Banana Receiving
12 1 Banana Receiving
13 1 Orange Receiving
14 2 Banana Receiving
15 3 Banana Receiving
我喜欢按“by_hour”列分组,同时汇总并返回编号。小组中发生的任务,我应该得到这样的东西:
by_hour task count
1 0 Apple Receiving 2
2 0 Orange Receiving 2
3 0 Banana Receiving 2
4 1 Orange Receiving 2
5 1 Banana Receiving 5
6 2 Banana Receiving 1
7 3 Banana Receiving 1
我尝试过: dfs_alltasks %>% group_by(by_hour) %>% summarise_all(no_rows = length(task))
但我收到“list2(...) 中的错误:找不到对象‘任务’”的错误
【问题讨论】:
-
看起来你只是想要
dplyr::count(dfs_alltasks, by_hour, task)。 -
由于您想同时按 'by_hour' 和 'task' 进行分组,因此您需要将两者都包含在 group_by 参数中。也不需要
summarise_all。summarise将完成这项工作,而不是length(task),使用n()来计算每个段中的行数。
标签: r