【发布时间】:2019-05-06 06:41:29
【问题描述】:
我有一个数据框,其中包含约 150K 行和 77 个分类变量,格式如下所示。我如何找到每个类别的分数和计数
一个数值变量和 77 个分组变量
students<-data.frame(ID = c("A","B","C","D"), Gender = c("M","F","F","F"), Socioeconomic = c("Low","Low","Medium","High"), Subject = c("Maths","Maths","Science", "Science"),
Scores = c(45,98, 50,38))
也就是说,我不想单独浏览每个分类列 77 次,但想要一个包含以下每个输出列表的小标题
students %>% group_by(Gender) %>% summarise(Mean.score = mean(Scores), Count = length(ID))
students %>% group_by(Socioeconomic) %>% summarise(Mean.score = mean(Scores), Count = length(ID))
students %>% group_by(Subject) %>% summarise(Mean.score = mean(Scores), Count = length(ID))```
【问题讨论】:
-
我不确定链接的问题是否重复(尽管我认为这个问题之前可能已经被问过)。链接的问题侧重于如何汇总由一组分类列分组的多个数字列。此问题询问如何汇总按每个分类列连续分组的单个数字列。
-
是的,这是正确的。我想将两个函数应用于由多个分类列分组的单个数字列