【发布时间】:2021-12-11 06:06:59
【问题描述】:
我是新来的,对编程也很陌生,所以任何帮助都将不胜感激。
我有一个如下所示的数据框 df1:
| Picture | Emotion | Gender | Type | Trial | Attr_scores | Fear_scores | Appr_scores | Avoid_scores |
|---|---|---|---|---|---|---|---|---|
| 1 | happy | male | human | first | 11 | 3 | 21 | 21 |
| 2 | sad | male | human | first | 12 | 6 | 22 | 22 |
| 3 | neutral | male | human | first | 13 | 2 | 23 | 23 |
| 4 | happy | male | cartoon | first | 14 | 3 | 24 | 24 |
| 5 | sad | male | cartoon | first | 15 | 6 | 25 | 25 |
| 6 | neutral | male | cartoon | first | 16 | 2 | 26 | 26 |
| 7 | happy | male | animal | first | 17 | 3 | 27 | 27 |
| 8 | sad | male | animal | first | 18 | 6 | 28 | 28 |
| 9 | neutral | male | animal | first | 19 | 2 | 29 | 29 |
| 10 | happy | female | human | first | 20 | 3 | 21 | 30 |
| 11 | sad | female | human | first | 21 | 6 | 22 | 31 |
| 12 | neutral | female | human | first | 22 | 2 | 23 | 32 |
| 13 | happy | female | cartoon | first | 23 | 3 | 24 | 33 |
| 14 | sad | female | cartoon | first | 24 | 6 | 25 | 34 |
| 15 | neutral | female | cartoon | first | 25 | 2 | 26 | 35 |
| 16 | happy | female | animal | first | 26 | 3 | 27 | 36 |
| 17 | sad | female | animal | first | 27 | 6 | 28 | 37 |
| 18 | neutral | female | animal | first | 28 | 2 | 29 | 38 |
这是生成它的代码:
Picture <- c(1:18)
Emotion <- rep(c('happy','sad','neutral'),times=6)
Gender <- rep(c('male','female'),each=9)
Type <- rep(c('human','cartoon','animal','human','cartoon','animal'),each=3)
Trial <- rep(c('first'),times=18)
Attr_scores <- c(11:28)
Fear_scores <- rep(c(3,6,2),times=6)
Appr_scores <- rep(c(21:29),times=2)
Avoid_scores <- c(21:38)
df1<-data.frame(Picture,Emotion,Gender,Type,Trial,Attr_scores,Fear_scores,Appr_scores,Avoid_scores)
我需要获取几对变量(一个自变量 + 一个因变量,例如 Emotion + Attr_scores、Emotion + Fear_scores、Gender + Attr_scores、Gender + Avoid_scores),并对它们中的每一个:1)运行汇总统计(比较平均值和标准差),2)运行单向方差分析,3)创建散点图。
到目前为止,我已经为第一对变量(Gender + Attr_scores)创建了代码。代码如下:
# Summary Statistics
library(dplyr)
group_by(df1, Gender) %>%
summarise(
N = n(),
Mean = mean(Attr_scores, na.rm = TRUE),
Sd = sd(Attr_scores, na.rm = TRUE)
)
# ANOVA
res.aov <- aov(Attr_scores ~ Gender, data = df1)
summary(res.aov)
#Plot
gender_attr_plot <- ggplot(df1, aes(x=Gender, y=Attr_scores)) +
geom_jitter(position=position_jitter(0.2))+
stat_summary(fun.data=mean_sdl, fun.args = list(mult = 1),
geom="pointrange", color="red")
ggsave("gender_attr_plot.png", gender_attr_plot, width = 1600, height = 900, units = "px")
我可以为每对额外的变量复制粘贴代码并每次手动更改变量名称,但这似乎是一种非常低效的处理方式。此外,如果我需要对任何额外的变量对运行相同的分析,我将不得不再次复制整个代码来执行此操作。
我想要做的是创建一个带有变量对的表或嵌套列表(如果需要额外的变量对,以后可以轻松更新)并编写一个循环来遍历这些变量对并执行每个动作的所有 3 个动作(汇总统计、ANOVA 和绘图)。
我认为它应该看起来像这样(这与实际的工作代码相去甚远,只是给出一个大概的想法):
variables <- list(
c(Gender, Attr_scores),
c(Gender, Fear_scores),
c(Type, Appr_scores),
c(Emotion, Avoid_scores))
for(i in variables){
library(dplyr)
group_by(df1, variables,'[[',1) %>%
summarise(
N = n(),
Mean = mean(variables,'[[',2, na.rm = TRUE),
Sd = sd(variables,'[[',2, na.rm = TRUE)
)
res.aov <- aov(variables,'[[',2 ~ variables,'[[',1, data = df1)
summary(res.aov)
plot <- ggplot(df1, aes(x=variables,'[[',1, y=variables,'[[',2)) +
geom_jitter(position=position_jitter(0.2))+
stat_summary(fun.data=mean_sdl, fun.args = list(mult = 1),
geom="pointrange", color="red")
ggsave("??????.png", plot, width = 1600, height = 900, units = "px")
}
显然,这不起作用,我一直在互联网上搜索解决方案,但我对 R 的了解还不足以弄清楚如何使它起作用。任何帮助将不胜感激!
【问题讨论】: