【发布时间】:2021-01-05 01:36:57
【问题描述】:
当从 3 个训练模型绘制 3 个模型指标(RMSE、MAE、Rsquared)到测试集指标时,我试图证明神经网络模型是最好的
- 训练和测试指标之间的距离最小
- 它还具有足够低的 RSME/MAE 和高 Rsquared
现在从所附的情节中并不那么明显。此外,指标的尺度不同,因为 Rsquared 在 [0,1] 区间内。有没有办法更好地绘制它,最好在同一个图上?
> trn
model RMSE Rsquared MAE dataType
1 Linear Reg 9.17 0.51 6.03 train
2 SVM Radial 7.86 0.64 4.86 train
3 Neural Networks 8.55 0.57 5.59 train
> tst
model RMSE Rsquared MAE dataType
1 Linear Reg 9.40 0.53 5.95 test
2 SVM Radial 9.16 0.55 5.50 test
3 Neural Networks 8.66 0.60 5.48 test
>
可重现的代码:
trn <- structure(list(model = c("Linear Reg", "SVM Radial", "Neural Networks"),
RMSE = c(9.17, 7.86, 8.55), Rsquared = c(0.51, 0.64, 0.57),
MAE = c(6.03, 4.86, 5.59)),
row.names = c(NA, -3L), class = "data.frame")
tst <- structure(list(model = c("Linear Reg", "SVM Radial", "Neural Networks"),
RMSE = c(9.4, 9.16, 8.66), Rsquared = c(0.53, 0.55, 0.6),
MAE = c(5.95, 5.5, 5.48)),
row.names = c(NA, -3L), class = "data.frame")
trn['dataType'] = 'train'
tst['dataType'] = 'test'
long_tbl <- rbind(trn, tst) %>%
pivot_longer(cols =!c('model', 'dataType'), names_to = 'metric', values_to='value')
ggplot(long_tbl, aes(x=model, y=value, shape = dataType, colour = metric )) +
geom_point()
【问题讨论】:
标签: r ggplot2 cross-validation