【发布时间】:2017-07-30 14:25:07
【问题描述】:
如何获取 sparkR 分类中的性能矩阵,例如 F1 分数、Precision、Recall、Confusion Matrix
# Load training data
df <- read.df("data/mllib/sample_libsvm_data.txt", source = "libsvm")
training <- df
testing <- df
# Fit a random forest classification model with spark.randomForest
model <- spark.randomForest(training, label ~ features, "classification", numTrees = 10)
# Model summary
summary(model)
# Prediction
predictions <- predict(model, testing)
head(predictions)
# Performance evaluation
我试过caret::confusionMatrix(testing$label,testing$prediction)它显示错误:
Error in unique.default(x, nmax = nmax) : unique() applies only to vectors
【问题讨论】:
标签: apache-spark machine-learning apache-spark-sql spark-dataframe sparkr