【发布时间】:2020-06-27 23:52:17
【问题描述】:
我正在计算 evaluating the cluster performance 的调整后兰德指数分数。假设,真实的集群和预测的集群如下所示。 {i, "x"} 格式表明元素 "x" 位于 ith 簇中。
>>> labels_true = [{0,"a"}, {0,"b"}, {0,"c"}, {1,"d"}, {1,"e"}, {1,"f"}]
>>> labels_pred = [{0,"a"}, {0,"b"}, {1,"c"}, {1,"d"}, {2,"e"}, {2,"f"}]
>>> metrics.adjusted_rand_score(labels_true, labels_pred)
ARI 分数即将达到 1.0,但它似乎不应该是 1.0,因为预测的集群与真实的集群不同。
我想知道这是否是计算 ARI 分数的有效方法。
【问题讨论】:
标签: python scikit-learn cluster-analysis