【发布时间】:2020-03-25 01:47:47
【问题描述】:
我有一个基本的散点图,想用不同的颜色显示所有异常值。我将异常值定义为与均值相差超过 2 个标准差。我生成的代码只显示了一个异常值,而我希望所有异常值都是不同的颜色:
import pandas as pd
import matplotlib.pyplot as plt
import numpy as np
data = pd.read_csv('1fXr31hcEemkYxLyQ1aU1g_50fc36ee697c4b158fe26ade3ec3bc24_Banknote-authentication-dataset- (1).csv')
data = np.array(data)
mean = np.mean(data, 0)
min = np.min(data,0)
max = np.max(data,0)
normed = (data - min) / (max - min)
mean = np.mean(normed, 0)
std_dev = np.std (normed, 0)
fig, graph = plt.subplots()
graph.scatter(normed [:,0], normed [:,1])
graph.scatter(mean[0], mean[1])
outliers = normed[normed>2*std_dev]
graph.scatter(outliers [0], outliers [1], c='red')
plt.show
【问题讨论】:
标签: pandas filter colors scatter-plot outliers