【发布时间】:2021-08-05 11:28:46
【问题描述】:
我想可视化波士顿数据集的数据点,并绘制线性回归平面。但是,我收到一个值错误。我正在使用 colab。以下是我运行的代码。
import pandas as pd
import matplotlib.pyplot as plt
from sklearn.datasets import load_boston
from sklearn.model_selection import train_test_split
from sklearn.linear_model import LinearRegression
data1 = load_boston()
df1 = pd.DataFrame(data1.data)
df1.columns = data1.feature_names
df1['price']=data1.target
X = df1.iloc[:,0:13]
Y = df1.iloc[:,13]
xtrain,xtest,ytrain,ytest = train_test_split(X,Y,test_size = 0.2,random_state =0)
lin_reg = LinearRegression()
lin_reg.fit(xtrain,ytrain)
ypredict = lin_reg.predict(xtest)
plt.scatter(xtrain, ytrain, color = 'red')
plt.plot(xtrain, lin_reg.predict(xtrain), color = 'blue')
最后两行出现错误。
ValueError Traceback (most recent call last)
<ipython-input-54-4f89e0554222> in <module>()
----> 1 plt.scatter(xtrain, ytrain, color = 'red')
2 plt.plot(xtrain, lin_reg.predict(xtrain), color = 'blue')
3 frames
/usr/local/lib/python3.7/dist-packages/matplotlib/axes/_axes.py in scatter(self, x, y, s, c, marker, cmap, norm, vmin, vmax, alpha, linewidths, verts, edgecolors, plotnonfinite, **kwargs)
4389 y = np.ma.ravel(y)
4390 if x.size != y.size:
-> 4391 raise ValueError("x and y must be the same size")
4392
4393 if s is None:
ValueError: x and y must be the same size
我知道 X 有 13 列,而 Y 有 1 列。这就是错误显示的原因。但我不知道如何纠正它。
有人可以帮忙吗?
【问题讨论】:
标签: python pandas matplotlib scikit-learn linear-regression