【发布时间】:2020-01-14 09:05:45
【问题描述】:
我有一个如下所示的数据框:
2015 2016 2017 2018 2019
Jan 10991273 15761628 20702648 28836397 32462658
Feb 11910590 18407640 21867727 29592821 34120748
Mar 14720817 22353179 28631090 37017871 41731600
Apr 14240928 22383817 28698805 36719286 40352427
May 13999937 22562954 30105567 38492879 42443240
Jun 14618367 23485559 31294015 40219835 42243535
Jul 16954514 22816656 31235203 39633588 41259052
Aug 16384802 22418011 30851292 39153646 38739977
Sep 17546431 22387746 30207602 37501028 37273751
Oct 19069836 24623265 34346016 42694615 0
Nov 15864435 21323086 30132954 34682955 0
Dec 14728914 18795856 26990426 33064396 0
基本上我是在尝试预测那些零值。
import pandas as pd
import numpy as np
from sklearn.linear_model import LinearRegression
lm = LinearRegression()
Z = df[['2015', '2016', '2017', '2018']]
lm.fit(Z , df['2019'])
y_predict = lm.predict(Z)
print(y_predict)
输出是这样的:
array([31915988.349427 , 42517368.34239061, 42188302.01643895,
40885793.12471981, 39423851.45561062, 43649973.20909937,
29415292.34136068, 28706985.92182099, 21561488.69575803,
24801192.93635751, -508990.40716472, 6069742.01418072])
该输出是否代表对这些值的预测?
【问题讨论】:
标签: python machine-learning linear-regression