【发布时间】:2018-07-31 20:22:00
【问题描述】:
我尝试通过 SGDClassifer.partial_fit 预测我新添加的数据的标签,如下所示:
from sklearn import neighbors, linear_model
import numpy as np
def train_predict():
X = [[1, 1], [2, 2.5], [2, 6.8], [4, 7]]
y = [1, 2, 3, 4]
sgd_clf = linear_model.SGDClassifier(loss="log")
sgd_clf.fit(X, y)
X1 = [[6,9]]
y1=[5]
f1 = sgd_clf.partial_fit(X1,y1)
f1.predict([[6,9]])
return f1
if __name__ == "__main__":
clf = train_predict()
fit 可以完美地预测标签。但是,部分拟合的预测会导致错误:
in compute_class_weight
raise ValueError("classes should include all valid labels that can be in y")
类似Sklearn SGDC partial_fit ValueError: classes should include all valid labels that can be in y,我看了partial_fit手册,http://scikit-learn.org/stable/modules/generated/sklearn.linear_model.SGDClassifier.html#sklearn.linear_model.SGDClassifier.partial_fit
但我仍然无法弄清楚如何设置 partial_fit 的参数,以便能够预测动态添加的数据。
任何参考或想法?
【问题讨论】:
标签: python machine-learning scikit-learn