【发布时间】:2014-05-11 02:39:56
【问题描述】:
我正在尝试使用一组样本权重来运行简单的 Sklearn Ridge 回归。 X_train 是一个 ~200k x 100 2D Numpy 数组。尝试使用 sample_weight 选项时出现内存错误。如果没有该选项,它就可以正常工作。为简单起见,我将功能减少到 2,但 sklearn 仍然给我一个内存错误。 有什么想法吗?
model=linear_model.Ridge()
model.fit(X_train, y_train,sample_weight=w_tr)
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
File "/home/g/anaconda/lib/python2.7/site-packages/sklearn/linear_model/ridge.py", line 449, in fit
return super(Ridge, self).fit(X, y, sample_weight=sample_weight)
File "/home/g/anaconda/lib/python2.7/site-packages/sklearn/linear_model/ridge.py", line 338, in fit
solver=self.solver)
File "/home/g/anaconda/lib/python2.7/site-packages/sklearn/linear_model/ridge.py", line 286, in ridge_regression
K = safe_sparse_dot(X, X.T, dense_output=True)
File "/home/g/anaconda/lib/python2.7/site-packages/sklearn/utils/extmath.py", line 83, in safe_sparse_dot
return np.dot(a, b)
MemoryError
【问题讨论】:
标签: python scikit-learn regression