【发布时间】:2020-12-14 18:52:19
【问题描述】:
这是我的代码
print(len(image_dataset.data))
print(len(phylum_target))
X_train, X_test, y_train, y_test = train_test_split(image_dataset.data, phylum_target, test_size=0.2,random_state=109)
这是输出和错误
5000
5000
Traceback (most recent call last):
File "Image_SVM_run_only.py", line 298, in <module>
X_train_temp, X_test_temp, y_train_temp, y_test_temp = train_test_split(image_dataset.data, phylum_target, test_size=0.2,random_state=109)
File "/root/anaconda3/envs/IBC/lib/python3.7/site-packages/sklearn/model_selection/_split.py", line 2127, in train_test_split
arrays = indexable(*arrays)
File "/root/anaconda3/envs/IBC/lib/python3.7/site-packages/sklearn/utils/validation.py", line 293, in indexable
check_consistent_length(*result)
File "/root/anaconda3/envs/IBC/lib/python3.7/site-packages/sklearn/utils/validation.py", line 257, in check_consistent_length
" samples: %r" % [int(l) for l in lengths])
ValueError: Found input variables with inconsistent numbers of samples: [4999, 5000]
即使训练数据和测试数据的长度相同,我还是遇到了这个错误。 请帮帮我T.T
【问题讨论】:
-
不仅仅是长度,每个的长度/形状[0]是多少?
-
` print(type(image_dataset.data)) print(type(phylum_target)) print(image_dataset.data.shape) print(phylum_target.shape) `
<class 'numpy.ndarray'> <class 'numpy.ndarray'> (5000, 49152) (5000, 1) -
image_dataset.data[0].shape 是 (49152,),phylum_target[0].shape 是 (1,)
标签: python scikit-learn train-test-split