【发布时间】:2022-01-24 08:58:53
【问题描述】:
我已经尝试过 .append 方法。代码是正确的,但它没有做任何事情。 我的 .csv 太大而无法打开,我无法实际添加,所以如果有人可以解决我的问题,请回答:
Code:
import pandas as pd
ARP_MitM_dataset = pd.read_csv('/content/drive/MyDrive/ARP MitM_dataset-002.csv');
label = pd.read_csv('/content/drive/MyDrive/ARP MitM_labels.csv');
t = iter(range(1, 401))
ARP_MitM_dataset.columns = ['Column'+str(i).format(next(t)) if 1 <= i <= 499 else x for i, x in enumerate(ARP_MitM_dataset.columns, 1)]
dataArr = ARP_MitM_dataset
labelArr = label
dataArr.append({' ':2504267}, ignore_index = True) <------ Check
from sklearn.model_selection import train_test_split
X_train, X_test, y_train, y_test = train_test_split(dataArr,labelArr, test_size = 0.40, random_state = 42) <--- Error
print(X_train.shape,y_train.shape)
print(X_test.shape,y_test.shape)
Error Showing:
ValueError:发现输入变量的数量不一致 样本:[2504266, 2504267]
【问题讨论】:
-
add = {'Column1': ['380.76885']} data2 = pd.DataFrame(add) dataArr = ARP_MitM_dataset labelArr = label dataArr.append(data2, ignore_index = True, sort = False) dataArr