【发布时间】:2020-12-01 02:58:08
【问题描述】:
我正在研究叶片病害检测,并尝试在SpMohanty's PlantVillage-dataset 上实施 CNN。
它有38 classes,每个类中的可变图像范围从 1500 到 3000 个图像/类。 Total images = 54303
这是我的Colab notebook。问题是在预测图像时,它总是抛出模型训练的第一类。我不明白为什么。
这是模型摘要:
Model: "sequential_1"
_________________________________________________________________
Layer (type) Output Shape Param #
=================================================================
conv2d_1 (Conv2D) (None, 64, 64, 32) 896
_________________________________________________________________
activation_1 (Activation) (None, 64, 64, 32) 0
_________________________________________________________________
batch_normalization_1 (Batch (None, 64, 64, 32) 128
_________________________________________________________________
max_pooling2d_1 (MaxPooling2 (None, 21, 21, 32) 0
_________________________________________________________________
dropout_1 (Dropout) (None, 21, 21, 32) 0
_________________________________________________________________
conv2d_2 (Conv2D) (None, 21, 21, 64) 18496
_________________________________________________________________
activation_2 (Activation) (None, 21, 21, 64) 0
_________________________________________________________________
batch_normalization_2 (Batch (None, 21, 21, 64) 256
_________________________________________________________________
conv2d_3 (Conv2D) (None, 21, 21, 64) 36928
_________________________________________________________________
activation_3 (Activation) (None, 21, 21, 64) 0
_________________________________________________________________
batch_normalization_3 (Batch (None, 21, 21, 64) 256
_________________________________________________________________
max_pooling2d_2 (MaxPooling2 (None, 10, 10, 64) 0
_________________________________________________________________
dropout_2 (Dropout) (None, 10, 10, 64) 0
_________________________________________________________________
conv2d_4 (Conv2D) (None, 10, 10, 128) 73856
_________________________________________________________________
activation_4 (Activation) (None, 10, 10, 128) 0
_________________________________________________________________
batch_normalization_4 (Batch (None, 10, 10, 128) 512
_________________________________________________________________
conv2d_5 (Conv2D) (None, 10, 10, 128) 147584
_________________________________________________________________
activation_5 (Activation) (None, 10, 10, 128) 0
_________________________________________________________________
batch_normalization_5 (Batch (None, 10, 10, 128) 512
_________________________________________________________________
max_pooling2d_3 (MaxPooling2 (None, 5, 5, 128) 0
_________________________________________________________________
dropout_3 (Dropout) (None, 5, 5, 128) 0
_________________________________________________________________
flatten_1 (Flatten) (None, 3200) 0
_________________________________________________________________
dense_1 (Dense) (None, 1024) 3277824
_________________________________________________________________
activation_6 (Activation) (None, 1024) 0
_________________________________________________________________
batch_normalization_6 (Batch (None, 1024) 4096
_________________________________________________________________
dropout_4 (Dropout) (None, 1024) 0
_________________________________________________________________
dense_2 (Dense) (None, 38) 38950
_________________________________________________________________
activation_7 (Activation) (None, 38) 0
=================================================================
Total params: 3,600,294
Trainable params: 3,597,414
Non-trainable params: 2,880
_________________________________________________________________
尽管该模型实现了 98.94% 的验证准确率和 99.7% 的训练准确率。
模型历史如下:
Epoch 1/10
678/678 [==============================] - 691s 1s/step - loss: 0.0527 - accuracy: 0.9836 - val_loss: 0.0521 - val_accuracy: 0.9840
Epoch 2/10
678/678 [==============================] - 705s 1s/step - loss: 0.0283 - accuracy: 0.9900 - val_loss: 0.0926 - val_accuracy: 0.9787
Epoch 3/10
678/678 [==============================] - 680s 1s/step - loss: 0.0205 - accuracy: 0.9925 - val_loss: 0.0228 - val_accuracy: 0.9924
Epoch 4/10
678/678 [==============================] - 692s 1s/step - loss: 0.0170 - accuracy: 0.9938 - val_loss: 0.0741 - val_accuracy: 0.9828
Epoch 5/10
678/678 [==============================] - 679s 1s/step - loss: 0.0148 - accuracy: 0.9946 - val_loss: 0.0503 - val_accuracy: 0.9860
Epoch 6/10
678/678 [==============================] - 682s 1s/step - loss: 0.0129 - accuracy: 0.9953 - val_loss: 0.0323 - val_accuracy: 0.9918
Epoch 7/10
678/678 [==============================] - 691s 1s/step - loss: 0.0110 - accuracy: 0.9960 - val_loss: 0.0393 - val_accuracy: 0.9890
Epoch 8/10
678/678 [==============================] - 701s 1s/step - loss: 0.0098 - accuracy: 0.9965 - val_loss: 0.0420 - val_accuracy: 0.9875
Epoch 9/10
678/678 [==============================] - 692s 1s/step - loss: 0.0090 - accuracy: 0.9967 - val_loss: 0.0687 - val_accuracy: 0.9855
Epoch 10/10
678/678 [==============================] - 690s 1s/step - loss: 0.0082 - accuracy: 0.9971 - val_loss: 0.0414 - val_accuracy: 0.9894
这看起来像是模型过度拟合的情况,但我为各种 train_test_splits 做了一个混淆矩阵,我发现 80% 的训练是最好的。所以我不认为这个模型是过拟合的。它也总是预测第一堂课,我尝试更改类的名称[例如:Apple_scab 到 Z_Apple_Scab],然后它开始打印输出为:Apple_Blackrot(Apple_scab 之后的下一个标签按字母顺序排列,现在在训练期间成为第一个标签。)
T.I.A.
[更新]:
这是我用来预测新图像的预测函数...
imAr = cvtim(impath)
savedclfmodel = pickle.load(open(model_file,'rb'))
pred = savedclfmodel.predict(imAr)
lb = pickle.load(open(lb_file,'rb'))
ret_data = lb.inverse_transform(pred)[0]
print("Predicted: ",ret_data)
其中 cvtim(image_path) 使用 keras.preprocessing.image.img_to_array 将图像转换为数组
而savedclfmodel使用pickle.load()打开之前保存的模型权重文件并存储在lb中,然后用于查找inverse_transform得到预测标签
【问题讨论】:
标签: tensorflow keras conv-neural-network multilabel-classification multiclass-classification