【发布时间】:2021-11-02 15:06:15
【问题描述】:
import tensorflow as tf
import tensorflow.keras
from tensorflow.keras.models import Sequential
from tensorflow.keras.layers import Dense, Dropout, Activation, Flatten, Conv2D, MaxPooling2D
import pickle
import numpy as np
from keras.models import model_from_json
from keras.models import load_model
import matplotlib.pyplot as plt
# Opening the files about data
X = pickle.load(open("X.pickle", "rb"))
y = pickle.load(open("y.pickle", "rb"))
# normalizing data (a pixel goes from 0 to 255)
X = X/255.0
# Building the model
model = Sequential()
# 3 convolutional layers
model.add(Conv2D(32, (3, 3), input_shape = X.shape[1:]))
model.add(Activation("relu"))
model.add(MaxPooling2D(pool_size=(2,2)))
model.add(Conv2D(64, (3, 3)))
model.add(Activation("relu"))
model.add(MaxPooling2D(pool_size=(2,2)))
model.add(Conv2D(64, (3, 3)))
model.add(Activation("relu"))
model.add(MaxPooling2D(pool_size=(2,2)))
model.add(Dropout(0.9))
# 5 hidden layers
model.add(Flatten())
model.add(Dense(128))
model.add(Activation("relu"))
model.add(Dense(128))
model.add(Activation("relu"))
model.add(Dense(128))
model.add(Activation("relu"))
model.add(Dense(128))
model.add(Activation("relu"))
model.add(Dense(128))
model.add(Activation("relu"))
# The output layer with 7 neurons, for 7 classes
model.add(Dense(13))
model.add(Activation("softmax"))
# Compiling the model using some basic parameters
model.compile(loss="sparse_categorical_crossentropy",
optimizer="adam",
metrics=["accuracy"])
# Training the model, with 40 iterations
# validation_split corresponds to the percentage of images used for the validation phase compared to all the images
print("X = " + str(len(X)))
print("y = " + str(len(y)))
history = model.fit(X, y, batch_size=32, epochs=1000, validation_split=0.1)
# Saving the model
model_json = model.to_json()
with open("model.json", "w") as json_file :
json_file.write(model_json)
model.save_weights("model.h5")
print("Saved model to disk")
model.save('CNN.model')
# Printing a graph showing the accuracy changes during the training phase
print(history.history.keys())
plt.show()
plt.plot(history.history['accuracy'])
plt.plot(history.history['loss'])
plt.title('model accuracy')
plt.ylabel('accuracy')
plt.xlabel('epoch')
plt.legend(['train', 'validation'], loc='upper left')
plt.show()
问题在于,我的训练损失较低,但验证准确度很高。而且验证的准确性也极低。我该如何解决这个问题?我试图将下降值增加到 0.9,但损失仍然高得多。我也尝试使用线性函数进行激活,但没有用。
请帮忙。
【问题讨论】:
-
这个问题太笼统,不清楚,给你一个具体好的建议。例如,我们需要有关您的数据集的信息。学习曲线是怎样的?你能分享一下训练期间的训练和验证损失吗?
-
我同意@FelixKleineBösing 所说的话,我还要补充一点,这甚至可能是题外话。
-
@FelixKleineBösing 我正在使用各种裁剪图像的自定义数据集,每个文件夹中有 50 张图像。我关注的作物共有 7 类。
-
@ChinmaySendye 我们还需要一个损失图,而不仅仅是准确性。在某些情况下,特别是在多类分类中,损失可能会减少,而准确度也会降低。
-
@ChinmayShendye 每个班级有 50 张图片吗?总共350张图片?我认为这是减少数据以获得能够以良好准确性对验证/测试集进行分类的通用模型的方法。
标签: python tensorflow machine-learning