【问题标题】:Transfer Learning model gives 0 accuracy regardless of architecture无论架构如何,迁移学习模型都提供 0 准确度
【发布时间】:2021-06-02 22:39:21
【问题描述】:

我正在尝试使用 Keras 和迁移学习来开发模型。我使用的数据集可以在这里找到:https://github.com/faezetta/VMMRdb

我选取了样本最多的 10 类汽车品牌,并使用迁移学习训练了两个基于 VGG16 架构的模型,如下面的代码所示。

samples_counts = utils.read_dictionary(utils.TOP10_BRANDS_COUNTS_NAME)

train_dataset = image_dataset_from_directory(
    directory=utils.TRAIN_SET_LOCATION,
    labels='inferred',
    label_mode='categorical',
    class_names=list(samples_counts.keys()),
    color_mode='rgb',
    batch_size=32,
    image_size=(56, 56),
    shuffle=True,
    seed=utils.RANDOM_STATE,
    validation_split=0.2,
    subset='training',
    interpolation='bilinear'
)

validation_dataset = image_dataset_from_directory(
    directory=utils.TRAIN_SET_LOCATION,
    labels='inferred',
    label_mode='categorical',
    class_names=list(samples_counts.keys()),
    color_mode='rgb',
    batch_size=32,
    image_size=(56, 56),
    shuffle=True,
    seed=utils.RANDOM_STATE,
    validation_split=0.2,
    subset='validation',
    interpolation='bilinear'
)

test_dataset = image_dataset_from_directory(
    directory=utils.TEST_SET_LOCATION,
    labels='inferred',
    label_mode='categorical',
    class_names=list(samples_counts.keys()),
    color_mode='rgb',
    batch_size=32,
    image_size=(56, 56),
    shuffle=True,
    seed=utils.RANDOM_STATE,
    interpolation='bilinear'
)

image_shape = (utils.RESIZE_HEIGHT, utils.RESIZE_WIDTH, 3)
base_model = apps.VGG16(include_top=False, weights='imagenet', input_shape=image_shape)
base_model.trainable = False

preprocess_input = apps.vgg16.preprocess_input
flatten_layer = layers.Flatten(name='flatten')
specialisation_layer = layers.Dense(1024, activation='relu', name='specialisation_layer')
avg_pooling_layer = layers.GlobalAveragePooling2D(name='pooling_layer')
dropout_layer = layers.Dropout(0.2, name='dropout_layer')
classification_layer = layers.Dense(10, activation='softmax', name='classification_layer')

inputs = tf.keras.Input(shape=(utils.RESIZE_HEIGHT, utils.RESIZE_WIDTH, 3))
x = preprocess_input(inputs)
x = base_model(x, training=False)

# First model
# x = flatten_layer(x)
# x = specialisation_layer(x)

# Second model
x = avg_pooling_layer(x)
x = dropout_layer(x)
outputs = classification_layer(x)
model = tf.keras.Model(inputs, outputs)

model.summary()

steps_per_epoch = len(train_dataset)
validation_steps = len(validation_dataset)
base_learning_rate = 0.0001
optimizer = optimizers.Adam(learning_rate=base_learning_rate)
loss_function = losses.CategoricalCrossentropy()
train_metrics = [metrics.Accuracy(), metrics.AUC(), metrics.Precision(), metrics.Recall()]

model.compile(optimizer=optimizer,
              loss=loss_function,
              metrics=train_metrics)

initial_results = model.evaluate(validation_dataset,
                                 steps=validation_steps,
                                 return_dict=True)

training_history = model.fit(train_dataset, epochs=10, verbose=0,
                             validation_data=validation_dataset,
                             callbacks=[TqdmCallback(verbose=2)],
                             steps_per_epoch=steps_per_epoch,
                             validation_steps=validation_steps)

history = training_history.history
final_results = model.evaluate(test_dataset,
                              return_dict=True,
                              callbacks=[TqdmCallback(verbose=2)])

总的来说,我一直得到 0 的准确性和糟糕的指标。我尝试了Transfer learning bad accuracyMNIST and transfer learning with VGG16 in Keras- low validation accuracy中提到的解决方案,但没有成功。

第一个模型的总结和结果是:

Model: "functional_1"
input_2 (InputLayer)         [(None, 56, 56, 3)]       0
tf_op_layer_strided_slice (T [(None, 56, 56, 3)]       0
tf_op_layer_BiasAdd (TensorF [(None, 56, 56, 3)]       0
vgg16 (Functional)           (None, 1, 1, 512)         14714688
flatten (Flatten)            (None, 512)               0
specialisation_layer (Dense) (None, 1024)              525312
classification_layer (Dense) (None, 10)                10250

Total params: 15,250,250
Trainable params: 535,562
Non-trainable params: 14,714,688
Initial results: loss = 9.01, accuracy = 0.0, auc = 0.53, precision = 0.13, recall = 0.12
Final results: loss = 2.5, accuracy = 0.0, auc = 0.71, precision = 0.31, recall = 0.14

第二个模型的总结和结果是:

Model: "functional_1"
input_2 (InputLayer)         [(None, 56, 56, 3)]       0
tf_op_layer_strided_slice (T [(None, 56, 56, 3)]       0
tf_op_layer_BiasAdd (TensorF [(None, 56, 56, 3)]       0
vgg16 (Functional)           (None, 1, 1, 512)         14714688
pooling_layer (GlobalAverage (None, 512)               0
dropout_layer (Dropout)      (None, 512)               0
classification_layer (Dense) (None, 10)                5130

Total params: 14,719,818
Trainable params: 5,130
Non-trainable params: 14,714,688
Initial Results: loss = 21.6, accuracy = 0, auc = 0.48, precision = 0.07, recall = 0.07
Final Results: loss = 2.02, accuracy = 0, auc = 0.72, precision = 0.44, recall = 0.009

【问题讨论】:

  • 你试过设置 base_model.trainable=True 吗?
  • 不,因为我不想重新训练 VGG16 网络,我只想训练我放在上面的层(迁移学习)

标签: python tensorflow keras tensorflow2.0 transfer-learning


【解决方案1】:

在下面的代码中

# Second model
x = avg_pooling_layer(x)
x = dropout_layer(x)
outputs = classification_layer(x)
model = tf.keras.Model(inputs, outputs)

您需要在 avg_pooling_layer 之后添加一个 Flatten 层。或者改变 ave_pooling_lay 到 GlobalMaxPooling2D 层,这是我认为最好的。所以你的第二个模型是

x=tf.keras.layers.GlobalMaxPooling2D()(x)
x = dropout_layer(x)
outputs = classification_layer(x)
model = tf.keras.Model(inputs, outputs)

同样在 Vgg 中你可以设置参数 pooling='average 那么输出是一维张量所以你不需要展平它也不需要添加 全球平均池化。在您的 test_dataset 和 validation_dataset 中设置 shuffle=False 并设置 seed=None。您的 steps_per_epoch 和验证步骤的值不正确。它们通常设置为样本数//batch_size。您可以在 model.fit 中将这些值保留为 None ,它将在内部确定这些值,还设置 verbose=1 以便您可以查看每个时期的训练结果。离开 callbacks=None 我什至不知道 TqdmCallback(verbose=2) 是什么。未在我能找到的任何文档中列出。

【讨论】:

  • 谢谢,你的建议我已经试过了。设置 shuffle = False 和 seed = None 并使用最大池确实有帮助,但是 steps_per_epoch 和 validation_steps 是正确的,因为数据集的长度是作为可用批次数计算的。 TqdmCallback 是一个显示进度条的回调,因此它不会影响任何内容。我不再获得 0 准确度,但验证集结果与测试集结果有很大不同。我会针对这个问题发布另一个问题。
猜你喜欢
  • 2019-01-30
  • 2022-11-12
  • 1970-01-01
  • 1970-01-01
  • 1970-01-01
  • 1970-01-01
  • 1970-01-01
  • 2022-12-03
  • 2018-09-24
相关资源
最近更新 更多