【发布时间】:2017-07-22 15:51:26
【问题描述】:
我是 Tensorflow 的新手,我尝试训练我的 CNN 模型以在未来对人脸进行分类。我有一个包含 56 个人的图像数据集,他们裁剪的脸是 numpy 数组,形状为 [-1,224,224,3] 和 float32 类型。当我尝试将 feed_dict 输入 tensorflow 我只是附上我的 train_X 和 train_Y 的样子,以便输入 tensorflow
我收到典型的错误 ValueError: Cannot feed value of shape (224, 224, 3) for Tensor 'Placeholder_3:0',其形状为 '(?, 224, 224, 3)'。这似乎很容易理解,但我不知道如何修改我的代码以使其工作。
我的 TensorFlow 代码在这里
import tensorflow as tf
config = tf.ConfigProto()
config.gpu_options.allocator_type = 'BFC'
#config.gpu_options.allow_growth = True
config.gpu_options.per_process_gpu_memory_fraction = 0.6
n_classes = 56
batch_size = 1
hm_epochs = 100
#x = tf.placeholder('float', [None, 150528])
x = tf.placeholder('float', [None, 224,224,3])
y = tf.placeholder('float')
keep_rate = 0.8
keep_prob = tf.placeholder(tf.float32)
def conv2d(x, W):
return tf.nn.conv2d(x, W, strides=[1,1,1,1], padding='SAME')
def maxpool2d(x):
# size of window movement of window
return tf.nn.max_pool(x, ksize=[1,2,2,1], strides=[1,2,2,1], padding='SAME')
def convolutional_neural_network(x):
weights = {'W_conv1':tf.Variable(tf.random_normal([5,5,3,32])),
'W_conv2':tf.Variable(tf.random_normal([5,5,32,64])),
'W_fc':tf.Variable(tf.random_normal([224*224*3,1024])),
'out':tf.Variable(tf.random_normal([1024, n_classes]))}
biases = {'b_conv1':tf.Variable(tf.random_normal([32])),
'b_conv2':tf.Variable(tf.random_normal([64])),
'b_fc':tf.Variable(tf.random_normal([1024])),
'out':tf.Variable(tf.random_normal([n_classes]))}
x = tf.reshape(x, shape=[-1, 224, 224, 3])
#x = train_X
#creating the first layer of CNN
conv1 = tf.nn.relu(conv2d(x, weights['W_conv1']) + biases['b_conv1']) # activation function 1
conv1 = maxpool2d(conv1)
#creating the second layer of CNN
conv2 = tf.nn.relu(conv2d(conv1, weights['W_conv2']) + biases['b_conv2']) # activation function 2
conv2 = maxpool2d(conv2)
fc = tf.reshape(conv2,[-1, 224*224*3])
fc = tf.nn.relu(tf.matmul(fc, weights['W_fc'])+biases['b_fc'])
fc = tf.nn.dropout(fc, keep_rate)
output = tf.matmul(fc, weights['out'])+biases['out']
return output
def train_neural_network(x):
i = 0
prediction = convolutional_neural_network(x)
cost = tf.reduce_mean( tf.nn.softmax_cross_entropy_with_logits(logits=prediction,labels=y) )
optimizer = tf.train.AdamOptimizer().minimize(cost)
with tf.Session(config = config) as sess:
sess.run(tf.global_variables_initializer())
for epoch in range(hm_epochs):
epoch_loss = 0
for _ in range(int(len(train_X)/batch_size)):
_, c = sess.run([optimizer, cost], feed_dict={x: train_X[i:i+batch_size], y: train_y[i:i+batch_size]}) #HERE IS THE ERROR
epoch_loss += c
i += 100
print('Epoch', epoch, 'completed out of',hm_epochs,'loss:',epoch_loss)
train_neural_network(x)
如果有人能够帮助我解决所有问题,我会非常高兴。提前感谢您的所有帮助。 P.S 顺便说一句,我需要以我的 GPU 不会给我 OOM 的方式对我的数据进行批处理。因为我可以更改进料方式以排除批处理,并且除了 OOM 错误之外它工作正常。有趣的故事,当我重新启动内核并再次运行它几次。发生了另一个错误 - InvalidArgumentError(请参阅上面的回溯):reshape 的输入是具有 200704 值的张量,但请求的形状需要 150528 的倍数。200704 根本不能在这里,因为这是 224*224*4 当我只有 224*224*3
【问题讨论】:
-
将您的 train_X 重塑为 [-1,224,224,3] ,您输入的是单个输入,因此它应该是 [1, 224, 224, 3] 而不是 [224, 224, 3]跨度>
标签: python tensorflow convolution