Keras LSTM 一次预测 1 个时间步长答案

【问题标题】：Keras LSTM predict 1 timestep at a timeKeras LSTM 一次预测 1 个时间步长
【发布时间】：2018-03-09 15:33:27
【问题描述】：

编辑添加：我找到了我认为可行的解决方案：https://bleyddyn.github.io/posts/2017/10/keras-lstm/

我正在尝试使用 Conv/LSTM 网络来控制机器人。我想我已经设置好了所有东西，所以我可以开始用回放内存中的批量数据训练它，但我不知道如何实际使用它来控制机器人。简化的测试代码如下。

import numpy as np

from keras.models import Sequential
from keras.layers import Dense, Flatten, Input
from keras.layers import Convolution2D
from keras.layers.recurrent import LSTM
from keras.layers.wrappers import TimeDistributed
from keras.utils import to_categorical

def make_model(num_actions, timesteps, input_dim, l2_reg=0.005 ):
    input_shape=(timesteps,) + input_dim
    model = Sequential()
    model.add(TimeDistributed( Convolution2D(8, (3, 3), strides=(2,2), activation='relu' ), input_shape=input_shape) )
    model.add(TimeDistributed( Convolution2D(16, (3, 3), strides=(2,2), activation='relu', ) ))
    model.add(TimeDistributed( Convolution2D(32, (3, 3), strides=(2,2), activation='relu', ) ))
    model.add(TimeDistributed(Flatten()))
    model.add(LSTM(512, return_sequences=True, activation='relu', unroll=True))
    model.add(Dense(num_actions, activation='softmax', ))
    model.compile(loss='categorical_crossentropy', optimizer='adam' )
    return model

batch_size = 16
timesteps = 10
num_actions = 6
model = make_model( num_actions, timesteps, (84,84,3) )
model.summary()

# Fake training batch. Would be pulled from a replay memory
batch = np.random.uniform( low=0, high=255, size=(batch_size,timesteps,84,84,3) )
y = np.random.randint( 0, high=5, size=(160) )
y = to_categorical( y, num_classes=num_actions )
y = y.reshape( batch_size, timesteps, num_actions )
# stateful should be false here
pred = model.train_on_batch( batch, y )

# move trained network to robot

# This works, but it isn't practical to not get outputs (actions) until after 10 timesteps and I don't think the LSTM internal state would be correct if I tried a rolling queue of input images.
batch = np.random.uniform( low=0, high=255, size=(1,timesteps,84,84,3) )
pred = model.predict( batch, batch_size=1 )

# This is what I would need to do on my robot, with the LSTM keeping state between calls to predict
max_time = 10 # or 100000, or forever, etc.
for i in range(max_time) :
    image = np.random.uniform( low=0, high=255, size=(1,1,84,84,3) ) # pull one image from camera
    # stateful should be true here
    pred = model.predict( image, batch_size=1 )
    # take action based on pred

我在“model.predict(image...”行中得到的错误是：

ValueError: 检查时出错：预期 time_distributed_1_input 的形状为 (None, 10, 84, 84, 3) 但得到的数组的形状为 (1, 1, 84, 84, 3)

这是可以理解的，但我找不到解决办法。我不太了解 Keras，甚至不知道我是否正确使用了 TimeDistributed 层。

那么，这在 Keras 中是否可行？如果有，怎么做？

如果没有，在 TF 或 PyTorch 中是否可以？

感谢您的任何建议！

编辑添加运行代码，虽然它不一定正确。仍然需要在 OpenAI 健身房任务上进行测试。

import numpy as np

from keras.models import Sequential
from keras.layers import Dense, Flatten, Input
from keras.layers import Convolution2D
from keras.layers.recurrent import LSTM
from keras.layers.wrappers import TimeDistributed
from keras.utils import to_categorical

def make_model(num_actions, timesteps, input_dim, l2_reg=0.005 ):
    input_shape=(1,None) + input_dim
    model = Sequential()
    model.add(TimeDistributed( Convolution2D(8, (3, 3), strides=(2,2), activation='relu' ), batch_input_shape=input_shape) )
    model.add(TimeDistributed( Convolution2D(16, (3, 3), strides=(2,2), activation='relu', ) ))
    model.add(TimeDistributed( Convolution2D(32, (3, 3), strides=(2,2), activation='relu', ) ))
    model.add(TimeDistributed(Flatten()))
    model.add(LSTM(512, return_sequences=True, activation='relu', stateful=True))
    model.add(Dense(num_actions, activation='softmax', ))
    model.compile(loss='categorical_crossentropy', optimizer='adam' )
    return model

batch_size = 16
timesteps = 10
num_actions = 6
model = make_model( num_actions, 1, (84,84,3) )
model.summary()

# Fake training batch. Would be pulled from a replay memory
batch = np.random.uniform( low=0, high=255, size=(batch_size,timesteps,84,84,3) )
y = np.random.randint( 0, high=5, size=(160) )
y = to_categorical( y, num_classes=num_actions )
y = y.reshape( batch_size, timesteps, num_actions )

# Need to find a way to prevent the optimizer from updating every b, but accumulate updates over an entire batch (batch_size).
for b in range(batch_size):
    pred = model.train_on_batch( np.reshape(batch[b,:], (1,timesteps,84,84,3)), np.reshape(y[b,:], (1,timesteps,num_actions)) )
    #for t in range(timesteps):
    #    pred = model.train_on_batch( np.reshape(batch[b,t,:], (1,1,84,84,3)), np.reshape(y[b,t,:], (1,1,num_actions)) )
    model.reset_states() # Don't carry internal state between batches

# move trained network to robot

# This works, but it isn't practical to not get outputs (actions) until after 10 timesteps
#batch = np.random.uniform( low=0, high=255, size=(1,timesteps,84,84,3) )
#pred = model.predict( batch, batch_size=1 )

# This is what I would need to do on my robot, with the LSTM keeping state between calls to predict
max_time = 10 # or 100000, or forever, etc.
for i in range(max_time) :
    image = np.random.uniform( low=0, high=255, size=(1,1,84,84,3) ) # pull one image from camera
    # stateful should be true here
    pred = model.predict( image, batch_size=1 )
    # take action based on pred
    print( pred )

【问题讨论】：

您的机器人在处理图像吗？这就是 84 x 84 应该的样子吗？
是的，84x84x3（宽度、高度、颜色通道）。

标签： keras lstm

【解决方案1】：

您首先需要了解您的数据。

这 5 个维度有什么意义吗？

我会尝试猜测：

- 1 learning example
- 1 time step (this is added by TimeDistributed, normal 2D convolutions don't take this)
- 84 image side
- 84 another image side
- 3 channels (RGB)

TimeDistributed 的目的是添加额外的timesteps 维度，以便您可以在不应该使用序列的层中模拟序列。

您的错误消息告诉您：

您的 input_shape 参数是 (None, 10, 84, 84, 3)，其中 None 是批量大小（样本数/示例数）。
您的输入data，即代码中的batch 是(1, 1, 84, 84, 3)。

不匹配，您应该使用包含 10 个时间步长的批次（由您的 input_shape 定义）。 stateful=False 模型可以批量打包 10 张图像并使用它进行训练。

但稍后，在stateful=True 的情况下，您将需要input_shape 只是一个步骤。（您可以创建一个仅用于预测的新模型并将所有权重从训练模型复制到预测模型，或者您可以尝试在该时间步长维度中使用None，这意味着您可以使用不同数量的时间步长进行训练和预测)

现在，与卷积层不同的是，LSTM 层已经期待时间步长。所以你应该找到一种方法来压缩你的数据在更小的维度上。

LSTM 将期待(None, timeSteps, features)。时间步长与前面相同。 10 个用于训练，1 个用于预测，您可以尝试在那里使用None。

因此，您不应该在TimeDistributed 中使用Flatten()，而应该简单地重塑数据，压缩不是批量大小或步骤的维度：

model.add(Reshape((8,9*9*32))) #the batch size doesn't participate in this definition, and it will remain as it is.

9*9*32 是前面卷积及其 32 个过滤器的边。（我只是不确定边是 9，也许是 8，你可以在当前的 model.summary() 中看到）。

最后，对于stateful=True 的情况，您必须使用batch_shape 而不是input_shape 来定义模型。一个批次中的样本数量必须是一个固定的数字，因为模型会假设第二批次中的样本是属于前一批中的样本的新步骤。（所有批次的样品数量都需要相同）。

【讨论】：

感谢您的详细回复。我尝试将 None 用于 timestep 参数和 Reshape 而不是 Flatten 的组合，但没有运气。
实际上，现在我重新阅读了错误消息，我认为您在时间步长上使用 None 可能是正确的。 ValueError：如果时间维度未定义或等于 1，则无法展开 RNN。 - 如果使用顺序模型，请通过将 input_shape 或 batch_input_shape 参数传递给第一层来指定时间维度。如果您的第一层是嵌入，您还可以使用input_length 参数。 - 如果使用功能 API，请通过将 shape 或 batch_shape 参数传递给输入层来指定时间维度。至少我的比玩具更简单的代码可以运行。
啊...我没有看到unroll 参数...。您有明确的理由使用它吗？我从来没有用过……它说当你使用unroll=True时，你必须有固定的时间步长。（所以我想它不适用于您想要混合不同有状态案例的案例）。
我没有理由，我认为这是我第一次复制别人的代码时留下的:(我将您的答案标记为正确，因为它适用于我的示例代码。现在在DQN。谢谢！
说得太快了。当我重新打开有状态时，我收到一个关于必须指定批量大小的错误。 ValueError：如果一个 RNN 是有状态的，它需要知道它的批量大小。指定输入张量的批量大小： - 如果使用顺序模型，请通过将 batch_input_shape 参数传递给第一层来指定批量大小。 - 如果使用函数式 API，请通过将 batch_shape 参数传递给您的输入层来指定时间维度。