【发布时间】:2020-07-09 11:18:08
【问题描述】:
这个问题与this question有关,它提供了一个在Tensorflow 1.15 中有效,但在TF2 中不再有效的解决方案
我正在从该问题中提取部分代码并稍微调整它(删除了冻结模型的多个输入,并随之消除了对 nest 的需求)。
注意:我将代码分隔成块,但它们应该作为文件运行(即,我不会在每个块中重复不必要的导入)
首先,我们生成一个冻结图用作虚拟测试网络:
import numpy as np
import tensorflow.compat.v1 as tf
def dump_model():
with tf.Graph().as_default() as gf:
x = tf.placeholder(tf.float32, shape=(None, 123), name='x')
c = tf.constant(100, dtype=tf.float32, name='C')
y = tf.multiply(x, c, name='y')
z = tf.add(y, x, name='z')
with tf.gfile.GFile("tmp_net.pb", "wb") as f:
raw = gf.as_graph_def().SerializeToString()
print(type(raw), len(raw))
f.write(raw)
dump_model()
然后,我们加载冻结的模型并将其包装在 Keras 模型中:
persisted_sess = tf.Session()
with tf.Session().as_default() as session:
with tf.gfile.FastGFile("./tmp_net.pb",'rb') as f:
graph_def = tf.GraphDef()
graph_def.ParseFromString(f.read())
persisted_sess.graph.as_default()
tf.import_graph_def(graph_def, name='')
print(persisted_sess.graph.get_name_scope())
for i, op in enumerate(persisted_sess.graph.get_operations()):
tensor = persisted_sess.graph.get_tensor_by_name(op.name + ':0')
print(i, '\t', op.name, op.type, tensor)
x_tensor = persisted_sess.graph.get_tensor_by_name('x:0')
y_tensor = persisted_sess.graph.get_tensor_by_name('y:0')
z_tensor = persisted_sess.graph.get_tensor_by_name('z:0')
from tensorflow.compat.v1.keras.layers import Lambda, InputLayer
from tensorflow.compat.v1.keras import Model
from tensorflow.python.keras.utils import layer_utils
input_x = InputLayer(name='x', input_tensor=x_tensor)
input_x.is_placeholder = True
output_y = Lambda(lambda x: y_tensor, name='output_y')(input_x.output)
output_z = Lambda(lambda x_b: z_tensor, name='output_z')(input_x.output)
base_model_inputs = layer_utils.get_source_inputs(input_x.output)
base_model = Model(base_model_inputs, [output_y, output_z])
最后,我们在一些随机数据上运行模型,并验证它运行时没有错误:
y_out, z_out = base_model.predict(np.ones((3, 123), dtype=np.float32))
y_out.shape, z_out.shape
在 TensorFlow 1.15.3 中,上面的输出是((3, 123), (3, 123)),但是,如果我在 TensorFlow 2.1.0 中运行相同的代码,前两个块运行没有问题,但第三个块失败:
TypeError: An op outside of the function building code is being passed
a "Graph" tensor. It is possible to have Graph tensors
leak out of the function building context by including a
tf.init_scope in your function building code.
For example, the following function will fail:
@tf.function
def has_init_scope():
my_constant = tf.constant(1.)
with tf.init_scope():
added = my_constant * 2
The graph tensor has name: y:0
该错误似乎与Tensorflow的自动“编译”和功能优化有关,但我不知道如何解释,错误的根源是什么,或者如何解决。
在 Tensorflow 2 中包装冻结模型的正确方法是什么?
【问题讨论】:
标签: python tensorflow keras tensorflow2.0