【发布时间】:2021-06-04 15:53:37
【问题描述】:
我有一个在 sagemaker 中训练的模型(自定义训练作业),并由我的训练脚本使用 keras model.save() 方法保存,该方法生成一个带有权重和索引的 variables 目录和一个 .pb 文件。该模型是来自 huggingface 的 transformer 库的 TFBertForSequenceClassification,根据他们的文档,该模型是 keras 模型的子类。但是,当我尝试使用 keras.models.load_model() 加载模型时,出现以下错误:
Traceback (most recent call last):
File "<input>", line 1, in <module>
File "/home/tyarosevich/anaconda3/envs/fresh_env/lib/python3.7/site-packages/tensorflow/python/keras/saving/save.py", line 187, in load_model
return saved_model_load.load(filepath, compile, options)
File "/home/tyarosevich/anaconda3/envs/fresh_env/lib/python3.7/site-packages/tensorflow/python/keras/saving/saved_model/load.py", line 121, in load
path, options=options, loader_cls=KerasObjectLoader)
File "/home/tyarosevich/anaconda3/envs/fresh_env/lib/python3.7/site-packages/tensorflow/python/saved_model/load.py", line 633, in load_internal
ckpt_options)
File "/home/tyarosevich/anaconda3/envs/fresh_env/lib/python3.7/site-packages/tensorflow/python/keras/saving/saved_model/load.py", line 194, in __init__
super(KerasObjectLoader, self).__init__(*args, **kwargs)
File "/home/tyarosevich/anaconda3/envs/fresh_env/lib/python3.7/site-packages/tensorflow/python/saved_model/load.py", line 130, in __init__
self._load_all()
File "/home/tyarosevich/anaconda3/envs/fresh_env/lib/python3.7/site-packages/tensorflow/python/keras/saving/saved_model/load.py", line 215, in _load_all
self._layer_nodes = self._load_layers()
File "/home/tyarosevich/anaconda3/envs/fresh_env/lib/python3.7/site-packages/tensorflow/python/keras/saving/saved_model/load.py", line 315, in _load_layers
layers[node_id] = self._load_layer(proto.user_object, node_id)
File "/home/tyarosevich/anaconda3/envs/fresh_env/lib/python3.7/site-packages/tensorflow/python/keras/saving/saved_model/load.py", line 341, in _load_layer
obj, setter = self._revive_from_config(proto.identifier, metadata, node_id)
File "/home/tyarosevich/anaconda3/envs/fresh_env/lib/python3.7/site-packages/tensorflow/python/keras/saving/saved_model/load.py", line 368, in _revive_from_config
obj, self._proto.nodes[node_id], node_id)
File "/home/tyarosevich/anaconda3/envs/fresh_env/lib/python3.7/site-packages/tensorflow/python/keras/saving/saved_model/load.py", line 298, in _add_children_recreated_from_config
obj_child, child_proto, child_id)
File "/home/tyarosevich/anaconda3/envs/fresh_env/lib/python3.7/site-packages/tensorflow/python/keras/saving/saved_model/load.py", line 298, in _add_children_recreated_from_config
obj_child, child_proto, child_id)
File "/home/tyarosevich/anaconda3/envs/fresh_env/lib/python3.7/site-packages/tensorflow/python/keras/saving/saved_model/load.py", line 250, in _add_children_recreated_from_config
metadata = json_utils.decode(proto.user_object.metadata)
File "/home/tyarosevich/anaconda3/envs/fresh_env/lib/python3.7/site-packages/tensorflow/python/keras/saving/saved_model/json_utils.py", line 60, in decode
return json.loads(json_string, object_hook=_decode_helper)
File "/home/tyarosevich/anaconda3/envs/fresh_env/lib/python3.7/json/__init__.py", line 361, in loads
return cls(**kw).decode(s)
File "/home/tyarosevich/anaconda3/envs/fresh_env/lib/python3.7/json/decoder.py", line 337, in decode
obj, end = self.raw_decode(s, idx=_w(s, 0).end())
File "/home/tyarosevich/anaconda3/envs/fresh_env/lib/python3.7/json/decoder.py", line 355, in raw_decode
raise JSONDecodeError("Expecting value", s, err.value) from None
json.decoder.JSONDecodeError: Expecting value: line 1 column 1 (char 0)
我被难住了。转换器库自己的 save_pretrained() 方法将层信息保存在 .json 文件中,但我不明白为什么 keras 模型保存会知道/关心这一点(而且我认为这不是问题所在)。有什么帮助吗?
【问题讨论】:
-
不知道你是否解决了这个问题,但我陷入了同样的错误。我发现将其保存为 .h5 文件扩展名解决了这个问题。更多信息在这里 - tensorflow.org/guide/keras/save_and_serialize
-
使用
model.save()或tf.keras.models.save_model()保存Keras 模型并使用tf.keras.models.load_model()加载模型。谢谢!
标签: python tensorflow keras huggingface-transformers