【发布时间】:2018-11-22 10:47:59
【问题描述】:
我正在尝试运行 Google clound ml-engine following this tutorial,执行此命令时:
$ gcloud ml-engine jobs submit training `whoami`_object_detection_`date +%s` \
--job-dir=gs://${YOUR_GCS_BUCKET}/train\
--packages dist/object_detection-0.1.tar.gz,slim/dist/slim-0.1.tar.gz,/tmp/pycocotools/pycocotools-2.0.tar.gz\
--module-name object_detection.model_tpu_main \
--runtime-version 1.10\
--scale-tier BASIC_TPU\
--region us-central1\
--
--model_dir=gs://${YOUR_GCS_BUCKET}/train
--tpu_zone us-central1
--pipeline_config_path=gs://${YOUR_GCS_BUCKET}/data/pipeline.config
回答我的命令:
ERROR: (gcloud.ml-engine.jobs.submit.training) INVALID_ARGUMENT: Field: runtime_version Error: The specified runtime version '1.10' with the Python version '' is not supported for TPU training. Please specify a different runtime version. See https://cloud.google.com/ml/docs/concepts/runtime-version-list for a list of supported versions
- '@type': type.googleapis.com/google.rpc.BadRequest
fieldViolations:
- description: The specified runtime version '1.10' with the Python version '' is
not supported for TPU training. Please specify a different runtime version.
See https://cloud.google.com/ml/docs/concepts/runtime-version-list for a list
of supported versions
field: runtime_version
因为没有检测到python的版本,所以我在命令行中添加了--config=config.yaml参数:
config.yaml:
trainingInput:
pythonVersion: "3.5"
但问题几乎没有改变:
...
- description: The specified runtime version '1.10' with the Python version '3.5'
...
runtime version list 指定运行时版本 1.10 与 python 3.5 兼容,我还尝试了不同的运行时环境/python 版本,它们需要工作但我的命令一直失败。
【问题讨论】:
-
您似乎正在尝试使用 Cloud TPU。如果您查看Support for Cloud TPU (Beta),它会说支持1.9 版本。
标签: python bash tensorflow google-cloud-ml