【问题标题】:Docker process not running, and any interaction with docker failsDocker 进程未运行,与 docker 的任何交互都失败
【发布时间】:2021-02-10 16:23:23
【问题描述】:

首先,我在 virtualbox 上的 Ubuntu 20.04 VM 中运行 docker。

我创建了一个简单的 shell 脚本来杀死在端口 9042 上运行的任何进程,然后启动我的 docker-compose 文件。这是有问题的脚本:

#!/bin/bash

# Check for and kill any processes running on port 9042
sudo kill -9 $(sudo lsof -t -i:9042)

# start docker-compose
docker-compose -f ./docker/docker-compose.yml up

然而,自从运行它以来,它使我的 docker 安装对任何类型的交互完全没有响应。任何 docker 命令都将无限期挂起,直到使用 Ctrl+C 取消,任何其他使用 docker 的系统命令(例如 sudo service docker start)也将无限期挂起。

如果我尝试运行 dockerd,它会失败并显示消息 failed to start daemon: pid file found, ensure docker is not running or delete /var/run/docker.pid。当我的系统报告说 docker 没有运行时,我继续删除var/run/docker.pid。如果我再次尝试运行 dockerd,我会收到不同的错误消息:failed to start daemon: error while opening volume store metadata database: timeout

在这个阶段,一些 docker 命令重新开始工作。 docker versiondocker help 都可以工作,但是仍然报告说 docker daemon 没有运行。尝试在 docker-compose 文件上运行 docker-compose up 会产生以下输出:

Traceback (most recent call last):
  File "/usr/lib/python3/dist-packages/urllib3/connectionpool.py", line 665, in urlopen
    httplib_response = self._make_request(
  File "/usr/lib/python3/dist-packages/urllib3/connectionpool.py", line 387, in _make_request
    conn.request(method, url, **httplib_request_kw)
  File "/usr/lib/python3.8/http/client.py", line 1255, in request
    self._send_request(method, url, body, headers, encode_chunked)
  File "/usr/lib/python3.8/http/client.py", line 1301, in _send_request
    self.endheaders(body, encode_chunked=encode_chunked)
  File "/usr/lib/python3.8/http/client.py", line 1250, in endheaders
    self._send_output(message_body, encode_chunked=encode_chunked)
  File "/usr/lib/python3.8/http/client.py", line 1010, in _send_output
    self.send(msg)
  File "/usr/lib/python3.8/http/client.py", line 950, in send
    self.connect()
  File "/home/david/.local/lib/python3.8/site-packages/docker/transport/unixconn.py", line 43, in connect
    sock.connect(self.unix_socket)
ConnectionRefusedError: [Errno 111] Connection refused

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "/usr/lib/python3/dist-packages/requests/adapters.py", line 439, in send
    resp = conn.urlopen(
  File "/usr/lib/python3/dist-packages/urllib3/connectionpool.py", line 719, in urlopen
    retries = retries.increment(
  File "/usr/lib/python3/dist-packages/urllib3/util/retry.py", line 400, in increment
    raise six.reraise(type(error), error, _stacktrace)
  File "/usr/lib/python3/dist-packages/six.py", line 702, in reraise
    raise value.with_traceback(tb)
  File "/usr/lib/python3/dist-packages/urllib3/connectionpool.py", line 665, in urlopen
    httplib_response = self._make_request(
  File "/usr/lib/python3/dist-packages/urllib3/connectionpool.py", line 387, in _make_request
    conn.request(method, url, **httplib_request_kw)
  File "/usr/lib/python3.8/http/client.py", line 1255, in request
    self._send_request(method, url, body, headers, encode_chunked)
  File "/usr/lib/python3.8/http/client.py", line 1301, in _send_request
    self.endheaders(body, encode_chunked=encode_chunked)
  File "/usr/lib/python3.8/http/client.py", line 1250, in endheaders
    self._send_output(message_body, encode_chunked=encode_chunked)
  File "/usr/lib/python3.8/http/client.py", line 1010, in _send_output
    self.send(msg)
  File "/usr/lib/python3.8/http/client.py", line 950, in send
    self.connect()
  File "/home/david/.local/lib/python3.8/site-packages/docker/transport/unixconn.py", line 43, in connect
    sock.connect(self.unix_socket)
urllib3.exceptions.ProtocolError: ('Connection aborted.', ConnectionRefusedError(111, 'Connection refused'))

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "/home/david/.local/lib/python3.8/site-packages/docker/api/client.py", line 205, in _retrieve_server_version
    return self.version(api_version=False)["ApiVersion"]
  File "/home/david/.local/lib/python3.8/site-packages/docker/api/daemon.py", line 181, in version
    return self._result(self._get(url), json=True)
  File "/home/david/.local/lib/python3.8/site-packages/docker/utils/decorators.py", line 46, in inner
    return f(self, *args, **kwargs)
  File "/home/david/.local/lib/python3.8/site-packages/docker/api/client.py", line 228, in _get
    return self.get(url, **self._set_request_timeout(kwargs))
  File "/usr/lib/python3/dist-packages/requests/sessions.py", line 546, in get
    return self.request('GET', url, **kwargs)
  File "/usr/lib/python3/dist-packages/requests/sessions.py", line 533, in request
    resp = self.send(prep, **send_kwargs)
  File "/usr/lib/python3/dist-packages/requests/sessions.py", line 646, in send
    r = adapter.send(request, **kwargs)
  File "/usr/lib/python3/dist-packages/requests/adapters.py", line 498, in send
    raise ConnectionError(err, request=request)
requests.exceptions.ConnectionError: ('Connection aborted.', ConnectionRefusedError(111, 'Connection refused'))

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "/home/david/.local/bin/docker-compose", line 8, in <module>
    sys.exit(main())
  File "/home/david/.local/lib/python3.8/site-packages/compose/cli/main.py", line 67, in main
    command()
  File "/home/david/.local/lib/python3.8/site-packages/compose/cli/main.py", line 123, in perform_command
    project = project_from_options('.', options)
  File "/home/david/.local/lib/python3.8/site-packages/compose/cli/command.py", line 60, in project_from_options
    return get_project(
  File "/home/david/.local/lib/python3.8/site-packages/compose/cli/command.py", line 131, in get_project
    client = get_client(
  File "/home/david/.local/lib/python3.8/site-packages/compose/cli/docker_client.py", line 41, in get_client
    client = docker_client(
  File "/home/david/.local/lib/python3.8/site-packages/compose/cli/docker_client.py", line 170, in docker_client
    client = APIClient(**kwargs)
  File "/home/david/.local/lib/python3.8/site-packages/docker/api/client.py", line 188, in __init__
    self._version = self._retrieve_server_version()
  File "/home/david/.local/lib/python3.8/site-packages/docker/api/client.py", line 212, in _retrieve_server_version
    raise DockerException(
docker.errors.DockerException: Error while fetching server API version: ('Connection aborted.', ConnectionRefusedError(111, 'Connection refused'))

sudo service docker start 等其他系统命令仍然无限期挂起,直到被杀死。

我已经尝试了这个线程 (Cannot connect to the Docker daemon at unix:/var/run/docker.sock. Is the docker daemon running?) 和这个线程 (Docker commands do not respond anymore) 中的每一个解决方案,但它们都不起作用。

有谁知道这可能是什么问题?

编辑:还有几点 -

  • docker.pid 文件在我重新启动 VM 时再次出现
  • 重新启动我的虚拟机并不能解决问题
  • 以 root 用户身份执行命令同样不执行任何操作
  • 尝试使用 sudo apt-get install --reinstall docker-ce 重新安装 docker 也挂在阶段 Preparing to unpack .../docker-ce_5%3a20.10.0~1.1.beta1-0~ubuntu-focal_amd64.deb ...

【问题讨论】:

  • 9042 端口上正在运行什么?
  • @Stefano 来自我本地 cassandra 安装的 Java 实例。因为我的容器在同一个端口上运行其他东西,所以我需要释放它。
  • 我想 Cassandra 是作为服务运行的,为什么不简单地停止它而不是残忍地杀死它呢?此外,这种方法第一次可能有效,但第二次会杀死用于映射端口的docker-proxy 进程。我真的不能说接下来会发生什么
  • 这是一个公平的观点 - 我已经认识到这是一个问题,并从我的项目中删除了这个脚本。我不打算再使用这个脚本了。

标签: docker ubuntu docker-compose virtual-machine ubuntu-20.04


【解决方案1】:

我知道这已经很晚了,但我从我提出的另一个类似问题中找到了答案。

Docker 容器存储在 Linux 上的默认位置 /var/lib/docker/。我能够识别导致问题的容器并删除了实际的容器文件。然后我使用 CLI 删除了容器的所有其他痕迹,docker 能够开始正常运行。

显然这样做是有风险的,因此请确保先采取足够的措施来备份您的计算机。

【讨论】:

    猜你喜欢
    • 1970-01-01
    • 2019-01-04
    • 1970-01-01
    • 1970-01-01
    • 2014-06-07
    • 1970-01-01
    • 2019-04-22
    • 2017-07-11
    • 2016-06-13
    相关资源
    最近更新 更多