【发布时间】:2015-12-16 10:25:59
【问题描述】:
我有很多 PHP 实例(250 到 500 甚至更多),它们打开一个 DB-Connection 并执行 SELECT、UPDATE 和 INSERT。几秒钟后,我收到以下错误:
PHP Fatal error: Uncaught exception 'PDOException' with message 'SQLSTATE[HY000] [2003] Can't connect to MySQL server on 'xxx.xxx.xxx.xxx' (4)' in /var/www/xxx/vendor/propel/propel/src/Propel/Runtime/Connection/PdoConnection.php:49
我花了至少 30 个小时来寻找解决方案。在我看来,故障在于 MySQL 或 Debian 的错误配置。我找到了各种配置选项,但都不起作用。
其他信息
- 该错误仅发生在远程服务器上 - 当我直接在 DB-Server 上启动脚本时,我没有收到任何错误。
当前配置
sysctl.conf
net.ipv4.conf.default.rp_filter = 1
net.ipv4.conf.default.accept_source_route = 0
kernel.sysrq = 0
kernel.core_uses_pid = 1
# Disable netfilter on bridges.
#net.bridge.bridge-nf-call-ip6tables = 0
#net.bridge.bridge-nf-call-iptables = 0
#net.bridge.bridge-nf-call-arptables = 0
net.ipv4.ip_forward = 1
net.ipv4.neigh.default.gc_thresh1 = 4096
net.ipv4.neigh.default.gc_thresh2 = 8192
net.ipv4.neigh.default.gc_thresh3 = 16384
net.ipv4.neigh.default.gc_interval = 5
net.ipv4.neigh.default.gc_stale_time = 120
net.core.netdev_max_backlog = 262144
#net.core.rmem_default = 16777216
net.core.rmem_max = 108544
net.core.somaxconn = 32768
net.core.wmem_max = 108544
net.netfilter.nf_conntrack_max = 10000000
net.netfilter.nf_conntrack_tcp_timeout_established = 40
net.netfilter.nf_conntrack_tcp_timeout_close = 10
net.netfilter.nf_conntrack_tcp_timeout_close_wait = 10
net.netfilter.nf_conntrack_tcp_timeout_fin_wait = 10
net.netfilter.nf_conntrack_tcp_timeout_last_ack = 10
net.netfilter.nf_conntrack_tcp_timeout_syn_recv = 10
net.netfilter.nf_conntrack_tcp_timeout_syn_sent = 10
net.netfilter.nf_conntrack_tcp_timeout_time_wait = 10
net.ipv4.tcp_fin_timeout = 1
net.ipv4.tcp_max_orphans = 262144
net.ipv4.tcp_max_syn_backlog = 16384
net.ipv4.tcp_max_syn_backlog = 262144
net.ipv4.tcp_rmem = 4096 87380 16777216
net.ipv4.tcp_sack = 0
net.ipv4.tcp_syn_retries = 2
net.ipv4.tcp_synack_retries = 2
net.ipv4.tcp_syncookies = 0
net.ipv4.tcp_timestamps = 0
net.ipv4.tcp_tw_recycle = 1
net.ipv4.tcp_wmem = 4096 16384 16777216
etc/mysql/my.cf
[client]
port = 3306
socket = /var/run/mysqld/mysqld.sock
[mysqld_safe]
socket = /var/run/mysqld/mysqld.sock
nice = 0
[mysqld]
user = mysql
pid-file = /var/run/mysqld/mysqld.pid
socket = /var/run/mysqld/mysqld.sock
port = 3306
basedir = /usr
datadir = /var/lib/mysql
tmpdir = /tmp
lc-messages-dir = /usr/share/mysql
skip-external-locking
bind-address = {local server ip}
key_buffer = 16M
max_allowed_packet = 16M
thread_stack = 192K
thread_cache_size = 1100
myisam-recover = BACKUP
max_connections = 1000
#table_cache = 64
#thread_concurrency = 10
query_cache_limit = 2G
query_cache_size = 32M
#general_log_file = /var/log/mysql/mysql.log
#general_log = 1
log_error = /var/log/mysql/error.log
slow_query_log_file = /var/log/mysql/mysql-slow.log
slow_query_log = 1
long_query_time = 2
log_queries_not_using_indexes
#server-id = 1
#log_bin = /var/log/mysql/mysql-bin.log
expire_logs_days = 10
max_binlog_size = 100M
#binlog_do_db = include_database_name
#binlog_ignore_db = include_database_name
innodb_buffer_pool_size = 16G
innodb_buffer_pool_instances = 16
[mysqldump]
quick
quote-names
max_allowed_packet = 16M
[mysql]
#no-auto-rehash # faster start of mysql but no tab completition
[isamchk]
key_buffer = 16M
!includedir /etc/mysql/conf.d/
加载模块 - modules.conf
nf_conntrack
nf_conntrack_ipv4
脚本在启动时启动
echo "1" > /proc/sys/net/ipv4/tcp_tw_reuse
echo "1" > /proc/sys/net/ipv4/tcp_tw_recycle
echo "15" > /proc/sys/net/ipv4/tcp_fin_timeout
/etc/init.d/networking restart
我的系统
- Debian 8.2
- MySQL 5.5.44-0+deb8u1
- php5-mysql 5.6.13+dfsg-0+deb8u1
- 具有 64GB RAM、48 个 CPU(Intel Xeon E7540 at 2.0 GHz)的虚拟机
- 在 ESxi-Host v. 6.0.0 2494585 上运行
负载测试
开始 x PHP-instances / loadTester.php
<?php
require_once __DIR__ . "/autoloader.php";
$numberOfInstances = 100;
$index = 0;
$numberOfDataset = 500;
while($index < $numberOfInstances) {
@system( "php " . __DIR__ . "/loadTest.php $index $numberOfDataset > /dev/null &" );
$index++;
}
一个实例 SELECT / loadTest.php
<?php
require_once __DIR__ . "/autoloader.php";
$skip = $argv[ 1 ];
$numberOfDataset = $argv[ 2 ];
$loops = 100;
$index = 0;
$skip = $skip * $numberOfDataset;
while ( $index < $loops ) {
$adresses = AddressQuery::create()->offset( $skip )->limit( $numberOfDataset )->find();
}
网络图
【问题讨论】:
-
你能从运行的 DB "SHOW PROCESSLIST" 中查看进程列表,看看你的 mutch 连接在这一刻真的被使用了吗
-
@BerndBuffen:在启动“loadTester.php”和 100 个实例后,我有 57 个睡眠和 63 个查询(写入网络)连接。 34 PHP-Process stoped 已经导致异常。
-
我几乎可以肯定 my.cnf 中没有错误(仅用于优化)。 db 的唯一测试是 smal bash 脚本。对于 {1..1000} 中的 i;回声 $i; mysql -uroot -pxxx -e "选择睡眠(300)"&;完成——这是否正在运行,所以数据库正常并且错误必须在 VM 中
-
我直接在服务器上启动 980-loop-script - 没有错误。我再次启动它 - 1000 个打开的连接,其余的我收到错误“错误 1040(HY000):连接太多” - 这是正确的,在 MySQL-config 中将 max_connection 设置为 1000。但是我在远程服务器上启动脚本 - 并且“无法连接到 'xxx.xxx.xxx.xxx' 上的 MySQL 服务器” - 868 连接后发生错误。杀死进程并重新启动后,在 423 个连接后发生错误...可能是 TCP 连接设置?
-
这也是我的看法,但我帮不了你,对不起-祝你好运
标签: php mysql tcp debian propel2