平均负载 - 爱码网

平均负载的定义

单位时间内，系统处于可运行状态和不可中断状态的进程数。也可以理解成单位时间内，活跃的进程数。

可运行状态：正在使用或者等待使用CPU的进程，使用top命令查看时，状态S为R的进程

不可中断状态：正处在内核态关键流程中的进程。最常见的是等待硬件设备的IO响应。不可中断状态是系统对进程和硬件设备的一种保护机制。

例如，当一个进程向磁盘读写数据时，为了保证数据的一致性，在得到磁盘回复前，是不能被其他进程或者中断打断的，这时候的进程就处于不可中断状态，如果此时的进程被打断了，就容易出现磁盘数据与进程数据不一致的问题。

Stack Overflow上对不可中断状态进程的解释

An uniterruptable process is a process which happens to be in a system call(kernel function) that can't be interrupted by a signal.

也就是说，处于不可中断状态的进程，在执行系统调用之后会被阻塞，而且不能被中断(杀掉)，直到系统调用完成。这系统系统调用实际上都是瞬时完成的，通过ps命令一般看不到这些进程，如果ps命令能观察到，可能是IO出现了问题，因为程序之所以进入不可中断状态，就是因为得不到相关IO响应(磁盘IO、网络IO、其他外设IO)。所以要想进程退出不可中断状态，就得使进程等待的IO恢复。

查看平均负载

使用`uptime`命令查看

使用命令man uptime查看uptime的作用：

Gives  a  one  line display of the following information. The current time, how long the system has been running, how many users are currently logged on, and the system load averages for the past 1, 5, and 15 minutes.

即查看当前时间，系统已经运行的时间、当前登录的用户数以及过去1、5、15分钟系统的平均负载。

uptime

平均负载

可以看到，过去1、5、15分钟，本系统的负载分别是1.05，1.27和1.31，充分利用好这三个值，可以让我们更全面、更立体的理解目前系统的负载状态。

如果1、 5、 15分钟的平均负载：

基本相同，说明系统的负载很稳定；
如果1分钟前的负载远大于15分钟前的负载，说明最近1分钟的负载在增加。这种情况可能是临时性的，需要持续的观察。一旦1分钟的平均负载过大，说明系统发生了过载，需要想办法分析优化了；
如果1分钟前的负载远小于15钟前的负载，说明系统的负载在下降。

根据上图可以知道，整体来说系统的负载很稳定。那平均负载为多少时，才能保证系统的运行效率呢？

平均负载为多少时合理

最理想的情况下，平均负载和系统CPU的逻辑核数一致。所以我们需要先知道系统中CPU的逻辑核数。

通过读取/proc/cpuinfo文件获取系统CPU的信息。Linux系统贯彻“一切都是文件”的思想，所以系统相关的很多信息，进程状态都可以通过读取相关文件来获取。

Linux系统上的/proc文件目录是一种文件系统，及proc文件系统。与其他常见的文件系统不同，proc文件系统是一种伪文件系统（虚拟文件系统），所有的文件都存储在内存中，不占用任何的磁盘空间，存储的是当前内核运行状态的一系列文件，用户可以通过这些文件查看有关系统硬件及当前正在运行进程的信息。可以理解为Linux为用户提供一种以文件系统的方式实现进程和内核的通信接口。

查看CPU信息

cat /proc/cpuinfo

平均负载

平均负载和CPU使用率

平均负载和CPU使用率并不存在并不是完全对应的，因为平均负载表示的是单位时间内，系统中处于可运行状态和不可中断状态下的进程数，而CPU使用率表示的是单位时间下CPU的繁忙程度。所以应该具体情况具体分析：

对于大量CPU密集型进程：平均负载会升高，同时会使用大量的CPU，所以CPU的使用率也会升高；
对于大量IO密集型进程：平均负载会升高，但是大部分的进程都在等待IO响应，CPU处于空闲状态，CPU使用率可能不会升高；
对于大量处于等待CPU的进程，平均负载会升高，而由于进程的调度也会使用CPU，所以CPU使用率也会升高。

平均负载案例分析

在进行案例分析之前，先介绍三个常用的工具

常用工具

stress

Impose certain types of compute stress on your system

stress 是一个系统压力测试工具，在这里用作异常进程模拟平均进程升高的场景。

参数选项	解释说明
–timeout N	timeout after N seconds
–cpu N	spawn N workers spining on sqrt()
–io N	spawn N workers spining on sync()
–vm N	spawn N workers spining on malloc()/free()

mpstat

mpstat 是一个常用的多核CPU性能分析工具，用来实时查看每个CPU的性能指标及平均指标

Report processors related statistics.

参数选项	解释说明
-I	Report interrupts statistics
-P ALL	Report all processors statistics
-u	Report CPU utilization
N	Diaplay reports at N seconds internal
M	Diaplay M reports at N seconds internal, if M is 0 or omiting, will report continuously.

字段说明

%usr: 用户空间使用CPU的时间占比
%nice: 优先级为nice的进程使用CPU时间占比
%sys: 系统内核使用CPU时间占比
%iowait: 系统处于IO时，CPU空闲时间占比
%irq: 处理硬件中断的CPU时间占比
%soft: 处理软中断的CPU时间占比
%idle: 没有IO时，CPU空闲时间占比
pidstat

pidstat 是一个常用的进程性能分析工具，用来实时查看进程的CPU、内存、I/O及上下文切换等指标。

Report statistics for Linux tasks.

参数选项	解释说明
-d	Report I/O statistics
-l	Display the process command name and all its arguments
-p pid	Select process for which statistics is to be reported
-r	Report page faults and memory utilization
-s	Report stack utilization
-t	Display statistics for threads associated with selected tasks
-v	Display number of threads and file descriptors associated with current task
-w	Report task swiching activity

字段说明

KB_rd/s: Number of kilobytes the task has caused to be read from disk per second.
KB_wr/s: Number of kilobytes the task has caused, or shall cause to be written to disk per second.
minflt/s: Total number of minor faults the task has made per second, those which have not required loading a memory page from disk.
majfflt/s: Total number of major faults the task has made per second, those which have required a memory page from disk.
VSZ: Virtual Size
RSS: Resident Set Size
threads: Number of threads associated with current task.
fd-nr: Number of file descriptors associated with current task.
cswch/s: Total number of voluntary context switches the task made per second. A voluntary context switch occurs when a task blocks because it requires a resource that is unavailable.
nvcswch/s: otal number of non voluntary context switches the task made per second. A involuntary context switch takes place when a task executes for the duration of its time slice and then is forced to relinquish the processor.