【问题标题】:Shell, run four processes parallelShell,并行运行四个进程
【发布时间】:2013-11-02 22:31:17
【问题描述】:

目前,我正忙于高效地运行耗时的模拟。目的是并行运行 4 个模拟,因为它是一个单线程应用程序和一个四核系统。我必须更改 shell 脚本:

./sim -r 1 &
./sim -r 2 &
./sim -r 3 &
./sim -r 4 &
wait
./sim -r 5 &
./sim -r 6 &
./sim -r 7 &
./sim -r 8 &
wait
... (another 112 jobs) 

使用此代码会一次又一次地等待。我还尝试将任务分成四个脚本并运行每个脚本,结果是一个脚本完成了,而另一个脚本完成了大约 30% 的剩余工作。我无法预测模拟需要多长时间。

有什么建议可以随时运行 4 次模拟吗?

【问题讨论】:

  • 有趣的问题。 AFAIK,它不能在bash(或 Korn shell、Bourne shell 或 POSIX shell)中完成。您可以等待特定进程(bash)或所有进程(所有进程),但不能一次等待一组进程中的一个。我可能会使用 Perl(或者 Python,也许)来运行;它可以等待任何进程终止,然后启动另一个。

标签: shell unix ubuntu


【解决方案1】:

在 Ubuntu 中安装 moreutils 软件包,然后使用 parallel 实用程序:

parallel -j 4 ./sim -r -- 1 2 3 4 5 6 7 8 ...

【讨论】:

【解决方案2】:

如果您不想安装 parallel 实用程序(假设它按指示工作,它看起来很整洁),那么您可以调整这个 Perl 脚本(基本上,更改执行的命令),并可能减少监控:

#!/usr/bin/env perl
use strict;
use warnings;
use constant MAX_KIDS => 4;

$| = 1;

my %pids;
my $kids = 0;   # Number of kids

for my $i (1..20)
{
    my $pid;
    if (($pid = fork()) == 0)
    {
        my $tm = int(rand() * (10 - 2) + 2);
        print "sleep $tm\n";
        # Using exec in a block on its own is the documented way to
        # avoid the warning:
        # Statement unlikely to be reached at filename.pl line NN.
        #   (Maybe you meant system() when you said exec()?)
        # Yes, I know the print and exit statements should never be
        # reached, but, dammit, sometimes things go wrong!
        { exec "sleep", $tm; }
        print STDERR "Oops: couldn't sleep $tm!\n";
        exit 1;
    }
    $pids{$pid} = 1;
    $kids++;
    my $time = time;
    print "PID: $pid; Kids: $kids; Time: $time\n";
    if ($kids >= MAX_KIDS)
    {
        my $kid = waitpid(-1, 0);
        print "Kid: $kid ($?)\n";
        if ($kid != -1)
        {
            delete $pids{$kid};
            $kids--;
        }
    }
}

while ((my $kid = waitpid(-1, 0)) > 0)
{
    my $time = time;
    print "Kid: $kid (Status: $?); Time: $time\n";
    delete $pids{$kid};
    $kids--;
}

# This should not do anything - and doesn't (any more!).
foreach my $pid (keys %pids)
{
    printf "Undead: $pid\n";
}

示例输出:

PID: 20152; Kids: 1; Time: 1383436882
PID: 20153; Kids: 2; Time: 1383436882
sleep 5
PID: 20154; Kids: 3; Time: 1383436882
sleep 7
sleep 9
PID: 20155; Kids: 4; Time: 1383436882
sleep 4
Kid: 20155 (0)
PID: 20156; Kids: 4; Time: 1383436886
sleep 4
Kid: 20152 (0)
PID: 20157; Kids: 4; Time: 1383436887
sleep 2
Kid: 20153 (0)
PID: 20158; Kids: 4; Time: 1383436889
sleep 9
Kid: 20157 (0)
PID: 20159; Kids: 4; Time: 1383436889
sleep 6
Kid: 20156 (0)
PID: 20160; Kids: 4; Time: 1383436890
sleep 6
Kid: 20154 (0)
PID: 20161; Kids: 4; Time: 1383436891
sleep 9
Kid: 20159 (0)
PID: 20162; Kids: 4; Time: 1383436895
sleep 7
Kid: 20160 (0)
PID: 20163; Kids: 4; Time: 1383436896
sleep 9
Kid: 20158 (0)
PID: 20164; Kids: 4; Time: 1383436898
sleep 6
Kid: 20161 (0)
PID: 20165; Kids: 4; Time: 1383436900
sleep 9
Kid: 20162 (0)
PID: 20166; Kids: 4; Time: 1383436902
sleep 9
Kid: 20164 (0)
PID: 20167; Kids: 4; Time: 1383436904
sleep 2
Kid: 20163 (0)
PID: 20168; Kids: 4; Time: 1383436905
sleep 6
Kid: 20167 (0)
PID: 20169; Kids: 4; Time: 1383436906
sleep 9
Kid: 20165 (0)
PID: 20170; Kids: 4; Time: 1383436909
sleep 4
Kid: 20168 (0)
PID: 20171; Kids: 4; Time: 1383436911
Kid: 20166 (0)
sleep 9
Kid: 20170 (Status: 0); Time: 1383436913
Kid: 20169 (Status: 0); Time: 1383436915
Kid: 20171 (Status: 0); Time: 1383436920

【讨论】:

    【解决方案3】:
    NUMJOBS=30
    NUMPOOLS=4
    
    seq 1 "$NUMJOBS" | for p in $(seq 1 $NUMPOOLS); do
        while read x; do ./sim -r "$x"; done &
    done
    

    for 循环创建一个后台进程池,从共享标准输入中读取数据以启动模拟。每个后台进程在其模拟运行时“阻塞”,然后从seq 命令读取下一个作业号。

    如果没有 for 循环,可能会更容易理解:

    seq 1 "$NUMJOBS" | {
        while read x; do ./sim -r "$x"; done &
        while read x; do ./sim -r "$x"; done &
        while read x; do ./sim -r "$x"; done &
        while read x; do ./sim -r "$x"; done &
    }
    

    假设sim 需要大量时间来运行,第一个while 将从其标准输入中读取1,第二个2 等。无论sim 哪个先完成,while 循环将read 5 来自标准输入;下一个完成将读取 6,依此类推。一旦最后一次模拟开始,每个read都会失败,导致循环退出。

    【讨论】:

      猜你喜欢
      • 2016-08-09
      • 2010-10-14
      • 1970-01-01
      • 2013-10-10
      • 1970-01-01
      • 1970-01-01
      • 1970-01-01
      • 1970-01-01
      相关资源
      最近更新 更多