【发布时间】:2017-07-25 08:52:54
【问题描述】:
我有 pig 脚本,当我从 pig(map reduce 模式)运行时会给出正确的结果,但是当我从 crontab 调度时,不会按照脚本存储输出。
猪脚本是,
a1 = load '/user/training/abhijit_hdfs/id' using PigStorage('\t') as (id:int,name:chararray,desig:chararray);
a2 = load '/user/training/abhijit_hdfs/trips' using PigStorage('\t') as (id:int,place:chararray,no_trips:int);
j = join a1 by id,a2 by id;
g = group j by(a1::id,a1::name,a1::desig);`
`su = foreach g generate group,SUM(j.a2::no_trips) as tripsum;
ord = order su by tripsum desc;
f2 = foreach ord generate $0.$0,$0.$1,$0.$2,$1;
store f2 into '/user/training/abhijit_hdfs/results/trip_output' using PigStorage(' ');
crontab 是,
[training@localhost ~]$ crontab -l
40 3 * * * /home/training/Abhijit_Local/trip_crontab.pig
请指导。
【问题讨论】:
标签: hadoop cron apache-pig bigdata