【发布时间】:2020-02-06 13:55:56
【问题描述】:
我正在尝试在执行SQL 的MR 上使用Hive,但它中途失败并出现以下错误:
Application application_1570514228864_0001 failed 2 times due to AM Container for appattempt_1570514228864_0001_000002 exited with exitCode: -1000
Failing this attempt.Diagnostics: [2019-10-08 13:57:49.272]Failed to download resource { { s3a://tpcds/tmp/hadoop-yarn/staging/root/.staging/job_1570514228864_0001/libjars, 1570514262820, FILE, null },pending,[(container_1570514228864_0001_02_000001)],1132444167207544,DOWNLOADING} java.io.IOException: Resource s3a://tpcds/tmp/hadoop-yarn/staging/root/.staging/job_1570514228864_0001/libjars changed on src filesystem (expected 1570514262820, was 1570514269265
从我的角度来看,错误日志中的关键信息是libjars changed on src filesystem (expected 1570514262820, was 1570514269265。 SO上有几个关于这个问题的帖子,但还没有得到回答,比如thread1和thread2。
我从apache jira 和redhat bugzilla 中发现了一些有价值的东西。我通过所有相关节点通过NTP 同步时钟。但同样的问题仍然存在。
欢迎任何评论,谢谢。
【问题讨论】: