【问题标题】:Communication with JobManager failed error in wordcount example in flinkflink 中 wordcount 示例中与 JobManager 的通信失败错误
【发布时间】:2016-03-22 22:00:35
【问题描述】:

我尝试运行 wordcount 示例: ../bin/flink run WordCount.jar

并在执行几分钟后给我一个“与 JobManager 通信失败错误”。

来源: https://github.com/apache/flink/blob/master/flink-examples/flink-scala-examples

异常如下:

Executing WordCount example with built-in default data.
  Provide parameters to read input data from a file.
  Usage: WordCount <text path> <result path>
org.apache.flink.client.program.ProgramInvocationException: The program execution failed: Communication with JobManager failed: Lost connection to the JobManager.
  at org.apache.flink.client.program.Client.runBlocking(Client.java:370)
  at org.apache.flink.client.program.Client.runBlocking(Client.java:348)
  at org.apache.flink.client.program.Client.runBlocking(Client.java:315)
  at org.apache.flink.client.program.ContextEnvironment.execute(ContextEnvironment.java:70)
  at org.apache.flink.api.java.ExecutionEnvironment.execute(ExecutionEnvironment.java:804)
  at org.apache.flink.api.java.DataSet.collect(DataSet.java:410)
  at org.apache.flink.api.java.DataSet.print(DataSet.java:1495)
  at org.apache.flink.examples.java.wordcount.WordCount.main(WordCount.java:80)
  at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
  at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
  at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
  at java.lang.reflect.Method.invoke(Method.java:483)
  at org.apache.flink.client.program.PackagedProgram.callMainMethod(PackagedProgram.java:497)
  at org.apache.flink.client.program.PackagedProgram.invokeInteractiveModeForExecution(PackagedProgram.java:395)
  at org.apache.flink.client.program.Client.runBlocking(Client.java:252)
  at org.apache.flink.client.CliFrontend.executeProgramBlocking(CliFrontend.java:675)
  at org.apache.flink.client.CliFrontend.run(CliFrontend.java:326)
  at org.apache.flink.client.CliFrontend.parseParameters(CliFrontend.java:977)
  at org.apache.flink.client.CliFrontend.main(CliFrontend.java:1027)
Caused by: org.apache.flink.runtime.client.JobExecutionException: Communication with JobManager failed: Lost connection to the JobManager.
  at org.apache.flink.runtime.client.JobClient.submitJobAndWait(JobClient.java:141)
  at org.apache.flink.client.program.Client.runBlocking(Client.java:368)
... 18 more
Caused by: org.apache.flink.runtime.client.JobClientActorConnectionTimeoutException: Lost connection to the JobManager.
  at org.apache.flink.runtime.client.JobClientActor.handleMessage(JobClientActor.java:243)
  at org.apache.flink.runtime.akka.FlinkUntypedActor.handleLeaderSessionID(FlinkUntypedActor.java:88)
  at org.apache.flink.runtime.akka.FlinkUntypedActor.onReceive(FlinkUntypedActor.java:68)
  at akka.actor.UntypedActor$$anonfun$receive$1.applyOrElse(UntypedActor.scala:167)
  at akka.actor.Actor$class.aroundReceive(Actor.scala:465)
  at akka.actor.UntypedActor.aroundReceive(UntypedActor.scala:97)
  at akka.actor.ActorCell.receiveMessage(ActorCell.scala:516)
  at akka.actor.ActorCell.invoke(ActorCell.scala:487)
/src/main/scala/org/apache/flink/examples/scala/wordcount/WordCount.scala
  at akka.dispatch.Mailbox.processMailbox(Mailbox.scala:254)
  at akka.dispatch.Mailbox.run(Mailbox.scala:221)
  at akka.dispatch.Mailbox.exec(Mailbox.scala:231)  
  at scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260)   
  at scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.pollAndExecAll(ForkJoinPool.java:1253)
  at scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1346)
  at scala.concurrent.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979)
  at scala.concurrent.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107)

在尝试运行您的命令时发生上述异常。

【问题讨论】:

  • 你能和我们分享JobManager的日志吗?它应该在FLINK_HOME/log/flink-XXX-jobmanager-XXX.log
  • 14:35:45,187 WARN Remoting - 尝试与无法访问的远程地址 [akka.tcp://flink@127.0.0.1:6123] 关联。地址现在被门控 5000 毫秒,所有到该地址的消息都将被传递到死信。原因:连接被拒绝:/127.0.0.1:6123 14:37:25,065 错误 org.apache.flink.client.CliFrontend - 运行命令时出错。 org.apache.flink.client.program.ProgramInvocationException:程序执行失败:与 JobManager 通信失败:与 JobManager 的连接丢失。 (...)

标签: scala word-count apache-flink


【解决方案1】:

我解决了分析@Till 建议的日志的问题。 服务器没有正常运行。重启所有服务器解决。

【讨论】:

  • 你说的是哪台服务器,我还是面临这个问题。
  • @PushpendraJaiswal 通过运行bin/start-local.sh启动服务器
猜你喜欢
  • 1970-01-01
  • 1970-01-01
  • 1970-01-01
  • 1970-01-01
  • 1970-01-01
  • 2018-01-29
  • 1970-01-01
  • 1970-01-01
  • 1970-01-01
相关资源
最近更新 更多