【问题标题】:spark-cassandra connector in local gives Spark cluster looks down本地的 spark-cassandra 连接器使 Spark 集群看起来向下
【发布时间】:2016-07-14 05:30:15
【问题描述】:

我对 spark 和 cassandra 很陌生。我正在尝试一个简单的 java 程序,我正在尝试使用 datastax 提供的 spark-cassandra-connector 向 cassandra 表添加新行。

我在笔记本电脑上运行 dse。使用 java,我试图通过 Spark 将数据保存到 cassandra DB。以下是代码:

Map<String, String> extra = new HashMap<String, String>();
        extra.put("city", "bangalore");
        extra.put("dept", "software");
        List<User> products = Arrays.asList(new User(1, "vamsi", extra));
        JavaRDD<User> productsRDD = sc.parallelize(products);
        javaFunctions(productsRDD, User.class).saveToCassandra("test", "users");

当我执行此代码时,我收到以下错误

16/03/26 20:57:31 INFO client.AppClient$ClientActor: Connecting to master spark://127.0.0.1:7077... 16/03/26 20:57:44 WARN scheduler.TaskSchedulerImpl: Initial job has not accepted any resources; check your cluster UI to ensure that workers are registered and have sufficient memory 16/03/26 20:57:51 INFO client.AppClient$ClientActor: Connecting to master spark://127.0.0.1:7077... 16/03/26 20:57:59 WARN scheduler.TaskSchedulerImpl: Initial job has not accepted any resources; check your cluster UI to ensure that workers are registered and have sufficient memory 16/03/26 20:58:11 ERROR client.AppClient$ClientActor: All masters are unresponsive! Giving up. 16/03/26 20:58:11 ERROR cluster.SparkDeploySchedulerBackend: Spark cluster looks dead, giving up. 16/03/26 20:58:11 INFO scheduler.TaskSchedulerImpl: Removed TaskSet 0.0, whose tasks have all completed, from pool 16/03/26 20:58:11 INFO scheduler.DAGScheduler: Failed to run runJob at RDDFunctions.scala:48 Exception in thread "main" org.apache.spark.SparkException: Job aborted: Spark cluster looks down at org.apache.spark.scheduler.DAGScheduler$$anonfun$org$apache$spark$scheduler$DAGScheduler$$abortStage$1.apply(DAGScheduler.scala:1020) at org.apache.spark.scheduler.DAGScheduler$$anonfun$org$apache$spark$scheduler$DAGScheduler$$abortStage$1.apply(DAGScheduler.scala:1018) at scala.collection.mutable.ResizableArray$class.foreach(ResizableArray.scala:59) at scala.collection.mutable.ArrayBuffer.foreach(ArrayBuffer.scala:47) at org.apache.spark.scheduler.DAGScheduler.org$apache$spark$scheduler$DAGScheduler$$abortStage(DAGScheduler.scala:1018) at org.apache.spark.scheduler.DAGScheduler$$anonfun$processEvent$10.apply(DAGScheduler.scala:604) at org.apache.spark.scheduler.DAGScheduler$$anonfun$processEvent$10.apply(DAGScheduler.scala:604) at scala.Option.foreach(Option.scala:236) at org.apache.spark.scheduler.DAGScheduler.processEvent(DAGScheduler.scala:604) at org.apache.spark.scheduler.DAGScheduler$$anonfun$start$1$$anon$2$$anonfun$receive$1.applyOrElse(DAGScheduler.scala:190) at akka.actor.ActorCell.receiveMessage(ActorCell.scala:498) at akka.actor.ActorCell.invoke(ActorCell.scala:456) at akka.dispatch.Mailbox.processMailbox(Mailbox.scala:237) at akka.dispatch.Mailbox.run(Mailbox.scala:219) at akka.dispatch.ForkJoinExecutorConfigurator$AkkaForkJoinTask.exec(AbstractDispatcher.scala:386) at scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260) at scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1339) at scala.concurrent.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979) at scala.concurrent.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107)

【问题讨论】:

    标签: apache-spark datastax datastax-enterprise spark-cassandra-connector


    【解决方案1】:

    看来您需要修复 Spark 配置...请参阅:

    http://www.datastax.com/dev/blog/common-spark-troubleshooting

    【讨论】:

      猜你喜欢
      • 2017-08-19
      • 2022-06-15
      • 2016-09-12
      • 2018-12-10
      • 2023-03-14
      • 1970-01-01
      • 2017-01-13
      • 2020-08-10
      • 1970-01-01
      相关资源
      最近更新 更多