【发布时间】:2017-01-04 22:16:04
【问题描述】:
我在尝试使用 sparkstreaming、python 运行字数统计示例时遇到错误。
不知道如何继续。以下是我正在运行的命令和错误。
/opt/spark/bin/spark-submit --jars spark-streaming_2.10-2.0.0.jar test_kafka.py broker.txt "localhost:2181:MyTopic"
Error:
Traceback (most recent call last):
File "/home/ubuntu/kafka/libs/test_kafka.py", line 21, in <module>
kvs = KafkaUtils.createDirectStream(ssc, [topic], {"metadata.broker.list": brokers})
File "/opt/spark/python/lib/pyspark.zip/pyspark/streaming/kafka.py", line 122, in createDirectStream
File "/opt/spark/python/lib/pyspark.zip/pyspark/streaming/kafka.py", line 195, in _get_helper
TypeError: 'JavaPackage' object is not callable
【问题讨论】:
-
您可能错过了一些导入语句。我对 pyspark 也有类似的问题。 stackoverflow.com/questions/37153866/…
标签: python hadoop apache-spark pyspark spark-streaming