【发布时间】:2017-08-03 06:35:20
【问题描述】:
我正在处理来自 Spark(EMR,在 Yarn 模式下)的 HBase 表。实际上,PySpark - 我认为这并不重要。我从 HBase 集群外部通过单独的 Thrift 服务调用 HBase。
看起来我能够连接到 Thrift 服务器,但 ZooKeeper 有一些问题(因为错误将我指向 ZooKeeper 端口 2181)。
为什么会发生这种情况,我该如何解决?
17/08/02 20:21:31 INFO ZooKeeper: Client environment:java.io.tmpdir=/tmp
17/08/02 20:21:31 INFO ZooKeeper: Client environment:java.compiler=<NA>
17/08/02 20:21:31 INFO ZooKeeper: Client environment:os.name=Linux
17/08/02 20:21:31 INFO ZooKeeper: Client environment:os.arch=amd64
17/08/02 20:21:31 INFO ZooKeeper: Client environment:os.version=4.4.35-33.55.amzn1.x86_64
17/08/02 20:21:31 INFO ZooKeeper: Client environment:user.name=hadoop
17/08/02 20:21:31 INFO ZooKeeper: Client environment:user.home=/home/hadoop
17/08/02 20:21:31 INFO ZooKeeper: Client environment:user.dir=/home/hadoop/data
17/08/02 20:21:31 INFO ZooKeeper: Initiating client connection, connectString=thrift-internal.production.k8s.prod.node.io:2181 sessionTimeout=180000 watcher=org.apache.hadoop.hbase.zookeeper.PendingWatcher@2818bc0e
17/08/02 20:21:31 INFO ClientCnxn: Opening socket connection to server ip-172-23-115-152.us-west-2.compute.internal/172.23.115.152:2181. Will not attempt to authenticate using SASL (unknown error)
【问题讨论】:
标签: apache-spark pyspark hbase apache-zookeeper thrift