前置准备Hadoop多节点集群

1.启动hadoop集群

   start-all.sh

2.启动Spark Stand Alone cluster

   /usr/local/spark/sbin/start-all.sh

3.运行IPython Notebook来使用Spark

   cd ~/pythonwork/ipynotebook

  PYSPARK_DRIVER_PYTHON=ipython PYSPARK_DRIVER_PYTHON_OPTS="notebook" MASTER=spark://master:7077 pyspark --num-executors 1 --total-executor-cores 2 --executor-memory 512m

14.IPython Notebook在Spark StandAlone模式运行

 

相关文章:

  • 2022-12-23
  • 2022-02-15
  • 2022-12-23
  • 2021-10-09
  • 2022-12-23
  • 2022-12-23
  • 2021-11-28
  • 2022-12-23
猜你喜欢
  • 2021-08-31
  • 2021-09-22
  • 2022-12-23
  • 2021-08-19
  • 2021-04-14
  • 2022-12-23
  • 2021-11-20
相关资源
相似解决方案