【发布时间】:2016-12-24 03:18:46
【问题描述】:
我正在研究 spark 1.3.0 。我的 build.sbt 如下所示:
libraryDependencies ++= Seq(
"org.apache.spark" %% "spark-core" % "1.3.0" % "provided",
"org.apache.spark" %% "spark-sql" % "1.3.0" % "provided",
"org.apache.spark" %% "spark-streaming" % "1.3.0" % "provided",
"org.apache.spark" %% "spark-mllib" % "1.3.0" % "provided",
"org.springframework.security" % "spring-security-web" % "3.0.7.RELEASE",
"com.databricks" % "spark-csv_2.10" % "1.4.0"
)
// META-INF discarding
mergeStrategy in assembly <<= (mergeStrategy in assembly) { (old) =>
{
case PathList("META-INF", xs @ _*) => MergeStrategy.discard
case x => MergeStrategy.first
}
}
在编译期间使用了这个 sbt 文件 hadoop 2.2.0。但是我的运行环境包含 hadoop 2.6.0 。谁能帮助我如何从 spark 库中排除 hadoop 依赖项并在 sbt 文件中提及 hadoop 2.6.0?
谢谢
【问题讨论】:
标签: hadoop apache-spark sbt spark-streaming hadoop2