【发布时间】:2014-12-14 02:02:27
【问题描述】:
我正在 EMR 上运行 Spark,如 Run Spark and Spark SQL on Amazon Elastic MapReduce 中所述:
本教程将引导您快速安装和操作 Spark 在 Amazon EMR 上进行大规模数据处理的通用引擎 簇。您还将使用在 Amazon S3 中创建和查询数据集 Spark SQL,并了解如何在 Amazon EMR 集群上监控 Spark 使用 Amazon CloudWatch。
我试图通过编辑$HOME/spark/conf/log4j.properties 来抑制INFO 日志,但无济于事。
输出如下:
$ ./spark/bin/spark-sql
Spark assembly has been built with Hive, including Datanucleus jars on classpath
SLF4J: Class path contains multiple SLF4J bindings.
SLF4J: Found binding in [jar:file:/home/hadoop/.versions/2.4.0/share/hadoop/common/lib/slf4j-log4j12-1.7.5.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: Found binding in [jar:file:/home/hadoop/.versions/spark-1.1.1.e/lib/spark-assembly-1.1.1-hadoop2.4.0.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an explanation.
SLF4J: Actual binding is of type [org.slf4j.impl.Log4jLoggerFactory]
2014-12-14 20:59:01,819 INFO [main] Configuration.deprecation (Configuration.java:warnOnceIfDeprecated(1009)) - mapred.input.dir.recursive is deprecated. Instead, use mapreduce.input.fileinputformat.input.dir.recursive
2014-12-14 20:59:01,825 INFO [main] Configuration.deprecation (Configuration.java:warnOnceIfDeprecated(1009)) - mapred.max.split.size is deprecated. Instead, use mapreduce.input.fileinputformat.split.maxsize
2014-12-14 20:59:01,825 INFO [main] Configuration.deprecation (Configuration.java:warnOnceIfDeprecated(1009)) - mapred.min.split.size is deprecated. Instead, use mapreduce.input.fileinputformat.split.minsize
2014-12-14 20:59:01,825 INFO [main] Configuration.deprecation (Configuration.java:warnOnceIfDeprecated(1009)) - mapred.min.split.size.per.rack is deprecated. Instead, use mapreduce.input.fileinputformat.split.minsize.per.rack
如何抑制上面的 INFO 消息?
【问题讨论】:
-
Spark 的 java 包是什么?
-
一些进展:做一个
./spark/bin/spark-shell --driver-java-options "-Dlog4j.debug"允许我传递一个JVM 选项并看到它是一个jar 中某处的log4j.properties 文件。做./spark/bin/spark-shell --driver-java-options "-Dlog4j.configuration=file:///home/hadoop/spark/conf/log4j.properties"似乎工作,虽然这看起来非常笨重
标签: log4j apache-spark emr