【发布时间】:2017-09-11 00:00:36
【问题描述】:
以下sn-p代码并行处理过滤器并将单个文件写入输出目录。有没有办法得到一个大的输出文件?
Array(
(filter1, outputPathBase + fileName),
(filter2, outputPathBase + fileName),
(filter3, outputPathBase + fileName)
).par.foreach {
case (extract, path) => extract.coalesce(1).write.mode("append").csv(path)
}
谢谢。
【问题讨论】:
标签: scala apache-spark scala-collections