【发布时间】:2015-09-23 06:19:23
【问题描述】:
连接两个数据框时如何提供更多列条件。例如我想运行以下内容:
val Lead_all = Leads.join(Utm_Master,
Leaddetails.columns("LeadSource","Utm_Source","Utm_Medium","Utm_Campaign") ==
Utm_Master.columns("LeadSource","Utm_Source","Utm_Medium","Utm_Campaign"),
"left")
我只想在这些列匹配时加入。但上述语法无效,因为 cols 只接受一个字符串。那么如何才能得到我想要的呢。
【问题讨论】:
标签: apache-spark apache-spark-sql rdd