【发布时间】:2018-08-10 07:39:49
【问题描述】:
我不确定用什么方法来表达这个问题,但举个例子会有所帮助。这是我的列数据框:名称、类型和计数:
+------+------+-------+
| Name | Type | Count |
+------+------+-------+
| a | 0 | 5 |
| a | 1 | 4 |
| a | 5 | 5 |
| a | 4 | 5 |
| a | 2 | 1 |
| b | 0 | 2 |
| b | 1 | 4 |
| b | 3 | 5 |
| b | 4 | 5 |
| b | 2 | 1 |
| c | 0 | 5 |
| c | ... | ... |
+------+------+-------+
我想获得一个像这样结构的新数据框,其中 Type 列值已成为新列:
+------+---+-----+---+---+---+---+
| Name | 0 | 1 | 2 | 3 | 4 | 5 | <- Number columns are types from input
+------+---+-----+---+---+---+---+
| a | 5 | 4 | 1 | 0 | 5 | 5 |
| b | 2 | 4 | 1 | 5 | 5 | 0 |
| c | 5 | ... | | | | |
+------+---+-----+---+---+---+---+
这里的列是 [Name,0,1,2,3,4,5]。
【问题讨论】:
标签: scala apache-spark apache-spark-sql