【发布时间】:2022-01-18 02:39:11
【问题描述】:
我有这个数据框:
+----+--------------------------------+
|name|dates |
+----+--------------------------------+
|A |[[1994, 12, 11], [,,]] |
|B |[[1994, 12, 11], [1994, 12, 15]]|
+----+--------------------------------+
使用此架构:
root
|-- name: string (nullable = true)
|-- dates: struct (nullable = true)
| |-- start_date: struct (nullable = true)
| | |-- year: integer (nullable = true)
| | |-- month: integer (nullable = true)
| | |-- day: integer (nullable = true)
| |-- end_date: struct (nullable = true)
| | |-- year: integer (nullable = true)
| | |-- month: integer (nullable = true)
| | |-- day: integer (nullable = true)
我想把它作为输出
当end_date内的所有字段为null时,设置结束日期为null
+----+--------------------------------+
|name|dates |
+----+--------------------------------+
|A |[[1994, 12, 11],] |
|B |[[1994, 12, 11], [1994, 12, 15]]|
+----+--------------------------------+
【问题讨论】:
标签: scala apache-spark apache-spark-sql