【发布时间】:2019-11-26 11:50:57
【问题描述】:
用火花,
import spark.implicits._
val data = Seq(
(1, ("value11", "value12")),
(2, ("value21", "value22")),
(3, ("value31", "value32"))
)
val df = data.toDF("id", "v1")
df.printSchema()
结果如下:
root
|-- id: integer (nullable = false)
|-- v1: struct (nullable = true)
| |-- _1: string (nullable = true)
| |-- _2: string (nullable = true)
现在如果我想自己创建架构,我应该如何处理?
val schema = StructType(Array(
StructField("id", IntegerType),
StructField("nested", ???)
))
谢谢。
【问题讨论】:
标签: apache-spark dataframe apache-spark-sql schema