【发布时间】:2019-03-16 10:33:00
【问题描述】:
数据集如下所示。卡在将HIRE_DATE 格式更改为日期格式字段
EMPLOYEE_ID,FIRST_NAME,LAST_NAME,EMAIL,PHONE_NUMBER,HIRE_DATE,JOB_ID,SALARY,COMMISSION_PCT,MANAGER_ID,DEPARTMENT_ID
100,Steven,King,SKING,515.123.4567,17-JUN-03,AD_PRES,24000, - , - ,90
101,Neena,Kochhar,NKOCHHAR,515.123.4568,21-SEP-05,AD_VP,17000, - ,100,90
还有代码sn-p
val empData = sparkSession.read.option("header", "true").option("inferSchema", "true").
csv(filePath)empData.printSchema()
printSchema 输出为HIRE_DATE 字段提供字符串。但我期待Dateformat 字段。我该如何改变?
【问题讨论】:
标签: scala apache-spark apache-spark-sql