【发布时间】:2018-08-27 10:27:21
【问题描述】:
我正在尝试从 GitHub 中的这段代码 click here 学习 graphx
在 spark-shell 上,当我尝试这个时:
def parseFlight(str: String): Flight = {
val line = str.split(",")
Flight(line(0), line(1), line(2), line(3), line(4).toInt, line(5).toLong, line(6), line(7).toLong, line(8), line(9).toDouble, line(10).toDouble, line(11).toDouble, line(12).toDouble, line(13).toDouble, line(14).toDouble, line(15).toDouble, line(16).toInt)
}
val textRDD = sc.textFile("/user/user01/data/rita2014jan.csv")
val flightsRDD = textRDD.map(parseFlight).cache()
val airports = flightsRDD.map(flight => (flight.org_id, flight.origin)).distinct
airports.take(1)
我得到了这个指向airports.take(1)的异常:
java.lang.NumberFormatException: empty String
如果我遗漏了什么,谁能告诉我?
【问题讨论】:
标签: scala apache-spark bigdata spark-graphx databricks