Spark Streaming个人的定义:
将不同的数据源的数据经过Spark St reaming处理之后将结果输出到外部文件系统
特点:
低延时
能从错误中高效的恢复: fault-tolerant
能够运行在成百上千的节点
能够将批处理、机器学习、图计算等子框架和Spark St reaming综合起来使用
Spark Streaming receives live input data streams and divides the data into batches, which are then processed by the Spark engine to generate the final stream of results in batches.