推荐资料
官方资料
Spark博客
Spark深入研究
Spark论文
Spark: Cluster Computing with Working Sets Matei Zaharia
Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing Matei Zaharia
Discretized Streams: An Efficient and Fault-Tolerant Model for Stream Processing on Large Clusters Matei Zaharia
Shark: SQL and Rich Analytics at Scale Reynold Shi Xin, Matei Zaharia