Spark Documentation

Setup instructions, programming guides, and other documentation are available for each version of Spark below:

Read these documents to get started with Spark. In addition, this page lists a few other external resources for learning Spark.

Video Tutorials

Hands-On Exercises

  • Hands-on exercises are available online. These exercises let you launch a small EC2 cluster, load a dataset, and query it with Spark, Shark, Spark Streaming, and MLLib.

AMP Camp Slides and Videos

  • The AMPLab regularly hosts two-day training camps on Spark and related "big data" components. Slides and videos from each camp are posted online:
    AMP Camp Three Big Data Bootcamp Berkeley (August 2013)
    AMP Camp Two Big Data Bootcamp Strata (February 2013)
    AMP Camp One Big Data Bootcamp Berkeley (August 2012)

Presentations

External Tutorials, Development Blogs, and Talks

Research Papers