aboutsummaryrefslogtreecommitdiff
path: root/docs/spark-simple-tutorial.md
diff options
context:
space:
mode:
Diffstat (limited to 'docs/spark-simple-tutorial.md')
-rw-r--r--docs/spark-simple-tutorial.md41
1 files changed, 41 insertions, 0 deletions
diff --git a/docs/spark-simple-tutorial.md b/docs/spark-simple-tutorial.md
new file mode 100644
index 0000000000..9875de62bd
--- /dev/null
+++ b/docs/spark-simple-tutorial.md
@@ -0,0 +1,41 @@
+---
+layout: global
+title: Tutorial - Running a Simple Spark Application
+---
+
+1. Create directory for spark demo:
+
+ ~$ mkdir SparkTest
+
+2. Copy the sbt files in ~/spark/sbt directory:
+
+ ~/SparkTest$ cp -r ../spark/sbt .
+
+3. Edit the ~/SparkTest/sbt/sbt file to look like this:
+
+ #!/bin/bash
+ java -Xmx800M -XX:MaxPermSize=150m -jar $(dirname $0)/sbt-launch-*.jar "$@"
+
+4. To build a Spark application, you need Spark and its dependencies in a single Java archive (JAR) file. Create this JAR in Spark's main directory with sbt as:
+
+ ~/spark$ sbt/sbt assembly
+
+5. create a source file in ~/SparkTest/src/main/scala directory:
+
+ ~/SparkTest/src/main/scala$ vi Test1.scala
+
+6. Make the contain of the Test1.scala file like this:
+
+ import spark.SparkContext
+ import spark.SparkContext._
+ object Test1 {
+ def main(args: Array[String]) {
+ val sc = new SparkContext("local", "SparkTest")
+ println(sc.parallelize(1 to 10).reduce(_ + _))
+ System.exit(0)
+ }
+ }
+
+7. Run the Test1.scala file:
+
+ ~/SparkTest$ sbt/sbt run