diff options
author | Matei Zaharia <matei@eecs.berkeley.edu> | 2013-03-13 06:49:16 -0700 |
---|---|---|
committer | Matei Zaharia <matei@eecs.berkeley.edu> | 2013-03-13 06:49:16 -0700 |
commit | 2d477fd36f218f127552a0f5b711df80ec6ba303 (patch) | |
tree | 3cce49199738e5c0192695f514a61efcfe774350 | |
parent | 00c4d238ddb368a9ef4c6251e232bd5808ba80f4 (diff) | |
parent | cf73fbd3054737d9f82fc0af9dc7f2667b37a4a0 (diff) | |
download | spark-2d477fd36f218f127552a0f5b711df80ec6ba303.tar.gz spark-2d477fd36f218f127552a0f5b711df80ec6ba303.tar.bz2 spark-2d477fd36f218f127552a0f5b711df80ec6ba303.zip |
Merge pull request #523 from andyk/master
Fix broken link in Quick Start
-rw-r--r-- | docs/quick-start.md | 4 |
1 files changed, 2 insertions, 2 deletions
diff --git a/docs/quick-start.md b/docs/quick-start.md index 36d024f13a..216f7c9cc5 100644 --- a/docs/quick-start.md +++ b/docs/quick-start.md @@ -189,7 +189,7 @@ public class SimpleJob { } {% endhighlight %} -This job simply counts the number of lines containing 'a' and the number containing 'b' in a system log file. Note that like in the Scala example, we initialize a SparkContext, though we use the special `JavaSparkContext` class to get a Java-friendly one. We also create RDDs (represented by `JavaRDD`) and run transformations on them. Finally, we pass functions to Spark by creating classes that extend `spark.api.java.function.Function`. The [Java programming guide]("java-programming-guide") describes these differences in more detail. +This job simply counts the number of lines containing 'a' and the number containing 'b' in a system log file. Note that like in the Scala example, we initialize a SparkContext, though we use the special `JavaSparkContext` class to get a Java-friendly one. We also create RDDs (represented by `JavaRDD`) and run transformations on them. Finally, we pass functions to Spark by creating classes that extend `spark.api.java.function.Function`. The [Java programming guide](java-programming-guide.html) describes these differences in more detail. To build the job, we also write a Maven `pom.xml` file that lists Spark as a dependency. Note that Spark artifacts are tagged with a Scala version. @@ -265,7 +265,7 @@ print "Lines with a: %i, lines with b: %i" % (numAs, numBs) This job simply counts the number of lines containing 'a' and the number containing 'b' in a system log file. Like in the Scala and Java examples, we use a SparkContext to create RDDs. We can pass Python functions to Spark, which are automatically serialized along with any variables that they reference. -For jobs that use custom classes or third-party libraries, we can add those code dependencies to SparkContext to ensure that they will be available on remote machines; this is described in more detail in the [Python programming guide](python-programming-guide). +For jobs that use custom classes or third-party libraries, we can add those code dependencies to SparkContext to ensure that they will be available on remote machines; this is described in more detail in the [Python programming guide](python-programming-guide.html). `SimpleJob` is simple enough that we do not need to specify any code dependencies. We can run this job using the `pyspark` script: |