Fixing a few basic typos in the Programming Guide.

Just a few minor fixes in the guide, so a new JIRA issue was not created per the guidelines. Author: Mike Dusenberry <dusenberrymw@gmail.com> Closes #6240 from dusenberrymw/Fix_Programming_Guide_Typos and squashes the following commits: ffa76eb [Mike Dusenberry] Fixing a few basic typos in the Programming Guide. (cherry picked from commit 61f164d3fdd1c8dcdba8c9d66df05ff4069aa6e6) Signed-off-by: Sean Owen <sowen@cloudera.com>
author: Mike Dusenberry <dusenberrymw@gmail.com> 2015-05-19 08:59:45 +0100
committer: Sean Owen <sowen@cloudera.com> 2015-05-19 09:00:19 +0100
commit: 0748263a2e36e9aef172808e3b6208a1f4d4fdb8 (patch)
tree: 87fa1276333e1924f784d8c37e02987e92a66d4c /docs
parent: a1d896b85bd3fb88284f8b6758d7e5f0a1bb9eb3 (diff)
download: spark-0748263a2e36e9aef172808e3b6208a1f4d4fdb8.tar.gz
spark-0748263a2e36e9aef172808e3b6208a1f4d4fdb8.tar.bz2
spark-0748263a2e36e9aef172808e3b6208a1f4d4fdb8.zip
1 files changed, 3 insertions, 3 deletions
diff --git a/docs/programming-guide.md b/docs/programming-guide.md
index 27816515c5..0c273769bb 100644
--- a/docs/programming-guide.md
+++ b/docs/programming-guide.md
@@ -1071,7 +1071,7 @@ for details.
 </tr>
 <tr>
   <td> <b>saveAsSequenceFile</b>(<i>path</i>) <br /> (Java and Scala) </td>
-  <td> Write the elements of the dataset as a Hadoop SequenceFile in a given path in the local filesystem, HDFS or any other Hadoop-supported file system. This is available on RDDs of key-value pairs that either implement Hadoop's Writable interface. In Scala, it is also
+  <td> Write the elements of the dataset as a Hadoop SequenceFile in a given path in the local filesystem, HDFS or any other Hadoop-supported file system. This is available on RDDs of key-value pairs that implement Hadoop's Writable interface. In Scala, it is also
    available on types that are implicitly convertible to Writable (Spark includes conversions for basic types like Int, Double, String, etc). </td>
 </tr>
 <tr>
@@ -1122,7 +1122,7 @@ ordered data following shuffle then it's possible to use:
 * `sortBy` to make a globally ordered RDD
 
 Operations which can cause a shuffle include **repartition** operations like
-[`repartition`](#RepartitionLink), and [`coalesce`](#CoalesceLink), **'ByKey** operations
+[`repartition`](#RepartitionLink) and [`coalesce`](#CoalesceLink), **'ByKey** operations
 (except for counting) like [`groupByKey`](#GroupByLink) and [`reduceByKey`](#ReduceByLink), and
 **join** operations like [`cogroup`](#CogroupLink) and [`join`](#JoinLink).
 
@@ -1138,7 +1138,7 @@ read the relevant sorted blocks.
         
 Certain shuffle operations can consume significant amounts of heap memory since they employ 
 in-memory data structures to organize records before or after transferring them. Specifically, 
-`reduceByKey` and `aggregateByKey` create these structures on the map side and `'ByKey` operations 
+`reduceByKey` and `aggregateByKey` create these structures on the map side, and `'ByKey` operations 
 generate these on the reduce side. When data does not fit in memory Spark will spill these tables 
 to disk, incurring the additional overhead of disk I/O and increased garbage collection.
author	Mike Dusenberry <dusenberrymw@gmail.com>	2015-05-19 08:59:45 +0100
committer	Sean Owen <sowen@cloudera.com>	2015-05-19 09:00:19 +0100
commit	0748263a2e36e9aef172808e3b6208a1f4d4fdb8 (patch)
tree	87fa1276333e1924f784d8c37e02987e92a66d4c /docs
parent	a1d896b85bd3fb88284f8b6758d7e5f0a1bb9eb3 (diff)
download	spark-0748263a2e36e9aef172808e3b6208a1f4d4fdb8.tar.gz spark-0748263a2e36e9aef172808e3b6208a1f4d4fdb8.tar.bz2 spark-0748263a2e36e9aef172808e3b6208a1f4d4fdb8.zip