Merge remote-tracking branch 'upstream/master' into sparsesvd

author: Reza Zadeh <rizlar@gmail.com> 2014-01-13 23:52:34 -0800
committer: Reza Zadeh <rizlar@gmail.com> 2014-01-13 23:52:34 -0800
commit: 845e568fada0550e632e7381748c5a9ebbe53e16 (patch)
tree: 3a4fa34894df649b5ef429cd794b73cf4b3e99b1 /docs/configuration.md
parent: f324d5355514b1c7ae85019b476046bb64b5593e (diff)
parent: fdaabdc67387524ffb84354f87985f48bd31cf60 (diff)
download: spark-845e568fada0550e632e7381748c5a9ebbe53e16.tar.gz
spark-845e568fada0550e632e7381748c5a9ebbe53e16.tar.bz2
spark-845e568fada0550e632e7381748c5a9ebbe53e16.zip
1 files changed, 10 insertions, 3 deletions
diff --git a/docs/configuration.md b/docs/configuration.md
index ad75e06fc7..be06bd19be 100644
--- a/docs/configuration.md
+++ b/docs/configuration.md
@@ -116,7 +116,7 @@ Apart from these, the following properties are also available, and may be useful
   <td>0.3</td>
   <td>
     Fraction of Java heap to use for aggregation and cogroups during shuffles, if
-    <code>spark.shuffle.externalSorting</code> is enabled. At any given time, the collective size of
+    <code>spark.shuffle.spill</code> is true. At any given time, the collective size of
     all in-memory maps used for shuffles is bounded by this limit, beyond which the contents will
     begin to spill to disk. If spills are often, consider increasing this value at the expense of
     <code>spark.storage.memoryFraction</code>.
@@ -155,6 +155,13 @@ Apart from these, the following properties are also available, and may be useful
   </td>
 </tr>
 <tr>
+  <td>spark.shuffle.spill.compress</td>
+  <td>false</td>
+  <td>
+    Whether to compress data spilled during shuffles.
+  </td>
+</tr>
+<tr>
   <td>spark.broadcast.compress</td>
   <td>true</td>
   <td>
@@ -382,13 +389,13 @@ Apart from these, the following properties are also available, and may be useful
 
 <tr>
   <td>spark.shuffle.consolidateFiles</td>
-  <td>true</td>
+  <td>false</td>
   <td>
     If set to "true", consolidates intermediate files created during a shuffle. Creating fewer files can improve filesystem performance for shuffles with large numbers of reduce tasks. It is recommended to set this to "true" when using ext4 or xfs filesystems. On ext3, this option might degrade performance on machines with many (>8) cores due to filesystem limitations.
   </td>
 </tr>
 <tr>
-  <td>spark.shuffle.externalSorting</td>
+  <td>spark.shuffle.spill</td>
   <td>true</td>
   <td>
     If set to "true", limits the amount of memory used during reduces by spilling data out to disk. This spilling
author	Reza Zadeh <rizlar@gmail.com>	2014-01-13 23:52:34 -0800
committer	Reza Zadeh <rizlar@gmail.com>	2014-01-13 23:52:34 -0800
commit	845e568fada0550e632e7381748c5a9ebbe53e16 (patch)
tree	3a4fa34894df649b5ef429cd794b73cf4b3e99b1 /docs/configuration.md
parent	f324d5355514b1c7ae85019b476046bb64b5593e (diff)
parent	fdaabdc67387524ffb84354f87985f48bd31cf60 (diff)
download	spark-845e568fada0550e632e7381748c5a9ebbe53e16.tar.gz spark-845e568fada0550e632e7381748c5a9ebbe53e16.tar.bz2 spark-845e568fada0550e632e7381748c5a9ebbe53e16.zip