aboutsummaryrefslogtreecommitdiff
path: root/docs/configuration.md
diff options
context:
space:
mode:
authorDavies Liu <davies@databricks.com>2015-12-21 14:21:43 -0800
committerDavies Liu <davies.liu@gmail.com>2015-12-21 14:21:43 -0800
commit29cecd4a42f6969613e5b2a40f2724f99e7eec01 (patch)
tree261b40272c991649ce584b7cec2056dcd2b6cf1c /docs/configuration.md
parentd655d37ddf59d7fb6db529324ac8044d53b2622a (diff)
downloadspark-29cecd4a42f6969613e5b2a40f2724f99e7eec01.tar.gz
spark-29cecd4a42f6969613e5b2a40f2724f99e7eec01.tar.bz2
spark-29cecd4a42f6969613e5b2a40f2724f99e7eec01.zip
[SPARK-12388] change default compression to lz4
According the benchmark [1], LZ4-java could be 80% (or 30%) faster than Snappy. After changing the compressor to LZ4, I saw 20% improvement on end-to-end time for a TPCDS query (Q4). [1] https://github.com/ning/jvm-compressor-benchmark/wiki cc rxin Author: Davies Liu <davies@databricks.com> Closes #10342 from davies/lz4.
Diffstat (limited to 'docs/configuration.md')
-rw-r--r--docs/configuration.md2
1 files changed, 1 insertions, 1 deletions
diff --git a/docs/configuration.md b/docs/configuration.md
index 85e7d1202d..a9ef37a9b1 100644
--- a/docs/configuration.md
+++ b/docs/configuration.md
@@ -595,7 +595,7 @@ Apart from these, the following properties are also available, and may be useful
</tr>
<tr>
<td><code>spark.io.compression.codec</code></td>
- <td>snappy</td>
+ <td>lz4</td>
<td>
The codec used to compress internal data such as RDD partitions, broadcast variables and
shuffle outputs. By default, Spark provides three codecs: <code>lz4</code>, <code>lzf</code>,