aboutsummaryrefslogtreecommitdiff
diff options
context:
space:
mode:
authorMatei Zaharia <matei@eecs.berkeley.edu>2013-11-25 19:09:31 -0800
committerMatei Zaharia <matei@eecs.berkeley.edu>2013-11-25 19:09:31 -0800
commitc46067f096f03b6d07ea1ba65c185b851f8a8d8e (patch)
tree4c0df4c207142f6a82c305435d787a661fc75e0e
parent14bb465bb3d65f5b1034ada85cfcad7460034073 (diff)
parent08afef37a07c501b1ba14e3d6da445712852ca1e (diff)
downloadspark-c46067f096f03b6d07ea1ba65c185b851f8a8d8e.tar.gz
spark-c46067f096f03b6d07ea1ba65c185b851f8a8d8e.tar.bz2
spark-c46067f096f03b6d07ea1ba65c185b851f8a8d8e.zip
Merge pull request #206 from ash211/patch-2
Update tuning.md Clarify when serializer is used based on recent user@ mailing list discussion.
-rw-r--r--docs/tuning.md3
1 files changed, 2 insertions, 1 deletions
diff --git a/docs/tuning.md b/docs/tuning.md
index f33fda37eb..a4be188169 100644
--- a/docs/tuning.md
+++ b/docs/tuning.md
@@ -39,7 +39,8 @@ in your operations) and performance. It provides two serialization libraries:
for best performance.
You can switch to using Kryo by calling `System.setProperty("spark.serializer", "org.apache.spark.serializer.KryoSerializer")`
-*before* creating your SparkContext. The only reason it is not the default is because of the custom
+*before* creating your SparkContext. This setting configures the serializer used for not only shuffling data between worker
+nodes but also when serializing RDDs to disk. The only reason Kryo is not the default is because of the custom
registration requirement, but we recommend trying it in any network-intensive application.
Finally, to register your classes with Kryo, create a public class that extends