aboutsummaryrefslogtreecommitdiff
path: root/docs/streaming-programming-guide.md
diff options
context:
space:
mode:
authoryangping.wu <wyphao.2007@163.com>2015-11-17 14:11:34 +0000
committerSean Owen <sowen@cloudera.com>2015-11-17 14:11:34 +0000
commit7276fa9aa9d2eccb6aebd5c690ac334699142f1e (patch)
tree8c42c85ee2053c7901992506fbcf7b44fa914703 /docs/streaming-programming-guide.md
parentfa13301ae440c4c9594280f236bcca11b62fdd29 (diff)
downloadspark-7276fa9aa9d2eccb6aebd5c690ac334699142f1e.tar.gz
spark-7276fa9aa9d2eccb6aebd5c690ac334699142f1e.tar.bz2
spark-7276fa9aa9d2eccb6aebd5c690ac334699142f1e.zip
[SPARK-11751] Doc describe error in the "Spark Streaming Programming Guide" page
In the **[Task Launching Overheads](http://spark.apache.org/docs/latest/streaming-programming-guide.html#task-launching-overheads)** section, >Task Serialization: Using Kryo serialization for serializing tasks can reduce the task sizes, and therefore reduce the time taken to send them to the slaves. as we known **Task Serialization** is configuration by **spark.closure.serializer** parameter, but currently only the Java serializer is supported. If we set **spark.closure.serializer** to **org.apache.spark.serializer.KryoSerializer**, then this will throw a exception. Author: yangping.wu <wyphao.2007@163.com> Closes #9734 from 397090770/397090770-patch-1.
Diffstat (limited to 'docs/streaming-programming-guide.md')
-rw-r--r--docs/streaming-programming-guide.md3
1 files changed, 1 insertions, 2 deletions
diff --git a/docs/streaming-programming-guide.md b/docs/streaming-programming-guide.md
index e9a27f446a..96b36b7a73 100644
--- a/docs/streaming-programming-guide.md
+++ b/docs/streaming-programming-guide.md
@@ -2001,8 +2001,7 @@ If the number of tasks launched per second is high (say, 50 or more per second),
of sending out tasks to the slaves may be significant and will make it hard to achieve sub-second
latencies. The overhead can be reduced by the following changes:
-* **Task Serialization**: Using Kryo serialization for serializing tasks can reduce the task
- sizes, and therefore reduce the time taken to send them to the slaves.
+* **Task Serialization**: Using Kryo serialization for serializing tasks can reduce the task sizes, and therefore reduce the time taken to send them to the slaves. This is controlled by the ```spark.closure.serializer``` property. However, at this time, Kryo serialization cannot be enabled for closure serialization. This may be resolved in a future release.
* **Execution mode**: Running Spark in Standalone mode or coarse-grained Mesos mode leads to
better task launch times than the fine-grained Mesos mode. Please refer to the