[SPARK-12487][STREAMING][DOCUMENT] Add docs for Kafka message handler

Author: Shixiong Zhu <shixiong@databricks.com> Closes #10439 from zsxwing/kafka-message-handler-doc.
author: Shixiong Zhu <shixiong@databricks.com> 2015-12-22 15:33:30 -0800
committer: Tathagata Das <tathagata.das1565@gmail.com> 2015-12-22 15:33:30 -0800
commit: 93db50d1c2ff97e6eb9200a995e4601f752968ae (patch)
tree: 2256e654d4be6f23daf95121d088215c79ffda9a /docs/streaming-kafka-integration.md
parent: b374a25831af031f461716c52b615665aa5392c2 (diff)
download: spark-93db50d1c2ff97e6eb9200a995e4601f752968ae.tar.gz
spark-93db50d1c2ff97e6eb9200a995e4601f752968ae.tar.bz2
spark-93db50d1c2ff97e6eb9200a995e4601f752968ae.zip
1 files changed, 3 insertions, 0 deletions
diff --git a/docs/streaming-kafka-integration.md b/docs/streaming-kafka-integration.md
index 5be73c4256..9454714eeb 100644
--- a/docs/streaming-kafka-integration.md
+++ b/docs/streaming-kafka-integration.md
@@ -104,6 +104,7 @@ Next, we discuss how to use this approach in your streaming application.
 			[key class], [value class], [key decoder class], [value decoder class] ](
 			streamingContext, [map of Kafka parameters], [set of topics to consume])
 
+	You can also pass a `messageHandler` to `createDirectStream` to access `MessageAndMetadata` that contains metadata about the current message and transform it to any desired type.
 	See the [API docs](api/scala/index.html#org.apache.spark.streaming.kafka.KafkaUtils$)
 	and the [example]({{site.SPARK_GITHUB_URL}}/blob/master/examples/src/main/scala/org/apache/spark/examples/streaming/DirectKafkaWordCount.scala).
 	</div>
@@ -115,6 +116,7 @@ Next, we discuss how to use this approach in your streaming application.
 				[key class], [value class], [key decoder class], [value decoder class],
 				[map of Kafka parameters], [set of topics to consume]);
 
+	You can also pass a `messageHandler` to `createDirectStream` to access `MessageAndMetadata` that contains metadata about the current message and transform it to any desired type.
 	See the [API docs](api/java/index.html?org/apache/spark/streaming/kafka/KafkaUtils.html)
 	and the [example]({{site.SPARK_GITHUB_URL}}/blob/master/examples/src/main/java/org/apache/spark/examples/streaming/JavaDirectKafkaWordCount.java).
 
@@ -123,6 +125,7 @@ Next, we discuss how to use this approach in your streaming application.
 		from pyspark.streaming.kafka import KafkaUtils
 		directKafkaStream = KafkaUtils.createDirectStream(ssc, [topic], {"metadata.broker.list": brokers})
 
+	You can also pass a `messageHandler` to `createDirectStream` to access `KafkaMessageAndMetadata` that contains metadata about the current message and transform it to any desired type.
 	By default, the Python API will decode Kafka data as UTF8 encoded strings. You can specify your custom decoding function to decode the byte arrays in Kafka records to any arbitrary data type. See the [API docs](api/python/pyspark.streaming.html#pyspark.streaming.kafka.KafkaUtils)
 	and the [example]({{site.SPARK_GITHUB_URL}}/blob/master/examples/src/main/python/streaming/direct_kafka_wordcount.py).
 	</div>
author	Shixiong Zhu <shixiong@databricks.com>	2015-12-22 15:33:30 -0800
committer	Tathagata Das <tathagata.das1565@gmail.com>	2015-12-22 15:33:30 -0800
commit	93db50d1c2ff97e6eb9200a995e4601f752968ae (patch)
tree	2256e654d4be6f23daf95121d088215c79ffda9a /docs/streaming-kafka-integration.md
parent	b374a25831af031f461716c52b615665aa5392c2 (diff)
download	spark-93db50d1c2ff97e6eb9200a995e4601f752968ae.tar.gz spark-93db50d1c2ff97e6eb9200a995e4601f752968ae.tar.bz2 spark-93db50d1c2ff97e6eb9200a995e4601f752968ae.zip