aboutsummaryrefslogtreecommitdiff
path: root/python/pyspark/sql/streaming.py
diff options
context:
space:
mode:
authorShixiong Zhu <shixiong@databricks.com>2016-10-13 13:31:50 -0700
committerShixiong Zhu <shixiong@databricks.com>2016-10-13 13:31:50 -0700
commit08eac356095c7faa2b19d52f2fb0cbc47eb7d1d1 (patch)
tree866b9bfaedaa30b1be16ca69f67ca17db80c28ec /python/pyspark/sql/streaming.py
parent84f149e414475c2e60863898992001c21cfc13b2 (diff)
downloadspark-08eac356095c7faa2b19d52f2fb0cbc47eb7d1d1.tar.gz
spark-08eac356095c7faa2b19d52f2fb0cbc47eb7d1d1.tar.bz2
spark-08eac356095c7faa2b19d52f2fb0cbc47eb7d1d1.zip
[SPARK-17834][SQL] Fetch the earliest offsets manually in KafkaSource instead of counting on KafkaConsumer
## What changes were proposed in this pull request? Because `KafkaConsumer.poll(0)` may update the partition offsets, this PR just calls `seekToBeginning` to manually set the earliest offsets for the KafkaSource initial offsets. ## How was this patch tested? Existing tests. Author: Shixiong Zhu <shixiong@databricks.com> Closes #15397 from zsxwing/SPARK-17834.
Diffstat (limited to 'python/pyspark/sql/streaming.py')
0 files changed, 0 insertions, 0 deletions