aboutsummaryrefslogtreecommitdiff
path: root/sql/core/src/main/scala
diff options
context:
space:
mode:
authorGaurav <gaurav@techtinium.com>2017-03-06 10:41:49 -0800
committerBurak Yavuz <brkyvz@gmail.com>2017-03-06 10:41:49 -0800
commit46a64d1e0ae12c31e848f377a84fb28e3efb3699 (patch)
treef070e6a3646450030a33e66282e85aa1efce6bdb /sql/core/src/main/scala
parent339b53a1311e08521d84a83c94201fcf3c766fb2 (diff)
downloadspark-46a64d1e0ae12c31e848f377a84fb28e3efb3699.tar.gz
spark-46a64d1e0ae12c31e848f377a84fb28e3efb3699.tar.bz2
spark-46a64d1e0ae12c31e848f377a84fb28e3efb3699.zip
[SPARK-19304][STREAMING][KINESIS] fix kinesis slow checkpoint recovery
## What changes were proposed in this pull request? added a limit to getRecords api call call in KinesisBackedBlockRdd. This helps reduce the amount of data returned by kinesis api call making the recovery considerably faster As we are storing the `fromSeqNum` & `toSeqNum` in checkpoint metadata, we can also store the number of records. Which can later be used for api call. ## How was this patch tested? The patch was manually tested Apologies for any silly mistakes, opening first pull request Author: Gaurav <gaurav@techtinium.com> Closes #16842 from Gauravshah/kinesis_checkpoint_recovery_fix_2_1_0.
Diffstat (limited to 'sql/core/src/main/scala')
0 files changed, 0 insertions, 0 deletions