diff options
author | Josh Rosen <joshrosen@databricks.com> | 2016-09-13 12:54:03 +0200 |
---|---|---|
committer | Herman van Hovell <hvanhovell@databricks.com> | 2016-09-13 12:54:03 +0200 |
commit | 3f6a2bb3f7beac4ce928eb660ee36258b5b9e8c8 (patch) | |
tree | e397fdba23c4cc536f452ba5fdf88934fbe74b07 /sbin/stop-all.sh | |
parent | 46f5c201e70053635bdeab4984ba1b649478bd12 (diff) | |
download | spark-3f6a2bb3f7beac4ce928eb660ee36258b5b9e8c8.tar.gz spark-3f6a2bb3f7beac4ce928eb660ee36258b5b9e8c8.tar.bz2 spark-3f6a2bb3f7beac4ce928eb660ee36258b5b9e8c8.zip |
[SPARK-17515] CollectLimit.execute() should perform per-partition limits
## What changes were proposed in this pull request?
CollectLimit.execute() incorrectly omits per-partition limits, leading to performance regressions in case this case is hit (which should not happen in normal operation, but can occur in some cases (see #15068 for one example).
## How was this patch tested?
Regression test in SQLQuerySuite that asserts the number of records scanned from the input RDD.
Author: Josh Rosen <joshrosen@databricks.com>
Closes #15070 from JoshRosen/SPARK-17515.
Diffstat (limited to 'sbin/stop-all.sh')
0 files changed, 0 insertions, 0 deletions