diff options
author | Eric Liang <ekl@databricks.com> | 2016-07-30 22:48:09 -0700 |
---|---|---|
committer | Reynold Xin <rxin@databricks.com> | 2016-07-30 22:48:09 -0700 |
commit | 957a8ab3743521850fb1c0106c37c5d3997b9e56 (patch) | |
tree | abbcd0676d67326624d37e47d93abc9b9bfe7942 /repl/scala-2.10/src/main/scala/org/apache | |
parent | a6290e51e402e8434d6207d553db1f551e714fde (diff) | |
download | spark-957a8ab3743521850fb1c0106c37c5d3997b9e56.tar.gz spark-957a8ab3743521850fb1c0106c37c5d3997b9e56.tar.bz2 spark-957a8ab3743521850fb1c0106c37c5d3997b9e56.zip |
[SPARK-16818] Exchange reuse incorrectly reuses scans over different sets of partitions
## What changes were proposed in this pull request?
This fixes a bug wherethe file scan operator does not take into account partition pruning in its implementation of `sameResult()`. As a result, executions may be incorrect on self-joins over the same base file relation.
The patch here is minimal, but we should reconsider relying on `metadata` for implementing sameResult() in the future, as string representations may not be uniquely identifying.
cc rxin
## How was this patch tested?
Unit tests.
Author: Eric Liang <ekl@databricks.com>
Closes #14425 from ericl/spark-16818.
Diffstat (limited to 'repl/scala-2.10/src/main/scala/org/apache')
0 files changed, 0 insertions, 0 deletions