diff options
author | Davies Liu <davies@databricks.com> | 2016-01-18 17:29:54 -0800 |
---|---|---|
committer | Davies Liu <davies.liu@gmail.com> | 2016-01-18 17:29:54 -0800 |
commit | 323d51f1dadf733e413203d678cb3f76e4d68981 (patch) | |
tree | f15f31a8c2bdf7ae73c0a7ff5c4a2a0ec175fa93 /.gitignore | |
parent | 39ac56fc60734d0e095314fc38a7b36fbb4c80f7 (diff) | |
download | spark-323d51f1dadf733e413203d678cb3f76e4d68981.tar.gz spark-323d51f1dadf733e413203d678cb3f76e4d68981.tar.bz2 spark-323d51f1dadf733e413203d678cb3f76e4d68981.zip |
[SPARK-12700] [SQL] embed condition into SMJ and BroadcastHashJoin
Currently SortMergeJoin and BroadcastHashJoin do not support condition, the need a followed Filter for that, the result projection to generate UnsafeRow could be very expensive if they generate lots of rows and could be filtered mostly by condition.
This PR brings the support of condition for SortMergeJoin and BroadcastHashJoin, just like other outer joins do.
This could improve the performance of Q72 by 7x (from 120s to 16.5s).
Author: Davies Liu <davies@databricks.com>
Closes #10653 from davies/filter_join.
Diffstat (limited to '.gitignore')
0 files changed, 0 insertions, 0 deletions