diff options
author | Davies Liu <davies@databricks.com> | 2015-11-04 21:30:21 -0800 |
---|---|---|
committer | Davies Liu <davies.liu@gmail.com> | 2015-11-04 21:30:21 -0800 |
commit | 81498dd5c86ca51d2fb351c8ef52cbb28e6844f4 (patch) | |
tree | 609fd1a3df2aeb64592dee8930ccbcf6efc0ec2e /python | |
parent | d0b56339625727744e2c30fc2167bc6a457d37f7 (diff) | |
download | spark-81498dd5c86ca51d2fb351c8ef52cbb28e6844f4.tar.gz spark-81498dd5c86ca51d2fb351c8ef52cbb28e6844f4.tar.bz2 spark-81498dd5c86ca51d2fb351c8ef52cbb28e6844f4.zip |
[SPARK-11425] [SPARK-11486] Improve hybrid aggregation
After aggregation, the dataset could be smaller than inputs, so it's better to do hash based aggregation for all inputs, then using sort based aggregation to merge them.
Author: Davies Liu <davies@databricks.com>
Closes #9383 from davies/fix_switch.
Diffstat (limited to 'python')
0 files changed, 0 insertions, 0 deletions