aboutsummaryrefslogtreecommitdiff
path: root/python
diff options
context:
space:
mode:
authorDavies Liu <davies@databricks.com>2015-11-04 21:30:21 -0800
committerDavies Liu <davies.liu@gmail.com>2015-11-04 21:30:21 -0800
commit81498dd5c86ca51d2fb351c8ef52cbb28e6844f4 (patch)
tree609fd1a3df2aeb64592dee8930ccbcf6efc0ec2e /python
parentd0b56339625727744e2c30fc2167bc6a457d37f7 (diff)
downloadspark-81498dd5c86ca51d2fb351c8ef52cbb28e6844f4.tar.gz
spark-81498dd5c86ca51d2fb351c8ef52cbb28e6844f4.tar.bz2
spark-81498dd5c86ca51d2fb351c8ef52cbb28e6844f4.zip
[SPARK-11425] [SPARK-11486] Improve hybrid aggregation
After aggregation, the dataset could be smaller than inputs, so it's better to do hash based aggregation for all inputs, then using sort based aggregation to merge them. Author: Davies Liu <davies@databricks.com> Closes #9383 from davies/fix_switch.
Diffstat (limited to 'python')
0 files changed, 0 insertions, 0 deletions