diff options
author | Cheng Lian <lian@databricks.com> | 2016-04-01 22:00:24 -0700 |
---|---|---|
committer | Yin Huai <yhuai@databricks.com> | 2016-04-01 22:00:24 -0700 |
commit | 27e71a2cd930ae28c82c9c3ee6476a12ea165fdf (patch) | |
tree | 8170697ae9631a2f9b307a785ef221bd3fd7b60f /R/README.md | |
parent | 4fc35e6f5c590feb47cbcb5b1136f2e985677b3f (diff) | |
download | spark-27e71a2cd930ae28c82c9c3ee6476a12ea165fdf.tar.gz spark-27e71a2cd930ae28c82c9c3ee6476a12ea165fdf.tar.bz2 spark-27e71a2cd930ae28c82c9c3ee6476a12ea165fdf.zip |
[SPARK-14244][SQL] Don't use SizeBasedWindowFunction.n created on executor side when evaluating window functions
## What changes were proposed in this pull request?
`SizeBasedWindowFunction.n` is a global singleton attribute created for evaluating size based aggregate window functions like `CUME_DIST`. However, this attribute gets different expression IDs when created on both driver side and executor side. This PR adds `withPartitionSize` method to `SizeBasedWindowFunction` so that we can easily rewrite `SizeBasedWindowFunction.n` on executor side.
## How was this patch tested?
A test case is added in `HiveSparkSubmitSuite`, which supports launching multi-process clusters.
Author: Cheng Lian <lian@databricks.com>
Closes #12040 from liancheng/spark-14244-fix-sized-window-function.
Diffstat (limited to 'R/README.md')
0 files changed, 0 insertions, 0 deletions