aboutsummaryrefslogtreecommitdiff
path: root/docs/running-on-yarn.md
diff options
context:
space:
mode:
authorWenchen Fan <wenchen@databricks.com>2016-12-05 11:37:13 -0800
committerHerman van Hovell <hvanhovell@databricks.com>2016-12-05 11:37:13 -0800
commit01a7d33d0851d82fd1bb477a58d9925fe8d727d8 (patch)
treec8145bdf2efb7e300400be96963391655ac1c662 /docs/running-on-yarn.md
parent246012859f0ed5248809a2e00e8355fbdaa8beb5 (diff)
downloadspark-01a7d33d0851d82fd1bb477a58d9925fe8d727d8.tar.gz
spark-01a7d33d0851d82fd1bb477a58d9925fe8d727d8.tar.bz2
spark-01a7d33d0851d82fd1bb477a58d9925fe8d727d8.zip
[SPARK-18711][SQL] should disable subexpression elimination for LambdaVariable
## What changes were proposed in this pull request? This is kind of a long-standing bug, it's hidden until https://github.com/apache/spark/pull/15780 , which may add `AssertNotNull` on top of `LambdaVariable` and thus enables subexpression elimination. However, subexpression elimination will evaluate the common expressions at the beginning, which is invalid for `LambdaVariable`. `LambdaVariable` usually represents loop variable, which can't be evaluated ahead of the loop. This PR skips expressions containing `LambdaVariable` when doing subexpression elimination. ## How was this patch tested? updated test in `DatasetAggregatorSuite` Author: Wenchen Fan <wenchen@databricks.com> Closes #16143 from cloud-fan/aggregator.
Diffstat (limited to 'docs/running-on-yarn.md')
0 files changed, 0 insertions, 0 deletions