[SPARK-18711][SQL] should disable subexpression elimination for LambdaVariable - spark

diff options

author	Wenchen Fan <wenchen@databricks.com>	2016-12-05 11:37:13 -0800
committer	Herman van Hovell <hvanhovell@databricks.com>	2016-12-05 11:37:13 -0800
commit	01a7d33d0851d82fd1bb477a58d9925fe8d727d8 (patch)
tree	c8145bdf2efb7e300400be96963391655ac1c662 /docs/running-on-yarn.md
parent	246012859f0ed5248809a2e00e8355fbdaa8beb5 (diff)
download	spark-01a7d33d0851d82fd1bb477a58d9925fe8d727d8.tar.gz spark-01a7d33d0851d82fd1bb477a58d9925fe8d727d8.tar.bz2 spark-01a7d33d0851d82fd1bb477a58d9925fe8d727d8.zip

[SPARK-18711][SQL] should disable subexpression elimination for LambdaVariable

## What changes were proposed in this pull request? This is kind of a long-standing bug, it's hidden until https://github.com/apache/spark/pull/15780 , which may add `AssertNotNull` on top of `LambdaVariable` and thus enables subexpression elimination. However, subexpression elimination will evaluate the common expressions at the beginning, which is invalid for `LambdaVariable`. `LambdaVariable` usually represents loop variable, which can't be evaluated ahead of the loop. This PR skips expressions containing `LambdaVariable` when doing subexpression elimination. ## How was this patch tested? updated test in `DatasetAggregatorSuite` Author: Wenchen Fan <wenchen@databricks.com> Closes #16143 from cloud-fan/aggregator.

Diffstat (limited to 'docs/running-on-yarn.md')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: