[SPARK-12727][SQL] support SQL generation for aggregate with multi-distinct - spark

diff options

author	Wenchen Fan <wenchen@databricks.com>	2016-03-08 11:45:08 -0800
committer	Reynold Xin <rxin@databricks.com>	2016-03-08 11:45:08 -0800
commit	46881b4ea229aecbb481626d8b9fbca24c0df075 (patch)
tree	b87768d6d17123041029896afa0ab7b1d6ef06b7 /mllib
parent	ad3c9a9730535faa6d270ee83412f79ec3db8333 (diff)
download	spark-46881b4ea229aecbb481626d8b9fbca24c0df075.tar.gz spark-46881b4ea229aecbb481626d8b9fbca24c0df075.tar.bz2 spark-46881b4ea229aecbb481626d8b9fbca24c0df075.zip

[SPARK-12727][SQL] support SQL generation for aggregate with multi-distinct

## What changes were proposed in this pull request? This PR add SQL generation support for aggregate with multi-distinct, by simply moving the `DistinctAggregationRewriter` rule to optimizer. More discussions are needed as this breaks an import contract: analyzed plan should be able to run without optimization. However, the `ComputeCurrentTime` rule has kind of broken it already, and I think maybe we should add a new phase for this kind of rules, because strictly speaking they don't belong to analysis and is coupled with the physical plan implementation. ## How was this patch tested? existing tests Author: Wenchen Fan <wenchen@databricks.com> Closes #11579 from cloud-fan/distinct.

Diffstat (limited to 'mllib')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: