aboutsummaryrefslogtreecommitdiff
path: root/mllib
diff options
context:
space:
mode:
authorWenchen Fan <wenchen@databricks.com>2016-03-08 11:45:08 -0800
committerReynold Xin <rxin@databricks.com>2016-03-08 11:45:08 -0800
commit46881b4ea229aecbb481626d8b9fbca24c0df075 (patch)
treeb87768d6d17123041029896afa0ab7b1d6ef06b7 /mllib
parentad3c9a9730535faa6d270ee83412f79ec3db8333 (diff)
downloadspark-46881b4ea229aecbb481626d8b9fbca24c0df075.tar.gz
spark-46881b4ea229aecbb481626d8b9fbca24c0df075.tar.bz2
spark-46881b4ea229aecbb481626d8b9fbca24c0df075.zip
[SPARK-12727][SQL] support SQL generation for aggregate with multi-distinct
## What changes were proposed in this pull request? This PR add SQL generation support for aggregate with multi-distinct, by simply moving the `DistinctAggregationRewriter` rule to optimizer. More discussions are needed as this breaks an import contract: analyzed plan should be able to run without optimization. However, the `ComputeCurrentTime` rule has kind of broken it already, and I think maybe we should add a new phase for this kind of rules, because strictly speaking they don't belong to analysis and is coupled with the physical plan implementation. ## How was this patch tested? existing tests Author: Wenchen Fan <wenchen@databricks.com> Closes #11579 from cloud-fan/distinct.
Diffstat (limited to 'mllib')
0 files changed, 0 insertions, 0 deletions