diff options
author | Andrew Ray <ray.andrew@gmail.com> | 2015-11-11 16:23:24 -0800 |
---|---|---|
committer | Yin Huai <yhuai@databricks.com> | 2015-11-11 16:23:24 -0800 |
commit | b8ff6888e76b437287d7d6bf2d4b9c759710a195 (patch) | |
tree | 3f6a821b341a99dd3534603c88dfa6845d8982a8 /bagel | |
parent | 1a21be15f655b9696ddac80aac629445a465f621 (diff) | |
download | spark-b8ff6888e76b437287d7d6bf2d4b9c759710a195.tar.gz spark-b8ff6888e76b437287d7d6bf2d4b9c759710a195.tar.bz2 spark-b8ff6888e76b437287d7d6bf2d4b9c759710a195.zip |
[SPARK-8992][SQL] Add pivot to dataframe api
This adds a pivot method to the dataframe api.
Following the lead of cube and rollup this adds a Pivot operator that is translated into an Aggregate by the analyzer.
Currently the syntax is like:
~~courseSales.pivot(Seq($"year"), $"course", Seq("dotNET", "Java"), sum($"earnings"))~~
~~Would we be interested in the following syntax also/alternatively? and~~
courseSales.groupBy($"year").pivot($"course", "dotNET", "Java").agg(sum($"earnings"))
//or
courseSales.groupBy($"year").pivot($"course").agg(sum($"earnings"))
Later we can add it to `SQLParser`, but as Hive doesn't support it we cant add it there, right?
~~Also what would be the suggested Java friendly method signature for this?~~
Author: Andrew Ray <ray.andrew@gmail.com>
Closes #7841 from aray/sql-pivot.
Diffstat (limited to 'bagel')
0 files changed, 0 insertions, 0 deletions