[SPARK-8992][SQL] Add pivot to dataframe api - spark

diff options

author	Andrew Ray <ray.andrew@gmail.com>	2015-11-11 16:23:24 -0800
committer	Yin Huai <yhuai@databricks.com>	2015-11-11 16:23:24 -0800
commit	b8ff6888e76b437287d7d6bf2d4b9c759710a195 (patch)
tree	3f6a821b341a99dd3534603c88dfa6845d8982a8 /bagel
parent	1a21be15f655b9696ddac80aac629445a465f621 (diff)
download	spark-b8ff6888e76b437287d7d6bf2d4b9c759710a195.tar.gz spark-b8ff6888e76b437287d7d6bf2d4b9c759710a195.tar.bz2 spark-b8ff6888e76b437287d7d6bf2d4b9c759710a195.zip

[SPARK-8992][SQL] Add pivot to dataframe api

This adds a pivot method to the dataframe api. Following the lead of cube and rollup this adds a Pivot operator that is translated into an Aggregate by the analyzer. Currently the syntax is like: ~~courseSales.pivot(Seq($"year"), $"course", Seq("dotNET", "Java"), sum($"earnings"))~~ ~~Would we be interested in the following syntax also/alternatively? and~~ courseSales.groupBy($"year").pivot($"course", "dotNET", "Java").agg(sum($"earnings")) //or courseSales.groupBy($"year").pivot($"course").agg(sum($"earnings")) Later we can add it to `SQLParser`, but as Hive doesn't support it we cant add it there, right? ~~Also what would be the suggested Java friendly method signature for this?~~ Author: Andrew Ray <ray.andrew@gmail.com> Closes #7841 from aray/sql-pivot.

Diffstat (limited to 'bagel')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: