[SPARK-9301][SQL] Add collect_set and collect_list aggregate functions - spark

diff options

author	Nick Buroojy <nick.buroojy@civitaslearning.com>	2015-11-09 14:30:37 -0800
committer	Michael Armbrust <michael@databricks.com>	2015-11-09 14:30:52 -0800
commit	f138cb873335654476d1cd1070900b552dd8b21a (patch)
tree	91bcc549fe561c4f100197f42bd8ce0ad03062be /sql/hive-thriftserver/src
parent	b7720fa45525cff6e812fa448d0841cb41f6c8a5 (diff)
download	spark-f138cb873335654476d1cd1070900b552dd8b21a.tar.gz spark-f138cb873335654476d1cd1070900b552dd8b21a.tar.bz2 spark-f138cb873335654476d1cd1070900b552dd8b21a.zip

[SPARK-9301][SQL] Add collect_set and collect_list aggregate functions

For now they are thin wrappers around the corresponding Hive UDAFs. One limitation with these in Hive 0.13.0 is they only support aggregating primitive types. I chose snake_case here instead of camelCase because it seems to be used in the majority of the multi-word fns. Do we also want to add these to `functions.py`? This approach was recommended here: https://github.com/apache/spark/pull/8592#issuecomment-154247089 marmbrus rxin Author: Nick Buroojy <nick.buroojy@civitaslearning.com> Closes #9526 from nburoojy/nick/udaf-alias. (cherry picked from commit a6ee4f989d020420dd08b97abb24802200ff23b2) Signed-off-by: Michael Armbrust <michael@databricks.com>

Diffstat (limited to 'sql/hive-thriftserver/src')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: